| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
| |
llvm-svn: 155686
|
| |
|
|
|
|
|
|
| |
instructions.
- However, it does support dmb, dsb, isb, mrs, and msr.
rdar://11331541
llvm-svn: 155685
|
| |
|
|
|
|
|
|
|
|
| |
The base address for the PC-relative load is Align(PC,4), so it's the
address of the word containing the 16-bit instruction, not the address
of the instruction itself. Ugh.
rdar://11314619
llvm-svn: 155659
|
| |
|
|
|
|
|
|
| |
the FeatureLeaForSP feature bit when llvm auto detects Intel Atom.
Patch by Andy Zhang
llvm-svn: 155655
|
| |
|
|
|
|
|
| |
On some cores it's a bad idea for performance to mix VFP and NEON instructions
and since these patterns are NEON anyway, the NEON load should be used.
llvm-svn: 155630
|
| |
|
|
| |
llvm-svn: 155626
|
| |
|
|
|
|
| |
corei7-avx, core-avx-i, and core-avx2 cpu names.
llvm-svn: 155618
|
| |
|
|
|
|
|
|
|
|
| |
the feature set of v7a. This comes about if the user specifies something like
-arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as
uxtab in this case.
rdar://11318438
llvm-svn: 155601
|
| |
|
|
|
|
| |
right-shifted by #32. These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation.
llvm-svn: 155565
|
| |
|
|
|
|
| |
only targets that want the function get it. This prevents other targets from getting an unused function warning.
llvm-svn: 155538
|
| |
|
|
|
|
| |
ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code.
llvm-svn: 155537
|
| |
|
|
| |
llvm-svn: 155522
|
| |
|
|
|
|
|
|
|
|
|
| |
When an instruction match is found, but the subtarget features it
requires are not available (missing floating point unit, or thumb vs arm
mode, for example), issue a diagnostic that identifies what the feature
mismatch is.
rdar://11257547
llvm-svn: 155499
|
| |
|
|
|
|
|
|
| |
r154362 was supposed to delete this bit, but obviously didn't.
rdar://11305594
llvm-svn: 155465
|
| |
|
|
|
|
|
| |
Remove the v2f64 patterns because it does not match any vbroadcast
instruction.
llvm-svn: 155461
|
| |
|
|
|
|
|
| |
immediate. We can't use it here because the shuffle code does not check that
the lower part of the word is identical to the upper part.
llvm-svn: 155440
|
| |
|
|
|
|
| |
its vector
llvm-svn: 155439
|
| |
|
|
|
|
|
|
|
| |
using the pattern (vbroadcast (i32load src)). In some cases, after we generate
this pattern new users are added to the load node, which prevent the selection
of the blend pattern. This commit provides fallback patterns which perform
in-vector broadcast (using in-vector vbroadcast in AVX2 and pshufd on AVX1).
llvm-svn: 155437
|
| |
|
|
| |
llvm-svn: 155429
|
| |
|
|
| |
llvm-svn: 155428
|
| |
|
|
| |
llvm-svn: 155399
|
| |
|
|
|
|
| |
the integer type is i8 (commonly used in graphics).
llvm-svn: 155397
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
on X86 Atom. Some of our tests failed because the tail merging part of
the BranchFolding pass was creating new basic blocks which did not
contain live-in information. When the anti-dependency code in the Post-RA
scheduler ran, it would sometimes rename the register containing
the function return value because the fact that the return value was
live-in to the subsequent block had been lost. To fix this, it is necessary
to run the RegisterScavenging code in the BranchFolding pass.
This patch makes sure that the register scavenging code is invoked
in the X86 subtarget only when post-RA scheduling is being done.
Post RA scheduling in the X86 subtarget is only done for Atom.
This patch adds a new function to the TargetRegisterClass to control
whether or not live-ins should be preserved during branch folding.
This is necessary in order for the anti-dependency optimizations done
during the PostRASchedulerList pass to work properly when doing
Post-RA scheduling for the X86 in general and for the Intel Atom in particular.
The patch adds and invokes the new function trackLivenessAfterRegAlloc()
instead of using the existing requiresRegisterScavenging().
It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of
requiresRegisterScavenging(). It changes the all the targets that
implemented requiresRegisterScavenging() to also implement
trackLivenessAfterRegAlloc().
It adds an assertion in the Post RA scheduler to make sure that post RA
liveness information is available when it is needed.
It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order
to avoid running into the added assertion.
Finally, this patch restores the use of anti-dependency checking
(which was turned off temporarily for the 3.1 release) for
Intel Atom in the Post RA scheduler.
Patch by Andy Zhang!
Thanks to Jakob and Anton for their reviews.
llvm-svn: 155395
|
| |
|
|
| |
llvm-svn: 155393
|
| |
|
|
| |
llvm-svn: 155392
|
| |
|
|
| |
llvm-svn: 155387
|
| |
|
|
|
|
|
|
|
|
|
| |
test suite failures. The failures occur at each stage, and only get
worse, so I'm reverting all of them.
Please resubmit these patches, one at a time, after verifying that the
regression test suite passes. Never submit a patch without running the
regression test suite.
llvm-svn: 155372
|
| |
|
|
| |
llvm-svn: 155367
|
| |
|
|
| |
llvm-svn: 155366
|
| |
|
|
| |
llvm-svn: 155365
|
| |
|
|
|
|
| |
just the switch. Saves a little bit of binary size.
llvm-svn: 155339
|
| |
|
|
|
|
| |
128-bit versus 256-bit vectors. Be explicit about both sizes and use llvm_unreachable. Similar changes to getLegalSplat.
llvm-svn: 155337
|
| |
|
|
| |
llvm-svn: 155336
|
| |
|
|
|
|
| |
shuffle operand has a different type than the the shuffle result since it can never happen.
llvm-svn: 155333
|
| |
|
|
| |
llvm-svn: 155332
|
| |
|
|
| |
llvm-svn: 155331
|
| |
|
|
| |
llvm-svn: 155330
|
| |
|
|
|
|
| |
an ConstantNode SDValue. getConstant was almost always called just before only to have the functions take it apart and build a new ConstantSDNode.
llvm-svn: 155325
|
| |
|
|
| |
llvm-svn: 155321
|
| |
|
|
|
|
| |
getUNDEF instead of requerying. Use &Mask[0] instead of Mask.data().
llvm-svn: 155320
|
| |
|
|
| |
llvm-svn: 155319
|
| |
|
|
|
|
| |
128-bit vectors to use either CONCAT_VECTORS or a helper function. CONCAT_VECTORS will itself be lowered to the same pattern as before. The helper function is needed for concats of BUILD_VECTORs since getNode(CONCAT_VECTORS) will just return a large BUILD_VECTOR and we may be trying to lower large BUILD_VECTORS when this occurs.
llvm-svn: 155318
|
| |
|
|
|
|
| |
Found by valgrind.
llvm-svn: 155313
|
| |
|
|
| |
llvm-svn: 155309
|
| |
|
|
| |
llvm-svn: 155307
|
| |
|
|
|
|
| |
of a hardcoded number.
llvm-svn: 155294
|
| |
|
|
| |
llvm-svn: 155291
|
| |
|
|
|
|
|
|
| |
on MSVC.
Thanks to Andy Gibbs, to report the issue.
llvm-svn: 155287
|
| |
|
|
| |
llvm-svn: 155286
|
| |
|
|
| |
llvm-svn: 155281
|