| Commit message (Collapse) | Author | Age | Files | Lines |
... | |
|
|
|
|
|
| |
Move ARM callee-saved lists into ARMCallingConv.td.
llvm-svn: 148357
|
|
|
|
|
|
| |
Add a trivial implementation of the getCallPreservedMask() hook.
llvm-svn: 148347
|
|
|
|
| |
llvm-svn: 148338
|
|
|
|
| |
llvm-svn: 148334
|
|
|
|
| |
llvm-svn: 148322
|
|
|
|
|
|
| |
displacement.
llvm-svn: 148321
|
|
|
|
| |
llvm-svn: 148312
|
|
|
|
| |
llvm-svn: 148301
|
|
|
|
|
|
|
|
| |
In CanXFormVExtractWithShuffleIntoLoad we assumed that EXTRACT_VECTOR_ELT can be later handled by the DAGCombiner.
However, in some cases on AVX, the EXTRACT_VECTOR_ELT is legalized to EXTRACT_SUBVECTOR + EXTRACT_VECTOR_ELT, which
currently is not handled by the DAGCombiner. In this patch I added a check that we only extract from the XMM part.
llvm-svn: 148298
|
|
|
|
| |
llvm-svn: 148295
|
|
|
|
|
|
| |
More short term hackery until we have a way to configure passes that work on LiveIntervals.
llvm-svn: 148289
|
|
|
|
|
|
| |
Probably could use better handling in DAG combine or getNode. Fixes PR11772.
llvm-svn: 148285
|
|
|
|
|
|
|
|
| |
account for all enumeration values explicitly.
(This time I believe I've checked all the -Wreturn-type warnings from GCC & added the couple of llvm_unreachables necessary to silence them. If I've missed any, I'll happily fix them as soon as I know about them)
llvm-svn: 148262
|
|
|
|
|
|
| |
No test case: output assembly will be identical.
llvm-svn: 148261
|
|
|
|
| |
llvm-svn: 148240
|
|
|
|
| |
llvm-svn: 148239
|
|
|
|
| |
llvm-svn: 148233
|
|
|
|
|
|
| |
unused variables).
llvm-svn: 148230
|
|
|
|
|
|
|
|
|
|
|
| |
We know that the blend instructions only use the MSB, so if the mask is
sign-extended then we can convert it into a SHL instruction. This is a
common pattern because the type-legalizer sign-extends the i1 type which
is used by the LLVM-IR for the condition.
Added a new optimization in SimplifyDemandedBits for SIGN_EXTEND_INREG -> SHL.
llvm-svn: 148225
|
|
|
|
|
|
| |
CodeGen.
llvm-svn: 148218
|
|
|
|
|
|
| |
f128mem instead of f256mem.
llvm-svn: 148196
|
|
|
|
|
|
| |
alignment on 256-bit AVX2 instructions.
llvm-svn: 148194
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
live across BBs before register allocation. This miscompiled 197.parser
when a cmp + b are optimized to a cbnz instruction even though the CPSR def
is live-in a successor.
cbnz r6, LBB89_12
...
LBB89_12:
ble LBB89_1
The fix consists of two parts. 1) Teach LiveVariables that some unallocatable
registers might be liveouts so don't mark their last use as kill if they are.
2) ARM constantpool island pass shouldn't form cbz / cbnz if the conditional
branch does not kill CPSR.
rdar://10676853
llvm-svn: 148168
|
|
|
|
| |
llvm-svn: 148167
|
|
|
|
|
|
|
|
|
|
| |
The QQ and QQQQ registers are not 'real', they are pseudo-registers used
to model some vld and vst instructions.
This makes the call clobber lists longer, but I intend to get rid of
those soon.
llvm-svn: 148151
|
|
|
|
| |
llvm-svn: 148134
|
|
|
|
| |
llvm-svn: 148131
|
|
|
|
|
|
| |
prevent a register copy. Similar to SHUFPS, but requires the mask to be converted.
llvm-svn: 148112
|
|
|
|
|
|
| |
vs SSE1.
llvm-svn: 148109
|
|
|
|
|
|
| |
ones if AVX2 is enabled. This gives the ExeDepsFix pass a chance to choose FP vs int as appropriate. Also use v8i32 as the type for getZeroVector if AVX2 is enabled. This is consistent with SSE2 using prefering v4i32.
llvm-svn: 148108
|
|
|
|
|
|
| |
v4i64 and v8i32.
llvm-svn: 148106
|
|
|
|
| |
llvm-svn: 148105
|
|
|
|
|
|
| |
PR11750.
llvm-svn: 148101
|
|
|
|
|
|
| |
horizontal add/sub if AVX2 is enabled. This caused an assert to fail for non 128/256-bit vectors when done before type legalizing. Fixes PR11749.
llvm-svn: 148096
|
|
|
|
| |
llvm-svn: 148077
|
|
|
|
|
|
|
| |
The registers are placed into the saved registers list in the reverse order,
which is why the original loop was written to loop backwards.
llvm-svn: 148064
|
|
|
|
|
|
|
| |
lc: X86ISelLowering.cpp:6480: llvm::SDValue llvm::X86TargetLowering::LowerVECTOR_SHUFFLE(llvm::SDValue, llvm::SelectionDAG&) const: Assertion `V1.getOpcode() != ISD::UNDEF&& "Op 1 of shuffle should not be undef"' failed.
Added a test.
llvm-svn: 148044
|
|
|
|
|
|
|
| |
This patch uses tcb_spare field in the tcb structure to store info.
Patch by Jyun-Yan You.
llvm-svn: 148041
|
|
|
|
|
|
|
| |
Uses the pvArbitrary slot of the TIB, which is reserved for applications. We
only support frames with a static size.
llvm-svn: 148040
|
|
|
|
|
|
| |
We are using one parser to parse att as well as intel style syntax.
llvm-svn: 148032
|
|
|
|
|
|
|
|
|
| |
Restore the (obviously wrong) behavior from before r147938 without relying on
undefined behavior. Add a fat FIXME note.
This should fix nightly tester failures.
llvm-svn: 148030
|
|
|
|
|
|
|
|
| |
is on the boundary of two 128-bit vectors.
The attached testcase was stuck in an endless loop.
llvm-svn: 148027
|
|
|
|
|
|
| |
more set bits set than 31 or 63.
llvm-svn: 148024
|
|
|
|
|
|
| |
In att style asm syntax memory operand size is derived from suffix attached with mnemonic. In intel style asm syntax it is part of memory operand hence predicate method check is required to select appropriate instruction.
llvm-svn: 148006
|
|
|
|
|
|
| |
This is a work in progress.
llvm-svn: 148002
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
same pattern. We already had this pattern is a few places, but others
tried to make a rough approximation of an actual DAG structure. As not
everywhere went to this trouble, nothing could rely on this being done.
In fact, I've checked all references to these node Ids, and the ones
that are using the topo-sort properties are actually satisfied with
a strict-weak-ordering. The requirement appears to be that Use >= Def.
I've added a big blurb of comments to this bit of the transform to
clarify why the order is so important for the next reader of the code.
I'm starting with this change as it is very small, and trivially
reverted if something breaks or the >= above really does need to be >.
If that proves the case, we can hide the problem by reverting this
patch, but the problem exists elsewhere as well, and so a more
comprehensive solution will be needed.
llvm-svn: 148001
|
|
|
|
| |
llvm-svn: 147966
|
|
|
|
|
|
|
|
| |
This uses TLS slot 90, which actually belongs to JavaScriptCore. We only support
frames with static size
Patch by Brian Anderson.
llvm-svn: 147960
|
|
|
|
|
|
| |
Patch by Brian Anderson.
llvm-svn: 147958
|
|
|
|
|
|
|
|
|
| |
hoped this would revive one of the llvm-gcc selfhost build bots, but it
didn't so it doesn't appear that my transform is the culprit.
If anyone else is seeing failures, please let me know!
llvm-svn: 147957
|