| Commit message (Collapse) | Author | Age | Files | Lines |
| ... | |
| |
|
|
| |
llvm-svn: 147758
|
| |
|
|
| |
llvm-svn: 147752
|
| |
|
|
| |
llvm-svn: 147748
|
| |
|
|
|
|
| |
Darwin doesn't do static, and ELF targets only support static.
llvm-svn: 147740
|
| |
|
|
| |
llvm-svn: 147739
|
| |
|
|
|
|
| |
is discouraged now.
llvm-svn: 147738
|
| |
|
|
| |
llvm-svn: 147734
|
| |
|
|
|
|
|
|
|
| |
This enables basic local CSE, giving us 20% smaller code for
consumer-typeset in -O0 builds.
<rdar://problem/10658692>
llvm-svn: 147720
|
| |
|
|
|
|
|
|
| |
file error checking. Use that to error on an unfinished cfi_startproc.
The error is not nice, but is already better than a segmentation fault.
llvm-svn: 147717
|
| |
|
|
|
|
|
| |
exposed with an upcoming change will would delete the copy to return register
because there is no use! It's amazing anything works.
llvm-svn: 147715
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
This eliminates a lot of constant pool entries for -O0 builds of code
with many global variable accesses.
This speeds up -O0 codegen of consumer-typeset by 2x because the
constant island pass no longer has to look at thousands of constant pool
entries.
<rdar://problem/10629774>
llvm-svn: 147712
|
| |
|
|
|
|
| |
Fixes rdar://10614894
llvm-svn: 147704
|
| |
|
|
|
|
| |
Experiments show this to be a small speedup for modern ARM cores.
llvm-svn: 147689
|
| |
|
|
| |
llvm-svn: 147685
|
| |
|
|
| |
llvm-svn: 147679
|
| |
|
|
| |
llvm-svn: 147676
|
| |
|
|
| |
llvm-svn: 147675
|
| |
|
|
| |
llvm-svn: 147602
|
| |
|
|
|
|
|
|
|
| |
instruction combining of sequences generated by ptestz/ptestc intrinsics to ptest+jcc pair for SSE and AVX.
Testing: passed 'make check' including LIT tests for all sequences being handled (both SSE and AVX)
Reviewers: Evan Cheng, David Blaikie, Bruno Lopes, Elena Demikhovsky, Chad Rosier, Anton Korobeynikov
llvm-svn: 147601
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
efficient.
This small bit of ASM code is sufficient to do what the old algorithm did:
movq %rax, %xmm0
punpckldq (c0), %xmm0 // c0: (uint4){ 0x43300000U, 0x45300000U, 0U, 0U }
subpd (c1), %xmm0 // c1: (double2){ 0x1.0p52, 0x1.0p52 * 0x1.0p32 }
#ifdef __SSE3__
haddpd %xmm0, %xmm0
#else
pshufd $0x4e, %xmm0, %xmm1
addpd %xmm1, %xmm0
#endif
It's arguably faster. One caveat, the 'haddpd' instruction isn't very fast on
all processors.
<rdar://problem/7719814>
llvm-svn: 147593
|
| |
|
|
|
|
|
|
|
|
|
|
| |
Now that canRealignStack() understands frozen reserved registers, it is
safe to use it for aligned spill instructions.
It will only return true if the registers reserved at the beginning of
register allocation allow for dynamic stack realignment.
<rdar://problem/10625436>
llvm-svn: 147579
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Once register allocation has started the reserved registers are frozen.
Fix the ARM canRealignStack() hook to respect the frozen register state.
Now the hook returns false if register allocation was started with frame
pointer elimination enabled.
It also returns false if register allocation started without a reserved
base pointer, and stack realignment would require a base pointer. This
bug was breaking oggenc on armv6.
No test case, an upcoming patch will use this functionality to realign
the stack for spill slots when possible.
llvm-svn: 147578
|
| |
|
|
| |
llvm-svn: 147553
|
| |
|
|
| |
llvm-svn: 147541
|
| |
|
|
| |
llvm-svn: 147519
|
| |
|
|
|
|
|
|
|
| |
versions derive from them.
- JALR64 is not needed since N64 does not emit jal.
- Add template parameter to BranchLink that sets the rt field.
- Fix the set of temporary registers for O32 and N64.
llvm-svn: 147518
|
| |
|
|
|
|
| |
is Mips64.
llvm-svn: 147516
|
| |
|
|
| |
llvm-svn: 147513
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
(x > y) ? x : y
=>
(x >= y) ? x : y
So for something like
(x - y) > 0 : (x - y) ? 0
It will be
(x - y) >= 0 : (x - y) ? 0
This makes is possible to test sign-bit and eliminate a comparison against
zero. e.g.
subl %esi, %edi
testl %edi, %edi
movl $0, %eax
cmovgl %edi, %eax
=>
xorl %eax, %eax
subl %esi, $edi
cmovsl %eax, %edi
rdar://10633221
llvm-svn: 147512
|
| |
|
|
| |
llvm-svn: 147495
|
| |
|
|
|
|
|
|
|
| |
This patch caused a miscompilation of oggenc because a frame pointer was
suddenly needed halfway through register allocation.
<rdar://problem/10625436>
llvm-svn: 147487
|
| |
|
|
| |
llvm-svn: 147485
|
| |
|
|
|
|
|
| |
then a vxorps + vinsertf128 pair if the original vector came from a load.
rdar://10594409
llvm-svn: 147481
|
| |
|
|
|
|
|
|
| |
If anybody has strong feelings about 'default: assert(0 && "blah")' vs
'default: llvm_unreachable("blah")', feel free to regularize the instances of
each in this file.
llvm-svn: 147459
|
| |
|
|
| |
llvm-svn: 147453
|
| |
|
|
|
|
| |
converting the indexing loops to unsigned to be consistent across functions.
llvm-svn: 147430
|
| |
|
|
|
|
| |
Also make it return false if there's not even a load at all. This makes the code better match the code in DAGCombiner that it tries to match. These two changes prevent some cases where vector_shuffles were making it to instruction selection and causing the older shuffle selection code to be triggered. Also needed to fix a bad pattern that this change exposed. This is the first step towards getting rid of the old shuffle selection support. No test cases yet because there's no way to tell whether a shuffle was handled in the legalize stage or at instruction selection.
llvm-svn: 147428
|
| |
|
|
|
|
| |
instructions only look at the highest bit.
llvm-svn: 147426
|
| |
|
|
| |
llvm-svn: 147411
|
| |
|
|
|
|
| |
is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers.
llvm-svn: 147409
|
| |
|
|
| |
llvm-svn: 147404
|
| |
|
|
| |
llvm-svn: 147394
|
| |
|
|
| |
llvm-svn: 147393
|
| |
|
|
|
|
| |
a load from being selected.
llvm-svn: 147392
|
| |
|
|
| |
llvm-svn: 147383
|
| |
|
|
|
|
|
|
|
|
|
| |
Implement encoder methods getJumpTargetOpValue and getBranchTargetOpValue
for jmptarget and brtarget Mips tablegen operand types in the code emitter
for old-style JIT. Rename the pc relative relocation for branches - new
name is Mips::reloc_mips_pc16.
Patch by Sasa Stankovic
llvm-svn: 147382
|
| |
|
|
|
|
| |
removing from Bulldozer CPU types since it would enable AVX code generation implicitly. Also make SSE4A imply SSE3. Without some level of SSE implied, XMM registers wouldn't be legal.
llvm-svn: 147369
|
| |
|
|
| |
llvm-svn: 147368
|
| |
|
|
| |
llvm-svn: 147367
|
| |
|
|
|
|
| |
of having the W bit set for XOP instructons. Removes ORing W-bits in the encoder and will similarly simplify the disassembler implementation.
llvm-svn: 147366
|