| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
| |
llvm-svn: 60499
|
|
|
|
|
|
|
| |
an area where eventually it would be good to use target-dependent
information.
llvm-svn: 60498
|
|
|
|
| |
llvm-svn: 60497
|
|
|
|
|
|
| |
examining non-anti-dependence edges.
llvm-svn: 60496
|
|
|
|
| |
llvm-svn: 60495
|
|
|
|
| |
llvm-svn: 60494
|
|
|
|
|
|
|
| |
parts, and add target-independent code to add/preserve
MachineMemOperands.
llvm-svn: 60488
|
|
|
|
| |
llvm-svn: 60487
|
|
|
|
|
|
|
| |
the frame reference. This will help post-RA scheduling determine
that spills to distinct stack slots are independent.
llvm-svn: 60486
|
|
|
|
| |
llvm-svn: 60484
|
|
|
|
|
|
| |
a new node if the node was actually remapped.
llvm-svn: 60482
|
|
|
|
|
|
| |
Print a single parameter .file directive if we have an ELF target.
llvm-svn: 60480
|
|
|
|
| |
llvm-svn: 60479
|
|
|
|
| |
llvm-svn: 60478
|
|
|
|
|
|
| |
'dyn_cast'.
llvm-svn: 60477
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
1) have it fold "br undef", which does occur with
surprising frequency as jump threading iterates.
2) teach j-t to delete dead blocks. This removes the successor
edges, reducing the in-edges of other blocks, allowing
recursive simplification.
3) Fold things like:
br COND, BBX, BBY
BBX:
br COND, BBZ, BBW
which also happens because jump threading iterates.
llvm-svn: 60470
|
|
|
|
| |
llvm-svn: 60469
|
|
|
|
| |
llvm-svn: 60468
|
|
|
|
|
|
|
| |
unconditionally delete the block. All likely clients will
do the checking anyway.
llvm-svn: 60464
|
|
|
|
|
|
| |
DeleteBlockIfDead method.
llvm-svn: 60463
|
|
|
|
|
|
|
|
|
|
|
|
| |
foldMemoryOperand how to "fold" them, by converting them into constant-pool
loads. When they aren't folded, they use xorps/cmpeqd, but for example when
register pressure is high, they may now be folded as memory operands, which
reduces register pressure.
Also, mark V_SET0 isAsCheapAsAMove so that two-address-elimination will
remat it instead of copying zeros around (V_SETALLONES was already marked).
llvm-svn: 60461
|
|
|
|
|
|
|
|
|
| |
is set but mayLoad is not set. Fix all the problems this turned up.
Change code to not use isSimpleLoad instead of mayLoad unless it
really wants isSimpleLoad.
llvm-svn: 60459
|
|
|
|
| |
llvm-svn: 60458
|
|
|
|
|
|
| |
than just i32.
llvm-svn: 60455
|
|
|
|
|
|
|
|
|
| |
delegates to the regular x86-32 convention which handles byval, but only
after it handles a few cases, and it's necessary to handle byval before
handling those cases. This fixes PR3122 (and rdar://6400815), llvm-gcc
miscompiling LLVM.
llvm-svn: 60453
|
|
|
|
|
|
|
|
|
|
|
| |
1. ppcf128 select is expanded to f64 select's.
2. f64 select operand 0 is an i1 truncate, it's promoted to i32 zero_extend.
3. f64 select is updated. It's changed back to a "NewNode" and being re-analyzed.
4. f64 select operands are being processed. Operand 0 is a "NewNode". It's being expunged out of ReplacedValues map.
5. ExpungeNode tries to remap f64 select and notice it's a "NewNode" and assert.
Duncan, please take a look. Thanks.
llvm-svn: 60443
|
|
|
|
| |
llvm-svn: 60442
|
|
|
|
|
|
| |
consistent with the way it's generally done in other places.
llvm-svn: 60439
|
|
|
|
|
|
|
|
|
|
| |
- Incorporate Tilmann Scheller's ISD::TRUNCATE custom lowering patch
- Update SPU calling convention info, even if it's not used yet (but can be
at some point or another)
- Ensure that any-extended f32 loads are custom lowered, especially when
they're promoted for use in printf.
llvm-svn: 60438
|
|
|
|
| |
llvm-svn: 60434
|
|
|
|
|
|
| |
splitting.
llvm-svn: 60433
|
|
|
|
| |
llvm-svn: 60432
|
|
|
|
| |
llvm-svn: 60431
|
|
|
|
| |
llvm-svn: 60429
|
|
|
|
| |
llvm-svn: 60409
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
straight-forward implementation. This does not require any extra
alias analysis queries beyond what we already do for non-local loads.
Some programs really really like load PRE. For example, SPASS triggers
this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc.
The biggest limitation to the implementation is that it does not split
critical edges. This is a huge killer on many programs and should be
addressed after the initial patch is enabled by default.
The implementation of this should incidentally speed up rejection of
non-local loads because it avoids creating the repl densemap in cases
when it won't be used for fully redundant loads.
This is currently disabled by default.
Before I turn this on, I need to fix a couple of miscompilations in
the testsuite, look at compile time performance numbers, and look at
perf impact. This is pretty close to ready though.
llvm-svn: 60408
|
|
|
|
| |
llvm-svn: 60407
|
|
|
|
| |
llvm-svn: 60406
|
|
|
|
| |
llvm-svn: 60405
|
|
|
|
| |
llvm-svn: 60404
|
|
|
|
| |
llvm-svn: 60403
|
|
|
|
| |
llvm-svn: 60402
|
|
|
|
| |
llvm-svn: 60401
|
|
|
|
|
|
|
|
|
| |
constant. If X is a constant, then this is folded elsewhere.
- Added a note to Target/README.txt to indicate that we'd like to implement
this when we're able.
llvm-svn: 60399
|
|
|
|
| |
llvm-svn: 60398
|
|
|
|
|
|
|
|
| |
- No need to do a swap on a canonicalized pattern.
No functionality change.
llvm-svn: 60397
|
|
|
|
| |
llvm-svn: 60395
|
|
|
|
|
|
|
| |
a new value numbering set after splitting a critical edge. This increases
the number of instances of PRE on 403.gcc from ~60 to ~570.
llvm-svn: 60393
|
|
|
|
| |
llvm-svn: 60392
|
|
|
|
| |
llvm-svn: 60391
|