| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
|
| |
check more intelligent. This speeds up mem2reg from 5.29s to
0.79s on a synthetic testcase with tons of predecessors and
phi nodes.
llvm-svn: 46767
|
|
|
|
| |
llvm-svn: 45418
|
|
|
|
|
|
| |
Also cleaned up some comments in source files.
llvm-svn: 43674
|
|
|
|
|
|
|
| |
Add a new DenseMapInfo::isEqual method to allow clients to redefine
the equality predicate used when probing the hash table.
llvm-svn: 42042
|
|
|
|
|
|
|
|
| |
setjmp/longjmp properly.
This fixes PR1520.
llvm-svn: 41461
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
In the old way, we computed and inserted phi nodes for the whole IDF of
the definitions of the alloca, then computed which ones were dead and
removed them.
In the new method, we first compute the region where the value is live,
and use that information to only insert phi nodes that are live. This
eliminates the need to compute liveness later, and stops the algorithm
from inserting a bunch of phis which it then later removes.
This speeds up the testcase in PR1432 from 2.00s to 0.15s (14x) in a
release build and 6.84s->0.50s (14x) in a debug build.
llvm-svn: 40825
|
|
|
|
| |
llvm-svn: 40824
|
|
|
|
|
|
| |
measurable speedup.
llvm-svn: 40823
|
|
|
|
|
|
|
| |
to the worklist, and handling the last one with a 'tail call'. This speeds
up PR1432 from 2.0578s to 2.0012s (2.8%)
llvm-svn: 40822
|
|
|
|
|
|
| |
mem2reg from 2.0742->2.0522s on PR1432.
llvm-svn: 40821
|
|
|
|
| |
llvm-svn: 40820
|
|
|
|
| |
llvm-svn: 40819
|
|
|
|
|
|
|
| |
faster than with the 'local to a block' fastpath. This speeds
up PR1432 from 2.1232 to 2.0686s (2.6%)
llvm-svn: 40818
|
|
|
|
|
|
|
| |
to increment NumLocalPromoted, and didn't actually delete the
dead alloca, leading to an extra iteration of mem2reg.
llvm-svn: 40817
|
|
|
|
| |
llvm-svn: 40816
|
|
|
|
|
|
|
|
|
| |
stored value was a non-instruction value. Doh.
This increase the # single store allocas from 8982 to 9026, and
speeds up mem2reg on the testcase in PR1432 from 2.17 to 2.13s.
llvm-svn: 40813
|
|
|
|
|
|
|
|
| |
and the alloca so they don't get reprocessed.
This speeds up PR1432 from 2.20s to 2.17s.
llvm-svn: 40812
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
1. Check for revisiting a block before checking domination, which is faster.
2. If the stored value isn't an instruction, we don't have to check for domination.
3. If we have a value used in the same block more than once, make sure to remove the
block from the UsingBlocks vector. Not doing so forces us to go through the slow
path for the alloca.
The combination of these improvements increases the number of allocas on the fastpath
from 8935 to 8982 on PR1432. This speeds it up from 2.90s to 2.20s (31%)
llvm-svn: 40811
|
|
|
|
|
|
| |
testcase in PR1432 from 6.33s to 2.90s (2.22x)
llvm-svn: 40810
|
|
|
|
|
|
|
|
|
|
| |
a using block from the list if we handle it. Not doing this caused us
to not be able to promote (with the fast path) allocas which have uses (whoops).
This increases the # allocas hitting this fastpath from 4042 to 8935 on the
testcase in PR1432, speeding up mem2reg by 2.6x
llvm-svn: 40809
|
|
|
|
|
|
| |
method.
llvm-svn: 40806
|
|
|
|
| |
llvm-svn: 40805
|
|
|
|
| |
llvm-svn: 40804
|
|
|
|
|
|
| |
in PR1432 by 6%
llvm-svn: 40803
|
|
|
|
| |
llvm-svn: 40802
|
|
|
|
|
|
| |
This allows faster immediate domiantor walk.
llvm-svn: 37500
|
|
|
|
| |
llvm-svn: 36444
|
|
|
|
| |
llvm-svn: 36441
|
|
|
|
| |
llvm-svn: 36299
|
|
|
|
| |
llvm-svn: 36271
|
|
|
|
| |
llvm-svn: 35370
|
|
|
|
| |
llvm-svn: 35053
|
|
|
|
| |
llvm-svn: 35052
|
|
|
|
|
|
| |
std::map of std::vector's (ouch!). This speeds up mem2reg by 10% on 176.gcc.
llvm-svn: 33974
|
|
|
|
|
|
|
|
| |
BBNumbers. Instead of using a bi-directional mapping, just use a single
densemap. This speeds up mem2reg on 176.gcc by 8%, from 1.3489 to
1.2485s.
llvm-svn: 33940
|
|
|
|
|
|
| |
but paves the way to eliminate BBNumbers.
llvm-svn: 33938
|
|
|
|
|
|
| |
bug is fixed.
llvm-svn: 33932
|
|
|
|
| |
llvm-svn: 33930
|
|
|
|
| |
llvm-svn: 33929
|
|
|
|
| |
llvm-svn: 33928
|
|
|
|
| |
llvm-svn: 33927
|
|
|
|
| |
llvm-svn: 29911
|
|
|
|
|
|
| |
dropped. This shrinks libllvmgcc.dylib another 67K
llvm-svn: 28975
|
|
|
|
|
|
|
|
|
|
| |
nondeterminism being bad) could cause some trivial missed optimizations (dead
phi nodes being left around for later passes to clean up).
With this, llvm-gcc4 now bootstraps and correctly compares. I don't know
why I never tried to do it before... :)
llvm-svn: 27984
|
|
|
|
|
|
|
|
|
|
| |
has a single def. In this case, look for uses that are dominated by the def
and attempt to rewrite them to directly use the stored value.
This speeds up mem2reg on these values and reduces the number of phi nodes
inserted. This should address PR665.
llvm-svn: 24411
|
|
|
|
| |
llvm-svn: 24410
|
|
|
|
| |
llvm-svn: 22667
|
|
|
|
|
|
|
|
| |
BasicBlock's removePredecessor routine. This requires shuffling around
the definition and implementation of hasContantValue from Utils.h,cpp into
Instructions.h,cpp
llvm-svn: 22664
|
|
|
|
| |
llvm-svn: 22523
|
|
|
|
|
|
|
|
| |
The optimization for locally used allocas was not safe for allocas that
were read before they were written. This change disables that optimization
in that case.
llvm-svn: 22318
|