|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| | 
| 
| 
| | llvm-svn: 78363 | 
| | 
| 
| 
| | llvm-svn: 77635 | 
| | 
| 
| 
| | llvm-svn: 77605 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | a Twine, e.g., for names).
 - I am a little ambivalent about this; we don't want the string conversion of
   utostr, but using overload '+' mixed with string and integer arguments is
   sketchy. On the other hand, this particular usage is something of an idiom.
llvm-svn: 77579 | 
| | 
| 
| 
| | llvm-svn: 76702 | 
| | 
| 
| 
| | llvm-svn: 74878 | 
| | 
| 
| 
| | llvm-svn: 74807 | 
| | 
| 
| 
| 
| 
| | separate back() and pop_back() calls.
llvm-svn: 71089 | 
| | 
| 
| 
| 
| 
| | incoming edges for a block with many predecessors.
llvm-svn: 69312 | 
| | 
| 
| 
| 
| 
| | debug intrinsics correctly.
llvm-svn: 66225 | 
| | 
| 
| 
| | llvm-svn: 59454 | 
| | 
| 
| 
| 
| 
| 
| 
| | promotion.
 - Eliminate uses after free and simplify tests.
Devang: Please check that this is still doing what you intended.
llvm-svn: 58887 | 
| | 
| 
| 
| | llvm-svn: 58830 | 
| | 
| 
| 
| | llvm-svn: 58826 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | LargeBlockInfo, we can now dramatically simplify their implementation
and speed them up at the same time.  Now the code has time proportional
to the number of uses of the alloca, not the size of the block.
This also eliminates code that tried to batch up different allocas which
are used in the same blocks, and eliminates the 'retry list' logic which
was baroque and no unneccesary.  In addition to being a speedup for crazy
cases, this is also a nice cleanup:
PromoteMemoryToRegister.cpp |  270 +++++++++++++++-----------------------------
 1 file changed, 96 insertions(+), 174 deletions(-)
llvm-svn: 58229 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | a trivial dense map.  Use this in RewriteSingleStoreAlloca to
avoid aggressively rescanning blocks over and over again.  This
fixes PR2925, speeding up mem2reg on the testcase in that bug
from 4.56s to 0.02s in a debug build on my machine.
llvm-svn: 58227 | 
| | 
| 
| 
| 
| 
| 
| 
| | Specifically, introduction of XXX::Create methods
for Users that have a potentially variable number of
Uses.
llvm-svn: 49277 | 
| | 
| 
| 
| 
| 
| | successors. This makes it support nounwind.
llvm-svn: 48320 | 
| | 
| 
| 
| 
| 
| 
| 
| | check more intelligent.  This speeds up mem2reg from 5.29s to 
0.79s on a synthetic testcase with tons of predecessors and
phi nodes.
llvm-svn: 46767 | 
| | 
| 
| 
| | llvm-svn: 45418 | 
| | 
| 
| 
| 
| 
| | Also cleaned up some comments in source files.
llvm-svn: 43674 | 
| | 
| 
| 
| 
| 
| 
| | Add a new DenseMapInfo::isEqual method to allow clients to redefine
the equality predicate used when probing the hash table.
llvm-svn: 42042 | 
| | 
| 
| 
| 
| 
| 
| 
| | setjmp/longjmp properly.
This fixes PR1520.
llvm-svn: 41461 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | In the old way, we computed and inserted phi nodes for the whole IDF of 
the definitions of the alloca, then computed which ones were dead and
removed them.
In the new method, we first compute the region where the value is live,
and use that information to only insert phi nodes that are live.  This
eliminates the need to compute liveness later, and stops the algorithm
from inserting a bunch of phis which it then later removes.
This speeds up the testcase in PR1432 from 2.00s to 0.15s (14x) in a
release build and 6.84s->0.50s (14x) in a debug build.
llvm-svn: 40825 | 
| | 
| 
| 
| | llvm-svn: 40824 | 
| | 
| 
| 
| 
| 
| | measurable speedup.
llvm-svn: 40823 | 
| | 
| 
| 
| 
| 
| 
| | to the worklist, and handling the last one with a 'tail call'.  This speeds
up PR1432 from 2.0578s to 2.0012s (2.8%)
llvm-svn: 40822 | 
| | 
| 
| 
| 
| 
| | mem2reg from 2.0742->2.0522s on PR1432.
llvm-svn: 40821 | 
| | 
| 
| 
| | llvm-svn: 40820 | 
| | 
| 
| 
| | llvm-svn: 40819 | 
| | 
| 
| 
| 
| 
| 
| | faster than with the 'local to a block' fastpath.  This speeds
up PR1432 from 2.1232 to 2.0686s (2.6%)
llvm-svn: 40818 | 
| | 
| 
| 
| 
| 
| 
| | to increment NumLocalPromoted, and didn't actually delete the
dead alloca, leading to an extra iteration of mem2reg.
llvm-svn: 40817 | 
| | 
| 
| 
| | llvm-svn: 40816 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | stored value was a non-instruction value.  Doh.
This increase the # single store allocas from 8982 to 9026, and
speeds up mem2reg on the testcase in PR1432 from 2.17 to 2.13s.
llvm-svn: 40813 | 
| | 
| 
| 
| 
| 
| 
| 
| | and the alloca so they don't get reprocessed.
This speeds up PR1432 from 2.20s to 2.17s.
llvm-svn: 40812 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | 1. Check for revisiting a block before checking domination, which is faster.
  2. If the stored value isn't an instruction, we don't have to check for domination.
  3. If we have a value used in the same block more than once, make sure to remove the
     block from the UsingBlocks vector.  Not doing so forces us to go through the slow
     path for the alloca.
The combination of these improvements increases the number of allocas on the fastpath
from 8935 to 8982 on PR1432.  This speeds it up from 2.90s to 2.20s (31%)
llvm-svn: 40811 | 
| | 
| 
| 
| 
| 
| | testcase in PR1432 from 6.33s to 2.90s (2.22x)
llvm-svn: 40810 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | a using block from the list if we handle it.  Not doing this caused us
to not be able to promote (with the fast path) allocas which have uses (whoops).
This increases the # allocas hitting this fastpath from 4042 to 8935 on the
testcase in PR1432, speeding up mem2reg by 2.6x
llvm-svn: 40809 | 
| | 
| 
| 
| 
| 
| | method.
llvm-svn: 40806 | 
| | 
| 
| 
| | llvm-svn: 40805 | 
| | 
| 
| 
| | llvm-svn: 40804 | 
| | 
| 
| 
| 
| 
| | in PR1432 by 6%
llvm-svn: 40803 | 
| | 
| 
| 
| | llvm-svn: 40802 | 
| | 
| 
| 
| 
| 
| | This allows faster immediate domiantor walk.
llvm-svn: 37500 | 
| | 
| 
| 
| | llvm-svn: 36444 | 
| | 
| 
| 
| | llvm-svn: 36441 | 
| | 
| 
| 
| | llvm-svn: 36299 | 
| | 
| 
| 
| | llvm-svn: 36271 | 
| | 
| 
| 
| | llvm-svn: 35370 | 
| | 
| 
| 
| | llvm-svn: 35053 |