|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | captured. This allows the tracker to look at the specific use, which may be
especially interesting for function calls.
Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does
not iterate until a fixpoint and does not guarantee that it produces the same
result regardless of iteration order. The new implementation builds up a graph
of how arguments are passed from function to function, and uses a bottom-up walk
on the argument-SCCs to assign nocapture. This gets us nocapture more often, and
does so rather efficiently and independent of iteration order.
llvm-svn: 147327 | 
| | 
| 
| 
| | llvm-svn: 145047 | 
| | 
| 
| 
| | llvm-svn: 145014 | 
| | 
| 
| 
| 
| 
| 
| 
| | Suggested in code review by Eli.
That code in InstCombine looks kinda suspicious.
llvm-svn: 145013 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | and stores capture) to permit the caller to see each capture point and decide
whether to continue looking.
Use this inside memdep to do an analysis that basicaa won't do. This lets us
solve another devirtualization case, fixing PR8908!
llvm-svn: 144580 | 
| | 
| 
| 
| 
| 
| 
| 
| | dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases.
Patch by Xiaoyi Guo.
llvm-svn: 141896 | 
| | 
| 
| 
| | llvm-svn: 137650 | 
| | 
| 
| 
| | llvm-svn: 135375 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | block.  This prevents (at least in some cases) O(N^2) runtime in passes like DSE.
The limit in this patch is probably too high, but it is enough to stop DSE from going completely insane on a testcase I have (which has a single block with around 50,000 non-aliasing stores in it).
rdar://9471075
llvm-svn: 133111 | 
| | 
| 
| 
| 
| 
| | dependence for the given instruction exists in the given block".  This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions.
llvm-svn: 133031 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | redundant with partially-aliasing loads.
When computing what portion of a clobbering load value is needed,
it doesn't consider phi-translation which may have occurred
between the clobbing load and the redundant load.
llvm-svn: 132631 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | instead of the first instruction in the block.  This is a bit of a hack; "Clobber" isn't really the right marking in the first place.  memdep doesn't really have any way of properly expressing "unanalyzable" at the moment.  Using it on the terminator is much less ambiguous than using it on an arbitrary instruction, though.
In the given testcase, the "Clobber" was pointing to a load, and GVN was incorrectly assuming that meant that the "Clobber" load overlapped the load being analyzed (when they are actually unrelated).
The included testcase tests both this commit and r132434.
Part two of rdar://9429882.  (r132434 was mislabeled.)
llvm-svn: 132442 | 
| | 
| 
| 
| 
| 
| 
| 
| | is is deemed unanalyzable (and we execute one of the "goto PredTranslationFailure" statements), make sure we don't put information about the predecessors of that block into the returned data structures; this can lead to, among other things, extraneous results (which will confuse passes using memdep).  Fixes an assert in GVN compiling ruby. Part of rdar://problem/9521954 .
Testcase coming up soon.
llvm-svn: 132434 | 
| | 
| 
| 
| | llvm-svn: 131437 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | wider load would allow elimination of subsequent loads, and when the wider
load is still a native integer type.  This eliminates a ton of loads on 
various benchmarks involving struct fields, though it is somewhat hobbled
by clang not being very aggressive about field alignment.
This is yet another step along the way towards resolving PR6627.
llvm-svn: 130390 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | an earlier load could be widened to encompass a later load.  For example,
if we see:
  X = load i8* P, align 4
  Y = load i8* (P+3), align 1
and we have a 32-bit native integer type, we can widen the former load
to i32 which then makes the second load redundant.  GVN can't actually
do anything with this load/load relation yet, so this isn't testable, but 
it is the next step to resolving PR6627, and a fairly general class of 
"merge neighboring loads" missed optimizations.
llvm-svn: 130250 | 
| | 
| 
| 
| | llvm-svn: 130248 | 
| | 
| 
| 
| 
| 
| | work-in-progress that is not progressing, and it has issues.
llvm-svn: 130247 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | return it as a clobber.  This allows GVN to do smart things.
Enhance GVN to be smart about the case when a small load is clobbered
by a larger overlapping load.  In this case, forward the value.  This
allows us to compile stuff like this:
int test(void *P) {
  int tmp = *(unsigned int*)P;
  return tmp+*((unsigned char*)P+1);
}
into:
_test:                                  ## @test
	movl	(%rdi), %ecx
	movzbl	%ch, %eax
	addl	%ecx, %eax
	ret
which has one load.  We already handled the case where the smaller
load was from a must-aliased base pointer.
llvm-svn: 130180 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | with BasicAA's DecomposeGEPExpression, which recently began
using a TargetData. This fixes PR8968, though the testcase
is awkward to reduce.
Also, update several off GetUnderlyingObject's users
which happen to have a TargetData handy to pass it in.
llvm-svn: 124134 | 
| | 
| 
| 
| 
| 
| 
| 
| | the cause of our gcc bootstrap miscompare."
It didn't.
llvm-svn: 123215 | 
| | 
| 
| 
| 
| 
| | gcc bootstrap miscompare.
llvm-svn: 123207 | 
| | 
| 
| 
| 
| 
| 
| | new gcc warning that complains on self-assignments and
self-initializations.
llvm-svn: 122458 | 
| | 
| 
| 
| 
| 
| 
| | function so that it can live in Analysis instead of
VMCore.
llvm-svn: 121885 | 
| | 
| 
| 
| | llvm-svn: 121723 | 
| | 
| 
| 
| | llvm-svn: 120381 | 
| | 
| 
| 
| 
| 
| 
| 
| | pointer (TD is passed to PHITransAddr).
I wonder why this didn't explode earlier.
llvm-svn: 119944 | 
| | 
| 
| 
| | llvm-svn: 119927 | 
| | 
| 
| 
| 
| 
| | and vaarg instructions.
llvm-svn: 118845 | 
| | 
| 
| 
| 
| 
| | these points.
llvm-svn: 118752 | 
| | 
| 
| 
| 
| 
| | it, so that it doesn't appear to be a known size.
llvm-svn: 118748 | 
| | 
| 
| 
| 
| 
| | the reverse map too. This fixes seflhost build errors.
llvm-svn: 118729 | 
| | 
| 
| 
| 
| 
| | for a given instruction into a helper function.
llvm-svn: 118723 | 
| | 
| 
| 
| 
| 
| | type is insufficient for, or incompatible with, the current query.
llvm-svn: 118721 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | references. For example, this allows gvn to eliminate the load in
this example:
  void foo(int n, int* p, int *q) {
    p[0] = 0;
    p[1] = 1;
    if (n) {
      *q = p[0];
    }
  }
llvm-svn: 118714 | 
| | 
| 
| 
| 
| 
| | from constant memory don't alias any stores.
llvm-svn: 117636 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | exposes an initializeMyPassFunction(), which
must be called in the pass's constructor.  This function uses static dependency declarations to recursively initialize
the pass's dependencies.
Clients that only create passes through the createFooPass() APIs will require no changes.  Clients that want to use the
CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h
before parsing commandline arguments.
I have tested this with all standard configurations of clang and llvm-gcc on Darwin.  It is possible that there are problems
with the static dependencies that will only be visible with non-standard options.  If you encounter any crash in pass
registration/creation, please send the testcase to me directly.
llvm-svn: 116820 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | perform initialization without static constructors AND without explicit initialization
by the client.  For the moment, passes are required to initialize both their
(potential) dependencies and any passes they preserve.  I hope to be able to relax
the latter requirement in the future.
llvm-svn: 116334 | 
| | 
| 
| 
| | llvm-svn: 115996 | 
| | 
| 
| 
| | llvm-svn: 114588 | 
| | 
| 
| 
| | llvm-svn: 113144 | 
| | 
| 
| 
| | llvm-svn: 113135 | 
| | 
| 
| 
| | llvm-svn: 110460 | 
| | 
| 
| 
| | llvm-svn: 110410 | 
| | 
| 
| 
| 
| 
| 
| 
| | address of the static
ID member as the sole unique type identifier.  Clean up APIs related to this change.
llvm-svn: 110396 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | response from getModRefInfo is not useful here. Instead, check for identical
calls only in the NoModRef case.
Reapply r110270, and strengthen it to compensate for the memdep changes.
When both calls are readonly, there is no dependence between them.
llvm-svn: 110382 | 
| | 
| 
| 
| 
| 
| | are unknown.
llvm-svn: 110090 | 
| | 
| 
| 
| 
| 
| 
| | add instead a CallSite(Value* V) constructor that is consistent with ImmutableCallSize
and use that one in client code
llvm-svn: 109553 | 
| | 
| 
| 
| 
| 
| | problem in CallSiteBase is fixed
llvm-svn: 109547 | 
| | 
| 
| 
| | llvm-svn: 109508 |