|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| ... |  | 
| | 
| 
| 
| 
| 
| | instead of making getDependencyFrom do it.
llvm-svn: 60647 | 
| | 
| 
| 
| | llvm-svn: 60644 | 
| | 
| 
| 
| 
| 
| | I do.
llvm-svn: 60643 | 
| | 
| 
| 
| 
| 
| 
| 
| | emphasize the scanning and make it more similar to 
getDependencyFrom
 
llvm-svn: 60642 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | clobber with the current implementation.  Instead of returning
a "precise clobber" just return a fuzzy one.  This doesn't 
matter to any clients anyway and should speed up analysis time
very very slightly.
llvm-svn: 60641 | 
| | 
| 
| 
| 
| 
| | original impl was correct and noone actually makes the query anyway.
llvm-svn: 60639 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | 1. Merge the 'None' result into 'Normal', making loads
   and stores return their dependencies on allocations as Normal.
2. Split the 'Normal' result into 'Clobber' and 'Def' to
   distinguish between the cases when memdep knows the value is
   produced from when we just know if may be changed.
3. Move some of the logic for determining whether readonly calls
   are CSEs into memdep instead of it being in GVN.  This still
   leaves verification that the arguments are hte same to GVN to
   let it know about value equivalences in different contexts.
4. Change memdep's call/call dependency analysis to use 
   getModRefInfo(CallSite,CallSite) instead of doing something 
   very weak.  This only really matters for things like DSA, but
   someday maybe we'll have some other decent context sensitive
   analyses :)
5. This reimplements the guts of memdep to handle the new results.
6. This simplifies GVN significantly:
   a) readonly call CSE is slightly simpler
   b) I eliminated the "getDependencyFrom" chaining for load 
      elimination and load CSE doesn't have to worry about 
      volatile (they are always clobbers) anymore.
   c) GVN no longer does any 'lastLoad' caching, leaving it to 
      memdep.
7. The logic in DSE is simplified a bit and sped up.  A potentially
   unsafe case was eliminated.
llvm-svn: 60607 | 
| | 
| 
| 
| 
| 
| | like binary operators.
llvm-svn: 60600 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | vector instead of a densemap.  This shrinks the memory usage of this thing
substantially (the high water mark) as well as making operations like
scanning it faster.  This speeds up memdep slightly, gvn goes from
3.9376 to 3.9118s on 403.gcc
This also splits out the statistics for the cached non-local case to
differentiate between the dirty and clean cached case.  Here's the stats
for 403.gcc:
  6153 memdep - Number of dirty cached non-local responses
169336 memdep - Number of fully cached non-local responses
162428 memdep - Number of uncached non-local responses
yay for caching :)
llvm-svn: 60313 | 
| | 
| 
| 
| 
| 
| | redundant with MemDepResult, and MemDepResult has a nicer interface.
llvm-svn: 60308 | 
| | 
| 
| 
| 
| 
| 
| | getAnalysis<>.  getAnalysis<> is apparently extremely expensive.
Doing this speeds up GVN on 403.gcc by 16%!
llvm-svn: 60304 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | ReverseLocalDeps when we update it.  This fixes a regression test
failure from my last commit.
Second, for each non-local cached information structure, keep a bit that
indicates whether it is dirty or not.  This saves us a scan over the whole
thing in the common case when it isn't dirty.
llvm-svn: 60274 | 
| | 
| 
| 
| | llvm-svn: 60272 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | instead of containing them by value.  This increases the density
(!) of NonLocalDeps as well as making the reallocation case 
faster.  This speeds up gvn on 403.gcc by 2% and makes room for
future improvements.
I'm not super thrilled with having to explicitly manage the new/delete
of the map, but it is necesary for the next change.
llvm-svn: 60271 | 
| | 
| 
| 
| | llvm-svn: 60268 | 
| | 
| 
| 
| 
| 
| 
| 
| | If we see that a load depends on the allocation of its memory with no
intervening stores, we now return a 'None' depedency instead of "Normal".
This tweaks GVN to do its optimization with the new result.
llvm-svn: 60267 | 
| | 
| 
| 
| 
| 
| 
| | method that returns its result as a DepResultTy instead of as a
MemDepResult.  This reduces conversion back and forth.
llvm-svn: 60266 | 
| | 
| 
| 
| 
| 
| | the file, no functionality change.
llvm-svn: 60265 | 
| | 
| 
| 
| 
| 
| | comments about what this class does.
llvm-svn: 60264 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | dependencies.  The basic situation was this: consider if we had:
  store1
  ...
  store2
  ...
  store3
Where memdep thinks that store3 depends on store2 and store2 depends 
on store1.  The problem happens when we delete store2: The code in 
question was updating dep info for store3 to be store1.  This is a
spiffy optimization, but is not safe at all, because aliasing isn't
transitive.  This bug isn't exposed today with DSE because DSE will only
zap store2 if it is identifical to store 3, and in this case, it is 
safe to update it to depend on store1.  However, memcpyopt is not so
fortunate, which is presumably why the "dropInstruction" code used to
exist.
Since this doesn't actually provide a speedup in practice, just rip the
code out.
llvm-svn: 60263 | 
| | 
| 
| 
| 
| 
| | Fix a subtle iterator invalidation bug I introduced in the last commit.
llvm-svn: 60258 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | an entry in the nonlocal deps map, don't reset entries
referencing that instruction to [dirty, null], instead, set
them to [dirty,next] where next is the instruction after the
deleted one.  Use this information in the non-local deps
code to avoid rescanning entire blocks.
This speeds up GVN slightly by avoiding pointless work.  On
403.gcc this makes GVN 1.5% faster. 
llvm-svn: 60256 | 
| | 
| 
| 
| 
| 
| 
| | a smallvector instead of a DenseMap.  This speeds up GVN by 5%
on 403.gcc.
llvm-svn: 60255 | 
| | 
| 
| 
| 
| 
| | no functionality/code change.
llvm-svn: 60254 | 
| | 
| 
| 
| 
| 
| 
| | formulation that is faster and doesn't require nonLazyHelper.
Much less code.
llvm-svn: 60253 | 
| | 
| 
| 
| | llvm-svn: 60242 | 
| | 
| 
| 
| | llvm-svn: 60241 | 
| | 
| 
| 
| 
| 
| 
| | Put a some code back to handle buggy behavior that GVN expects: it wants
loads to depend on each other, and accesses to depend on their allocations.
llvm-svn: 60240 | 
| | 
| 
| 
| 
| 
| 
| | Use getTypeStoreSize instead of ABITypeSize for in-memory size
in a couple places.
llvm-svn: 60238 | 
| | 
| 
| 
| 
| 
| 
| | former does caching, the later doesn't.  This dramatically simplifies
the logic in getDependency and getDependencyFrom.
llvm-svn: 60234 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | Document the Dirty value more precisely, use it for the uninitialized
DepResultTy value.  Change reverse mappings to be from an instruction*
instead of DepResultTy, and stop tracking other forms.  This makes it more
clear that we only care about the instruction cases.
Eliminate a DepResultTy,bool pair by using Dirty in the local case as well,
shrinking the map and simplifying the code.
This speeds up GVN by ~3% on 403.gcc.
llvm-svn: 60232 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | query.  This makes it crystal clear what cases can escape from MemDep that
the clients have to handle.  This also gives the clients a nice simplified
interface to it that is easy to poke at.
This patch also makes DepResultTy and MemoryDependenceAnalysis::DepType
private, yay.
llvm-svn: 60231 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | of a pointer/int pair instead of a manually bitmangled pointer.
This forces clients to think a little more about checking the 
appropriate pieces and will be useful for internal 
implementation improvements later.
I'm not particularly happy with this.  After going through this
I don't think that the clients of memdep should be exposed to
the internal type at all.  I'll fix this in a subsequent commit.
This has no functionality change.
llvm-svn: 60230 | 
| | 
| 
| 
| 
| 
| 
| | properly updates the reverse dependency map when it installs updated 
dependencies for instructions that depend on the removed instruction.
llvm-svn: 60222 | 
| | 
| 
| 
| 
| 
| | no functionality change.
llvm-svn: 60219 | 
| | 
| 
| 
| | llvm-svn: 60218 | 
| | 
| 
| 
| 
| 
| | This shows the root problem behind PR3141.
llvm-svn: 60216 | 
| | 
| 
| 
| 
| 
| 
| 
| | but it doesn't make any sense at all.
Also make the method const, private, and fit in 80 cols while we're at it.
llvm-svn: 60215 | 
| | 
| 
| 
| | llvm-svn: 60211 | 
| | 
| 
| 
| | llvm-svn: 56116 | 
| | 
| 
| 
| 
| 
| 
| 
| | circumstances we could end up remapping a dependee to the same instruction 
that we're trying to remove.  Handle this properly by just falling back to
a conservative solution.
llvm-svn: 54132 | 
| | 
| 
| 
| 
| 
| | unreachable blocks.
llvm-svn: 53032 | 
| | 
| 
| 
| 
| 
| 
| 
| | unreachable.
This fixes PR2503, though we should also fix other passes not to emit this kind of code.
llvm-svn: 52946 | 
| | 
| 
| 
| 
| 
| | entries.  This fixes PR2397.
llvm-svn: 51846 | 
| | 
| 
| 
| | llvm-svn: 51845 | 
| | 
| 
| 
| 
| 
| | instruction.  This fixes some Ada miscompiles reported in PR2324.
llvm-svn: 51069 | 
| | 
| 
| 
| 
| 
| 
| | several things that were neither in an anonymous namespace nor static
but not intended to be global.
llvm-svn: 51017 | 
| | 
| 
| 
| | llvm-svn: 50696 | 
| | 
| 
| 
| | llvm-svn: 49842 | 
| | 
| 
| 
| | llvm-svn: 49504 |