|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| | 
| 
| 
| | llvm-svn: 56116 | 
| | 
| 
| 
| 
| 
| 
| 
| | circumstances we could end up remapping a dependee to the same instruction 
that we're trying to remove.  Handle this properly by just falling back to
a conservative solution.
llvm-svn: 54132 | 
| | 
| 
| 
| 
| 
| | unreachable blocks.
llvm-svn: 53032 | 
| | 
| 
| 
| 
| 
| 
| 
| | unreachable.
This fixes PR2503, though we should also fix other passes not to emit this kind of code.
llvm-svn: 52946 | 
| | 
| 
| 
| 
| 
| | entries.  This fixes PR2397.
llvm-svn: 51846 | 
| | 
| 
| 
| | llvm-svn: 51845 | 
| | 
| 
| 
| 
| 
| | instruction.  This fixes some Ada miscompiles reported in PR2324.
llvm-svn: 51069 | 
| | 
| 
| 
| 
| 
| 
| | several things that were neither in an anonymous namespace nor static
but not intended to be global.
llvm-svn: 51017 | 
| | 
| 
| 
| | llvm-svn: 50696 | 
| | 
| 
| 
| | llvm-svn: 49842 | 
| | 
| 
| 
| | llvm-svn: 49504 | 
| | 
| 
| 
| 
| 
| | wrong order.
llvm-svn: 49499 | 
| | 
| 
| 
| 
| 
| | not the end.
llvm-svn: 48999 | 
| | 
| 
| 
| | llvm-svn: 48579 | 
| | 
| 
| 
| | llvm-svn: 48554 | 
| | 
| 
| 
| 
| 
| | bugs fixed.  This now passes PPC bootstrap.
llvm-svn: 47026 | 
| | 
| 
| 
| 
| 
| | 50 predecessors. Added command line option to play with this threshold.
llvm-svn: 46790 | 
| | 
| 
| 
| | llvm-svn: 46738 | 
| | 
| 
| 
| 
| 
| 
| 
| | dereferencing the end
of one of its internal maps.
llvm-svn: 46541 | 
| | 
| 
| 
| | llvm-svn: 45418 | 
| | 
| 
| 
| 
| 
| 
| 
| | some (disabled) debugging code
to make such problems easier to diagnose in the future, written by Duncan Sands.
llvm-svn: 44695 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | into alias analysis.  This meant updating the API
which now has versions of the getModRefBehavior,
doesNotAccessMemory and onlyReadsMemory methods
which take a callsite parameter.  These should be
used unless the callsite is not known, since in
general they can do a better job than the versions
that take a function.  Also, users should no longer
call the version of getModRefBehavior that takes
both a function and a callsite.  To reduce the
chance of misuse it is now protected.
llvm-svn: 44487 | 
| | 
| 
| 
| | llvm-svn: 44324 | 
| | 
| 
| 
| 
| 
| | are redundant.
llvm-svn: 44323 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | The meaning of getTypeSize was not clear - clarifying it is important
now that we have x86 long double and arbitrary precision integers.
The issue with long double is that it requires 80 bits, and this is
not a multiple of its alignment.  This gives a primitive type for
which getTypeSize differed from getABITypeSize.  For arbitrary precision
integers it is even worse: there is the minimum number of bits needed to
hold the type (eg: 36 for an i36), the maximum number of bits that will
be overwriten when storing the type (40 bits for i36) and the ABI size
(i.e. the storage size rounded up to a multiple of the alignment; 64 bits
for i36).
This patch removes getTypeSize (not really - it is still there but
deprecated to allow for a gradual transition).  Instead there is:
(1) getTypeSizeInBits - a number of bits that suffices to hold all
values of the type.  For a primitive type, this is the minimum number
of bits.  For an i36 this is 36 bits.  For x86 long double it is 80.
This corresponds to gcc's TYPE_PRECISION.
(2) getTypeStoreSizeInBits - the maximum number of bits that is
written when storing the type (or read when reading it).  For an
i36 this is 40 bits, for an x86 long double it is 80 bits.  This
is the size alias analysis is interested in (getTypeStoreSize
returns the number of bytes).  There doesn't seem to be anything
corresponding to this in gcc.
(3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded
up to a multiple of the alignment.  For an i36 this is 64, for an
x86 long double this is 96 or 128 depending on the OS.  This is the
spacing between consecutive elements when you form an array out of
this type (getABITypeSize returns the number of bytes).  This is
TYPE_SIZE in gcc.
Since successive elements in a SequentialType (arrays, pointers
and vectors) need to be aligned, the spacing between them will be
given by getABITypeSize.  This means that the size of an array
is the length times the getABITypeSize.  It also means that GEP
computations need to use getABITypeSize when computing offsets.
Furthermore, if an alloca allocates several elements at once then
these too need to be aligned, so the size of the alloca has to be
the number of elements multiplied by getABITypeSize.  Logically
speaking this doesn't have to be the case when allocating just
one element, but it is simpler to also use getABITypeSize in this
case.  So alloca's and mallocs should use getABITypeSize.  Finally,
since gcc's only notion of size is that given by getABITypeSize, if
you want to output assembler etc the same as gcc then getABITypeSize
is the size you want.
Since a store will overwrite no more than getTypeStoreSize bytes,
and a read will read no more than that many bytes, this is the
notion of size appropriate for alias analysis calculations.
In this patch I have corrected all type size uses except some of
those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard
cases).  I will get around to auditing these too at some point,
but I could do with some help.
Finally, I made one change which I think wise but others might
consider pointless and suboptimal: in an unpacked struct the
amount of space allocated for a field is now given by the ABI
size rather than getTypeStoreSize.  I did this because every
other place that reserves memory for a type (eg: alloca) now
uses getABITypeSize, and I didn't want to make an exception
for unpacked structs, i.e. I did it to make things more uniform.
This only effects structs containing long doubles and arbitrary
precision integers.  If someone wants to pack these types more
tightly they can always use a packed struct.
llvm-svn: 43620 | 
| | 
| 
| 
| 
| 
| 
| 
| | modest
speedup for GVN.
llvm-svn: 42185 | 
| | 
| 
| 
| 
| 
| 
| 
| | eventually
help non-local memdep caching.
llvm-svn: 42137 | 
| | 
| 
| 
| | llvm-svn: 41833 | 
| | 
| 
| 
| 
| 
| | on 401.bzip2.
llvm-svn: 41792 | 
| | 
| 
| 
| 
| 
| | time performance win in most cases.
llvm-svn: 41126 | 
| | 
| 
| 
| | llvm-svn: 40961 | 
| | 
| 
| 
| | llvm-svn: 40953 | 
| | 
| 
| 
| | llvm-svn: 40950 | 
| | 
| 
| 
| | llvm-svn: 40946 | 
| | 
| 
| 
| 
| 
| | on 403.gcc from ~15s to ~10s.
llvm-svn: 40884 | 
| | 
| 
| 
| 
| 
| | This brings GVN to parity with GCSE+LoadVN.
llvm-svn: 40882 | 
| | 
| 
| 
| | llvm-svn: 40746 | 
| | 
| 
| 
| 
| 
| | exposed.
llvm-svn: 40692 | 
| | 
| 
| 
| 
| 
| 
| | no guarantee that an instruction returned by getDependency exists in
the maps.
llvm-svn: 40647 | 
| | 
| 
| 
| 
| 
| | use up the entire 32-bit address space.
llvm-svn: 40596 | 
| | 
| 
| 
| | llvm-svn: 40542 | 
| | 
| 
| 
| 
| 
| | almost the same things from LCSSA.
llvm-svn: 40540 | 
| | 
| 
| 
| | llvm-svn: 40495 | 
| | 
| 
| 
| 
| 
| | Note: This has not yet been thoroughly tested.  Use at your own risk.
llvm-svn: 40489 | 
| | 
| 
| 
| 
| 
| | NOTE: This has only been cursorily tested.  Expected improvements soon.
llvm-svn: 40476 | 
| | 
| 
| 
| 
| 
| | flag when determining what to do with dependencies.
llvm-svn: 40079 | 
| | 
| 
| 
| 
| 
| | dead stores on 400.perlbench.
llvm-svn: 39929 | 
| | 
| 
| 
| | llvm-svn: 39769 | 
| | 
| 
| 
| | llvm-svn: 38511 | 
| | 
| 
| 
| | llvm-svn: 38510 |