| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
|
|
| |
Patch by: Michel Dänzer
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Reviewed-by: Christian König <christian.koenig@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 173053
|
|
|
|
|
|
|
|
|
| |
Patch by: Michel Dänzer
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Reviewed-by: Christian König <christian.koenig@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 173052
|
|
|
|
|
|
|
|
|
| |
Patch by: Michel Dänzer
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Reviewed-by: Christian König <christian.koenig@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 173051
|
|
|
|
| |
llvm-svn: 173040
|
|
|
|
|
|
|
|
|
|
|
|
| |
is free. The whole CodeMetrics API should probably be reworked more, but
this is enough to allow deleting the duplicate code there for computing
whether an instruction is free.
All of the passes using this have been updated to pull in TTI and hand
it to the CodeMetrics stuff. Further, a dead CodeMetrics API
(analyzeFunction) is nuked for lack of users.
llvm-svn: 173036
|
|
|
|
|
|
|
|
|
|
| |
analysis. How cute that it wasn't previously. ;]
Part of this confusion stems from the flattened header file tree. Thanks
to Benjamin for pointing out the goof on IRC, and we're considering
un-flattening the headers, so speak now if that would bug you.
llvm-svn: 173033
|
|
|
|
|
|
|
|
| |
old CodeMetrics system. TTI has the specific advantage of being
extensible and customizable by targets to reflect target-specific cost
metrics.
llvm-svn: 173032
|
|
|
|
|
|
|
|
|
|
|
| |
depend on and use other analyses (as long as they're either immutable
passes or CGSCC passes of course -- nothing in the pass manager has been
fixed here). Leverage this to thread TargetTransformInfo down through
the inline cost analysis.
No functionality changed here, this just threads things through.
llvm-svn: 173031
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
a dynamic analysis done on each call to the routine. However, now it can
use the standard pass infrastructure to reference other analyses,
instead of a silly setter method. This will become more interesting as
I teach it about more analysis passes.
This updates the two inliner passes to use the inline cost analysis.
Doing so highlights how utterly redundant these two passes are. Either
we should find a cheaper way to do always inlining, or we should merge
the two and just fiddle with the thresholds to get the desired behavior.
I'm leaning increasingly toward the latter as it would also remove the
Inliner sub-class split.
llvm-svn: 173030
|
|
|
|
|
|
| |
Formatting fixes brought to you by clang-format.
llvm-svn: 173029
|
|
|
|
|
|
| |
functionality changed.
llvm-svn: 173028
|
|
|
|
| |
llvm-svn: 173010
|
|
|
|
| |
llvm-svn: 173009
|
|
|
|
| |
llvm-svn: 173008
|
|
|
|
| |
llvm-svn: 173006
|
|
|
|
| |
llvm-svn: 173005
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
lowered cost.
Currently, this is a direct port of the logic implementing
isInstructionFree in CodeMetrics. The hope is that the interface can be
improved (f.ex. supporting un-formed instruction queries) and the
implementation abstracted so that as we have test cases and target
knowledge we can expose increasingly accurate heuristics to clients.
I'll start switching existing consumers over and kill off the routine in
CodeMetrics in subsequent commits.
llvm-svn: 172998
|
|
|
|
| |
llvm-svn: 172995
|
|
|
|
|
|
| |
this file despite it not matching coding standards.
llvm-svn: 172994
|
|
|
|
| |
llvm-svn: 172992
|
|
|
|
| |
llvm-svn: 172990
|
|
|
|
| |
llvm-svn: 172987
|
|
|
|
| |
llvm-svn: 172986
|
|
|
|
| |
llvm-svn: 172985
|
|
|
|
|
|
|
|
| |
It is not possible to distinguish 3r instructions from 2r / rus instructions
using only the fixed bits. Therefore if an instruction doesn't match the
2r / rus format try to decode it as a 3r instruction before returning Fail.
llvm-svn: 172984
|
|
|
|
| |
llvm-svn: 172971
|
|
|
|
| |
llvm-svn: 172969
|
|
|
|
|
|
|
|
|
| |
The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends.
This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical.
Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume
that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model.
llvm-svn: 172968
|
|
|
|
|
|
|
| |
We ignore the cpu frontend and focus on pipeline utilization. We do this because we
don't have a good way to estimate the loop body size at the IR level.
llvm-svn: 172964
|
|
|
|
|
|
| |
eagerly calling it.
llvm-svn: 172953
|
|
|
|
|
|
|
|
| |
new advance() APIs,
simplifying things and making a bunch of details more private to BitstreamCursor.
llvm-svn: 172947
|
|
|
|
| |
llvm-svn: 172941
|
|
|
|
|
|
|
|
| |
method to easy
the transition.
llvm-svn: 172940
|
|
|
|
| |
llvm-svn: 172936
|
|
|
|
|
|
| |
from the SVOp passed in.
llvm-svn: 172935
|
|
|
|
| |
llvm-svn: 172933
|
|
|
|
| |
llvm-svn: 172931
|
|
|
|
| |
llvm-svn: 172930
|
|
|
|
|
|
| |
functions in this file.
llvm-svn: 172927
|
|
|
|
|
|
|
| |
We weren't encoding boolean constants correctly due to modeling boolean as a
signed type & then sign extending an i1 up to a byte & getting 255.
llvm-svn: 172926
|
|
|
|
|
|
|
|
| |
through a BitstreamCursor that produce it: advance() and
advanceSkippingSubblocks(), representing the two most common ways clients
want to walk through bitcode.
llvm-svn: 172919
|
|
|
|
|
|
|
| |
has past the point of making sense. Lets tidy things up: first step, moving
a ton of big functions out of line.
llvm-svn: 172904
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This separates the check for "too few elements to run the vector loop" from the
"memory overlap" check, giving a lot nicer code and allowing to skip the memory
checks when we're not going to execute the vector code anyways. We still leave
the decision of whether to emit the memory checks as branches or setccs, but it
seems to be doing a good job. If ugly code pops up we may want to emit them as
separate blocks too. Small speedup on MultiSource/Benchmarks/MallocBench/espresso.
Most of this is legwork to allow multiple bypass blocks while updating PHIs,
dominators and loop info.
llvm-svn: 172902
|
|
|
|
| |
llvm-svn: 172894
|
|
|
|
|
|
| |
be for legal types. Keeps compiler from generating unneeded checks and handling for extended types.
llvm-svn: 172893
|
|
|
|
|
|
| |
includes.
llvm-svn: 172891
|
|
|
|
|
|
|
|
|
|
|
| |
but I cannot reproduce the problem and have scrubed my sources and
even tested with llvm-lit -v --vg.
Formatting fixes. Mostly long lines and
blank spaces at end of lines.
Contributer: Jack Carter
llvm-svn: 172882
|
|
|
|
| |
llvm-svn: 172877
|
|
|
|
|
|
| |
vinsertf128) is faster than using a single vmovups instruction.
llvm-svn: 172868
|
|
|
|
| |
llvm-svn: 172866
|