|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| | 
| 
| 
| | llvm-svn: 188932 | 
| | 
| 
| 
| | llvm-svn: 188844 | 
| | 
| 
| 
| | llvm-svn: 188831 | 
| | 
| 
| 
| 
| 
| 
| 
| | Also fix it calculating the wrong value. The struct index
is not a ConstantInt, so it was being interpreted as an array
index.
llvm-svn: 188713 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | This fixes SCEVExpander so that it does not create multiple distinct induction
variables for duplicate PHI entries. Specifically, given some code like this:
do.body6:                                         ; preds = %do.body6, %do.body6, %if.then5
  %end.0 = phi i8* [ undef, %if.then5 ], [ %incdec.ptr, %do.body6 ], [ %incdec.ptr, %do.body6 ]
...
Note that it is legal to have multiple entries for a basic block so long as the
associated value is the same. So the above input is okay, but expanding an
AddRec in this loop could produce code like this:
do.body6:                                         ; preds = %do.body6, %do.body6, %if.then5
  %indvar = phi i64 [ %indvar.next, %do.body6 ], [ %indvar.next1, %do.body6 ], [ 0, %if.then5 ]
  %end.0 = phi i8* [ undef, %if.then5 ], [ %incdec.ptr, %do.body6 ], [ %incdec.ptr, %do.body6 ]
...
  %indvar.next = add i64 %indvar, 1
  %indvar.next1 = add i64 %indvar, 1
And this is not legal because there are two PHI entries for %do.body6 each with
a distinct value.
Unfortunately, I don't have an in-tree test case.
llvm-svn: 188614 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | to find loops if the From and To instructions were in the same block.
Refactor the code a little now that we need to fill to start the CFG-walking
algorithm with more than one starting basic block sometimes.
Special thanks to Andrew Trick for catching an error in my understanding of
natural loops in code review.
llvm-svn: 188236 | 
| | 
| 
| 
| 
| 
| 
| | e.g. Use Ty->getPointerElementType()
instead of cast<PointerType>(Ty)->getElementType()
llvm-svn: 188223 | 
| | 
| 
| 
| | llvm-svn: 188219 | 
| | 
| 
| 
| | llvm-svn: 188140 | 
| | 
| 
| 
| 
| 
| 
| | Inlining between functions with different values of sanitize_* attributes
leads to over- or under-sanitizing, which is always bad.
llvm-svn: 187967 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | All libm floating-point rounding functions, except for round(), had their own
ISD nodes. Recent PowerPC cores have an instruction for round(), and so here I'm
adding ISD::FROUND so that round() can be custom lowered as well.
For the most part, this is straightforward. I've added an intrinsic
and a matching ISD node just like those for nearbyint() and friends. The
SelectionDAG pattern I've named frnd (because ISD::FP_ROUND has already claimed
fround).
This will be used by the PowerPC backend in a follow-up commit.
llvm-svn: 187926 | 
| | 
| 
| 
| | llvm-svn: 187806 | 
| | 
| 
| 
| 
| 
| | Remove assertion that the verifier should catch.
llvm-svn: 187692 | 
| | 
| 
| 
| | llvm-svn: 187635 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | This fix is very lightweight. The same fix already existed for AddRec
but was missing for NAry expressions.
This is obviously an improvement and I'm unsure how to test compile
time problems.
Patch by Xiaoyi Guo!
llvm-svn: 187475 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | instructions
Call into ComputeMaskedBits to figure out which bits are set on both add
operands and determine if the value is a power-of-two-or-zero or not.
llvm-svn: 187445 | 
| | 
| 
| 
| | llvm-svn: 187284 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | Adds unit tests for it too.
Split BasicBlockUtils into an analysis-half and a transforms-half, and put the
analysis bits into a new Analysis/CFG.{h,cpp}. Promote isPotentiallyReachable
into llvm::isPotentiallyReachable and move it into Analysis/CFG.
llvm-svn: 187283 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | conditions
Merge consecutive if-regions if they contain identical statements.
Both transformations reduce number of branches.  The transformation
is guarded by a target-hook, and is currently enabled only for +R600,
but the correctness has been tested on X86 target using a variety of
CPU benchmarks.
Patch by: Mei Ye
llvm-svn: 187278 | 
| | 
| 
| 
| 
| 
| | deallocation functions.
llvm-svn: 186798 | 
| | 
| 
| 
| | llvm-svn: 186774 | 
| | 
| 
| 
| | llvm-svn: 186758 | 
| | 
| 
| 
| 
| 
| 
| | the comment. No functionality change. This change broken out of
http://llvm-reviews.chandlerc.com/D996 .
llvm-svn: 186558 | 
| | 
| 
| 
| | llvm-svn: 186371 | 
| | 
| 
| 
| 
| 
| | size.
llvm-svn: 186274 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | The great thing about the SCEVAddRec No-Wrap flag (unlike nsw/nuw) is
that is can be preserved while normalizing (reassociating and
factoring).
The bad thing is that is can't be tranfered back to IR, which is one
of the reasons I don't like the concept of SCEVExpander.
Sorry, I can't think of a direct way to test this, which is why these
were FIXMEs for so long. I just think it's a good time to finally
clean it up.
llvm-svn: 186273 | 
| | 
| 
| 
| 
| 
| | Fixes PR16600.
llvm-svn: 186272 | 
| | 
| 
| 
| 
| 
| | Fixes PR16605.
llvm-svn: 186229 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | Address calculation for gather/scather in vectorized code can incur a
significant cost making vectorization unbeneficial. Add infrastructure to add
cost.
Tests and cost model for targets will be in follow-up commits.
radar://14351991
llvm-svn: 186187 | 
| | 
| 
| 
| 
| 
| | Thank Nick for figuring out these problems.
llvm-svn: 186146 | 
| | 
| 
| 
| 
| 
| | size.
llvm-svn: 186098 | 
| | 
| 
| 
| 
| 
| | No functionality change.
llvm-svn: 186095 | 
| | 
| 
| 
| | llvm-svn: 186065 | 
| | 
| 
| 
| | llvm-svn: 185973 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | ScalarEvolution::getSignedRange uses ComputeNumSignBits from ValueTracking on
ashr instructions. ComputeNumSignBits can return zero, but this case was not
handled correctly by the code in getSignedRange which was calling:
  APInt::getSignedMinValue(BitWidth).ashr(NS - 1)
with NS = 0, resulting in an assertion failure in APInt::ashr.
Now, we just return the conservative result (as with NS == 1).
Another bug found by llvm-stress.
llvm-svn: 185955 | 
| | 
| 
| 
| 
| 
| 
| | (add nsw x, (and x, y)) isn't a power of two if x is zero, it's zero
(add nsw x, (xor x, y)) isn't a power of two if y has bits set that aren't set in x
llvm-svn: 185954 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | The symptom is seg-fault, and the root cause is that a SCEV contains a SCEVUnknown
which has null-pointer to a llvm::Value.
 This is how the problem take place:
 ===================================
  1). In the pristine input IR, there are two relevant instrutions Op1 and Op2, 
     Op1's corresponding SCEV (denoted as SCEV(op1)) is a SCEVUnknown, and
     SCEV(Op2) contains SCEV(Op1).  None of these instructions are dead.
     Op1 : V1 = ...
     ...
     Op2 : V2 = ... // directly or indirectly (data-flow) depends on Op1
    
  2) Optimizer (LSR in my case) generates an instruction holding the equivalent
     value of Op1, making Op1 dead. 
     Op1': V1' = ...
     Op1: V1 = ... ; now dead)
     Op2 : V2 = ... //Now deps on Op1', but the SCEV(Op2) still contains SCEV(Op1)
  3) Op1 is deleted, and call-back function is called to reset 
     SCEV(Op1) to indicate it is invalid. However, SCEV(Op2) is not 
     invalidated as well.
  4) Following pass get the cached, invalid SCEV(Op2), and try to manipulate it,
     and cause segfault. 
 The fix:
 ========
 It seems there is no clean yet inexpensive fix. I write to dev-list
soliciting good solution, unforunately no ack. So, I decide to fix this 
problem in a brute-force way:
  When ScalarEvolution::getSCEV is called, check if the cached SCEV 
contains a invalid SCEVUnknow, if yes, remove the cached SCEV, and
re-evaluate the SCEV from scratch.
  I compile buch of big *.c and *.cpp, fortunately, I don't see any increase
in compile time.
 Misc:
=====
 The reduced test-case has 2357 lines of code+other-stuff, too big to commit.
 rdar://14283433
llvm-svn: 185843 | 
| | 
| 
| 
| 
| 
| | pointer arguments.
llvm-svn: 185776 | 
| | 
| 
| 
| | llvm-svn: 185748 | 
| | 
| 
| 
| 
| 
| 
| | functions. Make the function attributes pass add it to known library functions
and when it can deduce it.
llvm-svn: 185735 | 
| | 
| 
| 
| 
| 
| | specifying the vector size.
llvm-svn: 185606 | 
| | 
| 
| 
| 
| 
| | specifying the vector size.
llvm-svn: 185540 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | X is a power of two
This allows us to simplify urem instructions involving the add+xor to
turn into simpler math.
llvm-svn: 185272 | 
| | 
| 
| 
| | llvm-svn: 185187 | 
| | 
| 
| 
| 
| 
| 
| 
| | The Builtin attribute is an attribute that can be placed on function call site that signal that even though a function is declared as being a builtin,
rdar://problem/13727199
llvm-svn: 185049 | 
| | 
| 
| 
| 
| 
| 
| | This is a band-aid to fix the most severe regressions we're seeing from basing
spill decisions on block frequencies, until we have a better solution.
llvm-svn: 184835 | 
| | 
| 
| 
| 
| 
| | functionality change.
llvm-svn: 183709 | 
| | 
| 
| 
| | llvm-svn: 183175 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | Fixes rdar:14036816, PR16130.
There is an opportunity to compute precise trip counts for 'or'
expressions and multi-exit loops.
rdar:14038809: Optimize trip count computation for multi-exit loops.
To do this we need to record the fact that ExitLimit assumes NSW. When
it does not we can safely assume that the loop trip count is the
minimum ExitLimt across all subexpressions and loop exits.
llvm-svn: 183060 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | Account for the cost of scaling factor in Loop Strength Reduce when rating the
formulae. This uses a target hook.
The default implementation of the hook is: if the addressing mode is legal, the
scaling factor is free.
<rdar://problem/13806271>
llvm-svn: 183045 |