bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[IR] Remove terminatepad	David Majnemer	2015-12-14	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	It turns out that terminatepad gives little benefit over a cleanuppad which calls the termination function. This is not sufficient to implement fully generic filters but MSVC doesn't support them which makes terminatepad a little over-designed. Depends on D15478. Differential Revision: http://reviews.llvm.org/D15479 llvm-svn: 255522
*	[IR] Reformulate LLVM's EH funclet IR	David Majnemer	2015-12-12	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While we have successfully implemented a funclet-oriented EH scheme on top of LLVM IR, our scheme has some notable deficiencies: - catchendpad and cleanupendpad are necessary in the current design but they are difficult to explain to others, even to seasoned LLVM experts. - catchendpad and cleanupendpad are optimization barriers. They cannot be split and force all potentially throwing call-sites to be invokes. This has a noticable effect on the quality of our code generation. - catchpad, while similar in some aspects to invoke, is fairly awkward. It is unsplittable, starts a funclet, and has control flow to other funclets. - The nesting relationship between funclets is currently a property of control flow edges. Because of this, we are forced to carefully analyze the flow graph to see if there might potentially exist illegal nesting among funclets. While we have logic to clone funclets when they are illegally nested, it would be nicer if we had a representation which forbade them upfront. Let's clean this up a bit by doing the following: - Instead, make catchpad more like cleanuppad and landingpad: no control flow, just a bunch of simple operands; catchpad would be splittable. - Introduce catchswitch, a control flow instruction designed to model the constraints of funclet oriented EH. - Make funclet scoping explicit by having funclet instructions consume the token produced by the funclet which contains them. - Remove catchendpad and cleanupendpad. Their presence can be inferred implicitly using coloring information. N.B. The state numbering code for the CLR has been updated but the veracity of it's output cannot be spoken for. An expert should take a look to make sure the results are reasonable. Reviewers: rnk, JosephTremoulet, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D15139 llvm-svn: 255422
*	[ValueTracking] Remove untested / unreachable code, NFC	Sanjoy Das	2015-11-11	1	-18/+5
\| \| \| \| \| \| \| \|	Right now isTruePredicate is only ever called with Pred == ICMP_SLE or ICMP_ULE, and the ICMP_SLT and ICMP_ULT cases are dead. This change removes the untested dead code so that the function is not misleading. llvm-svn: 252676
*	[ValueTracking] Teach isImpliedCondition a new bitwise trick	Sanjoy Das	2015-11-10	1	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change teaches isImpliedCondition to prove things like (A \| 15) < L ==> (A \| 14) < L if the low 4 bits of A are known to be zero. Depends on D14391 Reviewers: majnemer, reames, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14392 llvm-svn: 252673
*	[ValueTracking] Use m_APInt instead of m_ConstantInt, NFC	Sanjoy Das	2015-11-10	1	-7/+8
\| \| \| \| \| \| \| \|	This change would add functionality if isImpliedCondition worked on vector types; but since it bail out on vector predicates this change is an NFC. llvm-svn: 252672
*	[ValueTracking] Recognize that and(x, add (x, -1)) clears the low bit	Philip Reames	2015-11-10	1	-0/+16
\| \| \| \| \| \| \| \| \| \|	This is a cleaned up version of a patch by John Regehr with permission. Originally found via the souper tool. If we add an odd number to x, then bitwise-and the result with x, we know that the low bit of the result must be zero. Either it was zero in x originally, or the add cleared it in the temporary value. As a result, one of the two values anded together must have the bit cleared. Differential Revision: http://reviews.llvm.org/D14315 llvm-svn: 252629
*	[ValueTracking] Add parameters to isImpliedCondition; NFC	Sanjoy Das	2015-11-06	1	-8/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change makes the `isImpliedCondition` interface similar to the rest of the functions in ValueTracking (in that it takes a DataLayout, AssumptionCache etc.). This is an NFC, intended to make a later diff less noisy. Depends on D14369 Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14391 llvm-svn: 252333
*	[ValueTracking] De-pessimize isImpliedCondition around unsigned compares	Sanjoy Das	2015-11-06	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently `isImpliedCondition` will optimize "I +_nuw C < L ==> I < L" only if C is positive. This is an unnecessary restriction -- the implication holds even if `C` is negative. Reviewers: reames, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14369 llvm-svn: 252332
*	[ValueTracking] Add a framework for encoding implication rules	Sanjoy Das	2015-11-06	1	-21/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change adds a framework for adding more smarts to `isImpliedCondition` around inequalities. Informally, `isImpliedCondition` will now try to prove "A < B ==> C < D" by proving "C <= A && B <= D", since then it follows "C <= A < B <= D". While this change is in principle NFC, I could not think of a way to not handle cases like "i +_nsw 1 < L ==> i < L +_nsw 1" (that ValueTracking did not handle before) while keeping the change understandable. I've added tests for these cases. Reviewers: reames, majnemer, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14368 llvm-svn: 252331
*	[ValueTracking] Expose `implies` via ValueTracking, NFC	Sanjoy Das	2015-10-28	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This will allow a later patch to `JumpThreading` use this functionality. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13971 llvm-svn: 251488
*	[ValueTracking] Use !range metadata more aggressively in KnownBits	Sanjoy Das	2015-10-28	1	-9/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach `computeKnownBitsFromRangeMetadata` to use `!range` metadata more aggressively. Reviewers: majnemer, nlewycky, jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14100 llvm-svn: 251487
*	[ValueTracking] Don't special case wrapped ConstantRanges; NFCI	Sanjoy Das	2015-10-27	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use `getUnsignedMax` directly instead of special casing a wrapped ConstantRange. The previous code would have been "buggy" (and this would have been a semantic change) if LLVM allowed !range metadata to denote full ranges. E.g. in %val = load i1, i1* %ptr, !range !{i1 1, i1 1} ;; == full set ValueTracking would conclude that the high bit (IOW the only bit) in %val was zero. Since !range metadata does not allow empty or full ranges, this change is just a minor stylistic improvement. llvm-svn: 251380
*	[ValueTracking] Extend r251146 to catch a fairly common case	James Molloy	2015-10-26	1	-2/+22
\| \| \| \| \| \| \| \| \| \| \| \| \|	Even though we may not know the value of the shifter operand, it's possible we know the shifter operand is non-zero. This can allow us to infer more known bits - for example: %1 = load %p !range {1, 5} %2 = shl %q, %1 We don't know %1, but we do know that it is nonzero so %2[0] is known zero, and importantly %2 is known non-zero. Calling isKnownNonZero is nontrivially expensive so use an Optional to run it lazily and cache its result. llvm-svn: 251294
*	Use all_of to simplify control flow. NFC.	Benjamin Kramer	2015-10-24	1	-8/+2
\| \| \| \|	llvm-svn: 251202
*	Extract out getConstantRangeFromMetadata; NFC	Sanjoy Das	2015-10-24	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \|	The loop idiom creating a ConstantRange is repeated twice in the codebase, time to give it a name and a home. The loop is also repeated in `rangeMetadataExcludesValue`, but using `getConstantRangeFromMetadata` there would not be an NFC -- the range returned by `getConstantRangeFromMetadata` may contain a value that none of the subranges did. llvm-svn: 251180
*	Handle non-constant shifts in computeKnownBits, and use computeKnownBits for ↵	Hal Finkel	2015-10-23	1	-34/+111
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	constant folding in InstCombine/Simplify First, the motivation: LLVM currently does not realize that: ((2072 >> (L == 0)) >> 7) & 1 == 0 where L is some arbitrary value. Whether you right-shift 2072 by 7 or by 8, the lowest-order bit is always zero. There are obviously several ways to go about fixing this, but the generic solution pursued in this patch is to teach computeKnownBits something about shifts by a non-constant amount. Previously, we would give up completely on these. Instead, in cases where we know something about the low-order bits of the shift-amount operand, we can combine (and together) the associated restrictions for all shift amounts consistent with that knowledge. As a further generalization, I refactored all of the logic for all three kinds of shifts to have this capability. This works well in the above case, for example, because the dynamic shift amount can only be 0 or 1, and thus we can say a lot about the known bits of the result. This brings us to the second part of this change: Even when we know all of the bits of a value via computeKnownBits, nothing used to constant-fold the result. This introduces the necessary code into InstCombine and InstSimplify. I've added it into both because: 1. InstCombine won't automatically pick up the associated logic in InstSimplify (InstCombine uses InstSimplify, but not via the API that passes in the original instruction). 2. Putting the logic in InstCombine allows the resulting simplifications to become part of the iterative worklist 3. Putting the logic in InstSimplify allows the resulting simplifications to be used by everywhere else that calls SimplifyInstruction (inlining, unrolling, and many others). And this requires a small change to our definition of an ephemeral value so that we don't break the rest case from r246696 (where the icmp feeding the @llvm.assume, is also feeding a br). Under the old definition, the icmp would not be considered ephemeral (because it is used by the br), but this causes the assume to remove itself (in addition to simplifying the branch structure), and it seems more-useful to prevent that from happening. llvm-svn: 251146
*	[ValueTracking] Add a new predicate: isKnownNonEqual()	James Molloy	2015-10-22	1	-0/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	isKnownNonEqual(A, B) returns true if it can be determined that A != B. At the moment it only knows two facts, that a non-wrapping add of nonzero to a value cannot be that value: A + B != A [where B != 0, addition is nsw or nuw] and that contradictory known bits imply two values are not equal. This patch also hooks this up to InstSimplify; InstSimplify had a peephole for the first fact but not the second so this teaches InstSimplify a new trick too (alas no measured performance impact!) llvm-svn: 251012
*	Silencing a -Wtype-limits warning; an unsigned value will always be >= 0; NFC.	Aaron Ballman	2015-10-15	1	-1/+1
\| \| \| \|	llvm-svn: 250404
*	Tighten known bits for ctpop based on zero input bits	Philip Reames	2015-10-14	1	-2/+12
\| \| \| \| \| \| \| \| \| \|	This is a cleaned up patch from the one written by John Regehr based on the findings of the Souper superoptimizer. The basic idea here is that input bits that are known zero reduce the maximum count that the intrinsic could return. We know that the number of bits required to represent a particular count is at most log2(N)+1. Differential Revision: http://reviews.llvm.org/D13253 llvm-svn: 250338
*	[asan] Disabling speculative loads under asan. Patch by Mike Aizatsky	Kostya Serebryany	2015-10-14	1	-1/+5
\| \| \| \|	llvm-svn: 250259
*	Analysis: Remove implicit ilist iterator conversions	Duncan P. N. Exon Smith	2015-10-10	1	-9/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove implicit ilist iterator conversions from LLVMAnalysis. I came across something really scary in `llvm::isKnownNotFullPoison()` which relied on `Instruction::getNextNode()` being completely broken (not surprising, but scary nevertheless). This function is documented (and coded to) return `nullptr` when it gets to the sentinel, but with an `ilist_half_node` as a sentinel, the sentinel check looks into some other memory and we don't recognize we've hit the end. Rooting out these scary cases is the reason I'm removing the implicit conversions before doing anything else with `ilist`; I'm not at all surprised that clients rely on badness. I found another scary case -- this time, not relying on badness, just bad (but I guess getting lucky so far) -- in `ObjectSizeOffsetEvaluator::compute_()`. Here, we save out the insertion point, do some things, and then restore it. Previously, we let the iterator auto-convert to `Instruction`, and then set it back using the `Instruction` version: Instruction PrevInsertPoint = Builder.GetInsertPoint(); / Logic that may change insert point */ if (PrevInsertPoint) Builder.SetInsertPoint(PrevInsertPoint); The check for `PrevInsertPoint` doesn't protect correctly against bad accesses. If the insertion point has been set to the end of a basic block (i.e., `SetInsertPoint(SomeBB)`), then `GetInsertPoint()` returns an iterator pointing at the list sentinel. The version of `SetInsertPoint()` that's getting called will then call `PrevInsertPoint->getParent()`, which explodes horribly. The only reason this hasn't blown up is that it's fairly unlikely the builder is adding to the end of the block; usually, we're adding instructions somewhere before the terminator. llvm-svn: 249925
*	ValueTracking: use getAlignment in isAligned	Artur Pilipenko	2015-10-09	1	-14/+1
\| \| \| \| \| \| \| \|	Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13517 llvm-svn: 249841
*	[ValueTracking] teach computeKnownBits that a fabs() clears sign bits	Sanjay Patel	2015-10-08	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \|	This was requested in D13076: if we're going to canonicalize to fabs(), ValueTracking should know that fabs() clears sign bits. In this patch (as in D13076), we're not handling vectors yet even though computeKnownBits' fabs() case itself should be vector-ready via the splat in this patch. Fixing this will require follow-on patches to correct other logic that uses 'getScalarType'. Differential Revision: http://reviews.llvm.org/D13222 llvm-svn: 249701
*	Teach computeKnownBits to use new align attribute/metadata	Artur Pilipenko	2015-10-07	1	-3/+12
\| \| \| \| \| \| \| \|	Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13470 llvm-svn: 249557
*	Extend known bits to understand @llvm.bswap	Philip Reames	2015-10-06	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	This is a cleaned up patch from the one written by John Regehr based on the findings of the Souper superoptimizer. When writing tests, I was surprised to find that instsimplify apparently doesn't know how to collapse bit test sequences based purely on known bits. This required me to split my tests across both instsimplify and instcombine. Differential Revision: http://reviews.llvm.org/D13250 llvm-svn: 249453
*	Refactor computeKnownBits alignment handling code	Artur Pilipenko	2015-09-30	1	-53/+38
\| \| \| \| \| \| \| \|	Reviewed By: reames, hfinkel Differential Revision: http://reviews.llvm.org/D12958 llvm-svn: 248892
*	[ValueTracking] Lower dom-conditions-dom-blocks and dom-conditions-max-uses ↵	Igor Laevsky	2015-09-29	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	thresholds On some of our benchmarks this change shows about 50% compile time improvement without any noticeable performance difference. Differential Revision: http://reviews.llvm.org/D13248 llvm-svn: 248801
*	[ValueTracking] Teach isKnownNonZero about monotonically increasing PHIs	James Molloy	2015-09-29	1	-0/+20
\| \| \| \| \| \| \| \|	If a PHI starts at a non-negative constant, monotonically increases (only adds of a constant are supported at the moment) and that add does not wrap, then the PHI is known never to be zero. llvm-svn: 248796
*	Introduce !align metadata for load instruction	Artur Pilipenko	2015-09-28	1	-0/+5
\| \| \| \| \| \| \| \|	Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D12853 llvm-svn: 248721
*	more space; NFC	Sanjay Patel	2015-09-25	1	-0/+1
\| \| \| \|	llvm-svn: 248609
*	[ValueTracking] Teach isKnownNonZero a new trick	James Molloy	2015-09-24	1	-0/+17
\| \| \| \| \| \| \| \|	If the shifter operand is a constant, and all of the bits shifted out are known to be zero, then if X is known non-zero at least one non-zero bit must remain. llvm-svn: 248508
*	Fix for pr24866	Philip Reames	2015-09-21	1	-1/+8
\| \| \| \| \| \|	Turns out that not every basic block is guaranteed to have a node within the DominatorTree. This is really hard to trigger, but the test case from the PR managed to do so. There's active discussion continuing about what documentation and/or invariants needed cleaned up. llvm-svn: 248216
*	Support align attribute for return values	Artur Pilipenko	2015-09-18	1	-0/+2
\| \| \| \| \| \| \| \|	Reviewed By: reames Differential Revision: http://reviews.llvm.org/D12844 llvm-svn: 247984
*	fix typo; NFC	Sanjay Patel	2015-09-17	1	-1/+1
\| \| \| \|	llvm-svn: 247938
*	[InstCombineCalls] Use isKnownNonNullAt() to check nullness of passing ↵	Chen Li	2015-09-14	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	arguments at callsite Summary: This patch replaces isKnownNonNull() with isKnownNonNullAt() when checking nullness of passing arguments at callsite. In this way it can handle cases where the argument does not have nonnull attribute but has a dominating null check from the CFG. It also adds assertions in isKnownNonNull() and isKnownNonNullFromDominatingCondition() to make sure the value checked is pointer type (as defined in LLVM document). These assertions might trip failures in things which are not covered under llvm/test, but fixes should be pretty obvious. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12779 llvm-svn: 247587
*	[WinEH] Add cleanupendpad instruction	Joseph Tremoulet	2015-09-03	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add a `cleanupendpad` instruction, used to mark exceptional exits out of cleanups (for languages/targets that can abort a cleanup with another exception). The `cleanupendpad` instruction is similar to the `catchendpad` instruction in that it is an EH pad which is the target of unwind edges in the handler and which itself has an unwind edge to the next EH action. The `cleanupendpad` instruction, similar to `cleanupret` has a `cleanuppad` argument indicating which cleanup it exits. The unwind successors of a `cleanuppad`'s `cleanupendpad`s must agree with each other and with its `cleanupret`s. Update WinEHPrepare (and docs/tests) to accomodate `cleanupendpad`. Reviewers: rnk, andrew.w.kaylor, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12433 llvm-svn: 246751
*	[ValueTracking] Look through casts when both operands are casts.	James Molloy	2015-09-02	1	-5/+17
\| \| \| \| \| \| \| \| \| \| \|	We only looked through casts when one operand was a constant. We can also look through casts when both operands are non-constant, but both are in fact the same cast type. For example: %1 = icmp ult i8 %a, %b %2 = zext i8 %a to i32 %3 = zext i8 %b to i32 %4 = select i1 %1, i32 %2, i32 %3 llvm-svn: 246678
*	Revert r246232 and r246304.	David Majnemer	2015-08-28	1	-4/+39
\| \| \| \| \| \| \| \| \|	This reverts isSafeToSpeculativelyExecute's use of ReadNone until we split ReadNone into two pieces: one attribute which reasons about how the function reasons about memory and another attribute which determines how it may be speculated, CSE'd, trap, etc. llvm-svn: 246331
*	[CodeGen] isInTailCallPosition didn't consider readnone tailcalls	David Majnemer	2015-08-28	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	A readnone tailcall may still have a chain of computation which follows it that would invalidate a tailcall lowering. Don't skip the analysis in such cases. This fixes PR24613. llvm-svn: 246304
*	[ValueTracking] readnone CallInsts are fair game for speculation	David Majnemer	2015-08-27	1	-39/+3
\| \| \| \| \| \| \| \| \| \|	Any call which is side effect free is trivially OK to speculate. We already had similar logic in EarlyCSE and GVN but we were missing it from isSafeToSpeculativelyExecute. This fixes PR24601. llvm-svn: 246232
*	isKnownNonNull needs to consider globals in non-zero address spaces.	Pete Cooper	2015-08-27	1	-2/+5
\| \| \| \| \| \| \| \| \|	Globals in address spaces other than one may have 0 as a valid address, so we should not assume that they can be null. Reviewed by Philip Reames. llvm-svn: 246137
*	[ValueTracking] computeOverflowForSignedAdd and isKnownNonNegative	Jingyue Wu	2015-08-20	1	-0/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Refactor, NFC Extracts computeOverflowForSignedAdd and isKnownNonNegative from NaryReassociate to ValueTracking in case others need it. Reviewers: reames Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D11313 llvm-svn: 245591
*	Take alignment into account in isSafeToSpeculativelyExecute and ↵	Artur Pilipenko	2015-08-17	1	-35/+79
\| \| \| \| \| \| \| \| \| \|	isSafeToLoadUnconditionally. Reviewed By: hfinkel, sanjoy, MatzeB Differential Revision: http://reviews.llvm.org/D9791 llvm-svn: 245223
*	[ValueTracking] Tweak a comment slightly	James Molloy	2015-08-12	1	-2/+2
\| \| \| \| \| \|	Hal asked for this change in D11146, but I missed it when I committed originally. llvm-svn: 244754
*	Add support for floating-point minnum and maxnum	James Molloy	2015-08-11	1	-33/+130
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The select pattern recognition in ValueTracking (as used by InstCombine and SelectionDAGBuilder) only knew about integer patterns. This teaches it about minimum and maximum operations. matchSelectPattern() has been extended to return a struct containing the existing Flavor and a new enum defining the pattern's behavior when given one NaN operand. C minnum() is defined to return the non-NaN operand in this case, but the idiomatic C "a < b ? a : b" would return the NaN operand. ARM and AArch64 at least have different instructions for these different cases. llvm-svn: 244580
*	Fix some comment typos.	Benjamin Kramer	2015-08-08	1	-3/+3
\| \| \| \|	llvm-svn: 244402
*	[Reassociation] Fix miscompile for va_arg arguments.	Quentin Colombet	2015-08-06	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	iisUnmovableInstruction() had a list of instructions hardcoded which are considered unmovable. The list lacked (at least) an entry for the va_arg and cmpxchg instructions. Fix this by introducing a new Instruction::mayBeMemoryDependent() instead of maintaining another instruction list. Patch by Matthias Braun <matze@braunis.de>. Differential Revision: http://reviews.llvm.org/D11577 rdar://problem/22118647 llvm-svn: 244244
*	New EH representation for MSVC compatibility	David Majnemer	2015-07-31	1	-1/+7
\| \| \| \| \| \| \| \| \| \|	This introduces new instructions neccessary to implement MSVC-compatible exception handling support. Most of the middle-end and none of the back-end haven't been audited or updated to take them into account. Differential Revision: http://reviews.llvm.org/D11097 llvm-svn: 243766
*	[SCEV] Apply NSW and NUW flags via poison value analysis	Jingyue Wu	2015-07-28	1	-0/+161
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Make Scalar Evolution able to propagate NSW and NUW flags from instructions to SCEVs in some cases. This is based on reasoning about when poison from instructions with these flags would trigger undefined behavior. This gives a 13% speed-up on some Eigen3-based Google-internal microbenchmarks for NVPTX. There does not seem to be clear agreement about when poison should be considered to propagate through instructions. In this analysis, poison propagates only in cases where that should be uncontroversial. This change makes LSR able to create induction variables for expressions like &ptr[i + offset] for loops like this: for (int i = 0; i < limit; ++i) { sum += ptr[i + offset]; } Here ptr is a 64 bit pointer and offset is a 32 bit integer. For NVPTX, LSR currently creates an induction variable for i + offset instead, which is not as fast. Improving this situation is what brings the 13% speed-up on some Eigen3-based Google-internal microbenchmarks for NVPTX. There are more details in this discussion on llvmdev. June: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-June/thread.html#87234 July: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-July/thread.html#87392 Patch by Bjarke Roune Reviewers: eliben, atrick, sanjoy Subscribers: majnemer, hfinkel, jingyue, meheff, llvm-commits Differential Revision: http://reviews.llvm.org/D11212 llvm-svn: 243460
*	IR: Do not consider available_externally linkage to be linker-weak.	Peter Collingbourne	2015-07-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From the linker's perspective, an available_externally global is equivalent to an external declaration (per isDeclarationForLinker()), so it is incorrect to consider it to be a weak definition. Also clean up some logic in the dead argument elimination pass and clarify its comments to better explain how its behavior depends on linkage, introduce GlobalValue::isStrongDefinitionForLinker() and start using it throughout the optimizers and backend. Differential Revision: http://reviews.llvm.org/D10941 llvm-svn: 241413