bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[SLP] Fixed test for extra arguments in horizontal reductions.	Alexey Bataev	2017-01-26	1	-3/+5
\| \| \| \|	llvm-svn: 293153
*	[PM] Simplify the new PM interface to the loop unroller and expose two	Chandler Carruth	2017-01-26	10	-8/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	factory functions for the two modes the loop unroller is actually used in in-tree: simplified full-unrolling and the entire thing including partial unrolling. I've also wired these up to nice names so you can express both of these being in a pipeline easily. This is a precursor to actually enabling these parts of the O2 pipeline. Differential Revision: https://reviews.llvm.org/D28897 llvm-svn: 293136
*	[LoopUnroll] Properly update loopinfo for runtime unrolling by 2	Michael Kuperstein	2017-01-26	2	-2/+44
\| \| \| \| \| \| \| \| \| \| \|	Even when we don't create a remainder loop (that is, when we unroll by 2), we may duplicate nested loops into the remainder. This is complicated by the fact the remainder may itself be either inserted into an outer loop, or at the top level. In the latter case, we may need to create new top-level loops. Differential Revision: https://reviews.llvm.org/D29156 llvm-svn: 293124
*	[NewGVN] Skip uses in unreachable blocks.	Davide Italiano	2017-01-26	1	-0/+37
\| \| \| \| \| \| \| \|	Otherwise we ask for a domtree node that's not there, and we crash. Differential Revision: https://reviews.llvm.org/D29145 llvm-svn: 293122
*	LowerTypeTests: Ignore external globals with type metadata.	Peter Collingbourne	2017-01-26	1	-0/+14
\| \| \| \| \| \|	Thanks to Davide Italiano for finding the problem and providing a test case. llvm-svn: 293119
*	[ValueTracking] Implement SignBitMustBeZero correctly for sqrt.	Justin Lebar	2017-01-26	1	-0/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously we assumed that the result of sqrt(x) always had 0 as its sign bit. But sqrt(-0) == -0. Reviewers: hfinkel, efriedma, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28928 llvm-svn: 293115
*	[SLP] Extra test for functionality with extra args.	Alexey Bataev	2017-01-25	1	-2/+4
\| \| \| \|	llvm-svn: 293076
*	[Guards] Introduce loop-predication pass	Artur Pilipenko	2017-01-25	3	-0/+796
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces guard based loop predication optimization. The new LoopPredication pass tries to convert loop variant range checks to loop invariant by widening checks across loop iterations. For example, it will convert for (i = 0; i < n; i++) { guard(i < len); ... } to for (i = 0; i < n; i++) { guard(n - 1 < len); ... } After this transformation the condition of the guard is loop invariant, so loop-unswitch can later unswitch the loop by this condition which basically predicates the loop by the widened condition: if (n - 1 < len) for (i = 0; i < n; i++) { ... } else deoptimize This patch relies on an NFC change to make ScalarEvolution::isMonotonicPredicate public (revision 293062). Reviewed By: sanjoy Differential Revision: https://reviews.llvm.org/D29034 llvm-svn: 293064
*	[InstCombine] Canonicalize guards for NOT OR condition	Artur Pilipenko	2017-01-25	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \|	This is a partial fix for Bug 31520 - [guards] canonicalize guards in instcombine Reviewed By: apilipenko Differential Revision: https://reviews.llvm.org/D29075 Patch by Maxim Kazantsev. llvm-svn: 293061
*	[InstCombine][SSE] Add support for PACKSS/PACKUS constant folding	Simon Pilgrim	2017-01-25	1	-38/+18
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28949 llvm-svn: 293060
*	[InstCombine] Canonicalize guards for AND condition	Artur Pilipenko	2017-01-25	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \|	This is a partial fix for Bug 31520 - [guards] canonicalize guards in instcombine Reviewed By: apilipenko Differential Revision: https://reviews.llvm.org/D29074 Patch by Maxim Kazantsev. llvm-svn: 293058
*	[InstCombine] Allow InstrCombine to remove one of adjacent guards if they ↵	Artur Pilipenko	2017-01-25	1	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	are equivalent This is a partial fix for Bug 31520 - [guards] canonicalize guards in instcombine Reviewed By: majnemer, apilipenko Differential Revision: https://reviews.llvm.org/D29071 Patch by Maxim Kazantsev. llvm-svn: 293056
*	[SLP] Improve horizontal vectorization for non-power-of-2 number of	Alexey Bataev	2017-01-25	1	-98/+72
\| \| \| \| \| \| \| \| \| \| \| \| \|	instructions. If number of instructions in horizontal reduction list is not power of 2 then only PowerOf2Floor(NumberOfInstructions) last elements are actually vectorized, other instructions remain scalar. Patch tries to vectorize the remaining elements either. Differential Revision: https://reviews.llvm.org/D28959 llvm-svn: 293042
*	Mark @llvm.powi.* as safe to speculatively execute.	whitequark	2017-01-25	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Floating point intrinsics in LLVM are generally not speculatively executed, since most of them are defined to behave the same as libm functions, which set errno. However, the @llvm.powi.* intrinsics do not correspond to any libm function, and lacks any defined error handling semantics in LangRef. It most certainly does not alter errno. llvm-svn: 293041
*	[X86] enable memory interleaving for X86\SLM arch.	Mohammed Agabaria	2017-01-25	1	-0/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28547 llvm-svn: 293040
*	[SimplifyCFG] Do not sink and merge inline-asm instructions.	Akira Hatanaka	2017-01-25	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conservatively disable sinking and merging inline-asm instructions as doing so can potentially create arguments that cannot satisfy the inline-asm constraints. For example, SimplifyCFG used to do the following transformation: (before) if.then: %0 = call i32 asm "rorl $2, $0", "=&r,0,n"(i32 %r6, i32 8) br label %if.end if.else: %1 = call i32 asm "rorl $2, $0", "=&r,0,n"(i32 %r6, i32 6) br label %if.end (after) %.sink = select i1 %tobool, i32 6, i32 8 %0 = call i32 asm "rorl $2, $0", "=&r,0,n"(i32 %r6, i32 %.sink) This would result in a crash in the backend since only immediate integer operands are permitted for constraint "n". rdar://problem/30110806 Differential Revision: https://reviews.llvm.org/D29111 llvm-svn: 293025
*	[InstCombine] Added regression test to narrow-swich.ll	Gerolf Hoflehner	2017-01-25	1	-0/+42
\| \| \| \|	llvm-svn: 293018
*	[PM] Teach LoopUnroll to update the LPM infrastructure as it unrolls	Chandler Carruth	2017-01-25	17	-1/+174
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	loops. We do this by reconstructing the newly added loops after the unroll completes to avoid threading pass manager details through all the mess of the unrolling infrastructure. I've enabled some extra assertions in the LPM to try and catch issues here and enabled a bunch of unroller tests to try and make sure this is sane. Currently, I'm manually running loop-simplify when needed. That should go away once it is folded into the LPM infrastructure. Differential Revision: https://reviews.llvm.org/D28848 llvm-svn: 293011
*	[coroutines] Spill the result of the invoke instruction correctly	Gor Nishanov	2017-01-25	1	-0/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When we decide that the result of the invoke instruction need to be spilled, we need to insert the spill into a block that is on the normal edge coming out of the invoke instruction. (Prior to this change the code would insert the spill immediately after the invoke instruction, which breaks the IR, since invoke is a terminator instruction). In the following example, we will split the edge going into %cont and insert the spill there. ``` %r = invoke double @print(double 0.0) to label %cont unwind label %pad cont: %0 = call i8 @llvm.coro.suspend(token none, i1 false) switch i8 %0, label %suspend [i8 0, label %resume i8 1, label %cleanup] resume: call double @print(double %r) ``` Reviewers: majnemer Reviewed By: majnemer Subscribers: mehdi_amini, llvm-commits, EricWF Differential Revision: https://reviews.llvm.org/D29102 llvm-svn: 293006
*	Explicitly promote indirect calls before sample profile annotation.	Dehao Chen	2017-01-24	2	-0/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In iterative sample pgo where profile is collected from PGOed binary, we may see indirect call targets promoted and inlined in the profile. Before profile annotation, we need to make this happen in order to annotate correctly on IR. This patch explicitly promotes these indirect calls and inlines them before profile annotation. Reviewers: xur, davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29040 llvm-svn: 292979
*	Remove the load hoisting code of MLSM, it is completely subsumed by GVNHoist	Daniel Berlin	2017-01-24	2	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: GVNHoist performs all the optimizations that MLSM does to loads, in a more general way, and in a faster time bound (MLSM is N^3 in most cases, N^4 in a few edge cases). This disables the load portion. Note that the way ld_hoist_st_sink.ll is written makes one think that the loads should be moved to the while.preheader block, but 1. Neither MLSM nor GVNHoist do it (they both move them to identical places). 2. MLSM couldn't possibly do it anyway, as the while.preheader block is not the head of the diamond, while.body is. (GVNHoist could do it if it was legal). 3. At a glance, it's not legal anyway because the in-loop load conflict with the in-loop store, so the loads must stay in-loop. I am happy to update the test to use update_test_checks so that checking is tighter, just was going to do it as a followup. Note that i can find no particular benefit to the store portion on any real testcase/benchmark i have (even size-wise). If we really still want it, i am happy to commit to writing a targeted store sinker, just taking the code from the MemorySSA port of MergedLoadStoreMotion (which is N^2 worst case, and N most of the time). We can do what it does in a much better time bound. We also should be both hoisting and sinking stores, not just sinking them, anyway, since whether we should hoist or sink to merge depends basically on luck of the draw of where the blockers are placed. Nonetheless, i have left it alone for now. Reviewers: chandlerc, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29079 llvm-svn: 292971
*	[InstSimplify] try to eliminate icmp Pred (add nsw X, C1), C2	Sanjay Patel	2017-01-24	1	-21/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I was surprised to see that we're missing icmp folds based on 'add nsw' in InstCombine, but we should handle the InstSimplify cases first because that could make the InstCombine code simpler. Here are Alive-based proofs for the logic: Name: add_neg_constant Pre: C1 < 0 && (C2 > ((1<<(width(C1)-1)) + C1)) %a = add nsw i7 %x, C1 %b = icmp sgt %a, C2 => %b = false Name: add_pos_constant Pre: C1 > 0 && (C2 < ((1<<(width(C1)-1)) + C1 - 1)) %a = add nsw i6 %x, C1 %b = icmp slt %a, C2 => %b = false Name: nuw Pre: C1 u>= C2 %a = add nuw i11 %x, C1 %b = icmp ult %a, C2 => %b = false Differential Revision: https://reviews.llvm.org/D29053 llvm-svn: 292952
*	[PH] Replace uses of AssertingVH from members of analysis results with	Chandler Carruth	2017-01-24	2	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a lazy-asserting PoisoningVH. AssertVH is fundamentally incompatible with cache-invalidation of analysis results. The invaliadtion happens after the AssertingVH has already fired. Instead, use a PoisoningVH that will assert if the dangling handle is ever used rather than merely be assigned or destroyed. This patch also removes all of the (numerous) doomed attempts to work around this fundamental incompatibility. It is a pretty significant simplification IMO. The most interesting change is in the Inliner where we still do some clearing because we don't want to rely on the coarse grained invalidation strategy of the containing pass manager. However, I prefer the approach that contains this logic to the cleanup phase of the Inliner, and I think we could enhance the CGSCC analysis management layer to make this even better in the future if desired. The rest is straight cleanup. I've also added a test for one of the harder cases to work around: when a module analysis contains many AssertingVHes pointing at functions. Differential Revision: https://reviews.llvm.org/D29006 llvm-svn: 292928
*	[SLP] Additional test for checking that instruction with extra args is	Alexey Bataev	2017-01-24	1	-0/+57
\| \| \| \| \| \|	not reconstructed. llvm-svn: 292911
*	Update domtree incrementally in loop peeling.	Serge Pavlov	2017-01-24	1	-1/+1
\| \| \| \| \| \| \| \| \|	With this change dominator tree remains in sync after each step of loop peeling. Differential Revision: https://reviews.llvm.org/D29029 llvm-svn: 292895
*	SimplifyLibCalls: Replace more unary libcalls with intrinsics	Matt Arsenault	2017-01-23	3	-123/+276
\| \| \| \|	llvm-svn: 292855
*	[LoopUnroll] First form LCSSA, then loop-simplify	Michael Kuperstein	2017-01-23	1	-0/+55
\| \| \| \| \| \| \| \| \| \| \| \| \|	Running non-LCSSA-preserving LoopSimplify followed by LCSSA on (roughly) the same loop is incorrect, since LoopSimplify may break LCSSA arbitrarily higher in the loop nest. Instead, run LCSSA first, and then run LCSSA-preserving LoopSimplify on the result. This fixes PR31718. Differential Revision: https://reviews.llvm.org/D29055 llvm-svn: 292854
*	[InstSimplify] add tests to show missing folds from 'icmp (add nsw)'; NFC	Sanjay Patel	2017-01-23	1	-0/+169
\| \| \| \|	llvm-svn: 292841
*	[SLP] Additional test with extra args in horizontal reductions.	Alexey Bataev	2017-01-23	1	-0/+63
\| \| \| \|	llvm-svn: 292821
*	[MemorySSA] Add new tests for invariant.groups	Piotr Padlewski	2017-01-23	1	-0/+182
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Next round of extra tests for MSSA. I have a prototype invariant.group handling implementation that fixes all the FIXMEs, and I think it will be easier to see what is the difference if I firstly post this, and then only fix fixits. Reviewers: george.burgess.iv, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29022 llvm-svn: 292797
*	[InstCombine][X86] Add MULDQ/MULUDQ constant folding support	Simon Pilgrim	2017-01-23	1	-24/+12
\| \| \| \|	llvm-svn: 292793
*	[InstCombine][X86] MULDQ/MULUDQ undef -> zero	Simon Pilgrim	2017-01-23	1	-6/+6
\| \| \| \| \| \|	Match generic mul behaviour so that <X x i64> multiply and muldq/muludq pattern act the same llvm-svn: 292784
*	[SLP] Additional test for SLP vectorizer with 31 reduction elements.	Alexey Bataev	2017-01-23	1	-0/+196
\| \| \| \|	llvm-svn: 292783
*	[InstCombine][SSE] Tests showing missed opportunities to constant fold ↵	Simon Pilgrim	2017-01-23	1	-0/+58
\| \| \| \| \| \|	PMULDQ/PMULUDQ llvm-svn: 292782
*	This test apparently requires an x86 target and is failing on numerous	Chandler Carruth	2017-01-23	1	-0/+48
\| \| \| \| \| \| \| \| \| \| \|	bots ever since d0k fixed the CHECK lines so that it did something at all. It isn't actually testing SCEV directly but LSR, so move it into LSR and the x86-specific tree of tests that already exists there. Target dependence is common and unavoidable with the current design of LSR. llvm-svn: 292774
*	[PM] Replace the hard invalidate in JumpThreading for LVI with correct	Chandler Carruth	2017-01-23	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	invalidation of deleted functions in GlobalDCE. This was always testing a bug really triggered in GlobalDCE. Right now we have analyses with asserting value handles into IR. As long as those remain, when deleting an IR unit, we cannot wait for the normal invalidation scheme to kick in even though it was designed to work correctly in the face of these kinds of deletions. Instead, the pass needs to directly handle invalidating the analysis results pointing at that IR unit. I've tought the Inliner about this and this patch teaches GlobalDCE. This will handle the asserting VH case in the existing test as well as other issues of the same fundamental variety. I've moved the test into the GlobalDCE directory and added a comment explaining what is going on. Note that we cannot simply require LVI here because LVI is too lazy. llvm-svn: 292773
*	[PM] Add a dedicated test case for the issue fixed in r292770.	Chandler Carruth	2017-01-23	1	-0/+33
\| \| \| \| \| \| \| \|	While this is covered by a clang test case, we should have something locally to LLVM that immediately checks the inliner doesn't leave analyses to dangling IR bodies. llvm-svn: 292772
*	Fix some broken CHECK lines.	Benjamin Kramer	2017-01-22	5	-7/+7
\| \| \| \| \| \|	The colon is important. llvm-svn: 292761
*	[PM] Fix a really nasty bug introduced when adding PGO support to the	Chandler Carruth	2017-01-22	1	-0/+111
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	new PM's inliner. The bug happens when we refine an SCC after having computed a proxy for the FunctionAnalysisManager, and then proceed to compute fresh analyses for functions in the new SCC using the manager provided by the old SCC's proxy. And when we manage to mutate a function in this new SCC in a way that invalidates those analyses. This can be... challenging to reproduce. I've managed to contrive a set of functions that trigger this and added a test case, but it is a bit brittle. I've directly checked that the passes run in the expected ways to help avoid the test just becoming silently irrelevant. This gets the new PM back to passing the LLVM test suite after the PGO improvements landed. llvm-svn: 292757
*	[MemorySSA] Remove deprecated comment from test	Piotr Padlewski	2017-01-21	1	-3/+0
\| \| \| \|	llvm-svn: 292733
*	[MemorySSA] Fix invariant.group test and add new	Piotr Padlewski	2017-01-21	1	-2/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This test had a bug: !llvm.invariant.group instead of !invariant.group. Also add some new test for future development. All tests passes, when MSSA will support invariant.group only the lines with FIXIT should be changed. Reviewers: dberlin, george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28969 llvm-svn: 292730
*	[InstCombine] use m_APInt to allow ashr folds for vectors with splat constants	Sanjay Patel	2017-01-21	1	-6/+2
\| \| \| \| \| \| \|	We may be able to assert that no shl-shl or lshr-lshr pairs ever get here because we should have already handled those in foldShiftedShift(). llvm-svn: 292726
*	[InstCombine] add tests for ashr-ashr; NFC	Sanjay Patel	2017-01-21	1	-6/+47
\| \| \| \|	llvm-svn: 292724
*	[PM] Teach the loop PM to run LoopSimplify prior to the loop pipeline.	Chandler Carruth	2017-01-21	2	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the last remaining core feature of the loop pass pipeline in the new PM and removes the last of the really egregious hacks in the LICM tests. Sadly, this requires really substantial changes in the unittests in order to provide and maintain simplified loops. This is particularly hard because for example LoopSimplify will try to fold undef branches to an ideal direction and simplify the loop accordingly. Differential Revision: https://reviews.llvm.org/D28766 llvm-svn: 292709
*	MergeFunctions: Preserve debug info in thunks, under option ↵	Anmol P. Paralkar	2017-01-21	1	-0/+223
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	-mergefunc-preserve-debug-info Summary: Under option -mergefunc-preserve-debug-info we: - Do not create a new function for a thunk. - Retain the debug info for a thunk's parameters (and associated instructions for the debug info) from the entry block. Note: -debug will display the algorithm at work. - Create debug-info for the call (to the shared implementation) made by a thunk and its return value. - Erase the rest of the function, retaining the (minimally sized) entry block to create a thunk. - Preserve a thunk's call site to point to the thunk even when both occur within the same translation unit, to aid debugability. Note that this behaviour differs from the underlying -mergefunc implementation which modifies the thunk's call site to point to the shared implementation when both occur within the same translation unit. Reviewers: echristo, eeckstein, dblaikie, aprantl, friss Reviewed By: aprantl Subscribers: davide, fhahn, jfb, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D28075 llvm-svn: 292702
*	[ConstantFold] Remove test checking that we don't constant-fold sqrt(-2).	Justin Lebar	2017-01-21	1	-9/+0
\| \| \| \| \| \| \|	This depended on libm's errno behavior (we constant fold iff libm's sqrt(-2) does not set errno) and was breaking on mac. llvm-svn: 292701
*	[ConstantFolding] Constant-fold llvm.sqrt(x) like other intrinsics.	Justin Lebar	2017-01-21	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently we return undef, but we're in the process of changing the LangRef so that llvm.sqrt behaves like the other math intrinsics, matching the return value of the standard libcall but not setting errno. This change is legal even without the LangRef change because currently calling llvm.sqrt(x) where x is negative is spec'ed to be UB. But in practice it's also safe because we're simply constant-folding fewer inputs: Inputs >= -0 get constant-folded as before, but inputs < -0 now aren't constant-folded, because ConstantFoldFP aborts if the host math function raises an fp exception. Reviewers: hfinkel, efriedma, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28929 llvm-svn: 292692
*	[InstCombine] auto-generate checks; NFC	Sanjay Patel	2017-01-20	2	-34/+75
\| \| \| \|	llvm-svn: 292682
*	LowerTypeTests: Simplify; always create SizeM1 with type IntPtrTy, move ↵	Peter Collingbourne	2017-01-20	3	-6/+6
\| \| \| \| \| \|	initialization out of if statement. llvm-svn: 292674
*	Add indirect call promotion to SamplePGO	Dehao Chen	2017-01-20	2	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds metadata for indirect call promotion in the sample profile loader. Reviewers: xur, davidxl, dnovillo Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28923 llvm-svn: 292672