bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[InstCombine] allow icmp (bit-manipulation-intrinsic(), C) folds for vectors	Sanjay Patel	2016-07-23	1	-8/+3
\| \| \| \|	llvm-svn: 276523
*	[Profile] Use explicit flag to enable IR PGO	Xinliang David Li	2016-07-23	1	-1/+2
\| \| \| \| \| \| \| \|	Patch by Jake VanAdrighem Differential Revision: http://reviews.llvm.org/D22607 llvm-svn: 276516
*	auto-generate checks	Sanjay Patel	2016-07-23	1	-32/+62
\| \| \| \|	llvm-svn: 276501
*	[LoopDataPrefetch] Include hotness of region in opt remark	Adam Nemet	2016-07-22	1	-0/+82
\| \| \| \|	llvm-svn: 276488
*	add tests for icmp vector folds	Sanjay Patel	2016-07-22	2	-1/+50
\| \| \| \|	llvm-svn: 276482
*	[SLPVectorizer] Vectorize reverse-order loads in horizontal reductions	Michael Kuperstein	2016-07-22	1	-0/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When vectorizing a tree rooted at a store bundle, we currently try to sort the stores before building the tree, so that the stores can be vectorized. For other trees, the order of the root bundle - which determines the order of all other bundles - is arbitrary. That is bad, since if a leaf bundle of consecutive loads happens to appear in the wrong order, we will not vectorize it. This is partially mitigated when the root is a binary operator, by trying to build a "reversed" tree when that's considered profitable. This patch extends the workaround we have for binops to trees rooted in a horizontal reduction. This fixes PR28474. Differential Revision: https://reviews.llvm.org/D22554 llvm-svn: 276477
*	add tests for icmp vector folds	Sanjay Patel	2016-07-22	1	-0/+24
\| \| \| \|	llvm-svn: 276476
*	add tests for icmp vector folds	Sanjay Patel	2016-07-22	1	-0/+24
\| \| \| \|	llvm-svn: 276475
*	add tests for icmp vector folds	Sanjay Patel	2016-07-22	2	-6/+69
\| \| \| \|	llvm-svn: 276472
*	update to use FileCheck and auto-generate checks	Sanjay Patel	2016-07-22	1	-161/+338
\| \| \| \|	llvm-svn: 276466
*	add tests for icmp vector folds	Sanjay Patel	2016-07-22	2	-0/+26
\| \| \| \|	llvm-svn: 276464
*	Recommit - [DSE]Enhance shorthening MemIntrinsic based on OverlapIntervals	Jun Bum Lim	2016-07-22	3	-0/+67
\| \| \| \| \| \| \| \| \| \| \|	Recommiting r275571 after fixing crash reported in PR28270. Now we erase elements of IOL in deleteDeadInstruction(). Original Summary: This change use the overlap interval map built from partial overwrite tracking to perform shortening MemIntrinsics. Add test cases which was missing opportunities before. llvm-svn: 276452
*	add tests for vector bit manipulation intrinsics	Sanjay Patel	2016-07-22	1	-2/+40
\| \| \| \|	llvm-svn: 276451
*	Invariant start/end intrinsics overloaded for address space	Anna Thomas	2016-07-22	3	-10/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: apilipenko, reames Subscribers: llvm-commits llvm-svn: 276447
*	Don't remove side effecting instructions due to ConstantFoldInstruction	David Majnemer	2016-07-22	3	-41/+53
\| \| \| \| \| \| \| \| \|	Just because we can constant fold the result of an instruction does not imply that we can delete the instruction. It may have side effects. This fixes PR28655. llvm-svn: 276389
*	[IRCE] Don't misuse CHECK-LABEL; NFC	Sanjoy Das	2016-07-22	5	-30/+31
\| \| \| \|	llvm-svn: 276373
*	[IRCE] Add an option to skip profitability checks	Sanjoy Das	2016-07-22	1	-0/+31
\| \| \| \| \| \| \| \|	If `-irce-skip-profitability-checks` is passed in, IRCE will kick in in all cases where it is legal for it to kick in. This flag is intended to help diagnose and analyse performance issues. llvm-svn: 276372
*	GVH-hoist: only clone GEPs (PR28606)	Sebastian Pop	2016-07-21	2	-2/+52
\| \| \| \| \| \| \| \| \|	Do not clone stored values unless they are GEPs that are special cased to avoid hoisting them without hoisting their associated ld/st. Differential revision: https://reviews.llvm.org/D22652 llvm-svn: 276358
*	[PM] Port NaryReassociate to the new PM	Wei Mi	2016-07-21	5	-0/+5
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D22648 llvm-svn: 276349
*	[InstSimplify] don't crash handling a pointer or aggregate type	Sanjay Patel	2016-07-21	1	-0/+13
\| \| \| \|	llvm-svn: 276345
*	[InstSimplify] recognize trunc + icmp sgt/slt variants of select ↵	Sanjay Patel	2016-07-21	1	-41/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	simplifications (PR28466) rL245171 exposed a hole in InstSimplify that manifested in a strange way in PR28466: https://llvm.org/bugs/show_bug.cgi?id=28466 It's possible to use trunc + icmp sgt/slt in place of an and + icmp eq/ne, so we need to recognize that pattern to eliminate selects that are choosing between some value and some bitmasked version of that value. Note that there is significant room for improvement (refactoring) and enhancement (more patterns, possibly in InstCombine rather than here). Differential Revision: https://reviews.llvm.org/D22537 llvm-svn: 276341
*	[OptDiag,LDist] Convert remaining opt remarks to use the new API	Adam Nemet	2016-07-21	1	-0/+6
\| \| \| \|	llvm-svn: 276340
*	[LV] Move vector int induction update to end of latch	Matthew Simpson	2016-07-21	3	-14/+15
\| \| \| \| \| \| \| \| \| \| \|	This patch moves the update instruction for vectorized integer induction phi nodes to the end of the latch block. This ensures consistent placement of all induction updates across all the kinds of int inductions we create (scalar, splat vector, or vector phi). Differential Revision: https://reviews.llvm.org/D22416 llvm-svn: 276339
*	add vector tests and a simpler version of the negative tests	Sanjay Patel	2016-07-21	1	-3/+48
\| \| \| \|	llvm-svn: 276328
*	Revert "Invariant start/end intrinsics overloaded for address space"	Anna Thomas	2016-07-21	3	-22/+10
\| \| \| \| \| \|	This reverts commit r276316. llvm-svn: 276320
*	Invariant start/end intrinsics overloaded for address space	Anna Thomas	2016-07-21	3	-10/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: tstellarAMD, reames, apilipenko Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22519 llvm-svn: 276316
*	[GVNHoist] Preserve optimization hints which agree	David Majnemer	2016-07-21	1	-0/+44
\| \| \| \| \| \| \|	If we have optimization hints with agree with each other along different paths, preserve them. llvm-svn: 276248
*	[GVNHoist] Don't wrongly preserve TBAA	David Majnemer	2016-07-21	1	-0/+29
\| \| \| \| \| \| \|	We hoisted loads/stores without taking into account which can cause miscompiles. llvm-svn: 276240
*	[OptDiag,LV] Add hotness attribute to applied-optimization remarks	Adam Nemet	2016-07-21	1	-4/+4
\| \| \| \| \| \| \|	Test coverage is provided by modifying the function in the FP-math testcase that we are allowed to vectorize. llvm-svn: 276223
*	[InstCombine] LogicOpc (zext X), C --> zext (LogicOpc X, C) (PR28476)	Sanjay Patel	2016-07-21	7	-54/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The benefits of this change include: 1. Remove DeMorgan-matching code that was added specifically to work-around the missing transform in http://reviews.llvm.org/rL248634. 2. Makes the DeMorgan transform work for vectors too. 3. Fix PR28476: https://llvm.org/bugs/show_bug.cgi?id=28476 Extending this transform to other casts and other associative operators may be useful too. See https://reviews.llvm.org/D22421 for a prerequisite for doing that though. Differential Revision: https://reviews.llvm.org/D22271 llvm-svn: 276221
*	[OptDiag,LV] Add hotness attribute to the derived analysis remarks	Adam Nemet	2016-07-20	1	-0/+113
\| \| \| \| \| \| \| \|	This includes FPCompute and Aliasing. Testcase is based on no_fpmath.ll. llvm-svn: 276211
*	[InstSimplify][InstCombine] don't crash when folding vector selects of icmp	Sanjay Patel	2016-07-20	2	-0/+34
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D22602 llvm-svn: 276209
*	[NVPTX] Enable the load-store vectorizer on nvptx.	Justin Lebar	2016-07-20	1	-16/+16
\| \| \| \| \| \| \| \| \| \|	Reviewers: tra Subscribers: jholewinski, arsenm, asbirlea Differential Revision: https://reviews.llvm.org/D22592 llvm-svn: 276196
*	[OptDiag,LV] Add hotness attribute to analysis remarks	Adam Nemet	2016-07-20	1	-0/+201
\| \| \| \| \| \| \| \|	The earlier change added hotness attribute to missed-optimization remarks. This follows up with the analysis remarks (the ones explaining the reason for the missed optimization). llvm-svn: 276192
*	[GVNHoist] Don't hoist PHI nodes	David Majnemer	2016-07-20	1	-0/+42
\| \| \| \| \| \| \| \| \|	We hoisted PHIs without respecting their special insertion point in the block, leading to verfier errors. This fixes PR28626. llvm-svn: 276181
*	[SCCP] Zap multiple return values.	Davide Italiano	2016-07-20	2	-2/+26
\| \| \| \| \| \| \| \| \|	We can replace the return values with undef if we replaced all the call uses with a constant/undef. Differential Revision: https://reviews.llvm.org/D22336 llvm-svn: 276174
*	[LSV] Don't move stores across may-load instrs, and loosen restrictions on ↵	Justin Lebar	2016-07-20	1	-33/+194
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	moving loads. Summary: Previously we wouldn't move loads/stores across instructions that had side-effects, where that was defined as may-write or may-throw. But this is not sufficiently restrictive: Stores can't safely be moved across instructions that may load. This patch also adds a DEBUG check that all instructions in our chain are either loads or stores. Reviewers: asbirlea Subscribers: llvm-commits, jholewinski, arsenm, mzolotukhin Differential Revision: https://reviews.llvm.org/D22547 llvm-svn: 276171
*	[LSV] Vectorize up to side-effecting instructions.	Justin Lebar	2016-07-20	2	-0/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously if we had a chain that contained a side-effecting instruction, we wouldn't vectorize it at all. Now we'll vectorize everything that comes before the side-effecting instruction. Reviewers: asbirlea Subscribers: arsenm, jholewinski, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22536 llvm-svn: 276170
*	minimize tests and auto-generate checks	Sanjay Patel	2016-07-20	1	-117/+73
\| \| \| \|	llvm-svn: 276147
*	Revert "[InstCombine] Enable cast-folding in logic(cast(icmp), cast(icmp))"	Benjamin Kramer	2016-07-20	1	-70/+0
\| \| \| \| \| \| \| \|	Makes InstCombine infloop when compiling v8. This reverts commit r275989 and r276105. llvm-svn: 276106
*	[InstCombine] Provide more test cases for cast-folding [NFC]	Tobias Grosser	2016-07-20	1	-3/+35
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In r275989 we enabled the folding of `logic(cast(icmp), cast(icmp))` to `cast(logic(icmp, icmp))`. Here we add more test cases to assure this folding works for all logical operations `and`/`or`/`xor`. Reviewers: grosser Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22561 Contributed-by: Matthias Reisinger llvm-svn: 276105
*	[X86][SSE] Add cost model values for CTPOP of vectors	Simon Pilgrim	2016-07-20	1	-35/+144
\| \| \| \| \| \| \| \|	This patch adds costs for the vectorized implementations of CTPOP, the default values were seriously underestimating the cost of these and was encouraging vectorization on targets where serialized use of POPCNT would be much better. Differential Revision: https://reviews.llvm.org/D22456 llvm-svn: 276104
*	Forgot to add a test for r276008.	David Majnemer	2016-07-20	1	-0/+18
\| \| \| \|	llvm-svn: 276082
*	[LV] Add hotness attribute to missed-optimization remarks	Adam Nemet	2016-07-20	1	-0/+213
\| \| \| \| \| \| \|	The new OptimizationRemarkEmitter analysis pass is hooked up to both new and old PM passes. llvm-svn: 276080
*	Revert "Revert r275883 and r275891. They seem to cause PR28608."	Michael Zolotukhin	2016-07-20	3	-0/+198
\| \| \| \| \| \| \|	This reverts commit r276064, and thus reapplies r275891 and r275883 with a fix for PR28608. llvm-svn: 276077
*	[LSV] Don't assume that loads/stores appear in address order in the BB.	Justin Lebar	2016-07-20	1	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: getVectorizablePrefix previously didn't work properly in the face of aliasing loads/stores. It unwittingly assumed that the loads/stores appeared in the BB in address order. If they didn't, it would do the wrong thing. Reviewers: asbirlea, tstellarAMD Subscribers: arsenm, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D22535 llvm-svn: 276072
*	Revert r275883 and r275891. They seem to cause PR28608.	Sean Silva	2016-07-19	2	-163/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Revert "[LoopSimplify] Update LCSSA after separating nested loops." This reverts commit r275891. Revert "[LCSSA] Post-process PHI-nodes created by SSAUpdate when constructing LCSSA form." This reverts commit r275883. llvm-svn: 276064
*	[PM] Port LoopUnroll.	Sean Silva	2016-07-19	1	-0/+1
\| \| \| \| \| \| \| \| \|	We just set PreserveLCSSA to always true since we don't have an analogous method `mustPreserveAnalysisID(LCSSA)`. Also port LoopInfo verifier pass to test LoopUnrollPass. llvm-svn: 276063
*	[LSV] Insert stores at the right point.	Justin Lebar	2016-07-19	1	-2/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously, the insertion point for stores was the last instruction in Chain before calling getVectorizablePrefixEndIdx. Thus if getVectorizablePrefixEndIdx didn't return Chain.size(), we still would insert at the last instruction in Chain. This patch changes our internal API a bit in an attempt to make it less prone to this sort of error. As a result, we end up recalculating the Chain's boundary instructions, but I think worrying about the speed hit of this is a premature optimization right now. Reviewers: asbirlea, tstellarAMD Subscribers: mzolotukhin, arsenm, llvm-commits Differential Revision: https://reviews.llvm.org/D22534 llvm-svn: 276056
*	[LSV] Add detail to correct-order.ll test.	Justin Lebar	2016-07-19	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This helps keep us honest -- there were a number of ways we could screw up and still have passed this test. Reviewers: asbirlea Subscribers: llvm-commits, arsenm Differential Revision: https://reviews.llvm.org/D22531 llvm-svn: 276053