bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[TBAAVerifier] Be stricter around verifying scalar nodes	Sanjoy Das	2016-12-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This fixes the issue exposed in PR31393, where we weren't trying sufficiently hard to diagnose bad TBAA metadata. This does reduce the variety in the error messages we print out, but I think the tradeoff of verifying more, simply and quickly overrules the need for more helpful error messags here. llvm-svn: 290713
*	[PM] Teach MemDep to invalidate its result object when its cached	Chandler Carruth	2016-12-27	1	-0/+76
\| \| \| \| \| \| \| \|	analysis handles become invalid. Add a test case for its invalidation logic. llvm-svn: 290620
*	[PM] Add more dedicated testing to cover the invalidation logic added to	Chandler Carruth	2016-12-27	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	BasicAA in r290603. I've kept the basic testing in the new PM test file as that also covers the AAManager invalidation logic. If/when there is a good place for broader AA testing it could move there. This test is somewhat unsatisfying as I can't get it to fail even with ASan outside of explicit checks of the invalidation. Apparently we don't yet have any test coverage of the BasicAA code paths using either the domtree or loopinfo -- I made both of them always be null and check-llvm passed. llvm-svn: 290612
*	[AliasAnalysis] Teach BasicAA about memcpy.	Bryant Wong	2016-12-25	3	-22/+22
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D27034 llvm-svn: 290526
*	[X86][SSE] Improve lowering of vXi64 multiplies	Simon Pilgrim	2016-12-21	1	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As mentioned on PR30845, we were performing our vXi64 multiplication as: AloBlo = pmuludq(a, b); AloBhi = pmuludq(a, psrlqi(b, 32)); AhiBlo = pmuludq(psrlqi(a, 32), b); return AloBlo + psllqi(AloBhi, 32)+ psllqi(AhiBlo, 32); when we could avoid one of the upper shifts with: AloBlo = pmuludq(a, b); AloBhi = pmuludq(a, psrlqi(b, 32)); AhiBlo = pmuludq(psrlqi(a, 32), b); return AloBlo + psllqi(AloBhi + AhiBlo, 32); This matches the lowering on gcc/icc. Differential Revision: https://reviews.llvm.org/D27756 llvm-svn: 290267
*	[ConstantFolding] Fix vector GEPs harder	Michael Kuperstein	2016-12-21	1	-0/+21
\| \| \| \| \| \| \| \| \| \|	For vector GEPs, CastGEPIndices can end up in an infinite recursion, because we compare the vector type to the scalar pointer type, find them different, and then try to cast a type to itself. Differential Revision: https://reviews.llvm.org/D28009 llvm-svn: 290260
*	Add files I seem to have dropped in my revert (r290086).	Daniel Jasper	2016-12-19	1	-0/+22
\| \| \| \| \| \|	Sorry! llvm-svn: 290087
*	Revert @llvm.assume with operator bundles (r289755-r289757)	Daniel Jasper	2016-12-19	2	-3/+3
\| \| \| \| \| \| \|	This creates non-linear behavior in the inliner (see more details in r289755's commit thread). llvm-svn: 290086
*	[AArch64] Guard Misaligned 128-bit store penalty by subtarget feature	Matthew Simpson	2016-12-15	1	-6/+12
\| \| \| \| \| \| \| \| \|	This patch checks that the SlowMisaligned128Store subtarget feature is set when penalizing such stores in getMemoryOpCost. Differential Revision: https://reviews.llvm.org/D27677 llvm-svn: 289845
*	[CostModel][X86] Updated reverse shuffle costs	Simon Pilgrim	2016-12-15	1	-32/+56
\| \| \| \|	llvm-svn: 289819
*	[CostModel] Fix long standing bug with reverse shuffle mask detection	Simon Pilgrim	2016-12-15	1	-0/+31
\| \| \| \| \| \|	Incorrect 'undef' mask index matching meant that broadcast shuffles could be detected as reverse shuffles llvm-svn: 289811
*	[CostModel][X86] Add tests for reverse shuffle costs	Simon Pilgrim	2016-12-15	1	-0/+143
\| \| \| \|	llvm-svn: 289800
*	Remove the AssumptionCache	Hal Finkel	2016-12-15	1	-22/+0
\| \| \| \| \| \| \| \| \|	After r289755, the AssumptionCache is no longer needed. Variables affected by assumptions are now found by using the new operand-bundle-based scheme. This new scheme is more computationally efficient, and also we need much less code... llvm-svn: 289756
*	Make processing @llvm.assume more efficient by using operand bundles	Hal Finkel	2016-12-15	2	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There was an efficiency problem with how we processed @llvm.assume in ValueTracking (and other places). The AssumptionCache tracked all of the assumptions in a given function. In order to find assumptions relevant to computing known bits, etc. we searched every assumption in the function. For ValueTracking, that means that we did O(#assumes * #values) work in InstCombine and other passes (with a constant factor that can be quite large because we'd repeat this search at every level of recursion of the analysis). Several of us discussed this situation at the last developers' meeting, and this implements the discussed solution: Make the values that an assume might affect operands of the assume itself. To avoid exposing this detail to frontends and passes that need not worry about it, I've used the new operand-bundle feature to add these extra call "operands" in a way that does not affect the intrinsic's signature. I think this solution is relatively clean. InstCombine adds these extra operands based on what ValueTracking, LVI, etc. will need and then those passes need only search the users of the values under consideration. This should fix the computational-complexity problem. At this point, no passes depend on the AssumptionCache, and so I'll remove that as a follow-up change. Differential Revision: https://reviews.llvm.org/D27259 llvm-svn: 289755
*	[Verifier] Add verification for TBAA metadata	Sanjoy Das	2016-12-11	10	-12/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change adds some verification in the IR verifier around struct path TBAA metadata. Other than some basic sanity checks (e.g. we get constant integers where we expect constant integers), this checks: - That by the time an struct access tuple `(base-type, offset)` is "reduced" to a scalar base type, the offset is `0`. For instance, in C++ you can't start from, say `("struct-a", 16)`, and end up with `("int", 4)` -- by the time the base type is `"int"`, the offset better be zero. In particular, a variant of this invariant is needed for `llvm::getMostGenericTBAA` to be correct. - That there are no cycles in a struct path. - That struct type nodes have their offsets listed in an ascending order. - That when generating the struct access path, you eventually reach the access type listed in the tbaa tag node. Reviewers: dexonsmith, chandlerc, reames, mehdi_amini, manmanren Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D26438 llvm-svn: 289402
*	ConstantFolding: Don't crash when encountering vector GEP	Keno Fischer	2016-12-08	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	ConstantFolding tried to cast one of the scalar indices to a vector type. Instead, use the vector type only for the first index (which is the only one allowed to be a vector) and use its scalar type otherwise. Fixes PR31250. Reviewers: majnemer Differential Revision: https://reviews.llvm.org/D27389 llvm-svn: 289073
*	[AArch64] Correct the check of signed 9-bit imm in isLegalAddressingMode()	Haicheng Wu	2016-12-07	1	-1/+97
\| \| \| \| \| \| \| \|	In the addressing mode, signed 9-bit imm is [-256, 255], not [-512, 511]. Differential Revision: https://reviews.llvm.org/D27480 llvm-svn: 288876
*	[TTI/CostModel] Correct the way getGEPCost() calls isLegalAddressingMode()	Haicheng Wu	2016-12-03	2	-11/+207
\| \| \| \| \| \| \| \|	Fix a bug when we call isLegalAddressingMode() from getGEPCost(). Differential Revision: https://reviews.llvm.org/D27357 llvm-svn: 288569
*	[ppc] Correctly compute the cost of loading 32/64 bit memory into VSR	Guozhi Wei	2016-12-03	1	-0/+19
\| \| \| \| \| \| \| \|	VSX has instructions lxsiwax/lxsdx that can load 32/64 bit value into VSX register cheaply. That patch makes it known to memory cost model, so the vectorization of the test case in pr30990 is beneficial. Differential Revision: https://reviews.llvm.org/D26713 llvm-svn: 288560
*	[SLP] Fixed cost model for horizontal reduction.	Alexey Bataev	2016-12-01	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently when cost of scalar operations is evaluated the vector type is used for scalar operations. Patch fixes this issue and fixes evaluation of the vector operations cost. Several test showed that vector cost model is too optimistic. It allowed vectorization of 8 or less add/fadd operations, though scalar code is faster. Actually, only for 16 or more operations vector code provides better performance. Differential Revision: https://reviews.llvm.org/D26277 llvm-svn: 288398
*	[SLP] Additional tests with the cost of vector operations.	Alexey Bataev	2016-12-01	1	-0/+2
\| \| \| \|	llvm-svn: 288377
*	Revert "[SLP] Additional tests with the cost of vector operations."	Alexey Bataev	2016-12-01	1	-2/+0
\| \| \| \| \| \|	This reverts commit a61718435fc4118c82f8aa6133fd81f803789c1e. llvm-svn: 288371
*	[SLP] Additional tests with the cost of vector operations.	Alexey Bataev	2016-12-01	1	-0/+2
\| \| \| \|	llvm-svn: 288369
*	[InstSimplify] allow integer vector types to use computeKnownBits	Sanjay Patel	2016-11-27	1	-15/+3
\| \| \| \| \| \| \| \|	Note that the non-splat lshr+lshr test folded, but that does not work in general. Something is missing or wrong in computeKnownBits as the non-splat shl+shl test still shows. llvm-svn: 288005
*	add tests to show missing analysis; NFC	Sanjay Patel	2016-11-27	1	-0/+71
\| \| \| \|	llvm-svn: 287998
*	[X86][AVX512] Add support for v2i64 fptosi/fptoui/sitofp/uitofp on ↵	Simon Pilgrim	2016-11-24	3	-9/+15
\| \| \| \| \| \| \| \|	AVX512DQ-only targets Use 512-bit instructions with subvector insertion/extraction like we do in a number of similar circumstances llvm-svn: 287882
*	[X86][AVX512] Add support for v4i64 fptosi/fptoui/sitofp/uitofp on ↵	Simon Pilgrim	2016-11-23	2	-4/+6
\| \| \| \| \| \| \| \|	AVX512DQ-only targets Use 512-bit instructions with subvector insertion/extraction like we do in a number of similar circumstances llvm-svn: 287762
*	[CostModel][X86] Add missing AVX512DQ v8i64 fptosi/sitofp costs	Simon Pilgrim	2016-11-23	1	-3/+3
\| \| \| \|	llvm-svn: 287760
*	[CostModel][X86] Add v2f32 -> v2i64 fptosi/fptoui cost tests	Simon Pilgrim	2016-11-23	2	-0/+14
\| \| \| \|	llvm-svn: 287756
*	[CostModel][X86] Updated sitofp/uitofp scalar/vector cost tests	Simon Pilgrim	2016-11-22	2	-1379/+492
\| \| \| \| \| \| \| \|	Better coverage of all legal types + special cases. Removed old fptoui tests which are all handled in fptoui.ll llvm-svn: 287678
*	Fix known zero bits for addrspacecast.	Yaxun Liu	2016-11-21	1	-0/+24
\| \| \| \| \| \| \| \| \| \|	Currently LLVM assumes that a pointer addrspacecasted to a different addr space is equivalent to trunc or zext bitwise, which is not true. For example, in amdgcn target, when a null pointer is addrspacecasted from addr space 4 to 0, its value is changed from i64 0 to i32 -1. This patch teaches LLVM not to assume known bits of addrspacecast instruction to its operand. Differential Revision: https://reviews.llvm.org/D26803 llvm-svn: 287545
*	[AVX-512] Support FCOPYSIGN for v16f32 and v8f64	Craig Topper	2016-11-18	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This extends FCOPYSIGN support to 512-bit vectors. I've also added tests to show what the 128-bit and 256-bit cases look like with broadcast loads. Reviewers: delena, zvi, RKSimon, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26791 llvm-svn: 287298
*	[CostModel][X86] Added mul costs for vXi8 vectors	Simon Pilgrim	2016-11-14	1	-16/+18
\| \| \| \| \| \|	More realistic v16i8/v32i8/v64i8 MUL costs - we have to extend to vXi16, use PMULLW and then truncate the result llvm-svn: 286838
*	[X86][AVX] Fixed v16i16/v32i8 ADD/SUB costs on AVX1 subtargets	Simon Pilgrim	2016-11-14	1	-8/+8
\| \| \| \| \| \| \| \|	Add explicit v16i16/v32i8 ADD/SUB costs, matching the costs of v4i64/v8i32 - they were missing for some reason. This has side effects on the LV max bandwidth tests (AVX1 now prefers 128-bit vectors vs AVX2 which still prefers 256-bit) llvm-svn: 286832
*	IR: Introduce inrange attribute on getelementptr indices.	Peter Collingbourne	2016-11-10	1	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the inrange keyword is present before any index, loading from or storing to any pointer derived from the getelementptr has undefined behavior if the load or store would access memory outside of the bounds of the element selected by the index marked as inrange. This can be used, e.g. for alias analysis or to split globals at element boundaries where beneficial. As previously proposed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2016-July/102472.html Differential Revision: https://reviews.llvm.org/D22793 llvm-svn: 286514
*	[RegionInfo] Add three tests that include infinite loops	Tobias Grosser	2016-11-10	3	-0/+71
\| \| \| \| \| \| \| \|	These examples are variations that were inspired from a small subgraph taken from paper.ll which are interesting as they show certain issues with infinite loops. llvm-svn: 286450
*	[BasicAA] Teach BasicAA to handle the inaccessiblememonly and ↵	Andrew Kaylor	2016-11-08	1	-16/+51
\| \| \| \| \| \| \| \|	inaccessiblemem_or_argmemonly attributes Differential Revision: https://reviews.llvm.org/D26382 llvm-svn: 286294
*	[VectorLegalizer] Expansion of CTLZ using CTPOP when possible	Simon Pilgrim	2016-11-08	1	-16/+16
\| \| \| \| \| \| \| \| \| \|	This patch avoids scalarization of CTLZ by instead expanding to use CTPOP (ref: "Hacker's Delight") when the necessary operations are available. This also adds the necessary cost models for X86 SSE2 targets (the main beneficiary) to ensure vectorization only happens when its useful. Differential Revision: https://reviews.llvm.org/D25910 llvm-svn: 286233
*	[AliasSetTracker] Make AST smarter about assume intrinsics that don't ↵	Chad Rosier	2016-11-07	1	-0/+19
\| \| \| \| \| \| \| \|	actually affect memory. Differential Revision: https://reviews.llvm.org/D26252 llvm-svn: 286108
*	Improved cost model for FDIV and FSQRT, by Andrew Tischenko	Alexey Bataev	2016-10-31	1	-82/+82
\| \| \| \| \| \| \| \| \| \|	There is a bug describing poor cost model for floating point operations: Bug 29083 - [X86][SSE] Improve costs for floating point operations. This patch is the second one in series of patches dealing with cost model. Differential Revision: https://reviews.llvm.org/D25722 llvm-svn: 285564
*	[Loads] Fix crash in is isDereferenceableAndAlignedPointer()	Tom Stellard	2016-10-28	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We were trying to add APInt values with different bit sizes after visiting an addrspacecast instruction which changed the bit width of the pointer. Reviewers: majnemer, hfinkel Subscribers: hfinkel, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D24774 llvm-svn: 285407
*	[X86][AVX512] Fix MUL v8i64 costs on non-AVX512DQ targets	Simon Pilgrim	2016-10-27	1	-2/+2
\| \| \| \|	llvm-svn: 285329
*	[X86][AVX512DQ] Improve lowering of MUL v2i64 and v4i64	Simon Pilgrim	2016-10-27	1	-4/+13
\| \| \| \| \| \| \| \| \| \|	With DQI but without VLX, lower v2i64 and v4i64 MUL operations with v8i64 MUL (vpmullq). Updated cost table accordingly. Differential Revision: https://reviews.llvm.org/D26011 llvm-svn: 285304
*	Revert "[AliasSetTracker] Make AST smarter about intrinsics that don't ↵	Chad Rosier	2016-10-26	1	-54/+0
\| \| \| \| \| \| \| \| \| \| \|	actually affect memory." This reverts commit r285191. LICM appears to rely on the Alias Set Tracker hitting lifetime markers to prevent code from being moved outside of the original scope. llvm-svn: 285227
*	[AliasSetTracker] Make AST smarter about intrinsics that don't actually ↵	Chad Rosier	2016-10-26	1	-0/+54
\| \| \| \| \| \| \| \|	affect memory. Differential Revision: https://reviews.llvm.org/D25969 llvm-svn: 285191
*	Fix regression from my recent GlobalsAA fix.	Eli Friedman	2016-10-24	1	-0/+54
\| \| \| \| \| \| \| \| \| \| \| \| \|	There are two fixes here: one, AnalyzeUsesOfPointer can't return false until it has checked all the uses of the pointer. Two, if a global uses another global, we have to assume the address of the first global escapes. Fixes https://llvm.org/bugs/show_bug.cgi?id=30707 . Differential Revision: https://reviews.llvm.org/D25798 llvm-svn: 285034
*	[CostModel][X86] Added tests for current integer signed/unsigned remainder costs	Simon Pilgrim	2016-10-23	1	-0/+116
\| \| \| \|	llvm-svn: 284940
*	[X86][SSE] Add SSE41/AVX1 costs for vector shifts.	Simon Pilgrim	2016-10-23	3	-109/+109
\| \| \| \| \| \|	We were defaulting to SSE2 costs which weren't taking into account the availability of PBLENDW/PBLENDVB to improve merging of per-element shift results. llvm-svn: 284939
*	[CostModel][X86] Added tests for current integer trunc costs	Simon Pilgrim	2016-10-23	1	-0/+141
\| \| \| \|	llvm-svn: 284938
*	[BasicAA] Fix - missed alias in GEP expressions	Gerolf Hoflehner	2016-10-22	1	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \|	In BasicAA GEP operand values get adjusted ("wrap-around") based on the pointersize. Otherwise, in non-64b modes, AA could report false negatives. However, a wrap-around is valid only for a fully evaluated expression. It had been introduced to fix an alias problem in http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20160118/326163.html. This commit restricts the wrap-around to constant gep operands only where the value is known at compile-time. llvm-svn: 284908