bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	function names start with a lowercase letter; NFC	Sanjay Patel	2016-02-01	14	-324/+324
\| \| \| \|	llvm-svn: 259425
*	[InstCombine] simplify masked scatter/gather intrinsics with zero masks	Sanjay Patel	2016-02-01	1	-4/+22
\| \| \| \| \| \| \| \| \| \| \|	A masked scatter with a zero mask means there's no store. A masked gather with a zero mask means the passthru arg is returned. This is a continuation of: http://reviews.llvm.org/rL259369 http://reviews.llvm.org/rL259392 llvm-svn: 259421
*	[InstCombine] simplify masked store intrinsics with all ones or zeros masks	Sanjay Patel	2016-02-01	1	-1/+21
\| \| \| \| \| \| \| \| \| \|	A masked store with a zero mask means there's no store. A masked store with an allOnes mask means it's a normal vector store. This is a continuation of: http://reviews.llvm.org/rL259369 llvm-svn: 259392
*	[InstCombine] Don't transform (X+INT_MAX)>=(Y+INT_MAX) -> (X<=Y)	David Majnemer	2016-02-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	This miscompile came about because we tried to use a transform which was only appropriate for xor operators when addition was present. This fixes PR26407. llvm-svn: 259375
*	[InstCombine] simplify masked load intrinsics with all ones or zeros masks	Sanjay Patel	2016-02-01	1	-0/+30
\| \| \| \| \| \| \| \| \|	A masked load with a zero mask means there's no load. A masked load with an allOnes mask means it's a normal vector load. Differential Revision: http://reviews.llvm.org/D16691 llvm-svn: 259369
*	add helper function for minnum/maxnum ; NFC	Sanjay Patel	2016-01-31	1	-74/+80
\| \| \| \|	llvm-svn: 259326
*	use range-based for loop; NFC	Sanjay Patel	2016-01-31	1	-3/+3
\| \| \| \|	llvm-svn: 259325
*	fix formatting; NFC	Sanjay Patel	2016-01-31	1	-13/+13
\| \| \| \|	llvm-svn: 259324
*	simplify; NFC	Sanjay Patel	2016-01-31	1	-8/+5
\| \| \| \|	llvm-svn: 259323
*	InstCombine: fabs(x) * fabs(x) -> x * x	Matt Arsenault	2016-01-30	1	-4/+15
\| \| \| \|	llvm-svn: 259295
*	Avoid overly large SmallPtrSet/SmallSet	Matthias Braun	2016-01-30	1	-1/+1
\| \| \| \| \| \| \|	These sets perform linear searching in small mode so it is never a good idea to use SmallSize/N bigger than 32. llvm-svn: 259283
*	function names start with a lower case letter ; NFC	Sanjay Patel	2016-01-29	1	-25/+25
\| \| \| \|	llvm-svn: 259264
*	fix formatting; NFC	Sanjay Patel	2016-01-29	1	-4/+8
\| \| \| \|	llvm-svn: 259262
*	[InstCombine] avoid an insertelement transformation that induces the ↵	Sanjay Patel	2016-01-29	1	-1/+17
\| \| \| \| \| \| \| \| \| \| \|	opposite extractelement fold (PR26354) We would infinite loop because we created a shufflevector that was wider than needed and then failed to combine that with the insertelement. When subsequently visiting the extractelement from that shuffle, we see that it's unnecessary, delete it, and trigger another visit to the insertelement. llvm-svn: 259236
*	less indenting; NFCI	Sanjay Patel	2016-01-28	1	-107/+109
\| \| \| \|	llvm-svn: 259002
*	Remove autoconf support	Chris Bieneman	2016-01-26	1	-15/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch is provided in preparation for removing autoconf on 1/26. The proposal to remove autoconf on 1/26 was discussed on the llvm-dev thread here: http://lists.llvm.org/pipermail/llvm-dev/2016-January/093875.html "I felt a great disturbance in the [build system], as if millions of [makefiles] suddenly cried out in terror and were suddenly silenced. I fear something [amazing] has happened." - Obi Wan Kenobi Reviewers: chandlerc, grosbach, bob.wilson, tstellarAMD, echristo, whitequark Subscribers: chfast, simoncook, emaste, jholewinski, tberghammer, jfb, danalbert, srhines, arsenm, dschuff, jyknight, dsanders, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16471 llvm-svn: 258861
*	[InstCombine, SCCP] Consolidate code used to remove instructions	David Majnemer	2016-01-24	1	-18/+3
\| \| \| \| \| \| \| \| \|	InstCombine and SCCP both want to remove dead code in a very particular way but using identical means to do so. Share the code between the two. No functionality change is intended. llvm-svn: 258653
*	AMDGPU: Rename intrinsics to use amdgcn prefix	Matt Arsenault	2016-01-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	The intrinsic target prefix should match the target name as it appears in the triple. This is not yet complete, but gets most of the important ones. llvm.AMDGPU.* intrinsics used by mesa and libclc are still handled for compatability for now. llvm-svn: 258557
*	[opaque pointer types] [NFC] FindAvailableLoadedValue: take LoadInst instead ↵	Eduard Burtescu	2016-01-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	of just the pointer. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16422 llvm-svn: 258477
*	don't repeat function names in comments; NFC	Sanjay Patel	2016-01-20	1	-19/+14
\| \| \| \|	llvm-svn: 258360
*	80-cols; NFC	Sanjay Patel	2016-01-20	1	-2/+2
\| \| \| \|	llvm-svn: 258323
*	remove outdated comment; NFC	Sanjay Patel	2016-01-19	1	-4/+0
\| \| \| \|	llvm-svn: 258147
*	[opaque pointer types] [NFC] GEP: replace get(Pointer)ElementType uses with ↵	Eduard Burtescu	2016-01-19	2	-16/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	get{Source,Result}ElementType. Summary: GEPOperator: provide getResultElementType alongside getSourceElementType. This is made possible by adding a result element type field to GetElementPtrConstantExpr, which GetElementPtrInst already has. GEP: replace get(Pointer)ElementType uses with get{Source,Result}ElementType. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16275 llvm-svn: 258145
*	combine clauses with same output ; NFCI	Sanjay Patel	2016-01-18	1	-8/+3
\| \| \| \|	llvm-svn: 258062
*	use m_OneUse ; NFCI	Sanjay Patel	2016-01-18	1	-4/+2
\| \| \| \|	llvm-svn: 258059
*	fix variable names, typos ; NFC	Sanjay Patel	2016-01-18	1	-36/+36
\| \| \| \|	llvm-svn: 258058
*	fix typo; NFC	Sanjay Patel	2016-01-18	1	-1/+1
\| \| \| \|	llvm-svn: 258057
*	[opaque pointer types] [breaking-change] [NFC] SimplifyGEPInst: take the ↵	Manuel Jacob	2016-01-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	source element type of the GEP as an argument. Patch by Eduard Burtescu. Reviewers: dblaikie, mjacob Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16281 llvm-svn: 258024
*	GlobalValue: use getValueType() instead of getType()->getPointerElementType().	Manuel Jacob	2016-01-16	2	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: mjacob Subscribers: jholewinski, arsenm, dsanders, dblaikie Patch by Eduard Burtescu. Differential Revision: http://reviews.llvm.org/D16260 llvm-svn: 257999
*	Re-commit r257064, after it was reverted in r257340.	Silviu Baranga	2016-01-15	1	-3/+320
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This contains a fix for the issue that caused the revert: we no longer assume that we can insert instructions after the instruction that produces the base pointer. We previously assumed that this would be ok, because the instruction produces a value and therefore is not a terminator. This is false for invoke instructions. We will now insert these new instruction directly at the location of the users. Original commit message: [InstCombine] Look through PHIs, GEPs, IntToPtrs and PtrToInts to expose more constants when comparing GEPs Summary: When comparing two GEP instructions which have the same base pointer and one of them has a constant index, it is possible to only compare indices, transforming it to a compare with a constant. This removes one use for the GEP instruction with the constant index, can reduce register pressure and can sometimes lead to removing the comparisson entirely. InstCombine was already doing this when comparing two GEPs if the base pointers were the same. However, in the case where we have complex pointer arithmetic (GEPs applied to GEPs, PHIs of GEPs, conversions to or from integers, etc) the value of the original base pointer will be hidden to the optimizer and this transformation will be disabled. This change detects when the two sides of the comparison can be expressed as GEPs with the same base pointer, even if they don't appear as such in the IR. The transformation will convert all the pointer arithmetic to arithmetic done on indices and all the relevant uses of GEPs to GEPs with a common base pointer. The GEP comparison will be converted to a comparison done on indices. Reviewers: majnemer, jmolloy Subscribers: hfinkel, jevinskie, jmolloy, aadg, llvm-commits Differential Revision: http://reviews.llvm.org/D15146 llvm-svn: 257897
*	Change isSafeToLoadUnconditionally arguments order. Separated from ↵	Artur Pilipenko	2016-01-15	1	-2/+2
\| \| \| \| \| \|	http://reviews.llvm.org/D10920. llvm-svn: 257894
*	[InstCombine] Rewrite bswap/bitreverse handling completely.	James Molloy	2016-01-15	1	-179/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are several requirements that ended up with this design; 1. Matching bitreversals is too heavyweight for InstCombine and doesn't really need to be done so early. 2. Bitreversals and byteswaps are very related in their matching logic. 3. We want to implement support for matching more advanced bswap/bitreverse patterns like partial bswaps/bitreverses. 4. Bswaps are best matched early in InstCombine. The result of these is that a new utility function is created in Transforms/Utils/Local.h that can be configured to search for bswaps, bitreverses or both. InstCombine uses it to find only bswaps, CGP uses it to find only bitreversals. We can then extend the matching logic in one place only. llvm-svn: 257875
*	function names start with a lower case letter ; NFC	Sanjay Patel	2016-01-12	2	-6/+6
\| \| \| \|	llvm-svn: 257496
*	Revert r257164 - it has caused spec2k6 failures in LTO mode	Silviu Baranga	2016-01-11	1	-322/+3
\| \| \| \|	llvm-svn: 257340
*	InstCombineCompares.cpp: Fix a warning. [-Wbraced-scalar-init]	NAKAMURA Takumi	2016-01-08	1	-1/+1
\| \| \| \|	llvm-svn: 257167
*	Re-commit r257064, this time with a fixed assert	Silviu Baranga	2016-01-08	1	-3/+322
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In setInsertionPoint if the value is not a PHI, Instruction or Argument it should be a Constant, not a ConstantExpr. Original commit message: [InstCombine] Look through PHIs, GEPs, IntToPtrs and PtrToInts to expose more constants when comparing GEPs Summary: When comparing two GEP instructions which have the same base pointer and one of them has a constant index, it is possible to only compare indices, transforming it to a compare with a constant. This removes one use for the GEP instruction with the constant index, can reduce register pressure and can sometimes lead to removing the comparisson entirely. InstCombine was already doing this when comparing two GEPs if the base pointers were the same. However, in the case where we have complex pointer arithmetic (GEPs applied to GEPs, PHIs of GEPs, conversions to or from integers, etc) the value of the original base pointer will be hidden to the optimizer and this transformation will be disabled. This change detects when the two sides of the comparison can be expressed as GEPs with the same base pointer, even if they don't appear as such in the IR. The transformation will convert all the pointer arithmetic to arithmetic done on indices and all the relevant uses of GEPs to GEPs with a common base pointer. The GEP comparison will be converted to a comparison done on indices. Reviewers: majnemer, jmolloy Subscribers: hfinkel, jevinskie, jmolloy, aadg, llvm-commits Differential Revision: http://reviews.llvm.org/D15146 llvm-svn: 257164
*	[InstCombine] insert a new shuffle in a safe place (PR25999)	Sanjay Patel	2016-01-08	1	-10/+7
\| \| \| \| \| \| \| \|	Limit this transform to a basic block and guard against PHIs. Hopefully, this fixes the remaining failures in PR25999: https://llvm.org/bugs/show_bug.cgi?id=25999 llvm-svn: 257133
*	Revert r257064. It caused failures in some sanitizer tests.	Silviu Baranga	2016-01-07	1	-322/+3
\| \| \| \|	llvm-svn: 257069
*	Fix build after r257064: we should be returning false, not nullptr	Silviu Baranga	2016-01-07	1	-2/+2
\| \| \| \|	llvm-svn: 257067
*	[InstCombine] Look through PHIs, GEPs, IntToPtrs and PtrToInts to expose ↵	Silviu Baranga	2016-01-07	1	-3/+322
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	more constants when comparing GEPs Summary: When comparing two GEP instructions which have the same base pointer and one of them has a constant index, it is possible to only compare indices, transforming it to a compare with a constant. This removes one use for the GEP instruction with the constant index, can reduce register pressure and can sometimes lead to removing the comparisson entirely. InstCombine was already doing this when comparing two GEPs if the base pointers were the same. However, in the case where we have complex pointer arithmetic (GEPs applied to GEPs, PHIs of GEPs, conversions to or from integers, etc) the value of the original base pointer will be hidden to the optimizer and this transformation will be disabled. This change detects when the two sides of the comparison can be expressed as GEPs with the same base pointer, even if they don't appear as such in the IR. The transformation will convert all the pointer arithmetic to arithmetic done on indices and all the relevant uses of GEPs to GEPs with a common base pointer. The GEP comparison will be converted to a comparison done on indices. Reviewers: majnemer, jmolloy Subscribers: hfinkel, jevinskie, jmolloy, aadg, llvm-commits Differential Revision: http://reviews.llvm.org/D15146 llvm-svn: 257064
*	fix typo; NFC	Sanjay Patel	2016-01-06	1	-1/+1
\| \| \| \|	llvm-svn: 256883
*	[InstCombine] insert a new shuffle before its uses (PR26015)	Sanjay Patel	2016-01-05	1	-8/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Although this solves the test case in PR26015: https://llvm.org/bugs/show_bug.cgi?id=26015 And may solve PR25999: https://llvm.org/bugs/show_bug.cgi?id=25999 ...I suspect this is not the best solution. I think we want to insert the new shuffle just ahead of the earliest ExtractElementInst that we're replacing, but I don't know how that should be implemented. Differential Revision: http://reviews.llvm.org/D15878 llvm-svn: 256857
*	[Statepoints] Refactor GCRelocateOperands into an intrinsic wrapper. NFC.	Manuel Jacob	2016-01-05	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This commit renames GCRelocateOperands to GCRelocateInst and makes it an intrinsic wrapper, similar to e.g. MemCpyInst. Also, all users of GCRelocateOperands were changed to use the new intrinsic wrapper instead. Reviewers: sanjoy, reames Subscribers: reames, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D15762 llvm-svn: 256811
*	[InstructionCombining] prepareICWorklistFromFunction halts in infinite loop ↵	Chen Li	2016-01-04	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	with instructions of token type Summary: This patch fixes a bug in prepareICWorklistFromFunction, where the loop becomes infinite with instructions of token type. The patch checks if the instruction is token type, and if so it updates EndInst with the current instruction. Reviewers: reames, majnemer Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D15859 llvm-svn: 256792
*	fix formatting; NFC	Sanjay Patel	2015-12-30	1	-8/+8
\| \| \| \|	llvm-svn: 256645
*	[InstCombine] transform more extract/insert pairs into shuffles (PR2109)	Sanjay Patel	2015-12-24	1	-3/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is an extension of the shuffle combining from r203229: http://reviews.llvm.org/rL203229 The idea is to widen a short input vector with undef elements so the existing shuffle transform for extract/insert can kick in. The motivation is to finally solve PR2109: https://llvm.org/bugs/show_bug.cgi?id=2109 For that example, the IR becomes: %1 = bitcast <2 x i32>* %P to <2 x float>* %ld1 = load <2 x float>, <2 x float>* %1, align 8 %2 = shufflevector <2 x float> %ld1, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> %i2 = shufflevector <4 x float> %A, <4 x float> %2, <4 x i32> <i32 0, i32 1, i32 4, i32 5> ret <4 x float> %i2 And x86 SSE output improves from: movq (%rdi), %xmm1 ## xmm1 = mem[0],zero movdqa %xmm1, %xmm2 shufps $229, %xmm2, %xmm2 ## xmm2 = xmm2[1,1,2,3] shufps $48, %xmm0, %xmm1 ## xmm1 = xmm1[0,0],xmm0[3,0] shufps $132, %xmm1, %xmm0 ## xmm0 = xmm0[0,1],xmm1[0,2] shufps $32, %xmm0, %xmm2 ## xmm2 = xmm2[0,0],xmm0[2,0] shufps $36, %xmm2, %xmm0 ## xmm0 = xmm0[0,1],xmm2[2,0] retq To the almost optimal: movhpd (%rdi), %xmm0 Note: There's a tension in the existing transform related to generating arbitrary shufflevector masks. We avoid that in other places in InstCombine because we're scared that codegen can't handle strange masks, but it looks like we're ok with producing those here. I purposely chose weird insert/extract indexes for the regression tests to see the effect in these cases. For PowerPC+Altivec, AArch64, and X86+SSE/AVX, I think the codegen is equal or better for these examples. Differential Revision: http://reviews.llvm.org/D15096 llvm-svn: 256394
*	[OperandBundles] Have InstCombine play nice with operand bundles	David Majnemer	2015-12-23	1	-4/+6
\| \| \| \| \| \| \|	Don't assume a call's use corresponds to an argument operand, it might correspond to a bundle operand. llvm-svn: 256327
*	[InstCombine] Fix indentation. NFC.	Craig Topper	2015-12-21	1	-2/+2
\| \| \| \|	llvm-svn: 256131
*	[InstCombine] Extend peephole DSE to handle unordered atomics	Philip Reames	2015-12-17	1	-6/+11
\| \| \| \| \| \| \| \| \| \| \| \|	This extends the same line of reasoning used in EarlyCSE w/http://reviews.llvm.org/D15352 to the DSE implementation in InstCombine. Key points: * We only remove unordered or simple stores. * The loads producing values consumed by dead stores don't influence whether the store is dead. Differential Revision: http://reviews.llvm.org/D15354 llvm-svn: 255932
*	[InstCombine] Adding "\n" to debug output. NFC.	Weiming Zhao	2015-12-17	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: [InstCombine] Adding '\n' to debug output. NFC. Patch by Zhaoshi Zheng <zhaoshiz@codeaurora.org> Reviewers: apazos, majnemer, weimingz Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15403 llvm-svn: 255920