bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix for PR32740 - Invalid floating type, unreachable between r300969 and r301029	Artur Pilipenko	2017-04-22	1	-2/+5
\| \| \| \| \| \|	The bug was introduced by r301018 "[InstCombine] fadd double (sitofp x), y check that the promotion is valid". The patch didn't expect that fadd can be on vectors not necessarily scalars. Add vector support along with the test. llvm-svn: 301070
*	[InstCombine] revert r300977 and r301021	Sanjay Patel	2017-04-21	1	-14/+4
\| \| \| \| \| \|	This can cause an inf-loop. Investigating... llvm-svn: 301035
*	[InstCombine] use isSubsetOf() for efficiency	Sanjay Patel	2017-04-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	C \| ~D == -1 ~(C \| ~D) == 0 ~C & D == 0 D & ~C == 0 D.isSubsetOf(C) llvm-svn: 301021
*	[InstCombine] fadd double (sitofp x), y check that the promotion is valid	Artur Pilipenko	2017-04-21	1	-22/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Doing these transformations check that the result of integer addition is representable in the FP type. (fadd double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (fadd double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This is a fix for https://bugs.llvm.org//show_bug.cgi?id=27036 Reviewed By: andrew.w.kaylor, scanon, spatel Differential Revision: https://reviews.llvm.org/D31182 llvm-svn: 301018
*	[InstCombine] prefer xor with -1 because 'not' is easier to understand (PR32706)	Sanjay Patel	2017-04-21	1	-4/+14
\| \| \| \| \| \| \| \| \|	This matches the demanded bits behavior in the DAG and should fix: https://bugs.llvm.org/show_bug.cgi?id=32706 Differential Revision: https://reviews.llvm.org/D32255 llvm-svn: 300977
*	[InstCombine] Remove the zextOrTrunc from ShrinkDemandedConstant.	Craig Topper	2017-04-20	1	-4/+2
\| \| \| \| \| \| \| \|	The demanded mask and the constant should always be the same width for all callers today. Also stop copying the demanded mask as its passed in. We should avoid allocating memory unless we are going to do something. The final AND to create the new constant will take care of it. llvm-svn: 300927
*	[InstCombine] function names start with lower-case letter; NFC	Sanjay Patel	2017-04-20	2	-3/+3
\| \| \| \| \| \|	Forgot to make this fix with the signature change in r300911. llvm-svn: 300912
*	[InstCombine] allow shl+shr demanded bits folds with splat constants	Sanjay Patel	2017-04-20	2	-22/+17
\| \| \| \|	llvm-svn: 300911
*	[InstCombine] allow shl demanded bits folds with splat constants	Sanjay Patel	2017-04-20	1	-2/+4
\| \| \| \| \| \|	More fixes are needed to enable the helper SimplifyShrShlDemandedBits(). llvm-svn: 300898
*	[InstCombine] Use APInt::intersects and APInt::isSubsetOf to improve a few ↵	Craig Topper	2017-04-20	1	-4/+4
\| \| \| \| \| \|	more places in SimplifyDemandedBits. llvm-svn: 300896
*	[InstCombine] allow ashr/lshr demanded bits folds with splat constants	Sanjay Patel	2017-04-20	1	-11/+14
\| \| \| \|	llvm-svn: 300888
*	[InstCombine] Use APInt::isSubsetOf to simplify some code in ↵	Craig Topper	2017-04-20	1	-37/+27
\| \| \| \| \| \| \| \|	SimplifyDemandedBits. NFC This allows us to use less temporary APInt for And and Invert operations. llvm-svn: 300885
*	[InstCombine] Remove redundant code from SimplifyDemandedBits handling for ↵	Craig Topper	2017-04-20	1	-18/+0
\| \| \| \| \| \|	Or. The code above it is equivalent if you work through the bitwise math. llvm-svn: 300876
*	[APInt] Rename getSignBit to getSignMask	Craig Topper	2017-04-20	7	-27/+27
\| \| \| \| \| \| \| \|	getSignBit is a static function that creates an APInt with only the sign bit set. getSignMask seems like a better name to convey its functionality. In fact several places use it and then store in an APInt named SignMask. Differential Revision: https://reviews.llvm.org/D32108 llvm-svn: 300856
*	[APInt] Add isSubsetOf method that can check if one APInt is a subset of ↵	Craig Topper	2017-04-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	another without creating temporary APInts This question comes up in many places in SimplifyDemandedBits. This makes it easy to ask without allocating additional temporary APInts. The BitVector class provides a similar functionality through its (IMHO badly named) test(const BitVector&) method. Though its output polarity is reversed. I've provided one example use case in this patch. I plan to do more as a follow up. Differential Revision: https://reviews.llvm.org/D32258 llvm-svn: 300851
*	In SimplifyDemandedUseBits, use computeKnownBits directly to handle Constants	Craig Topper	2017-04-20	1	-15/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Currently we don't explicitly process ConstantDataSequential, ConstantAggregateZero, or ConstantVector, or Undef before applying the Depth limit. Instead they occur after the depth check in the non-instruction path. For the constant types that we do handle, the code is replicated from computeKnownBits. This patch fixes the missing constant handling and the reduces the amount of code by just using computeKnownBits directly for any type of Constant. Differential Revision: https://reviews.llvm.org/D32123 llvm-svn: 300849
*	Simplify test for sret attribute in instcombine	Reid Kleckner	2017-04-19	1	-12/+6
\| \| \| \| \| \| \| \| \|	This change is correct because the verifier requires that at most one argument be marked 'sret'. NFC, removes a use of AttributeList slot APIs. llvm-svn: 300784
*	[APInt] Cast calls to add/sub/mul overflow methods to void if only their ↵	Craig Topper	2017-04-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	overflow bool out param is used. This is preparation for a clang change to improve the [[nodiscard]] warning to not be ignored on methods that return a class marked [[nodiscard]] that are defined in the class itself. See D32207. We should consider adding wrapper methods to APInt that return the overflow flag directly and discard the APInt result. This would eliminate the void casts and the need to create a bool before the call to pass to the out param. llvm-svn: 300758
*	[InstCombine] Reduce visitLoadInst() code duplication. NFCI.	Davide Italiano	2017-04-19	1	-20/+18
\| \| \| \|	llvm-svn: 300717
*	Add a getPointerOperandType() helper to LoadInst and StoreInst; NFC	Sanjoy Das	2017-04-18	1	-1/+1
\| \| \| \| \| \|	I will use this in a later change. llvm-svn: 300613
*	[APInt] Use lshrInPlace to replace lshr where possible	Craig Topper	2017-04-18	3	-8/+9
\| \| \| \| \| \| \| \| \| \|	This patch uses lshrInPlace to replace code where the object that lshr is called on is being overwritten with the result. This adds an lshrInPlace(const APInt &) version as well. Differential Revision: https://reviews.llvm.org/D32155 llvm-svn: 300566
*	[InstCombine] Matchers work with both ConstExpr and Instructions.	Davide Italiano	2017-04-17	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	So, `cast<Instruction>` is not guaranteed to succeed. Change the code so that we create a new constant and use it in the newly created instruction, as it's done in other places in InstCombine. OK'ed by Sanjay/Craig. Fixes PR32686. llvm-svn: 300495
*	Introduce APInt::isSignBitSet/isSignBitClear. Use in place isSignBitSet in ↵	Craig Topper	2017-04-17	1	-4/+4
\| \| \| \| \| \| \| \|	place of isNegative in known bits tracking. This makes statements like KnownZero.isNegative() (which means the value we're tracking is positive) less confusing. llvm-svn: 300457
*	AMDGPU: SimplifyDemandedElts for image intrinsics	Matt Arsenault	2017-04-17	1	-3/+80
\| \| \| \| \| \| \| \|	Causes some VGPR usage improvements in shaderdb, but introduces some SGPR spilling regressions due to random scheduling changes later. llvm-svn: 300453
*	[InstCombine] Simplify 1/X for vectors.	Craig Topper	2017-04-17	1	-16/+15
\| \| \| \|	llvm-svn: 300439
*	[InstCombine] Add support for vector srem->urem.	Craig Topper	2017-04-17	1	-7/+5
\| \| \| \|	llvm-svn: 300437
*	[InstCombine] Add support for turning vector sdiv into udiv.	Craig Topper	2017-04-17	1	-18/+16
\| \| \| \|	llvm-svn: 300435
*	[InstCombine][ValueTracking] When computing known bits for Srem make sure we ↵	Craig Topper	2017-04-16	1	-2/+2
\| \| \| \| \| \| \| \|	don't compute known bits for the LHS twice. If we already called computeKnownBits for the RHS being a constant power of 2, we've already computed everything we can and should just stop. I think previously we would still recurse if we had determined the result was negative or had not determined the sign bit at all. llvm-svn: 300432
*	[InstCombine] In SimplifyDemandedUseBits, don't bother to mask known bits of ↵	Craig Topper	2017-04-16	1	-3/+3
\| \| \| \| \| \| \| \|	constants with DemandedMask. Just because we didn't demand them doesn't mean they aren't known. llvm-svn: 300430
*	[X86][X86 intrinsics]Folding cmp(sub(a,b),0) into cmp(a,b) optimization	Michael Zuckerman	2017-04-16	1	-0/+31
\| \| \| \| \| \| \| \| \|	This patch adds new optimization (Folding cmp(sub(a,b),0) into cmp(a,b)) to instCombineCall pass and was written specific for X86 CMP intrinsics. Differential Revision: https://reviews.llvm.org/D31398 llvm-svn: 300422
*	[InstCombine] allow (X != C1 && X != C2) and similar patterns to match splat ↵	Sanjay Patel	2017-04-15	1	-19/+19
\| \| \| \| \| \|	vector constants llvm-svn: 300402
*	[InstCombine] MakeAnd/Or/Xor handling to reuse previous APInt computations	Craig Topper	2017-04-14	1	-36/+46
\| \| \| \| \| \| \| \| \| \| \| \|	When checking if we should return a constant, we create some temporary APInts to see if we know all bits. But the exact computations we do are needed in several other locations in the same code. This patch moves them to named temporaries so we can reuse them. Ideally we'd write directly to KnownZero/One, but we currently seem to only write those variables after all the simplifications checks and I didn't want to change that with this patch. Differential Revision: https://reviews.llvm.org/D32094 llvm-svn: 300376
*	[IR] Make paramHasAttr to use arg indices instead of attr indices	Reid Kleckner	2017-04-14	1	-1/+1
\| \| \| \| \| \| \| \| \|	This avoids the confusing 'CS.paramHasAttr(ArgNo + 1, Foo)' pattern. Previously we were testing return value attributes with index 0, so I introduced hasReturnAttr() for that use case. llvm-svn: 300367
*	[InstCombine] (X != C1 && X != C2) --> (X \| (C1 ^ C2)) != C2	Sanjay Patel	2017-04-14	1	-36/+65
\| \| \| \| \| \| \| \| \| \|	...when C1 differs from C2 by one bit and C1 <u C2: http://rise4fun.com/Alive/Vuo And move related folds to a helper function. This reduces code duplication and will make it easier to remove the scalar-only restriction as a follow-up step. llvm-svn: 300364
*	[InstCombine] Support folding a subtract with a constant LHS into a phi node	Craig Topper	2017-04-14	8	-28/+44
\| \| \| \| \| \| \| \| \| \| \| \|	We currently only support folding a subtract into a select but not a PHI. This fixes that. I had to fix an assumption in FoldOpIntoPhi that assumed the PHI node was always in operand 0. Now we pass it in like we do for FoldOpIntoSelect. But we still require some dancing to find the Constant when we create the BinOp or ConstantExpr. This is based code is similar to what we do for selects. Since I touched all call sites, this also renames FoldOpIntoPhi to foldOpIntoPhi to match coding standards. Differential Revision: https://reviews.llvm.org/D31686 llvm-svn: 300363
*	[InstCombine] Refactor SimplifyUsingDistributiveLaws to more explicitly skip ↵	Craig Topper	2017-04-14	1	-30/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	code when LHS/RHS aren't BinaryOperators Currently this code always makes 2 or 3 calls to tryFactorization regardless of whether the LHS/RHS are BinaryOperators. We make 3 calls when both operands are BinaryOperators with the same opcode. Or surprisingly, when neither are BinaryOperators. This is because getBinOpsForFactorization returns Instruction::BinaryOpsEnd when the operand is not a BinaryOperator. If both LHS and RHS are not BinaryOperators then they both have an Opcode of Instruction::BinaryOpsEnd. When this happens we rely on tryFactorization to early out due to A/B/C/D being null. Similar behavior occurs for the other calls, we rely on getBinOpsForFactorization having made A/B or C/D null to get tryFactorization to early out. We also rely on these null checks to check the result of getIdentityValue and early out for it. This patches refactors this to pull these checks up to SimplifyUsingDistributiveLaws so we don't rely on BinaryOpsEnd as a sentinel or this A/B/C/D null behavior. I think this makes this code easier to reason about. Should also give a tiny performance improvement for cases where the LHS or RHS isn't a BinaryOperator. Differential Revision: https://reviews.llvm.org/D31913 llvm-svn: 300353
*	[InstCombine] Use APInt::setSignBit and APInt::isNegative(). NFC	Craig Topper	2017-04-14	1	-3/+3
\| \| \| \|	llvm-svn: 300305
*	[IR] Make getParamAttributes take argument numbers, not ArgNo+1	Reid Kleckner	2017-04-13	1	-16/+19
\| \| \| \| \| \| \| \| \| \| \| \|	Add hasParamAttribute() and use it instead of hasAttribute(ArgNo+1, Kind) everywhere. The fact that the AttributeList index for an argument is ArgNo+1 should be a hidden implementation detail. NFC llvm-svn: 300272
*	[InstCombine] Use APInt::getBitsSetFrom instead of inverting the result of ↵	Craig Topper	2017-04-13	1	-4/+2
\| \| \| \| \| \|	getLowBitsSet. NFC llvm-svn: 300265
*	Revert accidentally-committed files in r300252.	Richard Smith	2017-04-13	1	-403/+0
\| \| \| \|	llvm-svn: 300253
*	Remove all allocation and divisions from GreatestCommonDivisor	Richard Smith	2017-04-13	1	-0/+403
\| \| \| \| \| \| \| \| \| \| \|	Switch from Euclid's algorithm to Stein's algorithm for computing GCD. This avoids the (expensive) APInt division operation in favour of bit operations. Remove all memory allocation from within the GCD loop by tweaking our `lshr` implementation so it can operate in-place. Differential Revision: https://reviews.llvm.org/D31968 llvm-svn: 300252
*	[InstCombine] Fix !prof metadata preservation for invokes	Reid Kleckner	2017-04-13	1	-18/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Bug noticed by inspection. Extend the test to handle invokes as well as calls, and rewrite it to not depend on the inliner and other passes. Also simplify the call site replacement code with CallSite, similar to what I did to dead arg elimination and arg promotion (rL300235 and rL300229). Reviewers: danielcdh, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32041 llvm-svn: 300251
*	[InstCombine] fold X == 0 \|\| X == -1 to one compare (PR32524)	Sanjay Patel	2017-04-13	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is effectively a retry of: https://reviews.llvm.org/rL299851 but now we have tests and an assert to make sure the bug that was exposed with that attempt will not happen again. I'll fix the code duplication and missing sibling fold next, but I want to make this change as small as possible to reduce risk since I messed it up last time. This should fix: https://bugs.llvm.org/show_bug.cgi?id=32524 llvm-svn: 300236
*	[InstCombine] Simplify attribute code with new AttributeList::get NFC	Reid Kleckner	2017-04-13	1	-31/+20
\| \| \| \|	llvm-svn: 300230
*	[InstCombine] use similar ops for related folds; NFCI	Sanjay Patel	2017-04-13	1	-10/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's less efficient to produce 'ule' than 'ult' since we know we're going to canonicalize to 'ult', but we shouldn't have duplicated code for these folds. As a trade-off, this was a pretty terrible way to make a '2'. :) if (LHSC == SubOne(RHSC)) AddC = ConstantExpr::getSub(AddOne(RHSC), LHSC); The next steps are to share the code to fix PR32524 and add the missing 'and' fold that was left out when PR14708 was fixed: https://bugs.llvm.org/show_bug.cgi?id=14708 llvm-svn: 300222
*	[InstCombine] fix assert to not always be true	Sanjay Patel	2017-04-13	1	-1/+1
\| \| \| \|	llvm-svn: 300202
*	[IR] Take func, ret, and arg attrs separately in AttributeList::get	Reid Kleckner	2017-04-13	1	-11/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	This seems like a much more natural API, based on Derek Schuff's comments on r300015. It further hides the implementation detail of AttributeList that function attributes come last and appear at index ~0U, which is easy for the user to screw up. git diff says it saves code as well: 97 insertions(+), 137 deletions(-) This also makes it easier to change the implementation, which I want to do next. llvm-svn: 300153
*	[InstCombine] Teach SimplifyMultipleUseDemandedBits to handle And/Or/Xor ↵	Craig Topper	2017-04-12	1	-11/+46
\| \| \| \| \| \| \| \|	known bits using the LHS/RHS known bits it already acquired without recursing back into computeKnownBits. This replicates the known bits and constant creation code from the single use case for these instructions and adds it here. The computeKnownBits and constant creation code for other instructions is now in the default case of the opcode switch. llvm-svn: 300094
*	[InstCombine] Remove unreachable code for turning an And where all demanded ↵	Craig Topper	2017-04-12	1	-4/+0
\| \| \| \| \| \| \| \|	bits on both sides are known to be zero into a constant 0. We already handled a superset check that included the known ones too and folded to a constant that may include ones. But it can also handle the case of no ones. llvm-svn: 300093
*	[InstCombine] fix wrong undef handling when converting select to shuffle	Sanjay Patel	2017-04-12	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	As discussed in: https://bugs.llvm.org/show_bug.cgi?id=32486 ...the canonicalization of vector select to shufflevector does not hold up when undef elements are present in the condition vector. Try to make the undef handling clear in the code and the LangRef. Differential Revision: https://reviews.llvm.org/D31980 llvm-svn: 300092