bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[InstSimplify] Remove unreachable default from SimplifyBinOp.	Craig Topper	2017-04-06	1	-22/+1
\| \| \| \| \| \|	We have dedicated handlers for every opcode so nothing can get here anymore. The switch doesn't get detected as fully covered because Opcode is an unsigned. Casting to Instruction::BinaryOps still doesn't detect it because BinaryOpsEnd is in the enum and 1 past the last opcode. llvm-svn: 299687
*	[InstSimplify] Teach SimplifyMulInst to recognize vectors of i1 as And. Not ↵	Craig Topper	2017-04-06	1	-1/+1
\| \| \| \| \| \|	just scalar i1. llvm-svn: 299665
*	[InstSimplify] Teach SimplifyAddInst and SimplifySubInst that vectors of i1 ↵	Craig Topper	2017-04-06	1	-2/+2
\| \| \| \| \| \|	can be treated as Xor too. llvm-svn: 299626
*	[LAA] Correctly return a half-open range in expandBounds	James Molloy	2017-04-05	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	This is a latent bug that's been hanging around for a while. For a loop-invariant pointer, expandBounds would return the range {Ptr, Ptr}, but this was interpreted as a half-open range, not a closed range. So we ended up planting incorrect bounds checks. Even worse, they were tautological, so we ended up incorrectly executing the optimized loop. llvm-svn: 299526
*	InstSimplify: Add a hook for shufflevector	Zvi Rackover	2017-04-03	1	-0/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add a hook for simplification of shufflevector's with the following rules: - Constant folding - NFC, as it was already being done by the default handler. - If only one of the operands is constant, constant fold the shuffle if the mask does not select elements from the variable operand - to show the hook is firing and affecting the test-cases. Reviewers: RKSimon, craig.topper, spatel, sanjoy, nlopes, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31525 llvm-svn: 299393
*	[CodeGenPrep] move aarch64-type-promotion to CGP	Jun Bum Lim	2017-04-03	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Move the aarch64-type-promotion pass within the existing type promotion framework in CGP. This change also support forking sexts when a new sext is required for promotion. Note that change is based on D27853 and I am submitting this out early to provide a better idea on D27853. Reviewers: jmolloy, mcrosier, javed.absar, qcolombet Reviewed By: qcolombet Subscribers: llvm-commits, aemerson, rengolin, mcrosier Differential Revision: https://reviews.llvm.org/D28680 llvm-svn: 299379
*	[APInt] Move isMask and isShiftedMask out of APIntOps and into the APInt ↵	Craig Topper	2017-04-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	class. Implement them without memory allocation for multiword This moves the isMask and isShiftedMask functions to be class methods. They now use the MathExtras.h function for single word size and leading/trailing zeros/ones or countPopulation for the multiword size. The previous implementation made multiple temorary memory allocations to do the bitwise arithmetic operations to match the MathExtras.h implementation. Differential Revision: https://reviews.llvm.org/D31565 llvm-svn: 299362
*	[InstSimplify] add constant folding for fdiv/frem	Sanjay Patel	2017-04-01	1	-70/+49
\| \| \| \| \| \|	Also, add a helper function so we don't have to repeat this code for each binop. llvm-svn: 299309
*	fix formatting; NFC	Sanjay Patel	2017-04-01	1	-34/+35
\| \| \| \|	llvm-svn: 299307
*	[APInt] Remove the mul/urem/srem/udiv/sdiv functions from the APIntOps ↵	Craig Topper	2017-04-01	1	-1/+1
\| \| \| \| \| \|	namespace. Replace the few usages with calls to the class methods. NFC llvm-svn: 299292
*	[APInt] Remove shift functions from APIntOps namespace. Replace the few ↵	Craig Topper	2017-03-31	1	-4/+4
\| \| \| \| \| \|	users with the APInt class methods. NFCI llvm-svn: 299248
*	[ScalarEvolution] Re-enable Predicate implication from operations	Max Kazantsev	2017-03-31	1	-16/+171
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The patch rL298481 was reverted due to crash on clang-with-lto-ubuntu build. The reason of the crash was type mismatch between either a or b and RHS in the following situation: LHS = sext(a +nsw b) > RHS. This is quite rare, but still possible situation. Normally we need to cast all {a, b, RHS} to their widest type. But we try to avoid creation of new SCEV that are not constants to avoid initiating recursive analysis that can take a lot of time and/or cache a bad value for iterations number. To deal with this, in this patch we reject this case and will not try to analyze it if the type of sum doesn't match with the type of RHS. In this situation we don't need to create any non-constant SCEVs. This patch also adds an assertion to the method IsProvedViaContext so that we could fail on it and not go further into range analysis etc (because in some situations these analyzes succeed even when the passed arguments have wrong types, what should not normally happen). The patch also contains a fix for a problem with too narrow scope of the analysis caused by wrong usage of predicates in recursive invocations. The regression test on the said failure: test/Analysis/ScalarEvolution/implied-via-addition.ll Reviewers: reames, apilipenko, anna, sanjoy Reviewed By: sanjoy Subscribers: mzolotukhin, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D31238 llvm-svn: 299205
*	Spelling mistakes in comments. NFCI.	Simon Pilgrim	2017-03-31	1	-1/+1
\| \| \| \|	llvm-svn: 299197
*	ModuleSummaryAnalysis: Use a more precise #include. NFC.	Peter Collingbourne	2017-03-31	1	-1/+1
\| \| \| \|	llvm-svn: 299142
*	[InstSimplify] Use m_SignBit instead of calling getSignBit and using ↵	Craig Topper	2017-03-30	1	-4/+2
\| \| \| \| \| \|	m_Specific. NFCI llvm-svn: 299121
*	[InstSimplify] Use APInt::isMaxSignedValue() instead of comparing with ↵	Craig Topper	2017-03-30	1	-1/+1
\| \| \| \| \| \|	~APInt::getSignBit. NFC llvm-svn: 299120
*	Revert r298711 "[InstCombine] Provide a way to calculate KnownZero/One for ↵	Craig Topper	2017-03-24	1	-35/+21
\| \| \| \| \| \| \| \|	Add/Sub in SimplifyDemandedUseBits without recursing into ComputeKnownBits" Tsan bot is failing. llvm-svn: 298745
*	[InstCombine] Provide a way to calculate KnownZero/One for Add/Sub in ↵	Craig Topper	2017-03-24	1	-21/+35
\| \| \| \| \| \| \| \|	SimplifyDemandedUseBits without recursing into ComputeKnownBits SimplifyDemandedUseBits for Add/Sub already recursed down LHS and RHS for simplifying bits. If that didn't provide any simplifications we fall back to calling computeKnownBits which will recurse again. Instead just take the known bits for LHS and RHS we already have and call into a new function in ValueTracking that can calculate the known bits given the LHS/RHS bits. llvm-svn: 298711
*	Revert "[ScalarEvolution] Re-enable Predicate implication from operations"	Max Kazantsev	2017-03-24	1	-159/+16
\| \| \| \| \| \| \| \|	This reverts commit rL298690 Causes failures on clang. llvm-svn: 298693
*	[ScalarEvolution] Re-enable Predicate implication from operations	Max Kazantsev	2017-03-24	1	-16/+159
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The patch rL298481 was reverted due to crash on clang-with-lto-ubuntu build. The reason of the crash was type mismatch between either a or b and RHS in the following situation: LHS = sext(a +nsw b) > RHS. This is quite rare, but still possible situation. Normally we need to cast all {a, b, RHS} to their widest type. But we try to avoid creation of new SCEV that are not constants to avoid initiating recursive analysis that can take a lot of time and/or cache a bad value for iterations number. To deal with this, in this patch we reject this case and will not try to analyze it if the type of sum doesn't match with the type of RHS. In this situation we don't need to create any non-constant SCEVs. This patch also adds an assertion to the method IsProvedViaContext so that we could fail on it and not go further into range analysis etc (because in some situations these analyzes succeed even when the passed arguments have wrong types, what should not normally happen). The patch also contains a fix for a problem with too narrow scope of the analysis caused by wrong usage of predicates in recursive invocations. The regression test on the said failure: test/Analysis/ScalarEvolution/implied-via-addition.ll llvm-svn: 298690
*	[ValueTracking] Use uint64_t for CarryIn in computeKnownBitsAddSub instead ↵	Craig Topper	2017-03-24	1	-2/+2
\| \| \| \| \| \|	of a creating a temporary APInt. NFC llvm-svn: 298688
*	[ValueTracking] Convert more places to use ↵	Craig Topper	2017-03-24	1	-7/+6
\| \| \| \| \| \|	setHighBits/setLowBits/setSignBit. NFCI llvm-svn: 298683
*	Use isFunctionHotInCallGraph to set the function section prefix.	Dehao Chen	2017-03-23	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The current prefix based function layout algorithm only looks at function's entry count, which is not sufficient. A function should be grouped together if its entry count or any call edge count is hot. Reviewers: davidxl, eraman Reviewed By: eraman Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31225 llvm-svn: 298656
*	[LVIPrinterPass] Print LVI info for function arguments	Anna Thomas	2017-03-23	1	-0/+12
\| \| \| \| \| \| \| \| \|	Using AssemblyAnnotationWriter for LVI printer prints for instructions and basic blocks. So, we explicitly need to print LVI info for the arguments of the function (these are values and not instructions). llvm-svn: 298640
*	Model ashr(shl(x, n), m) as mul(x, 2^(n-m)) when n > m	Zhaoshi Zheng	2017-03-23	1	-19/+46
\| \| \| \| \| \| \| \| \| \| \| \| \|	Given below case: %y = shl %x, n %z = ashr %y, m when n = m, SCEV models it as sext(trunc(x)). This patch tries to handle the case where n > m by using sext(mul(trunc(x), 2^(n-m)))) as the SCEV expression. llvm-svn: 298631
*	revert test commit r298629	Zhaoshi Zheng	2017-03-23	1	-4/+0
\| \| \| \|	llvm-svn: 298630
*	test commit	Zhaoshi Zheng	2017-03-23	1	-0/+4
\| \| \| \|	llvm-svn: 298629
*	[ValueTracking] Use APInt::isNegative instead of using operator[BitWidth-1]. ↵	Craig Topper	2017-03-23	1	-7/+7
\| \| \| \| \| \|	NFCI llvm-svn: 298584
*	[ValueTracking] Use setAllBits/setSignBit/setLowBits/setHighBits. NFCI	Craig Topper	2017-03-23	1	-20/+22
\| \| \| \|	llvm-svn: 298583
*	[LVI] Add an LVI printer pass to capture test LVI cache after transformations	Anna Thomas	2017-03-22	2	-6/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Adding a printer pass for printing the LVI cache values after transformations that use LVI. This will help us in identifying cases where LVI invariants are violated, or transforms that leave LVI in an incorrect state. Right now, I have added two test cases to show that the printer pass is working. I will be adding more test cases in a later change, once this change is checked in upstream. Reviewers: reames, dberlin, sanjoy, apilipenko Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30790 llvm-svn: 298542
*	Revert "[ScalarEvolution] Predicate implication from operations"	Max Kazantsev	2017-03-22	1	-147/+16
\| \| \| \| \| \| \| \|	This reverts commit rL298481 Fails clang-with-lto-ubuntu build. llvm-svn: 298489
*	[ValueTracking] Make sure we keep range metadata information when ↵	Craig Topper	2017-03-22	1	-2/+2
\| \| \| \| \| \|	calculating known bits for calls to bitreverse intrinsic. llvm-svn: 298488
*	[ValueTracking] use setLowBits/setHighBits/setBitsFrom to replace \|= ↵	Craig Topper	2017-03-22	1	-16/+13
\| \| \| \| \| \|	getHighBits/getLowBits. NFCI llvm-svn: 298486
*	[ScalarEvolution] Predicate implication from operations	Max Kazantsev	2017-03-22	1	-16/+147
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch allows SCEV predicate analysis to prove implication of some expression predicates from context predicates related to arguments of those expressions. It introduces three new rules: For addition: (A >X && B >= 0) \|\| (B >= 0 && A > X) ===> (A + B) > X. For division: (A > X) && (0 < B <= X + 1) ===> (A / B > 0). (A > X) && (-B <= X < 0) ===> (A / B >= 0). Using these rules, SCEV is able to prove facts like "if X > 1 then X / 2 > 0". They can also be combined with the same context, to prove more complex expressions like "if X > 1 then X/2 + 1 > 1". Diffirential Revision: https://reviews.llvm.org/D30887 Reviewed by: sanjoy llvm-svn: 298481
*	Let llvm.objectsize be conservative with null pointers	George Burgess IV	2017-03-21	2	-17/+25
\| \| \| \| \| \| \| \| \| \| \|	This adds a parameter to @llvm.objectsize that makes it return conservative values if it's given null. This fixes PR23277. Differential Revision: https://reviews.llvm.org/D28494 llvm-svn: 298430
*	Do not inline hot callsites for samplepgo in thinlto compile phase.	Dehao Chen	2017-03-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Because SamplePGO passes will be invoked twice in ThinLTO build: once at compile phase, the other at backend. We want to make sure the IR at the 2nd phase matches the hot part in profile, thus we do not want to inline hot callsites in the first phase. Reviewers: tejohnson, eraman Reviewed By: tejohnson Subscribers: mehdi_amini, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31201 llvm-svn: 298428
*	Use ProfileSummary:getProfileCount to get ScaledCount for ModuleSummary	Dehao Chen	2017-03-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: ModuleSummary should use the standard interface of ProfileSummary::getProfileCount. Reviewers: eraman, tejohnson Reviewed By: tejohnson Subscribers: tejohnson, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D31154 llvm-svn: 298404
*	Rename AttributeSet to AttributeList	Reid Kleckner	2017-03-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This class is a list of AttributeSetNodes corresponding the function prototype of a call or function declaration. This class used to be called ParamAttrListPtr, then AttrListPtr, then AttributeSet. It is typically accessed by parameter and return value index, so "AttributeList" seems like a more intuitive name. Rename AttributeSetImpl to AttributeListImpl to follow suit. It's useful to rename this class so that we can rename AttributeSetNode to AttributeSet later. AttributeSet is the set of attributes that apply to a single function, argument, or return value. Reviewers: sanjoy, javed.absar, chandlerc, pete Reviewed By: pete Subscribers: pete, jholewinski, arsenm, dschuff, mehdi_amini, jfb, nhaehnle, sbc100, void, llvm-commits Differential Revision: https://reviews.llvm.org/D31102 llvm-svn: 298393
*	[ConstantFolding] Fix to prevent constant folding having to repeatedly scan ↵	David Green	2017-03-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	operands. NFCI After the loop unroll threshold was increased in r295538, very large constant expressions can be created. This prevents them from having to be recursively scanned, leading to a compile time blow-up. Differential Revision: https://reviews.llvm.org/D30689 llvm-svn: 298356
*	[SCEV] Fix trip multiple calculation	Eli Friedman	2017-03-20	1	-10/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If loop bound containing calculations like min(a,b), the Scalar Evolution API getSmallConstantTripMultiple returns 4294967295 "-1" as the trip multiple. The problem is that, SCEV use -1 * umax to represent umin. The multiple constant -1 was returned, and the logic of guarding against huge trip counts was skipped. Because -1 has 32 active bits. The fix attempt to factor more general cases. First try to get the greatest power of two divisor of trip count expression. In case overflow happens, the trip count expression is still divisible by the greatest power of two divisor returned. Returns 1 if not divisible by 2. Patch by Huihui Zhang <huihuiz@codeaurora.org> Differential Revision: https://reviews.llvm.org/D30840 llvm-svn: 298301
*	Use MutableArrayRef for APFloat::convertToInteger	Simon Pilgrim	2017-03-20	1	-3/+3
\| \| \| \| \| \|	As discussed on D31074, use MutableArrayRef for destination integer buffers to help assert before stack overflows happen. llvm-svn: 298253
*	Extract FindAvailablePtrLoadStore out of FindAvailableLoadedValue. NFCI	Xin Tong	2017-03-19	1	-14/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extract FindAvailablePtrLoadStore out of FindAvailableLoadedValue. Prepare for upcoming change which will do phi-translation for load on phi pointer in jump threading SimplifyPartiallyRedundantLoad. This is in preparation for https://reviews.llvm.org/D30543 Reviewers: efriedma, sanjoy, davide, dberlin Reviewed By: davide Subscribers: junbuml, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D30524 llvm-svn: 298216
*	[Analysis] bitreverse(undef) returns undef	Brian Gesiak	2017-03-19	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The reverse of an artbitrary bitpattern is also an arbitrary bitpattern. Reviewers: trentxintong, arsenm, majnemer Reviewed By: majnemer Subscribers: majnemer, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D31118 llvm-svn: 298201
*	[ValueTracking] Remove deadish code from computeKnownBitsAddSub.	Craig Topper	2017-03-18	1	-24/+0
\| \| \| \| \| \|	The code assigned to KnownZero, but later code unconditionally assigned over it. I'm pretty sure the later code can handle the same cases and more equally well. llvm-svn: 298190
*	[ValueTracking] Add APInt::setSignBit and use it to replace ORing with ↵	Craig Topper	2017-03-18	1	-6/+6
\| \| \| \| \| \|	getSignBit which will malloc if the bit width is larger than 64. llvm-svn: 298180
*	[SCEV] Use const Loop L instead of Loop L. NFC	Eli Friedman	2017-03-17	1	-6/+7
\| \| \| \| \| \| \| \|	Use const pointer in the trip count and trip multiple calculations. Patch by Huihui Zhang <huihuiz@codeaurora.org> llvm-svn: 298161
*	[SCEV] Compute affine range in another way to avoid bitwidth extending.	Michael Zolotukhin	2017-03-16	1	-49/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This approach has two major advantages over the existing one: 1. We don't need to extend bitwidth in our computations. Extending bitwidth is a big issue for compile time as we often end up working with APInts wider than 64bit, which is a slow case for APInt. 2. When we zero extend a wrapped range, we lose some information (we replace the range with [0, 1 << src bit width)). Thus, avoiding such extensions better preserves information. Correctness testing: I ran 'ninja check' with assertions that the new implementation of getRangeForAffineAR gives the same results as the old one (this functionality is not present in this patch). There were several failures - I inspected them manually and found out that they all are caused by the fact that we're returning more accurate results now (see bullet (2) above). Without such assertions 'ninja check' works just fine, as well as SPEC2006. Compile time testing: CTMark/Os: - mafft/pairlocalalign -16.98% - tramp3d-v4/tramp3d-v4 -12.72% - lencod/lencod -11.51% - Bullet/bullet -4.36% - ClamAV/clamscan -3.66% - 7zip/7zip-benchmark -3.19% - sqlite3/sqlite3 -2.95% - SPASS/SPASS -2.74% - Average -5.81% Performance testing: The changes are expected to be neutral for runtime performance. Reviewers: sanjoy, atrick, pete Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30477 llvm-svn: 297992
*	[ValueTracking] Out of range shifts might be undef	Oliver Stannard	2017-03-14	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If it is possible for the RHS of a shift operation to be greater than or equal to the bit-width, then the result might be undef, and we can't report any known bits. In some cases, this was allowing a transformation in instcombine which widened an undef value from i1 to i32, increasing the range of values that a function could return. Differential revision: https://reviews.llvm.org/D30781 llvm-svn: 297724
*	[TargetTransformInfo] getIntrinsicInstrCost() scalarization estimation improved	Jonas Paulsson	2017-03-14	2	-9/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	getIntrinsicInstrCost() used to only compute scalarization cost based on types. This patch improves this so that the actual arguments are checked when they are available, in order to handle only unique non-constant operands. Tests updates: Analysis/CostModel/X86/arith-fp.ll Transforms/LoopVectorize/AArch64/interleaved_cost.ll Transforms/LoopVectorize/ARM/interleaved_cost.ll The improvement in getOperandsScalarizationOverhead() to differentiate on constants made it necessary to update the interleaved_cost.ll tests even though they do not relate to intrinsics. Review: Hal Finkel https://reviews.llvm.org/D29540 llvm-svn: 297705
*	[LVI] Add Datalayout to the class LazyValueInfo since all its Impls require ↵	Anna Thomas	2017-03-12	1	-1/+1
\| \| \| \| \| \|	it. NFC llvm-svn: 297583