bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[Constants] If we already have a ConstantInt*, prefer to use ↵	Craig Topper	2017-07-06	1	-5/+5
\| \| \| \| \| \| \| \|	isZero/isOne/isMinusOne instead of isNullValue/isOneValue/isAllOnesValue inherited from Constant. NFCI Going through the Constant methods requires redetermining that the Constant is a ConstantInt and then calling isZero/isOne/isMinusOne. llvm-svn: 307292
*	[InstCombine] Don't create extra ConstantInt objects in foldSelectICmpAnd. NFCI	Craig Topper	2017-07-06	1	-19/+17
\| \| \| \| \| \|	Instead just use APInt objects and only create a ConstantInt at the end if we need it for the Offset. llvm-svn: 307270
*	Revert of r306525: "Canonicalize clamp of float types to minmax"	Nikolai Bozhenov	2017-06-30	1	-10/+3
\| \| \| \|	llvm-svn: 306815
*	[InstCombine] Canonicalize clamp of float types to minmax in fast mode.	Nikolai Bozhenov	2017-06-28	1	-3/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This commit allows matchSelectPattern to recognize clamp of float arguments in the presence of FMF the same way as already done for integers. This case is a little different though. With integers, given the min/max pattern is recognized, DAGBuilder starts selecting MIN/MAX "automatically". That is not the case for float, because for them only full FMINNAN/FMINNUM/FMAXNAN/FMAXNUM ISD nodes exist and they do care about NaNs. On the other hand, some backends (e.g. X86) have only FMIN/FMAX nodes that do not care about NaNS and the former NAN/NUM nodes are illegal thus selection is not happening. So I decided to do such kind of transformation in IR (InstCombiner) instead of complicating the logic in the backend. Reviewers: spatel, jmolloy, majnemer, efriedma, craig.topper Reviewed By: efriedma Subscribers: hiraditya, javed.absar, n.bozhenov, llvm-commits Patch by Andrei Elovikov <andrei.elovikov@intel.com> Differential Revision: https://reviews.llvm.org/D33186 llvm-svn: 306525
*	[InstCombine] canonicalize icmp predicate feeding select	Sanjay Patel	2017-06-27	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This canonicalization was suggested in D33172 as a way to make InstCombine behavior more uniform. We have this transform for icmp+br, so unless there's some reason that icmp+select should be treated differently, we should do the same thing here. The benefit comes from increasing the chances of creating identical instructions. This is shown in the tests in logical-select.ll (PR32791). InstCombine doesn't fold those directly, but EarlyCSE can simplify the identical cmps, and then InstCombine can fold the selects together. The possible regression for the tests in select.ll raises questions about poison/undef: http://lists.llvm.org/pipermail/llvm-dev/2017-May/113261.html ...but that transform is just as likely to be triggered by this canonicalization as it is to be missed, so we're just pointing out a commutation deficiency in the pattern matching: https://reviews.llvm.org/rL228409 Differential Revision: https://reviews.llvm.org/D34242 llvm-svn: 306435
*	[InstCombine] Teach foldSelectICmpAndOr to recognize (select (icmp slt ↵	Craig Topper	2017-06-22	1	-11/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(trunc (X)), 0), Y, (or Y, C2)) Summary: InstCombine likes to turn (icmp eq (and X, C1), 0) into (icmp slt (trunc (X)), 0) sometimes. This breaks foldSelectICmpAndOr's ability to recognize (select (icmp eq (and X, C1), 0), Y, (or Y, C2))->(or (shl (and X, C1), C3), y). This patch tries to recover this. I had to flip around some of the early out checks so that I could create a new And instruction during the compare processing without it possibly never getting used. Reviewers: spatel, majnemer, davide Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34184 llvm-svn: 306029
*	[InstCombine] Don't let folding (select (icmp eq (and X, C1), 0), Y, (or Y, ↵	Craig Topper	2017-06-21	1	-4/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	C2)) create more instructions than it removes Summary: Previously this folding had no checks to see if it was going to result in less instructions. This was pointed out during the review of D34184 This patch adds code to count how many instructions its going to create vs how many its going to remove so we can make a proper decision. Reviewers: spatel, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34437 llvm-svn: 305926
*	[InstCombine] Pass a proper context instruction to all of the calls into ↵	Craig Topper	2017-06-09	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	InstSimplify Summary: This matches the behavior we already had for compares and makes us consistent everywhere. Reviewers: dberlin, hfinkel, spatel Reviewed By: dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33604 llvm-svn: 305049
*	[InstCombine][InstSimplify] Use APInt::isNullValue/isOneValue to reduce ↵	Craig Topper	2017-06-07	1	-2/+2
\| \| \| \| \| \| \| \|	compiled code for comparing APInts with 0 and 1. NFC These methods are specifically optimized to only counting leading zeros without an additional uint64_t compare. llvm-svn: 304876
*	InstCombine: Use the new SimplifyQuery versions of Simplify*. Use ↵	Daniel Berlin	2017-04-26	1	-2/+1
\| \| \| \| \| \|	AssumptionCache, DominatorTree, TargetLibraryInfo everywhere. llvm-svn: 301464
*	[ValueTracking] Introduce a KnownBits struct to wrap the two APInts for ↵	Craig Topper	2017-04-26	1	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	computeKnownBits This patch introduces a new KnownBits struct that wraps the two APInt used by computeKnownBits. This allows us to treat them as more of a unit. Initially I've just altered the signatures of computeKnownBits and InstCombine's simplifyDemandedBits to pass a KnownBits reference instead of two separate APInt references. I'll do similar to the SelectionDAG version of computeKnownBits/simplifyDemandedBits as a separate patch. I've added a constructor that allows initializing both APInts to the same bit width with a starting value of 0. This reduces the repeated pattern of initializing both APInts. Once place default constructed the APInts so I added a default constructor for those cases. Going forward I would like to add more methods that will work on the pairs. For example trunc, zext, and sext occur on both APInts together in several places. We should probably add a clear method that can be used to clear both pieces. Maybe a method to check for conflicting information. A method to return (Zero\|One) so we don't write it out everywhere. Maybe a method for (Zero\|One).isAllOnesValue() to determine if all bits are known. I'm sure there are many other methods we can come up with. Differential Revision: https://reviews.llvm.org/D32376 llvm-svn: 301432
*	[APInt] Rename getSignBit to getSignMask	Craig Topper	2017-04-20	1	-1/+1
\| \| \| \| \| \| \| \|	getSignBit is a static function that creates an APInt with only the sign bit set. getSignMask seems like a better name to convey its functionality. In fact several places use it and then store in an APInt named SignMask. Differential Revision: https://reviews.llvm.org/D32108 llvm-svn: 300856
*	[InstCombine] Support folding a subtract with a constant LHS into a phi node	Craig Topper	2017-04-14	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	We currently only support folding a subtract into a select but not a PHI. This fixes that. I had to fix an assumption in FoldOpIntoPhi that assumed the PHI node was always in operand 0. Now we pass it in like we do for FoldOpIntoSelect. But we still require some dancing to find the Constant when we create the BinOp or ConstantExpr. This is based code is similar to what we do for selects. Since I touched all call sites, this also renames FoldOpIntoPhi to foldOpIntoPhi to match coding standards. Differential Revision: https://reviews.llvm.org/D31686 llvm-svn: 300363
*	[InstCombine] fix wrong undef handling when converting select to shuffle	Sanjay Patel	2017-04-12	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	As discussed in: https://bugs.llvm.org/show_bug.cgi?id=32486 ...the canonicalization of vector select to shufflevector does not hold up when undef elements are present in the condition vector. Try to make the undef handling clear in the code and the LangRef. Differential Revision: https://reviews.llvm.org/D31980 llvm-svn: 300092
*	[InstCombine] avoid breaking up bitcasted vector min/max patterns (PR32306)	Sanjay Patel	2017-03-16	1	-0/+10
\| \| \| \| \| \| \| \|	As the related tests show, we're not canonicalizing to this form for scalars or vectors yet, but this solves the immediate problem in: https://bugs.llvm.org/show_bug.cgi?id=32306 llvm-svn: 297989
*	[InstCombine] canonicalize non-obivous forms of integer min/max	Sanjay Patel	2017-02-21	1	-17/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is part of trying to clean up our handling of min/max patterns in IR. By converting these to canonical form, we're more likely to recognize them because there are various places in InstCombine that don't use matchSelectPattern or m_SMax and friends. The backend fixups referenced in the now deleted TODO comment were added with: https://reviews.llvm.org/rL291392 https://reviews.llvm.org/rL289738 If there's any codegen fallout from this change, we should be able to address it in DAGCombiner or target-specific lowering. llvm-svn: 295758
*	[InstCombine] Do not exercise nested max/min pattern on abs	Anna Thomas	2017-02-21	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is a fix for assertion failure in `getInverseMinMaxSelectPattern` when ABS is passed in as a select pattern. We should not be invoking the simplification rule for ABS(MIN(~ x,y))) or ABS(MAX(~x,y)) combinations. Added a test case which would cause an assertion failure without the patch. Reviewers: sanjoy, majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30051 llvm-svn: 295719
*	Use InstCombine's builder in foldSelectCttzCtlz instead of creating a new one.	Amaury Sechet	2017-01-24	1	-3/+2
\| \| \| \| \| \| \| \| \| \|	Summary: As per title. This will add the instructiions we are interested in in the worklist. Reviewers: mehdi_amini, majnemer, andreadb Differential Revision: https://reviews.llvm.org/D29081 llvm-svn: 292957
*	Fix formating in foldSelectCttzCtlz. NFC	Amaury Sechet	2017-01-24	1	-1/+1
\| \| \| \|	llvm-svn: 292934
*	[InstCombine] if the condition of a select may be known via assumes, ↵	Sanjay Patel	2017-01-13	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	eliminate the select This is a limited solution for PR31512: https://llvm.org/bugs/show_bug.cgi?id=31512 The motivation is that we will need to increase usage of llvm.assume and/or metadata to solve PR28430: https://llvm.org/bugs/show_bug.cgi?id=28430 ...and this kind of simplification is needed to take advantage of that extra information. The 'not' test case would be handled by: https://reviews.llvm.org/D28485 Differential Revision: https://reviews.llvm.org/D28337 llvm-svn: 291915
*	Revert @llvm.assume with operator bundles (r289755-r289757)	Daniel Jasper	2016-12-19	1	-1/+1
\| \| \| \| \| \| \|	This creates non-linear behavior in the inliner (see more details in r289755's commit thread). llvm-svn: 290086
*	Remove the AssumptionCache	Hal Finkel	2016-12-15	1	-1/+1
\| \| \| \| \| \| \| \| \|	After r289755, the AssumptionCache is no longer needed. Variables affected by assumptions are now found by using the new operand-bundle-based scheme. This new scheme is more computationally efficient, and also we need much less code... llvm-svn: 289756
*	add optional param to copy metadata when creating selects; NFC	Sanjay Patel	2016-11-26	1	-7/+3
\| \| \| \| \| \| \| \| \| \| \|	There are other spots where we can use this; we're currently dropping metadata in some places, and there are proposed changes where we will want to propagate metadata. IRBuilder's CreateSelect() already has a parameter like this, so this change makes the regular 'Create' API line up with that. llvm-svn: 287976
*	[InstCombine] canonicalize min/max constant to select's false value	Sanjay Patel	2016-11-21	1	-0/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a first step towards canonicalization and improved folding/codegen for integer min/max as discussed here: http://lists.llvm.org/pipermail/llvm-dev/2016-November/106868.html Here, we're just matching the simplest min/max patterns and adjusting the icmp predicate while swapping the select operands. I've included FIXME tests in test/Transforms/InstCombine/select_meta.ll so it's easier to see how this might be extended (corresponds to the TODO comment in the code). That's also why I'm using matchSelectPattern() rather than a simpler check; once the backend is patched, we can just remove some of the restrictions to allow the obfuscated min/max patterns in the FIXME tests to be matched. Differential Revision: https://reviews.llvm.org/D26525 llvm-svn: 287585
*	[InstCombine] use dyn_cast rather isa+cast; NFC	Sanjay Patel	2016-11-11	1	-2/+2
\| \| \| \| \| \|	Follow-up to r286664 cleanup as suggested by Eli. Thanks! llvm-svn: 286671
*	[InstCombine] clean up foldSelectOpOp(); NFC	Sanjay Patel	2016-11-11	1	-10/+4
\| \| \| \|	llvm-svn: 286664
*	[InstCombine] fix profitability equation for max-of-nots transform	Sanjay Patel	2016-11-09	1	-7/+6
\| \| \| \| \| \| \| \| \| \|	As the test change shows, we can increase the critical path by adding a 'not' instruction, so make sure that we're actually removing an instruction if we do this transform. This transform could also cause us to miss folds of min/max pairs. llvm-svn: 286315
*	[InstCombine] reduce indentation; NFC	Sanjay Patel	2016-11-08	1	-23/+20
\| \| \| \|	llvm-svn: 286314
*	[InstCombine] allow splat vector folds in adjustMinMax() (retry r285732)	Sanjay Patel	2016-11-07	1	-14/+12
\| \| \| \| \| \| \| \|	This was reverted at r285866 because there was a crash handling a scalar select of vectors. I added a check for that pattern and a test case based on the example provided in the post-commit thread for r285732. llvm-svn: 286113
*	Revert "[InstCombine] allow splat vector folds in adjustMinMax()"	Greg Bedwell	2016-11-02	1	-10/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r285732. This change introduced a new assertion failure in the following testcase at -O2: typedef short __v8hi __attribute__((__vector_size__(16))); __v8hi foo(__v8hi &V1, __v8hi &V2, unsigned mask) { __v8hi Result = V1; if (mask & 0x80) Result[0] = V2[0]; return Result; } llvm-svn: 285866
*	[InstCombine] allow splat vector folds in adjustMinMax()	Sanjay Patel	2016-11-01	1	-14/+10
\| \| \| \|	llvm-svn: 285732
*	[InstCombine] clean up adjustMinMax(); NFCI	Sanjay Patel	2016-11-01	1	-92/+87
\| \| \| \| \| \| \| \| \|	1. Change param names for readability 2. Change pointer param to ref 3. Early exit to reduce indent 4. Change switch to if/else llvm-svn: 285718
*	[InstCombine] add helper function for adjustMinMax(); NFCI	Sanjay Patel	2016-11-01	1	-6/+19
\| \| \| \| \| \|	This is just a cut and paste; clean-up and enhancements to follow. llvm-svn: 285715
*	[InstCombine] re-use bitcasted compare operands in selects (PR28001)	Sanjay Patel	2016-10-29	1	-0/+50
\| \| \| \| \| \| \| \| \| \| \|	These mixed bitcast patterns show up with SSE/AVX intrinsics because we bitcast function parameters to <2 x i64>. The bitcasts obfuscate the expected min/max forms as shown in PR28001: https://llvm.org/bugs/show_bug.cgi?id=28001#c6 Differential Revision: https://reviews.llvm.org/D25943 llvm-svn: 285495
*	[InstCombine] fix foldSPFofSPF() to handle vector splats	Sanjay Patel	2016-10-27	1	-22/+18
\| \| \| \|	llvm-svn: 285345
*	fix formatting; NFC	Sanjay Patel	2016-10-25	1	-13/+13
\| \| \| \|	llvm-svn: 285078
*	[InstCombine] fold select X, (ext X), C	Sanjay Patel	2016-10-07	1	-1/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we're going to canonicalize IR towards select of constants, try harder to create those. Also, don't lose the metadata. This is actually 4 related transforms in one patch: // select X, (sext X), C --> select X, -1, C // select X, (zext X), C --> select X, 1, C // select X, C, (sext X) --> select X, C, 0 // select X, C, (zext X) --> select X, C, 0 Differential Revision: https://reviews.llvm.org/D25126 llvm-svn: 283575
*	[InstCombine] allow non-splat folds of select cond (ext X), C	Sanjay Patel	2016-09-30	1	-38/+33
\| \| \| \|	llvm-svn: 282906
*	[InstCombine] fix function names; NFC	Sanjay Patel	2016-09-29	1	-38/+38
\| \| \| \| \| \| \| \|	Also, make foldSelectExtConst() a member of InstCombiner, remove unnecessary parameters from its interface, and group visitSelectInst helpers together in the header file. llvm-svn: 282796
*	fix formatting; NFC	Sanjay Patel	2016-09-29	1	-11/+9
\| \| \| \|	llvm-svn: 282737
*	[InstCombine] canonicalize vector select with constant vector condition to ↵	Sanjay Patel	2016-09-16	1	-0/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	shuffle As discussed on llvm-dev ( http://lists.llvm.org/pipermail/llvm-dev/2016-August/104210.html ): turn a vector select with constant condition operand into a shuffle as a canonicalization step. Shuffles may be easier to reason about in conjunction with other shuffles and insert/extract. Possible known (minor?) regressions from this change are filed as: https://llvm.org/bugs/show_bug.cgi?id=28530 https://llvm.org/bugs/show_bug.cgi?id=28531 https://llvm.org/bugs/show_bug.cgi?id=30371 If something terrible happens to perf after this commit, feel free to revert until a backend fix is in place. Differential Revision: https://reviews.llvm.org/D24279 llvm-svn: 281787
*	fix formatting; NFC	Sanjay Patel	2016-09-06	1	-19/+14
\| \| \| \|	llvm-svn: 280727
*	[Profile] Propagate branch metadata properly in instcombine	Xinliang David Li	2016-08-25	1	-11/+15
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D23590 llvm-svn: 279693
*	[InstCombine] try to fold (select C, (sext A), B) into logical ops	Nicolai Haehnle	2016-08-05	1	-0/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Turn (select C, (sext A), B) into (sext (select C, A, B')) when A is i1 and B is a compatible constant, also for zext instead of sext. This will then be further folded into logical operations. The transformation would be valid for non-i1 types as well, but other parts of InstCombine prefer to have sext from non-i1 as an operand of select. Motivated by the shader compiler frontend in Mesa for AMDGPU, which emits i32 for boolean operations. With this change, the boolean logic is fully recovered. Reviewers: majnemer, spatel, tstellarAMD Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22747 llvm-svn: 277801
*	InstCombine: Replace some never-null pointers with references. NFC	Justin Bogner	2016-08-05	1	-1/+1
\| \| \| \|	llvm-svn: 277792
*	[InstSimplify][InstCombine] don't crash when folding vector selects of icmp	Sanjay Patel	2016-07-20	1	-1/+4
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D22602 llvm-svn: 276209
*	save type in local var; NFCI	Sanjay Patel	2016-07-07	1	-10/+11
\| \| \| \|	llvm-svn: 274760
*	[InstCombine] enhance (select X, C1, C2 --> ext X) to handle vectors	Sanjay Patel	2016-07-06	1	-22/+28
\| \| \| \| \| \| \| \| \|	By replacing dyn_cast of ConstantInt with m_Zero/m_One/m_AllOnes, we allow these transforms for splat vectors. Differential Revision: http://reviews.llvm.org/D21899 llvm-svn: 274696
*	[InstCombine] use more specific pattern matchers; NFCI	Sanjay Patel	2016-07-06	1	-12/+10
\| \| \| \| \| \| \| \|	Follow-up from r274465: we don't need to capture the value in these cases, so just match the constant that we're looking for. m_One/m_Zero work with vector splats as well as scalars. llvm-svn: 274670
*	[InstCombine] enable vector select of bools -> logic folds	Sanjay Patel	2016-07-03	1	-5/+8
\| \| \| \|	llvm-svn: 274465