bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[ValueTracking] Look through casts when determining non-nullness	Johannes Doerfert	2019-01-26	1	-7/+7
\| \| \| \| \| \| \| \| \| \|	Bitcast and certain Ptr2Int/Int2Ptr instructions will not alter the value of their operand and can therefore be looked through when we determine non-nullness. Differential Revision: https://reviews.llvm.org/D54956 llvm-svn: 352293
*	[InstCombine] Fold (min/max ~X, Y) -> ~(max/min X, ~Y) when Y is freely ↵	Craig Topper	2018-09-22	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	invertible Summary: This restores the combine that was reverted in r341883. The infinite loop from the failing test no longer occurs due to changes from r342163. Reviewers: spatel, dmgreen Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52070 llvm-svn: 342797
*	[InstCombine] Partially revert rL341674 due to PR38897.	Alina Sbirlea	2018-09-10	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Revert min/max changes in rL341674 dues to high compile times causing timeouts (PR38897). Checking in to unblock failing builds. Patch available for post-commit review and re-revert once resolved. Working on a smaller reproducer for PR38897. Reviewers: craig.topper, spatel Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D51897 llvm-svn: 341883
*	[InstCombine] Fold (min/max ~X, Y) -> ~(max/min X, ~Y) when Y is freely ↵	Craig Topper	2018-09-07	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	invertible If the ~X wasn't able to simplify above the max/min, we might be able to simplify it by moving it below the max/min. I had to modify the ~(min/max ~X, Y) transform to prevent getting stuck in a loop when we saw the new ~(max/min X, ~Y) before the ~Y had been folded away to remove the new not. Differential Revision: https://reviews.llvm.org/D51398 llvm-svn: 341674
*	llvm: Add support for "-fno-delete-null-pointer-checks"	Manoj Gupta	2018-07-09	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Support for this option is needed for building Linux kernel. This is a very frequently requested feature by kernel developers. More details : https://lkml.org/lkml/2018/4/4/601 GCC option description for -fdelete-null-pointer-checks: This Assume that programs cannot safely dereference null pointers, and that no code or data element resides at address zero. -fno-delete-null-pointer-checks is the inverse of this implying that null pointer dereferencing is not undefined. This feature is implemented in LLVM IR in this CL as the function attribute "null-pointer-is-valid"="true" in IR (Under review at D47894). The CL updates several passes that assumed null pointer dereferencing is undefined to not optimize when the "null-pointer-is-valid"="true" attribute is present. Reviewers: t.p.northover, efriedma, jyknight, chandlerc, rnk, srhines, void, george.burgess.iv Reviewed By: efriedma, george.burgess.iv Subscribers: eraman, haicheng, george.burgess.iv, drinkcat, theraven, reames, sanjoy, xbolva00, llvm-commits Differential Revision: https://reviews.llvm.org/D47895 llvm-svn: 336613
*	[InstCombine] move tests for select with bit-test of condition; NFC	Sanjay Patel	2018-04-24	1	-215/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These are all but 1 of the select-of-constant tests that appear to be transformed within foldSelectICmpAnd() and the block above it predicated by decomposeBitTestICmp(). As discussed in D45862 (and can be seen in several tests here), we probably want to stop doing those transforms because they can increase the instruction count without benefitting other passes or codegen. The 1 test not included here is a urem test where the bit hackery allows us to remove a urem. To preserve killing that urem, we should do some stronger known-bits analysis or pattern matching of 'urem x, (select-of-pow2-constants)'. llvm-svn: 330768
*	[InstCombine] regenerate checks; NFC	Sanjay Patel	2018-04-24	1	-430/+573
\| \| \| \| \| \| \| \| \| \|	The first step in fixing problems raised in D45862 is to make the problems visible. Now we can more easily see/update cases where selects have been turned into multiple instructions with no apparent improvement in analysis or benefits for other passes (vectorization). llvm-svn: 330731
*	[PatternMatch] enhance m_One() to ignore undef elements in vectors	Sanjay Patel	2018-02-17	1	-1/+1
\| \| \| \|	llvm-svn: 325437
*	[InstSimplify, InstCombine] add tests with vector undef elts; NFC	Sanjay Patel	2018-02-17	1	-6/+15
\| \| \| \| \| \|	These would fold if the m_One pattern matcher accounted for undef elts. llvm-svn: 325436
*	[InstCombine] Simplify binops that are only used by a select and are fed by ↵	Craig Topper	2017-11-15	1	-0/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a select with the same condition. Summary: This patch optimizes a binop sandwiched between 2 selects with the same condition. Since we know its only used by the select we can propagate the appropriate input value from the earlier select. As I'm writing this I realize I may need to avoid doing this for division in case the select was protecting a divide by zero? Reviewers: spatel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39999 llvm-svn: 318267
*	Fix some misc. -enable-var-scope violations	Matt Arsenault	2017-11-13	1	-1/+1
\| \| \| \|	llvm-svn: 318006
*	[InstCombine] auto-generate complete checks; NFC	Sanjay Patel	2017-10-02	1	-39/+54
\| \| \| \|	llvm-svn: 314712
*	[InstCombine] Teach select01 helper of foldSelectIntoOp to handle vector splats	Craig Topper	2017-08-28	1	-0/+47
\| \| \| \| \| \| \| \|	We were handling some vectors in foldSelectIntoOp, but not if the operand of the bin op was any kind of vector constant. This patch fixes it to treat vector splats the same as scalars. Differential Revision: https://reviews.llvm.org/D37232 llvm-svn: 311940
*	[InstCombine] Make folding (X >s -1) ? C1 : C2 --> ((X >>s 31) & (C2 - C1)) ↵	Craig Topper	2017-08-16	1	-0/+51
\| \| \| \| \| \| \| \| \| \|	+ C1 support splat vectors This also uses decomposeBitTestICmp to decode the compare. Differential Revision: https://reviews.llvm.org/D36781 llvm-svn: 311044
*	[InstCombine] Add test case for PR33721.	Craig Topper	2017-07-11	1	-0/+7
\| \| \| \|	llvm-svn: 307621
*	[InstCombine] canonicalize icmp predicate feeding select	Sanjay Patel	2017-06-27	1	-12/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This canonicalization was suggested in D33172 as a way to make InstCombine behavior more uniform. We have this transform for icmp+br, so unless there's some reason that icmp+select should be treated differently, we should do the same thing here. The benefit comes from increasing the chances of creating identical instructions. This is shown in the tests in logical-select.ll (PR32791). InstCombine doesn't fold those directly, but EarlyCSE can simplify the identical cmps, and then InstCombine can fold the selects together. The possible regression for the tests in select.ll raises questions about poison/undef: http://lists.llvm.org/pipermail/llvm-dev/2017-May/113261.html ...but that transform is just as likely to be triggered by this canonicalization as it is to be missed, so we're just pointing out a commutation deficiency in the pattern matching: https://reviews.llvm.org/rL228409 Differential Revision: https://reviews.llvm.org/D34242 llvm-svn: 306435
*	fix trivial typos in comment, NFC	Hiroshi Inoue	2017-06-24	1	-2/+2
\| \| \| \| \| \|	dereferencable -> dereferenceable llvm-svn: 306210
*	[InstCombine] fix wrong undef handling when converting select to shuffle	Sanjay Patel	2017-04-12	1	-3/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	As discussed in: https://bugs.llvm.org/show_bug.cgi?id=32486 ...the canonicalization of vector select to shufflevector does not hold up when undef elements are present in the condition vector. Try to make the undef handling clear in the code and the LangRef. Differential Revision: https://reviews.llvm.org/D31980 llvm-svn: 300092
*	[InstCombine] canonicalize non-obivous forms of integer min/max	Sanjay Patel	2017-02-21	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is part of trying to clean up our handling of min/max patterns in IR. By converting these to canonical form, we're more likely to recognize them because there are various places in InstCombine that don't use matchSelectPattern or m_SMax and friends. The backend fixups referenced in the now deleted TODO comment were added with: https://reviews.llvm.org/rL291392 https://reviews.llvm.org/rL289738 If there's any codegen fallout from this change, we should be able to address it in DAGCombiner or target-specific lowering. llvm-svn: 295758
*	[InstCombine] fix operand-complexity-based canonicalization (PR28296)	Sanjay Patel	2017-02-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code comments didn't match the code logic, and we didn't actually distinguish the fake unary (not/neg/fneg) operators from arguments. Adding another level to the weighting scheme provides more structure and can help simplify the pattern matching in InstCombine and other places. I fixed regressions that would have shown up from this change in: rL290067 rL290127 But that doesn't mean there are no pattern-matching logic holes left; some combines may just be missing regression tests. Should fix: https://llvm.org/bugs/show_bug.cgi?id=28296 Differential Revision: https://reviews.llvm.org/D27933 llvm-svn: 294049
*	[ValueTracking] recognize a 'not' of an assumed condition as false	Sanjay Patel	2017-01-17	1	-3/+2
\| \| \| \| \| \| \| \|	Also, add the corresponding match to the AssumptionCache's 'Affected Values' list. Differential Revision: https://reviews.llvm.org/D28485 llvm-svn: 292239
*	[InstCombine] if the condition of a select may be known via assumes, ↵	Sanjay Patel	2017-01-13	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	eliminate the select This is a limited solution for PR31512: https://llvm.org/bugs/show_bug.cgi?id=31512 The motivation is that we will need to increase usage of llvm.assume and/or metadata to solve PR28430: https://llvm.org/bugs/show_bug.cgi?id=28430 ...and this kind of simplification is needed to take advantage of that extra information. The 'not' test case would be handled by: https://reviews.llvm.org/D28485 Differential Revision: https://reviews.llvm.org/D28337 llvm-svn: 291915
*	[InstCombine] auto-generate checks for select+bitwise logic tests; NFC	Sanjay Patel	2016-11-30	1	-259/+0
\| \| \| \|	llvm-svn: 288254
*	[InstCombine] move min/max tests to min/max test file; NFC	Sanjay Patel	2016-11-08	1	-142/+0
\| \| \| \|	llvm-svn: 286256
*	[InstCombine] move/fix tests for adjusted min/max	Sanjay Patel	2016-11-01	1	-104/+0
\| \| \| \| \| \| \|	I think the former 'test50' had a typo making it functionally equivalent to the former 'test49'; changed the predicate to provide more coverage. llvm-svn: 285706
*	[ValueTracking] recognize more variants of smin/smax	Sanjay Patel	2016-10-29	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	Try harder to detect obfuscated min/max patterns: the initial pattern was added with D9352 / rL236202. There was a bug fix for PR27137 at rL264996, but I think we can do better by folding the corresponding smax pattern and commuted variants. The codegen tests demonstrate the effect of ValueTracking on the backend via SelectionDAGBuilder. We can't expose these differences minimally in IR because we don't have smin/smax intrinsics for IR. Differential Revision: https://reviews.llvm.org/D26091 llvm-svn: 285499
*	[InstCombine] move/add tests for smin/smax folds	Sanjay Patel	2016-10-28	1	-25/+0
\| \| \| \|	llvm-svn: 285414
*	[InstCombine] fix foldSPFofSPF() to handle vector splats	Sanjay Patel	2016-10-27	1	-8/+6
\| \| \| \|	llvm-svn: 285345
*	[InstCombine] add vector tests for foldSPFofSPF to show missing folds	Sanjay Patel	2016-10-27	1	-0/+33
\| \| \| \|	llvm-svn: 285340
*	[InstCombine] auto-generate checks for min/max tests	Sanjay Patel	2016-10-27	1	-28/+40
\| \| \| \|	llvm-svn: 285336
*	[ValueTracking] fix matchSelectPattern to allow vector splat folds of ↵	Sanjay Patel	2016-10-27	1	-8/+1
\| \| \| \| \| \|	min/max/abs/nabs llvm-svn: 285303
*	[InstCombine] add tests for missing folds of vector abs/nabs/min/max	Sanjay Patel	2016-10-27	1	-3/+20
\| \| \| \|	llvm-svn: 285299
*	[InstCombine] regenerate some checks	Sanjay Patel	2016-10-24	1	-75/+86
\| \| \| \|	llvm-svn: 285036
*	[InstCombine] canonicalize vector select with constant vector condition to ↵	Sanjay Patel	2016-09-16	1	-0/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	shuffle As discussed on llvm-dev ( http://lists.llvm.org/pipermail/llvm-dev/2016-August/104210.html ): turn a vector select with constant condition operand into a shuffle as a canonicalization step. Shuffles may be easier to reason about in conjunction with other shuffles and insert/extract. Possible known (minor?) regressions from this change are filed as: https://llvm.org/bugs/show_bug.cgi?id=28530 https://llvm.org/bugs/show_bug.cgi?id=28531 https://llvm.org/bugs/show_bug.cgi?id=30371 If something terrible happens to perf after this commit, feel free to revert until a backend fix is in place. Differential Revision: https://reviews.llvm.org/D24279 llvm-svn: 281787
*	[InstCombine] LogicOpc (zext X), C --> zext (LogicOpc X, C) (PR28476)	Sanjay Patel	2016-07-21	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The benefits of this change include: 1. Remove DeMorgan-matching code that was added specifically to work-around the missing transform in http://reviews.llvm.org/rL248634. 2. Makes the DeMorgan transform work for vectors too. 3. Fix PR28476: https://llvm.org/bugs/show_bug.cgi?id=28476 Extending this transform to other casts and other associative operators may be useful too. See https://reviews.llvm.org/D22421 for a prerequisite for doing that though. Differential Revision: https://reviews.llvm.org/D22271 llvm-svn: 276221
*	[InstSimplify][InstCombine] don't crash when folding vector selects of icmp	Sanjay Patel	2016-07-20	1	-0/+23
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D22602 llvm-svn: 276209
*	regenerate checks	Sanjay Patel	2016-07-19	1	-10/+15
\| \| \| \|	llvm-svn: 276042
*	[InstCombine] enable vector select of bools -> logic folds	Sanjay Patel	2016-07-03	1	-8/+11
\| \| \| \|	llvm-svn: 274465
*	add vector bool select tests and regenerate checks for scalar bool select tests	Sanjay Patel	2016-07-03	1	-59/+139
\| \| \| \|	llvm-svn: 274460
*	[InstCombine] allow more than one use for vector bitcast folding with selects	Sanjay Patel	2016-06-17	1	-73/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The motivating example for this transform is similar to D20774 where bitcasts interfere with a single cmp/select sequence, but in this case we have 2 uses of each bitcast to produce min and max ops: define void @minmax_bc_store(<4 x float> %a, <4 x float> %b, <4 x float>* %ptr1, <4 x float>* %ptr2) { %cmp = fcmp olt <4 x float> %a, %b %bc1 = bitcast <4 x float> %a to <4 x i32> %bc2 = bitcast <4 x float> %b to <4 x i32> %sel1 = select <4 x i1> %cmp, <4 x i32> %bc1, <4 x i32> %bc2 %sel2 = select <4 x i1> %cmp, <4 x i32> %bc2, <4 x i32> %bc1 %bc3 = bitcast <4 x float>* %ptr1 to <4 x i32>* store <4 x i32> %sel1, <4 x i32>* %bc3 %bc4 = bitcast <4 x float>* %ptr2 to <4 x i32>* store <4 x i32> %sel2, <4 x i32>* %bc4 ret void } With this patch, we move the selects up to use the input args which allows getting rid of all of the bitcasts: define void @minmax_bc_store(<4 x float> %a, <4 x float> %b, <4 x float>* %ptr1, <4 x float>* %ptr2) { %cmp = fcmp olt <4 x float> %a, %b %sel1.v = select <4 x i1> %cmp, <4 x float> %a, <4 x float> %b %sel2.v = select <4 x i1> %cmp, <4 x float> %b, <4 x float> %a store <4 x float> %sel1.v, <4 x float>* %ptr1, align 16 store <4 x float> %sel2.v, <4 x float>* %ptr2, align 16 ret void } The asm for x86 SSE then improves from: movaps %xmm0, %xmm2 cmpltps %xmm1, %xmm2 movaps %xmm2, %xmm3 andnps %xmm1, %xmm3 movaps %xmm2, %xmm4 andnps %xmm0, %xmm4 andps %xmm2, %xmm0 orps %xmm3, %xmm0 andps %xmm1, %xmm2 orps %xmm4, %xmm2 movaps %xmm0, (%rdi) movaps %xmm2, (%rsi) To: movaps %xmm0, %xmm2 minps %xmm1, %xmm2 maxps %xmm0, %xmm1 movaps %xmm2, (%rdi) movaps %xmm1, (%rsi) The TODO comments show that we're limiting this transform only to vectors and only to bitcasts because we need to improve other transforms or risk creating worse codegen. Differential Revision: http://reviews.llvm.org/D21190 llvm-svn: 273011
*	[InstCombine] Fix incorrect rule from rL236202	Sanjoy Das	2016-03-31	1	-0/+18
\| \| \| \| \| \| \|	The rule for SMIN introduced in rL236202 doesn't work as advertised: the check for Pred == ICmpInst::ICMP_SGT was missing. llvm-svn: 264996
*	Push isDereferenceableAndAlignedPointer down into isSafeToLoadUnconditionally	Artur Pilipenko	2016-01-17	1	-0/+27
\| \| \| \| \| \| \| \|	Reviewed By: reames Differential Revision: http://reviews.llvm.org/D16226 llvm-svn: 258010
*	Take alignment into account in isSafeToLoadUnconditionally	Artur Pilipenko	2015-06-25	1	-0/+17
\| \| \| \| \| \| \| \|	Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D10475 llvm-svn: 240636
*	Reapply 239795 - [InstCombine] Propagate non-null facts to call parameters	Philip Reames	2015-06-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	The original change broke clang side tests. I will be submitting those momentarily. This change includes post commit feedback on the original change from from Pete Cooper. Original Submission comments: If a parameter to a function is known non-null, use the existing parameter attributes to record that fact at the call site. This has no optimization benefit by itself - that I know of - but is an enabling change for http://reviews.llvm.org/D9129. Differential Revision: http://reviews.llvm.org/D9132 llvm-svn: 239849
*	Revert 239795	Philip Reames	2015-06-16	1	-1/+1
\| \| \| \| \| \|	I forgot to update some clang test cases. I'll fix and resubmit tomorrow. llvm-svn: 239800
*	[InstCombine] Propagate non-null facts to call parameters	Philip Reames	2015-06-16	1	-1/+1
\| \| \| \| \| \| \| \|	If a parameter to a function is known non-null, use the existing parameter attributes to record that fact at the call site. This has no optimization benefit by itself - that I know of - but is an enabling change for http://reviews.llvm.org/D9129. Differential Revision: http://reviews.llvm.org/D9132 llvm-svn: 239795
*	[InstCombine] Don't miscompile select to poison	David Majnemer	2015-06-06	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we have (select a, b, c), it is sometimes valid to simplify this to a single select operand. However, doing so is only valid if the computation doesn't inject poison into the computation. It might be helpful to consider the following example: (select (icmp ne %i, INT_MAX), (add nsw %i, 1), INT_MIN) The select is equivalent to (add %i, 1) but not (add nsw %i, 1). Self hosting on x86_64 revealed that this occurs very, very rarely so bailing out is hopefully pretty reasonable. llvm-svn: 239215
*	Revert "[InstCombine] Rephrase fix to SimplifyWithOpReplaced"	Renato Golin	2015-06-05	1	-10/+0
\| \| \| \| \| \| \| \| \|	This reverts commit r239141. This commit was an attempt to reintroduce a previous patch that broke many self-hosting bots with clang timeouts, but it still has slowdown issues, at least on ARM, increasing the compilation time (stage 2, clang's) by 5x. llvm-svn: 239175
*	[InstCombine] Rephrase fix to SimplifyWithOpReplaced	David Majnemer	2015-06-05	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I don't have the IR which is causing the build bot breakage but I can postulate as to why they are timing out: 1. SimplifyWithOpReplaced was stripping flags from the simplified value. 2. visitSelectInstWithICmp was overriding SimplifyWithOpReplaced because it's simplification wasn't correct. 3. InstCombine would revisit the add instruction and note that it can rederive the flags. 4. By modifying the value, we chose to revisit instructions which reuse the value. One of the instructions is the original select, causing LLVM to never reach fixpoint. Instead, strip the flags only when we are sure we are going to perform the simplification. llvm-svn: 239141
*	Revert "[InstCombine] Don't miscompile safe increment idiom"	Daniel Jasper	2015-06-05	1	-10/+0
\| \| \| \| \| \| \| \| \|	This is breaking a lot of build bots and is causing very long-running compiles (infinite loops)? Likely, we shouldn't return nullptr? llvm-svn: 239139