bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	revert r314698 - [InstCombine] remove one-use restriction for icmp (shr ↵	Sanjay Patel	2017-10-05	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \|	exact X, C1), C2 --> icmp X, (C2<<C1) There is a bot failure that appears to be related to this change: http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15-selfhost-neon/builds/2117 ...so reverting to confirm that and attempting to keep the bot green while investigating. llvm-svn: 314984
*	[InstCombine] Fix a vector splat handling bug in transformZExtICmp.	Craig Topper	2017-10-05	1	-3/+1
\| \| \| \| \| \| \| \| \| \|	We were using an i1 type and then zero extending to a vector. Instead just create the 0/1 directly as a ConstantInt with the correct type. No need to ask ConstantExpr to zero extend for us. This bug is a bit tricky to hit because it requires us to visit a zext of an icmp that would normally be simplified to true/false, but that icmp hasnt' been visited yet. In the test case this zext and icmp were created by visiting a udiv and due to worklist ordering we got to the zext first. Fixes PR34841. llvm-svn: 314971
*	Revert r314928 to investigate thinLTO bootstrap failure	Xinliang David Li	2017-10-05	2	-234/+0
\| \| \| \|	llvm-svn: 314961
*	[InstCombine] Improve support for ashr in foldICmpAndShift	Craig Topper	2017-10-04	1	-9/+12
\| \| \| \| \| \| \| \|	We can support ashr similar to lshr, if we know that none of the shifted in bits are used. In that case SimplifyDemandedBits would normally convert it to lshr. But that conversion doesn't happen if the shift has additional users. Differential Revision: https://reviews.llvm.org/D38521 llvm-svn: 314945
*	Recommit r314561 after fixing msan build failure	Xinliang David Li	2017-10-04	2	-0/+234
\| \| \| \| \| \| \|	(trial 2) Incoming val defined by terminator instruction which also requires bitcasts can not be handled. llvm-svn: 314928
*	[InstCombine] Use isSignBitCheck to simplify an if statement. Directly ↵	Craig Topper	2017-10-03	1	-12/+8
\| \| \| \| \| \| \| \|	create new sign bit compares instead of manipulating the constant. NFCI Since we no longer had the direct constant compares, manipulating the constant seemeded less clear. llvm-svn: 314830
*	[InstCombine] Change a bunch of methods to take APInts by reference instead ↵	Craig Topper	2017-10-03	2	-134/+134
\| \| \| \| \| \| \| \|	of pointer. This allows us to remove a bunch of dereferences and only have a few dereferences at the call sites. llvm-svn: 314762
*	[InstCombine] Replace an equality compare of two APInt pointers with a ↵	Craig Topper	2017-10-03	1	-1/+1
\| \| \| \| \| \| \| \|	compare of the APInts themselves. Apparently this works by virtue of the fact that the pointers are pointers to the APInts stored inside of the ConstantInt objects. But I really don't think we should be relying on that. llvm-svn: 314761
*	[InstCombine] remove one-use restriction for icmp (shr exact X, C1), C2 --> ↵	Sanjay Patel	2017-10-02	1	-6/+6
\| \| \| \| \| \|	icmp X, (C2<<C1) llvm-svn: 314698
*	Update getMergedLocation to check the instruction type and merge properly.	Dehao Chen	2017-10-02	3	-16/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the merged instruction is call instruction, we need to set the scope to the closes common scope between 2 locations, otherwise it will cause trouble when the call is getting inlined. Reviewers: dblaikie, aprantl Reviewed By: dblaikie, aprantl Subscribers: llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D37877 llvm-svn: 314694
*	[InstCombine] Use APInt for all the math in foldICmpDivConstant	Craig Topper	2017-10-01	1	-95/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This currently uses ConstantExpr to do its math, but as noted in a TODO it can all be done directly on APInt. Reviewers: spatel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38440 llvm-svn: 314640
*	Revert r314579: "Recommi r314561 after fixing over-debug assertion".	Daniel Jasper	2017-10-01	2	-225/+0
\| \| \| \| \| \| \| \| \| \| \| \|	And follow-up r314585. Leads to segfaults. I'll forward reproduction instructions to the patch author. Also, for a recommit, still add the original patch description. Otherwise, it becomes really tedious to find out what a patch actually does. The fact that it is a recommit with a fix is somewhat secondary. llvm-svn: 314622
*	Fix buildbot failure -- tighten type check for matching phi	Xinliang David Li	2017-09-30	1	-1/+1
\| \| \| \|	llvm-svn: 314585
*	Recommi r314561 after fixing over-debug assertion	Xinliang David Li	2017-09-30	2	-0/+225
\| \| \| \|	llvm-svn: 314579
*	Revert 314561 due to debug build assertion failure	Xinliang David Li	2017-09-29	2	-223/+0
\| \| \| \|	llvm-svn: 314563
*	Eliminate PHI (int typed) which has only one use by intptr	Xinliang David Li	2017-09-29	2	-0/+223
\| \| \| \| \| \| \| \| \| \|	This patch will eliminate redundant intptr/ptrtoint that pessimizes analyses such as SCEV, AA and will make optimization passes such as auto-vectorization more powerful. Differential revision: http://reviews.llvm.org/D37832 llvm-svn: 314561
*	Revert r314017 '[InstCombine] Simplify check for RHS being a splat constant ↵	Craig Topper	2017-09-27	1	-16/+22
\| \| \| \| \| \| \| \|	in foldICmpUsingKnownBits by just checking Op1Min==Op1Max rather than going through m_APInt.' This reverts r314017 and similar code added in later commits. It seems to not work for pointer compares and is causing a bot failure for the last several days. llvm-svn: 314360
*	[InstCombine] Gating select arithmetic optimization.	Chad Rosier	2017-09-27	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \|	These changes faciliate positive behavior for arithmetic based select expressions that match its translation criteria, keeping code size gated to neutral or improved scenarios. Patch by Michael Berg <michael_c_berg@apple.com>! Differential Revision: https://reviews.llvm.org/D38263 llvm-svn: 314320
*	[InstCombine] Remove one use restriction on the shift for calls to ↵	Craig Topper	2017-09-26	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \|	foldICmpAndShift. If this transformation succeeds, we're going to remove our dependency on the shift by rewriting the and. So it doesn't matter how many uses the shift has. This distributes the one use check to other transforms in foldICmpAndConstConst that do need it. Differential Revision: https://reviews.llvm.org/D38206 llvm-svn: 314233
*	[InstCombine] Move an optimization from foldICmpAndConstConst to ↵	Craig Topper	2017-09-25	1	-16/+10
\| \| \| \| \| \| \| \| \| \|	foldICmpUsingKnownBits All this optimization cares about is knowing how many low bits of LHS is known to be zero and whether that means that the result is 0 or greater than the RHS constant. It doesn't matter where the zeros in the low bits came from. So we don't need to specifically look for an AND. Instead we can use known bits. Differential Revision: https://reviews.llvm.org/D38195 llvm-svn: 314153
*	[InstCombine] remove extract-of-select vector transform (2nd try)	Sanjay Patel	2017-09-25	1	-33/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The 1st attempt at this: https://reviews.llvm.org/rL314117 was reverted at: https://reviews.llvm.org/rL314118 because of bot fails for clang tests that were checking optimized IR. That should be fixed with: https://reviews.llvm.org/rL314144 ...so try again. Original commit message: The transform to convert an extract-of-a-select-of-vectors was added at: https://reviews.llvm.org/rL194013 And a question about the validity of this transform was raised in the review: https://reviews.llvm.org/D1539: ...but not answered AFAICT> Most of the motivating cases in that patch are now handled by other combines. These are the tests that were added with the original commit, but they are not regressing even after we remove the transform in this patch. The diffs we see after removing this transform cause us to avoid increasing the instruction count, so we don't want to do those transforms as canonicalizations. The motivation for not turning a vector-select-of-vectors into a scalar operation is shown in PR33301: https://bugs.llvm.org/show_bug.cgi?id=33301 ...in those cases, we'll get vector ops with this patch rather than the vector/scalar mix that we currently see. Differential Revision: https://reviews.llvm.org/D38006 llvm-svn: 314147
*	revert r314117 because there are bogus clang tests that depend on the optimizer	Sanjay Patel	2017-09-25	1	-0/+33
\| \| \| \|	llvm-svn: 314118
*	[InstCombine] remove extract-of-select vector transform	Sanjay Patel	2017-09-25	1	-33/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The transform to convert an extract-of-a-select-of-vectors was added at: rL194013 And a question about the validity of this transform was raised in the review: https://reviews.llvm.org/D1539: ...but not answered AFAICT> Most of the motivating cases in that patch are now handled by other combines. These are the tests that were added with the original commit, but they are not regressing even after we remove the transform in this patch. The diffs we see after removing this transform cause us to avoid increasing the instruction count, so we don't want to do those transforms as canonicalizations. The motivation for not turning a vector-select-of-vectors into a scalar operation is shown in PR33301: https://bugs.llvm.org/show_bug.cgi?id=33301 ...in those cases, we'll get vector ops with this patch rather than the vector/scalar mix that we currently see. Differential Revision: https://reviews.llvm.org/D38006 llvm-svn: 314117
*	[InstCombine] Teach foldICmpUsingKnownBits to simplify SLE/SGE/ULE/UGE to ↵	Craig Topper	2017-09-22	1	-0/+8
\| \| \| \| \| \| \| \|	equality comparisons when the min/max ranges intersect in a single value. This is the inverse of what we do for SGT/SLT/UGT/ULT. llvm-svn: 314032
*	[InstCombine] Add constant splat handling to one of the ICMP_SLT/SGT cases ↵	Craig Topper	2017-09-22	1	-6/+5
\| \| \| \| \| \|	in foldICmpUsingKnownBits. llvm-svn: 314025
*	[InstCombine] Move the call to isSignBitCheck into getDemandedBitsLHSMask ↵	Craig Topper	2017-09-22	1	-15/+8
\| \| \| \| \| \| \| \|	instead of calling it outside and passing its result through a flag. NFCI The result of the isSignBitCheck isn't used anywhere else and this allows us to share the m_APInt call in the likely case that it isn't a sign bit check. llvm-svn: 314018
*	[InstCombine] Simplify check for RHS being a splat constant in ↵	Craig Topper	2017-09-22	1	-8/+6
\| \| \| \| \| \|	foldICmpUsingKnownBits by just checking Op1Min==Op1Max rather than going through m_APInt. llvm-svn: 314017
*	[InstCombine] Make cases for ICMP_UGT/ICMP_ULT use similar formatting since ↵	Craig Topper	2017-09-22	1	-6/+3
\| \| \| \| \| \|	they use similar code. NFC llvm-svn: 314016
*	Re-land r313825: "[IR] Add llvm.dbg.addr, a control-dependent version of ↵	Reid Kleckner	2017-09-21	1	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	llvm.dbg.declare" The fix is to avoid invalidating our insertion point in replaceDbgDeclare: Builder.insertDeclare(NewAddress, DIVar, DIExpr, Loc, InsertBefore); + if (DII == InsertBefore) + InsertBefore = &std::next(InsertBefore->getIterator()); DII->eraseFromParent(); I had to write a unit tests for this instead of a lit test because the use list order matters in order to trigger the bug. The reduced C test case for this was: void useit(int); static inline void inlineme() { int x[2]; useit(x); } void f() { inlineme(); inlineme(); } llvm-svn: 313905
*	Revert r313825: "[IR] Add llvm.dbg.addr, a control-dependent version of ↵	Daniel Jasper	2017-09-21	1	-7/+6
\| \| \| \| \| \| \| \| \| \| \|	llvm.dbg.declare" .. as well as the two subsequent changes r313826 and r313875. This leads to segfaults in combination with ASAN. Will forward repro instructions to the original author (rnk). llvm-svn: 313876
*	[InstCombine] Teach getDemandedBitsLHSMask to handle constant splat vectors	Craig Topper	2017-09-20	1	-11/+7
\| \| \| \| \| \| \| \|	This replaces a ConstantInt dyn_cast with m_APInt Differential Revision: https://reviews.llvm.org/D38100 llvm-svn: 313840
*	[IR] Add llvm.dbg.addr, a control-dependent version of llvm.dbg.declare	Reid Kleckner	2017-09-20	1	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This implements the design discussed on llvm-dev for better tracking of variables that live in memory through optimizations: http://lists.llvm.org/pipermail/llvm-dev/2017-September/117222.html This is tracked as PR34136 llvm.dbg.addr is intended to be produced and used in almost precisely the same way as llvm.dbg.declare is today, with the exception that it is control-dependent. That means that dbg.addr should always have a position in the instruction stream, and it will allow passes that optimize memory operations on local variables to insert llvm.dbg.value calls to reflect deleted stores. See SourceLevelDebugging.rst for more details. The main drawback to generating DBG_VALUE machine instrs is that they usually cause LLVM to emit a location list for DW_AT_location. The next step will be to teach DwarfDebug.cpp how to recognize more DBG_VALUE ranges as not needing a location list, and possibly start setting DW_AT_start_offset for variables whose lifetimes begin mid-scope. Reviewers: aprantl, dblaikie, probinson Subscribers: eraman, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D37768 llvm-svn: 313825
*	[InstCombine] Handle (X & C2) < C1 --> (X & C2) == 0	Craig Topper	2017-09-20	1	-6/+13
\| \| \| \| \| \| \| \|	We already did (X & C2) > C1 --> (X & C2) != 0, if any bit set in (X & C2) will produce a result greater than C1. But there is an equivalent inverse condition with <= C1 (which will be canonicalized to < C1+1) Differential Revision: https://reviews.llvm.org/D38065 llvm-svn: 313819
*	[InstCombine] Use APInt::getActiveBits() to avoid creating an APInt from a ↵	Craig Topper	2017-09-20	1	-1/+1
\| \| \| \| \| \|	trailing zero count to do a comparison. NFCI llvm-svn: 313792
*	[InstCombine] Add select simplifications	Quentin Colombet	2017-09-20	4	-52/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In these cases, two selects have constant selectable operands for both the true and false components and have the same conditional expression. We then create two arithmetic operations of the same type and feed a final select operation using the result of the true arithmetic for the true operand and the result of the false arithmetic for the false operand and reuse the original conditionl expression. The arithmetic operations are naturally folded as a consequence, leaving only the newly formed select to replace the old arithmetic operation. Patch by: Michael Berg <michael_c_berg@apple.com> Differential Revision: https://reviews.llvm.org/D37019 llvm-svn: 313774
*	[X86] Remove VPERM2F128/VPERM2I128 intrinsics and autoupgrade to native ↵	Craig Topper	2017-09-16	1	-74/+0
\| \| \| \| \| \| \| \|	shuffles. I've moved the test cases from the InstCombine optimizations to the backend to keep the coverage we had there. It covered every possible immediate so I've preserved the resulting shuffle mask for each of those immediates. llvm-svn: 313450
*	[InstCombine] Add a flag to disable LowerDbgDeclare	Reid Kleckner	2017-09-13	1	-1/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This should improve optimized debug info for address-taken variables at the cost of inaccurate debug info in some situations. We patched this into clang and deployed this change to Chromium developers, and this significantly improved debuggability of optimized code. The long-term solution to PR34136 seems more and more like it's going to take a while, so I would like to commit this change under a flag so that it can be used as a stop-gap measure. This flag should really help so for C++ aggregates like std::string and std::vector, which are typically address-taken, even after inlining, and cannot be SROA-ed. Reviewers: aprantl, dblaikie, probinson, dberlin Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D36596 llvm-svn: 313108
*	Test commit	Uriel Korach	2017-09-10	1	-1/+1
\| \| \| \|	llvm-svn: 312878
*	Merge isKnownNonNull into isKnownNonZero	Nuno Lopes	2017-09-09	1	-2/+2
\| \| \| \| \| \| \| \| \|	It now knows the tricks of both functions. Also, fix a bug that considered allocas of non-zero address space to be always non null Differential Revision: https://reviews.llvm.org/D37628 llvm-svn: 312869
*	[ValueTracking, InstCombine] canonicalize fcmp ord/uno with non-NAN ops to ↵	Sanjay Patel	2017-09-05	2	-15/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	null constants This is a preliminary step towards solving the remaining part of PR27145 - IR for isfinite(): https://bugs.llvm.org/show_bug.cgi?id=27145 In order to solve that one more generally, we need to add matching for and/or of fcmp ord/uno with a constant operand. But while looking at those patterns, I realized we were missing a canonicalization for nonzero constants. Rather than limiting to just folds for constants, we're adding a general value tracking method for this based on an existing DAG helper. By transforming everything to 0.0, we can simplify the existing code in foldLogicOfFCmps() and pick up missing vector folds. Differential Revision: https://reviews.llvm.org/D37427 llvm-svn: 312591
*	[InstCombine] Move foldSelectICmpAnd helper function earlier in the file to ↵	Craig Topper	2017-09-05	1	-105/+105
\| \| \| \| \| \|	enable reuse in a future patch. llvm-svn: 312518
*	[InstCombine] In foldSelectIntoOp, avoid creating a Constant before we know ↵	Craig Topper	2017-09-05	1	-17/+18
\| \| \| \| \| \| \| \| \| \|	for sure we're going to use it and avoid an unnecessary call to m_APInt. Instead of creating a Constant and then calling m_APInt with it (which will always return true). Just create an APInt initially, and use that for the checks in isSelect01 function. If it turns out we do need the Constant, create it from the APInt. This is a refactor for a future patch that will do some more checks of the constant values here. llvm-svn: 312517
*	[InstCombine] replace unnecessary fcmp fold with assert	Sanjay Patel	2017-09-02	1	-6/+3
\| \| \| \| \| \|	See https://reviews.llvm.org/rL312411 for related InstSimplify tests. llvm-svn: 312421
*	[InstCombine] combine foldAndOfFCmps and foldOrOfFcmps; NFCI	Sanjay Patel	2017-09-02	2	-77/+35
\| \| \| \| \| \| \| \|	In addition to removing chunks of duplicated code, we don't want these to diverge. If there's a fold for one, there should be a fold of the other via DeMorgan's Laws. llvm-svn: 312420
*	[InstCombine] fix misnamed locals and use them to reduce code; NFCI	Sanjay Patel	2017-09-02	1	-34/+34
\| \| \| \| \| \| \| \| \|	We had these locals: Value Op0RHS = LHS->getOperand(1); Value Op1LHS = RHS->getOperand(0); ...so we confusingly transposed the meaning of left/right and op0/op1. llvm-svn: 312418
*	[InstCombine] remove unnecessary code; NFC	Sanjay Patel	2017-09-02	1	-3/+0
\| \| \| \|	llvm-svn: 312416
*	[InstCombine] move related functions next to each other; NFC	Sanjay Patel	2017-09-02	1	-51/+51
\| \| \| \| \| \| \| \|	This makes it easier to see that they're almost duplicates. As with the similar icmp functions, there should be identical folds for both logic ops because those are DeMorganized variants. llvm-svn: 312415
*	[InstCombine] use local variable to reduce code duplication; NFCI	Sanjay Patel	2017-09-02	1	-11/+9
\| \| \| \|	llvm-svn: 312414
*	[InstCombine][InstSimplify] Teach decomposeBitTestICmp to look through ↵	Craig Topper	2017-09-01	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	truncate instructions This patch teaches decomposeBitTestICmp to look through truncate instructions on the input to the compare. If a truncate is found it will now return the pre-truncated Value and appropriately extend the APInt mask. This allows some code to be removed from InstSimplify that was doing this functionality. This allows InstCombine's bit test combining code to match a pre-truncate Value with the same Value appear with an 'and' on another icmp. Or it allows us to combine a truncate to i16 and a truncate to i8. This also required removing the type check from the beginning of getMaskedTypeForICmpPair, but I believe that's ok because we still have to find two values from the input to each icmp that are equal before we'll do any transformation. So the type check was really just serving as an early out. There was one user of decomposeBitTestICmp that didn't want to look through truncates, so I've added a flag to prevent that behavior when necessary. Differential Revision: https://reviews.llvm.org/D37158 llvm-svn: 312382
*	[InstCombine] Don't require the compare types to be the same in ↵	Craig Topper	2017-09-01	1	-3/+2
\| \| \| \| \| \| \| \| \| \|	getMaskedTypeForICmpPair. A future patch will make the code look through truncates feeding the compare. So the compares might be different types but the pretruncated types might be the same. This should be safe because we still require the same Value* to be used truncated or not in both compares. So that serves to ensure the types are the same. llvm-svn: 312381