bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	OpaquePtr: add Type parameter to Loads analysis API.	Tim Northover	2019-07-09	7	-21/+63
\| \| \| \| \| \| \| \| \| \| \| \| \|	This makes the functions in Loads.h require a type to be specified independently of the pointer Value so that when pointers have no structure other than address-space, it can still do its job. Most callers had an obvious memory operation handy to provide this type, but a SROA and ArgumentPromotion were doing more complicated analysis. They get updated to merge the properties of the various instructions they were considering. llvm-svn: 365468
*	[Loop Peeling] Add support for peeling of loops with multiple exits	Serguei Katkov	2019-07-09	2	-23/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch modifies the loop peeling transformation so that it does not expect that there is only one loop exit from latch. It modifies only transformation. Update of branch weights remains only for exit from latch. The motivation is that in follow-up patch I plan to enable loop peeling for loops with multiple exits but only if other exits then from latch one goes to block with call to deopt. For now this patch is NFC. Reviewers: reames, mkuper, iajbar, fhahn Reviewed By: reames, fhahn Subscribers: zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D63921 llvm-svn: 365441
*	[LoopInfo] Update getExitEdges to accept vector of pairs for non const ↵	Serguei Katkov	2019-07-09	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	BasicBlock D63921 requires getExitEdges fills a vector of Edge pairs where BasicBlocks are not constant. The rest Loop API mostly returns non-const BasicBlocks, so to be more consistent with other Loop API getExitEdges is modified to return non-const BasicBlocks as well. This is an alternative solution to D64060. Reviewers: reames, fhahn Reviewed By: reames, fhahn Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D64309 llvm-svn: 365437
*	[LoopPred] Stylistic improvement to recently added NE/EQ normalization [NFC]	Philip Reames	2019-07-09	1	-9/+5
\| \| \| \|	llvm-svn: 365425
*	[LoopPred] Extend LFTR normalization to the inverse EQ case	Philip Reames	2019-07-09	1	-0/+5
\| \| \| \| \| \|	A while back, I added support for NE latches formed by LFTR. I didn't think that quite through, as LFTR will also produce the inverse EQ form for some loops and I hadn't handled that. This change just adds handling for that case as well. llvm-svn: 365419
*	[Attributor] Deduce the "returned" argument attribute	Johannes Doerfert	2019-07-08	1	-0/+426
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Deduce the "returned" argument attribute by collecting all potentially returned values. Not only the unique return value, if any, can be used by subsequent attributes but also the set of all potentially returned values as well as the mapping from returned values to return instructions that they originate from (see AAReturnedValues::checkForallReturnedValues). Change in statistics (-stats) for LLVM-TS + Spec2006, totaling ~19% more "returned" arguments. ADDED: attributor NumAttributesManifested n/a -> 637 ADDED: attributor NumAttributesValidFixpoint n/a -> 25545 ADDED: attributor NumFnArgumentReturned n/a -> 637 ADDED: attributor NumFnKnownReturns n/a -> 25545 ADDED: attributor NumFnUniqueReturned n/a -> 14118 CHANGED: deadargelim NumRetValsEliminated 470 -> 449 ( -4.468%) REMOVED: functionattrs NumReturned 535 -> n/a CHANGED: indvars NumElimIdentity 138 -> 164 ( +18.841%) Reviewers: homerdin, hfinkel, fedor.sergeev, sanjoy, spatel, nlopes, nicholas, reames, efriedma, chandlerc Subscribers: hiraditya, bollu, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D59919 llvm-svn: 365407
*	[InstCombine] fold insertelement into splat of same scalar	Sanjay Patel	2019-07-08	1	-0/+37
\| \| \| \| \| \| \| \| \| \| \| \|	Forming the canonical splat shuffle improves analysis and may allow follow-on transforms (although some possibilities are missing as shown in the test diffs). The backend generically turns these patterns into build_vector, so there should be no codegen regressions. All targets are expected to be able to lower splats efficiently. llvm-svn: 365379
*	Keep the order of the basic blocks in the cloned loop as the original	Whitney Tsang	2019-07-08	1	-24/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	loop Summary: Do the cloning in two steps, first allocate all the new loops, then clone the basic blocks in the same order as the original loop. Reviewer: Meinersbur, fhahn, kbarton, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, hiraditya, llvm-commits Tag: https://reviews.llvm.org/D64224 Differential Revision: llvm-svn: 365366
*	Add, and infer, a nofree function attribute	Brian Homerding	2019-07-08	1	-5/+0
\| \| \| \| \| \| \| \| \| \|	Removing dead code leftover from refactor. Reviewers: jdoerfert Differential Revision: https://reviews.llvm.org/D49165 llvm-svn: 365345
*	[InstCombine] canonicalize insert+splat to/from element 0 of vector	Sanjay Patel	2019-07-08	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \|	We recognize a splat from element 0 in (VectorUtils) llvm::getSplatValue() and also in ShuffleVectorInst::isZeroEltSplatMask(), so this converts to that form for better matching. The backend generically turns these patterns into build_vector, so there should be no codegen difference. llvm-svn: 365342
*	Add, and infer, a nofree function attribute	Brian Homerding	2019-07-08	3	-1/+67
\| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a function attribute, nofree, to indicate that a function does not, directly or indirectly, call a memory-deallocation function (e.g., free, C++'s operator delete). Reviewers: jdoerfert Differential Revision: https://reviews.llvm.org/D49165 llvm-svn: 365336
*	[Float2Int] Add support for unary FNeg to Float2Int	Cameron McInally	2019-07-08	1	-0/+14
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D63941 llvm-svn: 365324
*	[IRBuilder] Introduce helpers for and/or of multiple values at once	Philip Reames	2019-07-06	3	-25/+10
\| \| \| \| \| \| \| \|	We had versions of this code scattered around, so consolidate into one location. Not strictly NFC since the order of intermediate results may change in some places, but since these operations are associatives, should not change results. llvm-svn: 365259
*	[ThinLTO] Attempt to recommit r365188 after alignment fix	Eugene Leviant	2019-07-05	2	-11/+14
\| \| \| \|	llvm-svn: 365215
*	Reverted r365188 due to alignment problems on i686-android	Eugene Leviant	2019-07-05	2	-14/+11
\| \| \| \|	llvm-svn: 365206
*	[ThinLTO] Attempt to recommit r365040 after caching fix	Eugene Leviant	2019-07-05	2	-11/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's possible that some function can load and store the same variable using the same constant expression: store %Derived* @foo, %Derived bitcast (%Base @bar to %Derived*) %42 = load %Derived, %Derived bitcast (%Base @bar to %Derived**) The bitcast expression was mistakenly cached while processing loads, and never examined later when processing store. This caused @bar to be mistakenly treated as read-only variable. See load-store-caching.ll. llvm-svn: 365188
*	[InstCombine] allow undef elements when forming splat from chain of ↵	Sanjay Patel	2019-07-04	1	-4/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	insertelements We allow forming a splat (broadcast) shuffle, but we were conservatively limiting that to cases where all elements of the vector are specified. It should be safe from a codegen perspective to allow undefined lanes of the vector because the expansion of a splat shuffle would become the chain of inserts again. Forming splat shuffles can reduce IR and help enable further IR transforms. Motivating bugs: https://bugs.llvm.org/show_bug.cgi?id=42174 https://bugs.llvm.org/show_bug.cgi?id=16739 Differential Revision: https://reviews.llvm.org/D63848 llvm-svn: 365147
*	[LoopPeel] Some small comment update. NFC.	Serguei Katkov	2019-07-04	1	-3/+3
\| \| \| \| \| \| \|	Follow-up change of comment after https://reviews.llvm.org/D63917 is landed. llvm-svn: 365107
*	[PowerPC] Hardware Loop branch instruction's condition may not be icmp.	Chen Zheng	2019-07-04	1	-1/+1
\| \| \| \| \| \| \|	This fixes pr42492. Differential Revision: https://reviews.llvm.org/D64124 llvm-svn: 365104
*	Revert [ThinLTO] Optimize writeonly globals out	Reid Kleckner	2019-07-04	2	-14/+11
\| \| \| \| \| \| \| \| \|	This reverts r365040 (git commit 5cacb914758c7f436b47c8362100f10cef14bbc4) Speculatively reverting, since this appears to have broken check-lld on Linux. Partial analysis in https://crbug.com/981168. llvm-svn: 365097
*	[JumpThreading] Fix threading with unusual PHI nodes.	Eli Friedman	2019-07-03	1	-3/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the block being cloned contains a PHI node, in general, we need to clone that PHI node, even though it's trivial. If the operand of the PHI is an instruction in the block being cloned, the correct value for the operand doesn't exist until SSAUpdater constructs it. We usually don't hit this issue because we try to avoid threading across loop headers, but it's possible to hit this in some cases involving irreducible CFGs. I added a flag to allow threading across loop headers to make the testcase easier to understand. Thanks to Brian Rzycki for reducing the testcase. Fixes https://bugs.llvm.org/show_bug.cgi?id=42085. Differential Revision: https://reviews.llvm.org/D63913 llvm-svn: 365094
*	[LFTR] Use SCEVExpander for the pointer limit case instead of manual IR gen	Philip Reames	2019-07-03	1	-10/+5
\| \| \| \| \| \|	As noted in the test change, this is not trivially NFC, but all of the changes in output are cases where the SCEVExpander form is more canonical/optimal than the hand generation. llvm-svn: 365075
*	[LFTR] Remove a stray variable shadow of the same value [NFC]	Philip Reames	2019-07-03	1	-1/+0
\| \| \| \|	llvm-svn: 365072
*	[LFTR] Style and comment changes to clarify the narrow vs wide bitwidth ↵	Philip Reames	2019-07-03	1	-17/+18
\| \| \| \| \| \|	evaluation behavior [NFC] llvm-svn: 365071
*	[LFTR] Sink the decision not use truncate scheme for constants into ↵	Philip Reames	2019-07-03	1	-46/+43
\| \| \| \| \| \| \| \|	genLoopLimit [NFC] We might as well just evaluate the constants using SCEV, and having the cases grouped makes the logic slightly easier to read anyway. llvm-svn: 365070
*	[LFTR] Remove falsely generalized (dead) code [NFC]	Philip Reames	2019-07-03	1	-5/+2
\| \| \| \|	llvm-svn: 365067
*	[LFTR] Hoist extend expressions outside of loops w/o waiting for LICM	Philip Reames	2019-07-03	1	-1/+4
\| \| \| \| \| \| \| \|	The motivation for this is two fold: 1) Make the output (and thus tests) a bit more readable to a human trying to understand the result of the transform 2) Reduce spurious diffs in a potential future change to restructure all of this logic to use SCEVExpander (which hoists by default) llvm-svn: 365066
*	[ThinLTO] Optimize writeonly globals out	Eugene Leviant	2019-07-03	2	-11/+14
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D63444 llvm-svn: 365040
*	[InstCombine] Y - ~X --> X + Y + 1 fold (PR42457)	Roman Lebedev	2019-07-03	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I think we'd want this new variant, because we obviously have better handling for `add` as compared to `sub`/`not`. https://rise4fun.com/Alive/WMn Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42457 \| PR42457 ]] Reviewers: spatel, nikic, huihuiz, efriedma Reviewed By: spatel Subscribers: RKSimon, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63992 llvm-svn: 365011
*	MSan: handle callbr instructions	Alexander Potapenko	2019-07-03	1	-21/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Handling callbr is very similar to handling an inline assembly call: MSan must checks the instruction's inputs. callbr doesn't (yet) have outputs, so there's nothing to unpoison, and conservative assembly handling doesn't apply either. Fixes PR42479. Reviewers: eugenis Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64072 llvm-svn: 365008
*	[LoopPeel] Re-factor llvm::peelLoop method. NFC.	Serguei Katkov	2019-07-03	1	-25/+49
\| \| \| \| \| \| \| \| \| \| \|	Extract code dealing with branch weights in separate functions. Reviewers: reames, mkuper, iajbar, fhahn Reviewed By: reames, fhahn Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D63917 llvm-svn: 365002
*	[PowerPC] exclude ICmpZero in LSR if icmp can be replaced in later hardware ↵	Chen Zheng	2019-07-03	1	-7/+26
\| \| \| \| \| \| \| \| \|	loop. Differential Revision: https://reviews.llvm.org/D63477 llvm-svn: 364993
*	[NFC] Strenghten isInteger condition for rL364940	David Bolvansky	2019-07-02	1	-2/+3
\| \| \| \|	llvm-svn: 364969
*	[SLP] Recommit: Look-ahead operand reordering heuristic.	Vasileios Porpodas	2019-07-02	1	-46/+248
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch introduces a new heuristic for guiding operand reordering. The new "look-ahead" heuristic can look beyond the immediate predecessors. This helps break ties when the immediate predecessors have identical opcodes (see lit test for an example). Reviewers: RKSimon, ABataev, dtemirbulatov, Ayal, hfinkel, rnk Reviewed By: RKSimon, dtemirbulatov Subscribers: hiraditya, phosek, rnk, rcorcs, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60897 llvm-svn: 364964
*	[ThinLTO] Add summary entries for index-based WPD	Teresa Johnson	2019-07-02	1	-12/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If LTOUnit splitting is disabled, the module summary analysis computes the summary information necessary to perform single implementation devirtualization during the thin link with the index and no IR. The information collected from the regular LTO IR in the current hybrid WPD algorithm is summarized, including: 1) For vtable definitions, record the function pointers and their offset within the vtable initializer (subsumes the information collected from IR by tryFindVirtualCallTargets). 2) A record for each type metadata summarizing the vtable definitions decorated with that metadata (subsumes the TypeIdentiferMap collected from IR). Also added are the necessary bitcode records, and the corresponding assembly support. The follow-on index-based WPD patch is D55153. Depends on D53890. Reviewers: pcc Subscribers: mehdi_amini, Prazek, inglorion, eraman, steven_wu, dexonsmith, arphaman, llvm-commits Differential Revision: https://reviews.llvm.org/D54815 llvm-svn: 364960
*	[SimplifyLibCalls] powf(x, sitofp(n)) -> powi(x, n)	David Bolvansky	2019-07-02	1	-12/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Partially solves https://bugs.llvm.org/show_bug.cgi?id=42190 Reviewers: spatel, nikic, efriedma Reviewed By: efriedma Subscribers: efriedma, nikic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63038 llvm-svn: 364940
*	Provide basic Full LTO extension points	Serge Guelton	2019-07-02	1	-0/+4
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D61738 llvm-svn: 364937
*	[InstCombine] Shift amount reassociation: fixup constantexpr handling (PR42484)	Roman Lebedev	2019-07-02	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I was actually wondering if there was some nicer way than m_Value()+cast, but apparently what i was really "subconsciously" thinking about was correctness issue. hasNoUnsignedWrap()/hasNoUnsignedWrap() exist for Instruction, not for BinaryOperator, so let's just use m_Instruction(), thus both avoiding a cast, and a crash. Fixes https://bugs.llvm.org/show_bug.cgi?id=42484, https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=15587 llvm-svn: 364915
*	[PGO] Update ICP pass for recent byval type changes	Reid Kleckner	2019-07-01	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	Fixes verifier errors encountered in PR42413. Reviewers: xur, t.p.northover, inglorion, gbiv, george.burgess.iv Differential Revision: https://reviews.llvm.org/D63842 llvm-svn: 364861
*	[InstCombine] reduce more checks for power-of-2-or-zero using ctpop	Sanjay Patel	2019-07-01	1	-1/+7
\| \| \| \| \| \| \| \| \|	Extends the transform from: rL364341 ...to include another (more common?) pattern that tests whether a value is a power-of-2 (including or excluding zero). llvm-svn: 364856
*	Revert [SLP] Look-ahead operand reordering heuristic.	Jordan Rupprecht	2019-07-01	1	-236/+46
\| \| \| \| \| \| \| \|	This reverts r364478 (git commit 574cb0eb3a7ac95e62d223a60bef891171dfe321) The patch is causing compilation timeouts. llvm-svn: 364846
*	[InstCombine] (Y + ~X) + 1 --> Y - X fold (PR42459)	Roman Lebedev	2019-07-01	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: To be noted, this pattern is not unhandled by instcombine per-se, it is somehow does end up being folded when one runs opt -O3, but not if it's just -instcombine. Regardless, that fold is indirect, depends on some other folds, and is thus blind when there are extra uses. This does address the regression being exposed in D63992. https://godbolt.org/z/7DGltU https://rise4fun.com/Alive/EPO0 Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42459 \| PR42459 ]] Reviewers: spatel, nikic, huihuiz Reviewed By: spatel Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63993 llvm-svn: 364792
*	[InstCombine] Shift amount reassociation in bittest (PR42399)	Roman Lebedev	2019-07-01	1	-0/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Given pattern: `icmp eq/ne (and ((x shift Q), (y oppositeshift K))), 0` we should move shifts to the same hand of 'and', i.e. rewrite as `icmp eq/ne (and (x shift (Q+K)), y), 0` iff `(Q+K) u< bitwidth(x)` It might be tempting to not restrict this to situations where we know we'd fold two shifts together, but i'm not sure what rules should there be to avoid endless combine loops. We pick the same shift that was originally used to shift the variable we picked to shift: https://rise4fun.com/Alive/6x1v Should fix [[ https://bugs.llvm.org/show_bug.cgi?id=42399 \| PR42399]]. Reviewers: spatel, nikic, RKSimon Reviewed By: spatel Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63829 llvm-svn: 364791
*	[InstCombine] Omit 'urem' where possible	Roman Lebedev	2019-07-01	1	-4/+20
\| \| \| \| \| \| \| \|	This was added in D63390 / rL364286 to backend, but it makes sense to also handle it in middle-end. https://rise4fun.com/Alive/Zsln llvm-svn: 364738
*	[SimpleLoopUnswitch] Implement handling of prof branch_weights metadata for ↵	Yevgeny Rouban	2019-07-01	1	-17/+39
\| \| \| \| \| \| \| \|	SwitchInst Differential Revision: https://reviews.llvm.org/D60606 llvm-svn: 364734
*	[InstCombine] canonicalize fcmp+select to minnum/maxnum intrinsics	Sanjay Patel	2019-06-30	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \|	This is the opposite direction of D62158 (we have to choose 1 form or the other). Now that we have FMF on the select, this becomes more palatable. And the benefits of having a single IR instruction for this operation (less chances of missing folds based on extra uses, etc) overcome my previous comments about the potential advantage of larger pattern matching/analysis. Differential Revision: https://reviews.llvm.org/D62414 llvm-svn: 364721
*	Cleanup: llvm::bsearch -> llvm::partition_point after r364719	Fangrui Song	2019-06-30	1	-2/+2
\| \| \| \|	llvm-svn: 364720
*	[LFTR] Rephrase getLoopTest into "based-on" check; NFCI	Nikita Popov	2019-06-29	1	-23/+23
\| \| \| \| \| \| \| \| \|	What we want to know here is whether we're already using this value for the loop condition, so make the query about that. We can extend this to a more general "based-on" relationship, rather than a direct icmp use later. llvm-svn: 364715
*	[InstCombine] canonicalize fmin/fmax to LLVM intrinsics minnum/maxnum	Sanjay Patel	2019-06-29	1	-24/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This transform came up in D62414, but we should deal with it first. We have LLVM intrinsics that correspond exactly to libm calls (unlike most libm calls, these libm calls never set errno). This holds without any fast-math-flags, so we should always canonicalize to those intrinsics directly for better optimization. Currently, we convert to fcmp+select only when we have FMF (nnan) because fcmp+select does not preserve the semantics of the call in the general case. Differential Revision: https://reviews.llvm.org/D63214 llvm-svn: 364714
*	[LFTR] Remove unnecessary latch check; NFCI	Nikita Popov	2019-06-29	1	-14/+9
\| \| \| \| \| \| \| \| \| \| \|	The whole indvars pass works on loops in simplified form, so there is always a unique latch. Convert the condition into an assertion in needsLFTR (though we also assert this in later LFTR functions). Additionally update the comment on getLoopTest() now that we are dealing with multiple exits. llvm-svn: 364713