bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert r347934 "[SCEV] Guard movement of insertion point for loop-invariants"	David L. Jones	2018-12-05	1	-42/+41
\| \| \| \| \| \| \| \|	This change caused SEGVs in instcombine. (The r347934 change seems to me to be a precipitating cause, not a root cause. Details are on the llvm-commits thread for r347934.) llvm-svn: 348426
*	[CmpInstAnalysis] fix function signature for ICmp code to predicate; NFC	Sanjay Patel	2018-12-04	1	-10/+10
\| \| \| \| \| \| \|	The old function underspecified the return type, took an unused parameter, and had a misleading name. llvm-svn: 348292
*	[CmpInstAnalysis] fix formatting; NFC	Sanjay Patel	2018-12-03	1	-4/+4
\| \| \| \| \| \| \| \|	There are potential improvements to the structure of this API raised by D54994, but remove some cosmetic blemishes before making any functional changes. llvm-svn: 348149
*	Fixing -print-module-scope for legacy SCC passes	Fedor Sergeev	2018-12-03	1	-4/+21
\| \| \| \| \| \| \| \| \| \|	It appears that print-module-scope was not implemented for legacy SCC passes. Fixed to print a whole module instead of just current SCC. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D54793 llvm-svn: 348144
*	[ValueTracking] Support funnel shifts in computeKnownBits()	Nikita Popov	2018-12-02	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \|	If the shift amount is known, we can determine the known bits of the output based on the known bits of two inputs. This is essentially the same functionality as implemented in D54869, but for ValueTracking rather than InstCombine SimplifyDemandedBits. Differential Revision: https://reviews.llvm.org/D55140 llvm-svn: 348091
*	[ValueTracking] add helper function for testing implied condition; NFCI	Sanjay Patel	2018-12-02	2	-38/+35
\| \| \| \| \| \| \| \|	We were duplicating code around the existing isImpliedCondition() that checks for a predecessor block/dominating condition, so make that a wrapper call. llvm-svn: 348088
*	[ThinLTO] Allow importing of functions with var args	Teresa Johnson	2018-12-01	1	-9/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Follow up to D54270, which allowed importing of var args functions unless they called va_start. As pointed out in the post-commit comments on that patch, the inliner can handle functions that call va_start in certain situations as well. Go ahead and enable importing of all var args functions. Measurements on a large binary show that this increases imports and binary size by an insignificant amount. Reviewers: davidxl Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D54607 llvm-svn: 348068
*	LegacyDivergenceAnalysis: fix uninitialized value	Nicolai Haehnle	2018-11-30	1	-1/+3
\| \| \| \| \|	Change-Id: I014502e431a68f7beddf169f6a3d19dac5dd2c26 llvm-svn: 348051
*	[DA] GPUDivergenceAnalysis for unstructured GPU kernels	Nicolai Haehnle	2018-11-30	2	-26/+108
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch #3 of the new DivergenceAnalysis <https://lists.llvm.org/pipermail/llvm-dev/2018-May/123606.html> The GPUDivergenceAnalysis is intended to eventually supersede the existing LegacyDivergenceAnalysis. The existing LegacyDivergenceAnalysis produces incorrect results on unstructured Control-Flow Graphs: <https://bugs.llvm.org/show_bug.cgi?id=37185> This patch adds the option -use-gpu-divergence-analysis to the LegacyDivergenceAnalysis to turn it into a transparent wrapper for the GPUDivergenceAnalysis. Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: jholewinski, jvesely, jfb, llvm-commits, alex-t, sameerds, arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D53493 llvm-svn: 348048
*	Fix parenthesis warning in IVDescriptors	Renato Golin	2018-11-30	1	-3/+3
\| \| \| \|	llvm-svn: 347990
*	Add a new reduction pattern match	Renato Golin	2018-11-30	1	-6/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adding a new reduction pattern match for vectorizing code similar to TSVC s3111: for (int i = 0; i < N; i++) if (a[i] > b) sum += a[i]; This patch adds support for fadd, fsub and fmull, as well as multiple branches and different (but compatible) instructions (ex. add+sub) in different branches. The difference from the previous patch(https://reviews.llvm.org/D49168) is as follows: - Added check of fast-math property of fp-instruction to the previous patch - Fix/add some pattern for if-reduction.ll Differential Revision: https://reviews.llvm.org/D54464 Patch by Takahiro Miyoshi <takahiro.miyoshi@linaro.org> and Masakazu Ueno <masakazu.ueno@linaro.org> llvm-svn: 347989
*	[SCEV] Guard movement of insertion point for loop-invariants	Warren Ristow	2018-11-30	1	-41/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	r320789 suppressed moving the insertion point of SCEV expressions with dev/rem operations to the loop header in non-loop-invariant situations. This, and similar, hoisting is also unsafe in the loop-invariant case, since there may be a guard against a zero denominator. This is an adjustment to the fix of r320789 to suppress the movement even in the loop-invariant case. This fixes PR30806. Differential Revision: https://reviews.llvm.org/D54713 llvm-svn: 347934
*	Introduce MaxUsesToExplore argument to capture tracking	Artur Pilipenko	2018-11-29	1	-11/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently CaptureTracker gives up if it encounters a value with more than 20 uses. The motivation for this cap is to keep it relatively cheap for BasicAliasAnalysis use case, where the results can't be cached. Although, other clients of CaptureTracker might be ok with higher cost. This patch introduces an argument for PointerMayBeCaptured functions to specify the max number of uses to explore. The motivation for this change is a downstream user of CaptureTracker, but I believe upstream clients of CaptureTracker might also benefit from more fine grained cap. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D55042 llvm-svn: 347910
*	[InstSimplify] fold select with implied condition	Sanjay Patel	2018-11-29	1	-0/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is an almost direct move of the functionality from InstCombine to InstSimplify. There's no reason not to do this in InstSimplify because we never create a new value with this transform. (There's a question of whether any dominance-based transform belongs in either of these passes, but that's a separate issue.) I've changed 1 of the conditions for the fold (1 of the blocks for the branch must be the block we started with) into an assert because I'm not sure how that could ever be false. We need 1 extra check to make sure that the instruction itself is in a basic block because passes other than InstCombine may be using InstSimplify as an analysis on values that are not wired up yet. The 3-way compare changes show that InstCombine has some kind of phase-ordering hole. Otherwise, we would have already gotten the intended final result that we now show here. llvm-svn: 347896
*	NFC. Use unsigned type for uses counter in CaptureTracking	Artur Pilipenko	2018-11-29	1	-2/+2
\| \| \| \|	llvm-svn: 347826
*	[ValueTracking] Determine always-overflow condition for unsigned sub	Nikita Popov	2018-11-28	1	-6/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Always-overflow was already determined for unsigned addition, but not subtraction. This patch establishes parity. This allows us to perform some additional simplifications for signed saturating subtractions. This change is part of https://reviews.llvm.org/D54534. llvm-svn: 347771
*	[stack-safety] Update comment	Vitaly Buka	2018-11-27	1	-1/+1
\| \| \| \|	llvm-svn: 347626
*	[stack-safety] Fix and uncomment assert	Vitaly Buka	2018-11-27	1	-3/+2
\| \| \| \|	llvm-svn: 347625
*	[stack-safety] Fix build on gcc 5.4	Vitaly Buka	2018-11-27	1	-1/+1
\| \| \| \|	llvm-svn: 347624
*	Fix debug build break	JF Bastien	2018-11-26	1	-2/+3
\| \| \| \| \| \|	Comment out an assertion from D54543 which failed with error: no member named 'Range' in '(anonymous namespace)::PassAsArgInfo'. llvm-svn: 347616
*	[stack-safety] Inter-Procedural Analysis implementation	Vitaly Buka	2018-11-26	1	-4/+203
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: IPA is implemented as module pass which produce map from Function or Alias to StackSafetyInfo for a single function. From prototype by Evgenii Stepanov and Vlad Tsyrklevich. Reviewers: eugenis, vlad.tsyrklevich, pcc, glider Subscribers: hiraditya, mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D54543 llvm-svn: 347611
*	[stack-safety] Empty local passes for Stack Safety Global Analysis	Vitaly Buka	2018-11-26	2	-0/+46
\| \| \| \| \| \| \| \| \| \|	Reviewers: eugenis, vlad.tsyrklevich Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D54541 llvm-svn: 347610
*	[stack-safety] Local analysis implementation	Vitaly Buka	2018-11-26	1	-5/+377
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Analysis produces StackSafetyInfo which contains information with how allocas and parameters were used in functions. From prototype by Evgenii Stepanov and Vlad Tsyrklevich. Reviewers: eugenis, vlad.tsyrklevich, pcc, glider Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D54504 llvm-svn: 347603
*	[stack-safety] Empty local passes for Stack Safety Local Analysis	Vitaly Buka	2018-11-26	3	-0/+62
\| \| \| \| \| \| \| \| \| \|	Reviewers: eugenis, vlad.tsyrklevich Subscribers: mgorny, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D54502 llvm-svn: 347602
*	[DemandedBits] Add support for funnel shifts	Nikita Popov	2018-11-26	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for funnel shifts to the DemandedBits analysis. The demanded bits of the first two operands can be determined if the shift amount is constant. The demanded bits of the third operand (shift amount) can be determined if the bitwidth is a power of two. This is basically the same functionality as implemented in D54869 and D54478, but for DemandedBits rather than InstCombine. Differential Revision: https://reviews.llvm.org/D54876 llvm-svn: 347561
*	Revert unapproved commit	Joel Jones	2018-11-24	1	-154/+4
\| \| \| \|	llvm-svn: 347511
*	[AArch64] Enable libm vectorized functions via SLEEF	Joel Jones	2018-11-24	1	-4/+154
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This changeset is modeled after Intel's submission for SVML. It enables trigonometry functions vectorization via SLEEF: http://sleef.org/. * A new vectorization library enum is added to TargetLibraryInfo.h: SLEEF. * A new option is added to TargetLibraryInfoImpl - ClVectorLibrary: SLEEF. * A comprehensive test case is included in this changeset. * In a separate changeset (for clang), a new vectorization library argument is added to -fveclib: -fveclib=SLEEF. Trigonometry functions that are vectorized by sleef: acos asin atan atanh cos cosh exp exp2 exp10 lgamma log10 log2 log sin sinh sqrt tan tanh tgamma Patch by Stefan Teleman Differential Revision: https://reviews.llvm.org/D53927 llvm-svn: 347510
*	[LVI] run transfer function for binary operator even when the RHS isn't a ↵	John Regehr	2018-11-21	1	-36/+39
\| \| \| \| \| \| \| \| \| \| \| \| \|	constant LVI was symbolically executing binary operators only when the RHS was constant, missing the case where we have a ConstantRange for the RHS, but not an actual constant. Tested using check-all and by bootstrapping. Compile time is not impacted measurably. Differential Revision: https://reviews.llvm.org/D19859 llvm-svn: 347379
*	[InstSimplify] fold funnel shifts with undef operands	Sanjay Patel	2018-11-20	1	-1/+10
\| \| \| \| \| \| \| \|	Splitting these off from the D54666. Patch by: nikic (Nikita Popov) llvm-svn: 347332
*	[InstructionSimplify] Add support for saturating add/sub	Sanjay Patel	2018-11-20	1	-0/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for saturating add/sub in InstructionSimplify. In particular, the following simplifications are supported: sat(X + 0) -> X sat(X + undef) -> -1 sat(X uadd MAX) -> MAX (and commutative variants) sat(X - 0) -> X sat(X - X) -> 0 sat(X - undef) -> 0 sat(undef - X) -> 0 sat(0 usub X) -> 0 sat(X usub MAX) -> 0 Patch by: @nikic (Nikita Popov) Differential Revision: https://reviews.llvm.org/D54532 llvm-svn: 347330
*	[ConstantFolding] Add support for saturating add/sub	Sanjay Patel	2018-11-20	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	Support saturating add/sub in constant folding, based on the APInt methods introduced in D54332. Patch by: @nikic (Nikita Popov) Differential Revision: https://reviews.llvm.org/D54531 llvm-svn: 347328
*	[IR] Add hasNPredecessors, hasNPredecessorsOrMore to BasicBlock	Vedant Kumar	2018-11-19	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add methods to BasicBlock which make it easier to efficiently check whether a block has N (or more) predecessors. This can be more efficient than using pred_size(), which is a linear time operation. We might consider adding similar methods for successors. I haven't done so in this patch because succ_size() is already O(1). With this patch applied, I measured a 0.065% compile-time reduction in user time for running `opt -O3` on the sqlite3 amalgamation (30 trials). The change in mergeStoreIntoSuccessor alone saves 45 million linked list iterations in a stage2 Release build of llc. See llvm.org/PR39702 for a harder but more general way of achieving similar results. Differential Revision: https://reviews.llvm.org/D54686 llvm-svn: 347256
*	[LV] Avoid vectorizing unsafe dependencies in uniform address	Anna Thomas	2018-11-19	1	-4/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently, when vectorizing stores to uniform addresses, the only instance we prevent vectorization is if there are multiple stores to the same uniform address causing an unsafe dependency. This patch teaches LAA to avoid vectorizing loops that have an unsafe cross-iteration dependency between a load and a store to the same uniform address. Fixes PR39653. Reviewers: Ayal, efriedma Subscribers: rkruppe, llvm-commits Differential Revision: https://reviews.llvm.org/D54538 llvm-svn: 347220
*	[LoopPass] fixing 'Modification' messages in -debug-pass=Executions for loop ↵	Fedor Sergeev	2018-11-19	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	passes Legacy loop pass manager is issuing "Made Modification" message after each Loop Pass run, however condition for issuing it is accumulated among all the runs. That leads to confusing 'modification' messages as soon as the first modification is done. Changing condition to be "current pass made modifications", similar to how it is being done in all other pass managers. llvm-svn: 347215
*	[ProfileSummary] Standardize methods and fix comment	Vedant Kumar	2018-11-19	2	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Every Analysis pass has a get method that returns a reference of the Result of the Analysis, for example, BlockFrequencyInfo &BlockFrequencyInfoWrapperPass::getBFI(). I believe that ProfileSummaryInfo::getPSI() is the only exception to that, as it was returning a pointer. Another change is renaming isHotBB and isColdBB to isHotBlock and isColdBlock, respectively. Most methods use BB as the argument of variable names while methods usually refer to Basic Blocks as Blocks, instead of BB. For example, Function::getEntryBlock, Loop:getExitBlock, etc. I also fixed one of the comments. Patch by Rodrigo Caetano Rocha! Differential Revision: https://reviews.llvm.org/D54669 llvm-svn: 347182
*	Use llvm::copy. NFC	Fangrui Song	2018-11-17	2	-2/+2
\| \| \| \|	llvm-svn: 347126
*	[ThinLTO] Internalize readonly globals	Eugene Leviant	2018-11-16	1	-17/+58
\| \| \| \| \| \| \| \|	An attempt to recommit r346584 after failure on OSX build bot. Fixed cache key computation in ThinLTOCodeGenerator and added test case llvm-svn: 347033
*	[InstSimplify] delete shift-of-zero guard ops around funnel shifts	Sanjay Patel	2018-11-15	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a problem seen in common rotate idioms as noted in: https://bugs.llvm.org/show_bug.cgi?id=34924 Note that we are not canonicalizing standard IR (shifts and logic) to the intrinsics yet. (Although I've written this before...) I think this is the last step before we enable that transform. Ie, we could regress code by doing that transform without this simplification in place. In PR34924, I questioned whether this is a valid transform for target-independent IR, but I convinced myself this is ok. If we're speculating a funnel shift by turning cmp+br into select, then SimplifyCFG has already determined that the transform is justified. It's possible that SimplifyCFG is not taking into account profile or other metadata, but if that's true, then it's a bug independent of funnel shifts. Also, we do have CGP code to restore a guard like this around an intrinsic if it can't be lowered cheaply. But that isn't necessary for funnel shift because the default expansion in SelectionDAGBuilder includes this same cmp+select. Differential Revision: https://reviews.llvm.org/D54552 llvm-svn: 346960
*	[ThinLTO] Update handling of vararg functions to match inliner	Teresa Johnson	2018-11-14	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously we marked all vararg functions as non-inlinable in the function summary, which prevented their importing. However, the corresponding inliner restriction was loosened in r321940/r342675 to only apply to functions calling va_start. Adjust the summary flag computation to match. Reviewers: davidxl Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D54270 llvm-svn: 346883
*	[TTI] getOperandInfo - a broadcast shuffle means the result is OK_UniformValue	Simon Pilgrim	2018-11-14	1	-0/+7
\| \| \| \|	llvm-svn: 346868
*	[MemorySSA] Create query after checking if instruction is a fence.	Alina Sbirlea	2018-11-13	1	-2/+3
\| \| \| \| \| \| \|	The alternative is checking if I is a fence in the Query constructor, so as to not attempt to get a non-existent MemoryLocation. llvm-svn: 346798
*	Revert "[ThinLTO] Internalize readonly globals"	Steven Wu	2018-11-13	1	-58/+17
\| \| \| \| \| \|	This reverts commit 10c84a8f35cae4a9fc421648d9608fccda3925f2. llvm-svn: 346768
*	[VectorUtils] Use namespace for InterleaveGroup template specialization.	Florian Hahn	2018-11-13	1	-4/+6
\| \| \| \|	llvm-svn: 346759
*	[VPlan] VPlan version of InterleavedAccessInfo.	Florian Hahn	2018-11-13	1	-10/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch turns InterleaveGroup into a template with the instruction type being a template parameter. It also adds a VPInterleavedAccessInfo class, which only contains a mapping from VPInstructions to their respective InterleaveGroup. As we do not have access to scalar evolution in VPlan, we can re-use convert InterleavedAccessInfo to VPInterleavedAccess info. Reviewers: Ayal, mssimpso, hfinkel, dcaballe, rengolin, mkuper, hsaito Reviewed By: rengolin Differential Revision: https://reviews.llvm.org/D49489 llvm-svn: 346758
*	[TTI] Make TargetTransformInfo::getOperandInfo static. NFCI.	Simon Pilgrim	2018-11-13	1	-2/+1
\| \| \| \| \| \|	It has no member dependencies and this makes it easier to reuse in other cost analysis code. llvm-svn: 346755
*	[VectorUtils] add funnel-shifts to the list of vectorizable intrinsics	Sanjay Patel	2018-11-12	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This just identifies the intrinsics as candidates for vectorization. It does not mean we will attempt to vectorize under normal conditions (the test file is forcing vectorization). The cost model must be fixed to show that the transform is profitable in general. Allowing vectorization with these intrinsics is required to avoid potential regressions from canonicalizing to the intrinsics from generic IR: https://bugs.llvm.org/show_bug.cgi?id=37417 llvm-svn: 346661
*	[VectorUtils] reorder list of vectorizable intrinsics; NFC	Sanjay Patel	2018-11-12	1	-10/+9
\| \| \| \| \| \| \|	We need to add funnel-shifts to this list, so clean up the random order before it gets worse. llvm-svn: 346660
*	[LICM] Hoist guards from non-header blocks	Max Kazantsev	2018-11-12	2	-0/+36
\| \| \| \| \| \| \| \| \| \| \|	This patch relaxes overconservative checks on whether or not we could write memory before we execute an instruction. This allows us to hoist guards out of loops even if they are not in the header block. Differential Revision: https://reviews.llvm.org/D50891 Reviewed By: fedor.sergeev llvm-svn: 346643
*	[ThinLTO] Internalize readonly globals	Eugene Leviant	2018-11-10	1	-17/+58
\| \| \| \| \| \| \| \| \|	This patch allows internalising globals if all accesses to them (from live functions) are from non-volatile load instructions Differential revision: https://reviews.llvm.org/D49362 llvm-svn: 346584
*	[TTI] Flip vector types in getShuffleCost SK_ExtractSubvector call	Simon Pilgrim	2018-11-09	1	-1/+1
\| \| \| \| \| \| \| \|	For SK_ExtractSubvector, the default 'Ty' type is the source operand type and 'SubTy' is the destination subvector type I got this the wrong way around when I added rL346510 llvm-svn: 346534