bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add basic loop fusion pass.	Kit Barton	2019-04-17	3	-0/+1217
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a basic loop fusion pass. It will fuse loops that conform to the following 4 conditions: 1. Adjacent (no code between them) 2. Control flow equivalent (if one loop executes, the other loop executes) 3. Identical bounds (both loops iterate the same number of iterations) 4. No negative distance dependencies between the loop bodies. The pass does not make any changes to the IR to create opportunities for fusion. Instead, it checks if the necessary conditions are met and if so it fuses two loops together. The pass has not been added to the pass pipeline yet, and thus is not enabled by default. It can be run stand alone using the -loop-fusion option. Differential Revision: https://reviews.llvm.org/D55851 llvm-svn: 358607
*	[InstCombine] Factor out unreachable inst idiom creation [NFC]	Philip Reames	2019-04-17	3	-13/+15
\| \| \| \| \| \| \| \|	In InstCombine, we use an idiom of "store i1 true, i1 undef" to indicate we've found a path which we've proven unreachable. We can't actually insert the unreachable instruction since that would require changing the CFG. We leave that to simplifycfg later. This just factors out that idiom creation so we don't duplicate the same mostly undocument idiom creation in multiple places. llvm-svn: 358600
*	[LoopUnroll] Allow unrolling if the unrolled size does not exceed loop size.	Florian Hahn	2019-04-17	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In the following cases, unrolling can be beneficial, even when optimizing for code size: 1) very low trip counts 2) potential to constant fold most instructions after fully unrolling. We can unroll in those cases, by setting the unrolling threshold to the loop size. This might highlight some cost modeling issues and fixing them will have a positive impact in general. Reviewers: vsk, efriedma, dmgreen, paquette Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D60265 llvm-svn: 358586
*	[CVP] processOverflowIntrinsic(): don't crash if constant-holding happened	Roman Lebedev	2019-04-17	1	-4/+7
\| \| \| \| \| \| \|	As reported by Mikael Holmén in post-commit review in https://reviews.llvm.org/D60791#1469765 llvm-svn: 358559
*	Revert "Add basic loop fusion pass." Per request.	Eric Christopher	2019-04-17	3	-1214/+0
\| \| \| \| \| \|	This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358553
*	Revert "Temporarily Revert "Add basic loop fusion pass.""	Eric Christopher	2019-04-17	3	-0/+1214
\| \| \| \| \| \| \| \|	The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552
*	Remove the run-slp-after-loop-vectorization option.	Eric Christopher	2019-04-17	1	-12/+3
\| \| \| \| \| \| \|	It's been on by default for 4 years and cleans up the pass hierarchy. llvm-svn: 358548
*	Temporarily Revert "Add basic loop fusion pass."	Eric Christopher	2019-04-17	3	-1214/+0
\| \| \| \| \| \| \| \|	As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546
*	Add basic loop fusion pass.	Kit Barton	2019-04-17	3	-0/+1214
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a basic loop fusion pass. It will fuse loops that conform to the following 4 conditions: 1. Adjacent (no code between them) 2. Control flow equivalent (if one loop executes, the other loop executes) 3. Identical bounds (both loops iterate the same number of iterations) 4. No negative distance dependencies between the loop bodies. The pass does not make any changes to the IR to create opportunities for fusion. Instead, it checks if the necessary conditions are met and if so it fuses two loops together. The pass has not been added to the pass pipeline yet, and thus is not enabled by default. It can be run stand alone using the -loop-fusion option. Phabricator: https://reviews.llvm.org/D55851 llvm-svn: 358543
*	Fix a typo in comments. [NFC]	Ali Tamur	2019-04-16	1	-1/+1
\| \| \| \|	llvm-svn: 358531
*	[EarlyCSE] detect equivalence of selects with inverse conditions and ↵	Sanjay Patel	2019-04-16	1	-2/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	commuted operands (PR41101) This is 1 of the problems discussed in the post-commit thread for: rL355741 / http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190311/635516.html and filed as: https://bugs.llvm.org/show_bug.cgi?id=41101 Instcombine tries to canonicalize some of these cases (and there's room for improvement there independently of this patch), but it can't always do that because of extra uses. So we need to recognize these commuted operand patterns here in EarlyCSE. This is similar to how we detect commuted compares and commuted min/max/abs. Differential Revision: https://reviews.llvm.org/D60723 llvm-svn: 358523
*	[CVP] Simplify umulo and smulo that cannot overflow	Nikita Popov	2019-04-16	1	-6/+1
\| \| \| \| \| \| \| \| \| \| \|	If a umul.with.overflow or smul.with.overflow operation cannot overflow, simplify it to a simple mul nuw / mul nsw. After the refactoring in D60668 this is just a matter of removing an explicit check against multiplications. Differential Revision: https://reviews.llvm.org/D60791 llvm-svn: 358521
*	[SLP] Refactoring of the operand reordering code.	Simon Pilgrim	2019-04-16	1	-171/+463
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a refactoring patch which should have all the functionality of the current code. Its goal is twofold: i. Cleanup and simplify the reordering code, and ii. Generalize reordering so that it will work for an arbitrary number of operands, not just 2. This is the second patch in a series of patches that will enable operand reordering across chains of operations. An example of this was presented in EuroLLVM'18 https://www.youtube.com/watch?v=gIEn34LvyNo . Committed on behalf of @vporpo (Vasileios Porpodas) Differential Revision: https://reviews.llvm.org/D59973 llvm-svn: 358519
*	[InstCombine] Prune fshl/fshr with masked operands	Nikita Popov	2019-04-16	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a constant shift amount is used, then only some of the LHS/RHS operand bits are demanded and we may be able to simplify based on that. InstCombineSimplifyDemanded already had the necessary support for that, we just weren't calling it with fshl/fshr as root. In particular, this allows us to relax some masked funnel shifts into simple shifts, as shown in the tests. Patch by Shawn Landden. Differential Revision: https://reviews.llvm.org/D60660 llvm-svn: 358515
*	[IR] Add WithOverflowInst class	Nikita Popov	2019-04-16	4	-161/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a WithOverflowInst class with a few helper methods to get the underlying binop, signedness and nowrap type and makes use of it where sensible. There will be two more uses in D60650/D60656. The refactorings are all NFC, though I left some TODOs where things could be improved. In particular we have two places where add/sub are handled but mul isn't. Differential Revision: https://reviews.llvm.org/D60668 llvm-svn: 358512
*	Re-commit r357452: SimplifyCFG SinkCommonCodeFromPredecessors: Also sink ↵	Hans Wennborg	2019-04-16	1	-14/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	function calls without used results (PR41259) The original commit caused false positives from AddressSanitizer's use-after-scope checks, which have now been fixed in r358478. > The code was previously checking that candidates for sinking had exactly > one use or were a store instruction (which can't have uses). This meant > we could sink call instructions only if they had a use. > > That limitation seemed a bit arbitrary, so this patch changes it to > "instruction has zero or one use" which seems more natural and removes > the need to special-case stores. > > Differential revision: https://reviews.llvm.org/D59936 llvm-svn: 358483
*	Asan use-after-scope: don't poison allocas if there were untraced lifetime ↵	Hans Wennborg	2019-04-16	1	-1/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	intrinsics in the function (PR41481) If there are any intrinsics that cannot be traced back to an alloca, we might have missed the start of a variable's scope, leading to false error reports if the variable is poisoned at function entry. Instead, if there are some intrinsics that can't be traced, fail safe and don't poison the variables in that function. Differential revision: https://reviews.llvm.org/D60686 llvm-svn: 358478
*	[CodeExtractor] Add a few debug lines to understand why a region is not ↵	Quentin Colombet	2019-04-16	1	-3/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	extracted The CodeExtractor is not smart enough to compute which basic block is the entry of a region. Instead it relies on the order of the list of basic blocks that is handed to it and assumes that the entry is the first block in the list. Without the additional debug information, it is hard to understand why a valid region does not get extracted, because we would miss that the order of in the list just doesn't match what the CodeExtractor wants. NFC llvm-svn: 358471
*	[LSR] Rewrite misses some fixup locations if it splits critical edge	Quentin Colombet	2019-04-15	1	-1/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If LSR split critical edge during rewriting phi operands and phi node has other pending fixup operands, we need to update those pending fixups. Otherwise formulae will not be implemented completely and some instructions will not be eliminated. llvm.org/PR41445 Differential Revision: https://reviews.llvm.org/D60645 Patch by: Denis Bakhvalov <denis.bakhvalov@intel.com> llvm-svn: 358457
*	[LoopPred] Stop passing around builders [NFC]	Philip Reames	2019-04-15	1	-31/+49
\| \| \| \| \| \| \| \| \| \| \| \|	This is a preparatory patch for D60093. This patch itself is NFC, but while preparing this I noticed and committed a small hoisting change in rL358419. The basic structure of the new scheme is that we pass around the guard ("the using instruction"), and select an optimal insert point by examining operands at each construction point. This seems conceptually a bit cleaner to start with as it isolates the knowledge about insertion safety at the actual insertion point. Note that the non-hoisting path is not actually used at the moment. That's not exercised until D60093 is rebased on this one. Differential Revision: https://reviews.llvm.org/D60718 llvm-svn: 358434
*	[DEBUGINFO] Prevent Instcombine from dropping debuginfo when removing zexts	Wolfgang Pieb	2019-04-15	1	-5/+4
\| \| \| \| \| \| \| \| \| \| \|	Zexts can be treated like no-op casts when it comes to assessing whether their removal affects debug info. Reviewer: aprantl Differential Revision: https://reviews.llvm.org/D60641 llvm-svn: 358431
*	[PGO] Profile guided code size optimization.	Hiroshi Yamauchi	2019-04-15	11	-38/+169
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Enable some of the existing size optimizations for cold code under PGO. A ~5% code size saving in big internal app under PGO. The way it gets BFI/PSI is discussed in the RFC thread http://lists.llvm.org/pipermail/llvm-dev/2019-March/130894.html Note it doesn't currently touch loop passes. Reviewers: davidxl, eraman Reviewed By: eraman Subscribers: mgorny, javed.absar, smeenai, mehdi_amini, eraman, zzheng, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59514 llvm-svn: 358422
*	[LoopPred] Hoist and of predicated checks where legal	Philip Reames	2019-04-15	1	-2/+16
\| \| \| \| \| \|	If we have multiple range checks which can be predicated, hoist the and of the results outside the loop. This minorly cleans up the resulting IR, but the main motivation is as a building block for D60093. llvm-svn: 358419
*	[InstCombine] canonicalize fdiv after fmul if reassociation is allowed	Sanjay Patel	2019-04-15	1	-0/+8
\| \| \| \| \| \| \| \|	(X / Y) * Z --> (X * Z) / Y This can allow other optimizations/reassociations as shown in the test diffs. llvm-svn: 358404
*	[Transforms][ASan] Move findAllocaForValue() to Utils/Local.cpp. NFC	Alexander Potapenko	2019-04-15	2	-39/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Factor out findAllocaForValue() from ASan so that we can use it in MSan to handle lifetime intrinsics. Reviewers: eugenis, pcc Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60615 llvm-svn: 358380
*	[Mem2Reg] Delete unused PointerAllocaValues	Fangrui Song	2019-04-14	1	-5/+0
\| \| \| \| \| \|	It is unused after AliasSetTracker support was removed. llvm-svn: 358352
*	[Mem2Reg] Simplify and micro optimize	Fangrui Song	2019-04-14	1	-13/+9
\| \| \| \| \| \| \| \|	* Rearrange continu/break * BBNumbers.lookup(A) -> BBNumbers.find(A)->second BBNumbers has been computed, thus we can assume the value exists in the predicate. llvm-svn: 358351
*	[Mem2Reg] Don't call LBI.deleteValue on AllocInst/DbgVariableIntrinsic	Fangrui Song	2019-04-14	1	-6/+1
\| \| \| \| \| \|	Only StoreInst/LoadInst are assigned numbers. Other types of instructions are not in LBI. llvm-svn: 358350
*	[Mem2Reg] Simplify rewriteSingleStoreAlloca	Fangrui Song	2019-04-14	1	-5/+2
\| \| \| \|	llvm-svn: 358349
*	[InstCombine] Remove redundant/bogus mul_with_overflow combines	Nikita Popov	2019-04-13	1	-8/+0
\| \| \| \| \| \| \| \| \| \| \| \|	As pointed out in D60518 folding mulo(%x, undef) to {undef, undef} isn't correct. As a correct version of this already exists in InstructionSimplify (https://github.com/llvm-mirror/llvm/blob/bd8056ef326e075cc500f3f0cfcd1193bc200594/lib/Analysis/InstructionSimplify.cpp#L4750-L4757) this is just dead code though. Drop it together with the mul(%x, 0) -> {0, false} fold that is also already handled by InstSimplify. Differential Revision: https://reviews.llvm.org/D60649 llvm-svn: 358339
*	[Mem2Reg] Delete unused AllocaPointerVal	Fangrui Song	2019-04-13	1	-4/+0
\| \| \| \| \| \|	It is no longer used after the AliasSetTracker updating logic was removed. llvm-svn: 358334
*	[InstCombine] Canonicalize (-X srem Y) to -(X srem Y).	Chen Zheng	2019-04-13	1	-0/+5
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D60647 llvm-svn: 358328
*	[SCEV] Add option to forget everything in SCEV.	Alina Sbirlea	2019-04-12	5	-36/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Create a method to forget everything in SCEV. Add a cl::opt and PassManagerBuilder option to use this in LoopUnroll. Motivation: Certain Halide applications spend a very long time compiling in forgetLoop, and prefer to forget everything and rebuild SCEV from scratch. Sample difference in compile time reduction: 21.04 to 14.78 using current ToT release build. Testcase showcasing this cannot be opensourced and is fairly large. The option disabled by default, but it may be desirable to enable by default. Evidence in favor (two difference runs on different days/ToT state): File Before (s) After (s) clang-9.bc 7267.91 6639.14 llvm-as.bc 194.12 194.12 llvm-dis.bc 62.50 62.50 opt.bc 1855.85 1857.53 File Before (s) After (s) clang-9.bc 8588.70 7812.83 llvm-as.bc 196.20 194.78 llvm-dis.bc 61.55 61.97 opt.bc 1739.78 1886.26 Reviewers: sanjoy Subscribers: mehdi_amini, jlebar, zzheng, javed.absar, dmgreen, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60144 llvm-svn: 358304
*	[InstCombine] Fix a nasty miscompile introduced w/masked.gather demanded elts	Philip Reames	2019-04-12	1	-1/+5
\| \| \| \| \| \| \| \|	This fixes a miscompile which was introduced in r356510 (https://reviews.llvm.org/D57372). The problem is that the original patch removed pointer operands where the load results we're demanded, but without considering the legality of the load itself. If the masked.gather had active, but undemanded, lanes, then we could end up creating a load which loaded from an undef address. The result could be a segfault, or, in theory, an arbitrary read from a random memory location into an used register. llvm-svn: 358299
*	[CVP] Set NSW/NUW flags when simplifying with.overflow	Nikita Popov	2019-04-12	1	-2/+6
\| \| \| \| \| \| \| \| \| \|	When CVP determines that a with.overflow intrinsic cannot overflow, it currently inserts a simple add/sub. As we already determined that there can be no overflow, we should add the appropriate NUW/NSW flag. Differential Revision: https://reviews.llvm.org/D60585 llvm-svn: 358298
*	[DebugInfo] Fix pr41175 Dead Store Elimination missing debug loc	Jeremy Morse	2019-04-12	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Bug: https://bugs.llvm.org/show_bug.cgi?id=41175 In the bug test case the DSE pass is shortening the range of memory that a memset is working on. A getelementptr is generated so that the new starting address can be passed to memset. This instruction was not given a DebugLoc. To fix the bug, copy the DebugLoc from the memset instruction. Patch by Orlando Cazalet-Hyams! Differential Revision: https://reviews.llvm.org/D60556 llvm-svn: 358270
*	Use llvm::lower_bound. NFC	Fangrui Song	2019-04-12	2	-11/+8
\| \| \| \| \| \|	This reapplies rL358161. That commit inadvertently reverted an exegesis file to an old version. llvm-svn: 358246
*	[PGO] Better handling of profile hash mismatch	Rong Xu	2019-04-11	1	-6/+20
\| \| \| \| \| \| \| \| \| \| \|	We currently assume profile hash conflicts will be caught by an upfront check and we assert for the cases that escape the check. The assumption is not always true as there are chances of conflict. This patch prints a warning and skips annotating the function for the escaped cases,. Differential Revision: https://reviews.llvm.org/D60154 llvm-svn: 358225
*	Revert "Use llvm::lower_bound. NFC"	Ali Tamur	2019-04-11	2	-8/+11
\| \| \| \| \| \| \| \| \|	This reverts commit rL358161. This patch have broken the test: llvm/test/tools/llvm-exegesis/X86/uops-CMOV16rm-noreg.s llvm-svn: 358199
*	Use llvm::lower_bound. NFC	Fangrui Song	2019-04-11	2	-11/+8
\| \| \| \|	llvm-svn: 358161
*	[InstCombine] Handle ssubo always overflow	Nikita Popov	2019-04-10	1	-3/+3
\| \| \| \| \| \| \| \| \|	Following D60483 and D60497, this adds support for AlwaysOverflows handling for ssubo. This is the last case we can handle right now. Differential Revision: https://reviews.llvm.org/D60518 llvm-svn: 358100
*	[InstCombine] ssubo X, C -> saddo X, -C	Nikita Popov	2019-04-10	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \|	ssubo X, C is equivalent to saddo X, -C. Make the transformation in InstCombine and allow the logic implemented for saddo to fold prior usages of add nsw or sub nsw with constants. Patch by Dan Robertson. Differential Revision: https://reviews.llvm.org/D60061 llvm-svn: 358099
*	[InstCombine] Handle saddo always overflow	Nikita Popov	2019-04-10	1	-3/+3
\| \| \| \| \| \| \| \| \|	Followup to D60483: Handle AlwaysOverflow conditions for saddo as well. Differential Revision: https://reviews.llvm.org/D60497 llvm-svn: 358095
*	[VPLAN] Minor improvement to testing and debug messages.	Florian Hahn	2019-04-10	1	-7/+10
\| \| \| \| \| \| \| \| \| \| \| \|	1. Use computed VF for stress testing. 2. If the computed VF does not produce vector code (VF smaller than 2), force VF to be 4. 3. Test vectorization of i64 data on AArch64 to make sure we generate VF != 4 (on X86 that was already tested on AVX). Patch by Francesco Petrogalli <francesco.petrogalli@arm.com> Differential Revision: https://reviews.llvm.org/D59952 llvm-svn: 358056
*	[InstCombine] Handle usubo always overflow	Nikita Popov	2019-04-10	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \|	Check AlwaysOverflow condition for usubo. The implementation is the same as the existing handling for uaddo and umulo. Handling for saddo and ssubo will follow (smulo doesn't have the necessary ValueTracking support). Differential Revision: https://reviews.llvm.org/D60483 llvm-svn: 358052
*	[InstCombine] Directly call computeOverflow methods in ↵	Nikita Popov	2019-04-10	1	-6/+13
\| \| \| \| \| \| \| \| \|	OptimizeOverflowCheck; NFC Instead of using the willOverflow helpers. This makes it easier to extend handling of AlwaysOverflows. llvm-svn: 358051
*	[InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y).	Chen Zheng	2019-04-10	1	-0/+6
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D60395 llvm-svn: 358050
*	[ObjC][ARC] Convert the retainRV marker that is passed as a named	Akira Hatanaka	2019-04-10	1	-9/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	metadata into a module flag in the auto-upgrader and make the ARC contract pass read the marker as a module flag. This is needed to fix a bug where ARC contract wasn't inserting the retainRV marker when LTO was enabled, which caused objects returned from a function to be auto-released. rdar://problem/49464214 Differential Revision: https://reviews.llvm.org/D60303 llvm-svn: 358047
*	Revert "[InstCombine] [InstCombine] Canonicalize (-X s/ Y) to -(X s/ Y)."	Nikita Popov	2019-04-09	1	-6/+0
\| \| \| \| \| \| \| \| \| \| \|	This reverts commit 1383a9168948aabfd827220c9445ce0ce5765800. sdiv-canonicalize.ll fails after this revision. The fold needs to be moved outside the branch handling constant operands. However when this is done there are further test changes, so I'm reverting this in the meantime. llvm-svn: 358026
*	[InstCombine] Restructure OptimizeOverflowCheck; NFC	Nikita Popov	2019-04-09	1	-31/+28
\| \| \| \| \| \| \| \| \|	Change the code to always handle the unsigned+signed cases together with the same basic structure for add/sub/mul. The simple folds are always handled first and then the ValueTracking overflow checks are used. llvm-svn: 358025