bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add LoadCombine pass.	Michael J. Spencer	2014-05-29	3	-0/+270
\| \| \| \| \| \| \| \|	This pass is disabled by default. Use -combine-loads to enable in -O[1-3] Differential revision: http://reviews.llvm.org/D3580 llvm-svn: 209791
*	Distribute sext/zext to the operands of and/or/xor	Jingyue Wu	2014-05-27	1	-13/+29
\| \| \| \| \| \| \| \| \| \| \| \|	This is an enhancement to SeparateConstOffsetFromGEP. With this patch, we can extract a constant offset from "s/zext and/or/xor A, B". Added a new test @ext_or to verify this enhancement. Refactoring the code, I also extracted some common logic to function Distributable. llvm-svn: 209670
*	Make the LoopRotate pass's maximum header size configurable both ↵	Owen Anderson	2014-05-26	1	-4/+14
\| \| \| \| \| \| \| \| \| \|	programmatically and via the command line, mirroring similar functionality in LoopUnroll. In situations where clients used custom unrolling thresholds, their intent could previously be foiled by LoopRotate having a hardcoded threshold. llvm-svn: 209617
*	Add the extracted constant offset using GEP	Jingyue Wu	2014-05-23	1	-26/+50
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fixed a TODO in r207783. Add the extracted constant offset using GEP instead of ugly ptrtoint+add+inttoptr. Using GEP simplifies future optimizations and makes IR easier to understand. Updated all affected tests, and added a new test in split-gep.ll to cover a corner case where emitting uglygep is necessary. llvm-svn: 209537
*	Add support for missed and analysis optimization remarks.	Diego Novillo	2014-05-22	1	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds two new diagnostics: -pass-remarks-missed and -pass-remarks-analysis. They take the same values as -pass-remarks but are intended to be triggered in different contexts. -pass-remarks-missed is used by LLVMContext::emitOptimizationRemarkMissed, which passes call when they tried to apply a transformation but couldn't. -pass-remarks-analysis is used by LLVMContext::emitOptimizationRemarkAnalysis, which passes call when they want to inform the user about analysis results. The patch also: 1- Adds support in the inliner for the two new remarks and a test case. 2- Moves emitOptimizationRemark* functions to the llvm namespace. 3- Adds an LLVMContext argument instead of making them member functions of LLVMContext. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3682 llvm-svn: 209442
*	[LSR] Canonicalize reg1 + ... + regN into reg1 + ... + 1*regN.	Quentin Colombet	2014-05-20	1	-183/+375
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit introduces a canonical representation for the formulae. Basically, as soon as a formula has more that one base register, the scaled register field is used for one of them. The register put into the scaled register is preferably a loop variant. The commit refactors how the formulae are built in order to produce such representation. This yields a more accurate, but still perfectible, cost model. <rdar://problem/16731508> llvm-svn: 209230
*	Use range for	Matt Arsenault	2014-05-19	1	-4/+1
\| \| \| \|	llvm-svn: 209147
*	Revert "Implement global merge optimization for global variables."	Rafael Espindola	2014-05-16	2	-76/+10
\| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r208934. The patch depends on aliases to GEPs with non zero offsets. That is not supported and fairly broken. The good news is that GlobalAlias is being redesigned and will have support for offsets, so this patch should be a nice match for it. llvm-svn: 208978
*	Implement global merge optimization for global variables.	Jiangning Liu	2014-05-15	2	-10/+76
\| \| \| \| \| \| \| \| \| \| \|	This commit implements two command line switches -global-merge-on-external and -global-merge-aligned, and both of them are false by default, so this optimization is disabled by default for all targets. For ARM64, some back-end behaviors need to be tuned to get this optimization further enabled. llvm-svn: 208934
*	Fix typos	Alp Toker	2014-05-15	1	-2/+2
\| \| \| \|	llvm-svn: 208839
*	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been	Jay Foad	2014-05-14	1	-1/+1
\| \| \| \| \| \|	inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811
*	GVN: Fix non-determinism in map iteration.	Benjamin Kramer	2014-05-13	1	-4/+7
\| \| \| \| \| \| \| \| \|	Iterating over a DenseMaop is non-deterministic and results to unpredictable IR output. Based on a patch by Daniel Reynaud! llvm-svn: 208728
*	GVN: rangify a couple of loops.	Benjamin Kramer	2014-05-13	1	-13/+9
\| \| \| \| \| \|	No functionality change. llvm-svn: 208727
*	Improve wording to make it sounds more like a change than an analysis.	Nick Lewycky	2014-05-08	1	-2/+3
\| \| \| \|	llvm-svn: 208370
*	Simplify and fix incorrect comment. No functionality change.	Richard Smith	2014-05-08	1	-22/+15
\| \| \| \|	llvm-svn: 208272
*	Detabify.	Nick Lewycky	2014-05-06	1	-2/+2
\| \| \| \|	llvm-svn: 208019
*	Improve 'tail' call marking in TRE. A bootstrap of clang goes from 375k ↵	Nick Lewycky	2014-05-05	1	-73/+241
\| \| \| \| \| \| \| \| \| \|	calls marked tail in the IR to 470k, however this improvement does not carry into an improvement of the call/jmp ratio on x86. The most common pattern is a tail call + br to a block with nothing but a 'ret'. The number of tail call to loop conversions remains the same (1618 by my count). The new algorithm does a local scan over the use-def chains to identify local "alloca-derived" values, as well as points where the alloca could escape. Then, a visit over the CFG marks blocks as being before or after the allocas have escaped, and annotates the calls accordingly. llvm-svn: 208017
*	LoopUnroll: If we're doing partial unrolling, use the PartialThreshold to ↵	Benjamin Kramer	2014-05-04	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \|	limit unrolling. Otherwise we use the same threshold as for complete unrolling, which is way too high. This made us unroll any loop smaller than 150 instructions by 8 times, but only if someone specified -march=core2 or better, which happens to be the default on darwin. llvm-svn: 207940
*	[GVN] Pass the phi-translated address of a load instead of the untranslated	Akira Hatanaka	2014-05-02	1	-2/+1
\| \| \| \| \| \| \| \| \|	address to AnalyzeLoadFromClobberingLoad. This fixes a bug in load-PRE where PRE is applied to a load that is not partially redundant. <rdar://problem/16638765>. llvm-svn: 207853
*	Update and sort CMakeLists.	Benjamin Kramer	2014-05-01	1	-5/+6
\| \| \| \|	llvm-svn: 207785
*	Add an optimization that does CSE in a group of similar GEPs.	Eli Bendersky	2014-05-01	2	-0/+584
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This optimization merges the common part of a group of GEPs, so we can compute each pointer address by adding a simple offset to the common part. The optimization is currently only enabled for the NVPTX backend, where it has a large payoff on some benchmarks. Review: http://reviews.llvm.org/D3462 Patch by Jingyue Wu. llvm-svn: 207783
*	ConstantHoisting.cpp: Add <tuple> for std::tie, since r207593 removed ↵	NAKAMURA Takumi	2014-04-30	1	-0/+1
\| \| \| \| \| \|	FileSystem.h, it includes <tuple>. llvm-svn: 207614
*	Tidy up.	Jim Grosbach	2014-04-29	1	-2/+2
\| \| \| \|	llvm-svn: 207585
*	Spelling.	Jim Grosbach	2014-04-29	1	-1/+1
\| \| \| \|	llvm-svn: 207584
*	Reapply r207271 without the testcase	Adam Nemet	2014-04-29	1	-9/+12
\| \| \| \| \| \|	PR19608 was filed to find a suitable testcase. llvm-svn: 207569
*	Revert r207271 for now. This commit introduced a test case that ran	Chandler Carruth	2014-04-28	1	-12/+9
\| \| \| \| \| \| \| \|	clang directly from the LLVM test suite! That doesn't work. I've followed up on the review thread to try and get a viable solution sorted out, but trying to get the tree clean here. llvm-svn: 207462
*	[C++] Use 'nullptr'.	Craig Topper	2014-04-28	3	-4/+4
\| \| \| \|	llvm-svn: 207394
*	RecursivelyDeleteTriviallyDeadInstructions() could remove	Gerolf Hoflehner	2014-04-26	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \|	more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 Repaired r207302. llvm-svn: 207309
*	Revert commit r207302 since build failures	Gerolf Hoflehner	2014-04-26	1	-9/+1
\| \| \| \| \| \|	have been reported. llvm-svn: 207303
*	RecursivelyDeleteTriviallyDeadInstructions() could remove	Gerolf Hoflehner	2014-04-26	1	-1/+9
\| \| \| \| \| \| \| \| \|	more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 llvm-svn: 207302
*	[LoopStrengthReduce] Don't trim formula that uses a subset of required registers	Adam Nemet	2014-04-25	1	-9/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Consider this use from the new testcase: LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i32 reg({1000,+,-1}<nw><%for.body>) -3003 + reg({3,+,3}<nw><%for.body>) -1001 + reg({1,+,1}<nuw><nsw><%for.body>) -1000 + reg({0,+,1}<nw><%for.body>) -3000 + reg({0,+,3}<nuw><%for.body>) reg({-1000,+,1}<nw><%for.body>) reg({-3000,+,3}<nsw><%for.body>) This is the last use we consider for a solution in SolveRecurse, so CurRegs is a large set. (CurRegs is the set of registers that are needed by the previously visited uses in the in-progress solution.) ReqRegs is { {3,+,3}<nw><%for.body>, {1,+,1}<nuw><nsw><%for.body> } This is the intersection of the regs used by any of the formulas for the current use and CurRegs. Now, the code requires a formula to contain all these regs (the comment is simply wrong), otherwise the formula is immediately disqualified. Obviously, no formula for this use contains two regs so they will all get disqualified. The fix modifies the check to allow the formula in this case. The idea is that neither of these formulae is introducing any new registers which is the point of this early pruning as far as I understand. In terms of set arithmetic, we now allow formulas whose used regs are a subset of the required regs not just the other way around. There are few more loops in the test-suite that are now successfully LSRed. I have benchmarked those and found very minimal change. Fixes <rdar://problem/13965777> llvm-svn: 207271
*	SCC: Change clients to use const, NFC	Duncan P. N. Exon Smith	2014-04-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	It's fishy to be changing the `std::vector<>` owned by the iterator, and no one actual does it, so I'm going to remove the ability in a subsequent commit. First, update the users. <rdar://problem/14292693> llvm-svn: 207252
*	[C++] Use 'nullptr'. Transforms edition.	Craig Topper	2014-04-25	28	-413/+422
\| \| \| \|	llvm-svn: 207196
*	Remove more default address space argument usage.	Matt Arsenault	2014-04-23	1	-1/+2
\| \| \| \| \| \|	These places are inconsequential in practice. llvm-svn: 207021
*	[Constant Hoisting] Materialize the constant before the cloned cast instruction.	Juergen Ributzka	2014-04-22	1	-2/+11
\| \| \| \| \| \| \| \| \| \| \| \|	In the case where the constant comes from a cloned cast instruction, the materialization code has to go before the cloned cast instruction. This commit fixes the method that finds the materialization insertion point by making it aware of this case. This fixes <rdar://problem/15532441> llvm-svn: 206913
*	[Constant Hoisting] Print the instructions in the correct order for ↵	Juergen Ributzka	2014-04-22	1	-2/+2
\| \| \| \| \| \|	debugging. No functional change. llvm-svn: 206912
*	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	Chandler Carruth	2014-04-22	35	-36/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	definition below all of the header #include lines, lib/Transforms/... edition. This one is tricky for two reasons. We again have a couple of passes that define something else before the includes as well. I've sunk their name macros with the DEBUG_TYPE. Also, InstCombine contains headers that need DEBUG_TYPE, so now those headers #define and #undef DEBUG_TYPE around their code, leaving them well formed modular headers. Fixing these headers was a large motivation for all of these changes, as "leaky" macros of this form are hard on the modules implementation. llvm-svn: 206844
*	Fix PR7272 in -tailcallelim instead of the inliner	Reid Kleckner	2014-04-21	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The -tailcallelim pass should be checking if byval or inalloca args can be captured before marking calls as tail calls. This was the real root cause of PR7272. With a better fix in place, revert the inliner change from r105255. The test case it introduced still passes and has been moved to test/Transforms/Inline/byval-tail-call.ll. Reviewers: chandlerc Differential Revision: http://reviews.llvm.org/D3403 llvm-svn: 206789
*	Remove some empty statements	Alp Toker	2014-04-19	1	-1/+1
\| \| \| \| \| \|	Cleanup only. llvm-svn: 206710
*	remove some dead code	Nuno Lopes	2014-04-17	1	-21/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	lib/Analysis/IPA/InlineCost.cpp \| 18 ------------------ lib/Analysis/RegionPass.cpp \| 1 - lib/Analysis/TypeBasedAliasAnalysis.cpp \| 1 - lib/Transforms/Scalar/LoopUnswitch.cpp \| 21 --------------------- lib/Transforms/Utils/LCSSA.cpp \| 2 -- lib/Transforms/Utils/LoopSimplify.cpp \| 6 ------ utils/TableGen/AsmWriterEmitter.cpp \| 13 ------------- utils/TableGen/DFAPacketizerEmitter.cpp \| 7 ------- utils/TableGen/IntrinsicEmitter.cpp \| 2 -- 9 files changed, 71 deletions(-) llvm-svn: 206506
*	verify-di: Implement DebugInfoVerifier	Duncan P. N. Exon Smith	2014-04-15	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement DebugInfoVerifier, which steals verification relying on DebugInfoFinder from Verifier. - Adds LegacyDebugInfoVerifierPassPass, a ModulePass which wraps DebugInfoVerifier. Uses -verify-di command-line flag. - Change verifyModule() to invoke DebugInfoVerifier as well as Verifier. - Add a call to createDebugInfoVerifierPass() wherever there was a call to createVerifierPass(). This implementation as a module pass should sidestep efficiency issues, allowing us to turn debug info verification back on. <rdar://problem/15500563> llvm-svn: 206300
*	D3348 - [BUG] "Rotate Loop" pass kills "llvm.vectorizer.enable" metadata	Alexey Bataev	2014-04-15	1	-0/+9
\| \| \| \|	llvm-svn: 206266
*	Implement depth_first and inverse_depth_first range factory functions.	David Blaikie	2014-04-11	1	-7/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Also updated as many loops as I could find using df_begin/idf_begin - strangely I found no uses of idf_begin. Is that just used out of tree? Also a few places couldn't use df_begin because either they used the member functions of the depth first iterators or had specific ordering constraints (I added a comment in the latter case). Based on a patch by Jim Grosbach. (Jim - you just had iterator_range<T> where you needed iterator_range<idf_iterator<T>>) llvm-svn: 206016
*	Fix some doc and comment typos	Alp Toker	2014-04-09	1	-1/+1
\| \| \| \|	llvm-svn: 205899
*	Revert "[Constant Hoisting] Lazily compute the idom and cache the result."	Juergen Ributzka	2014-04-03	1	-43/+4
\| \| \| \| \| \| \|	This code is no longer usefull, because we only compute and use the IDom once. There is no benefit in caching it anymore. llvm-svn: 205498
*	Add some additional fields to TTI::UnrollingPreferences	Hal Finkel	2014-04-01	1	-4/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In preparation for an upcoming commit implementing unrolling preferences for x86, this adds additional fields to the UnrollingPreferences structure: - PartialThreshold and PartialOptSizeThreshold - Like Threshold and OptSizeThreshold, but used when not fully unrolling. These are necessary because we need different thresholds for full unrolling from those used when partially unrolling (the full unrolling thresholds are generally going to be larger). - MaxCount - A cap on the unrolling factor when partially unrolling. This can be used by a target to prevent the unrolled loop from exceeding some resource limit independent of the loop size (such as number of branches). There should be no functionality change for any in-tree targets. llvm-svn: 205347
*	Move partial/runtime unrolling late in the pipeline	Hal Finkel	2014-03-31	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The generic (concatenation) loop unroller is currently placed early in the standard optimization pipeline. This is a good place to perform full unrolling, but not the right place to perform partial/runtime unrolling. However, most targets don't enable partial/runtime unrolling, so this never mattered. However, even some x86 cores benefit from partial/runtime unrolling of very small loops, and follow-up commits will enable this. First, we need to move partial/runtime unrolling late in the optimization pipeline (importantly, this is after SLP and loop vectorization, as vectorization can drastically change the size of a loop), while keeping the full unrolling where it is now. This change does just that. llvm-svn: 205264
*	Revert "GVN: merge overflow intrinsics with non-overflow instructions."	Erik Verbruggen	2014-03-28	1	-124/+58
\| \| \| \| \| \| \| \| \|	This reverts commit r203553, and follow-up commits r203558 and r203574. I will follow this up on the mailinglist to do it in a way that won't cause subtle PRE bugs. llvm-svn: 205009
*	Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch ↵	Nick Lewycky	2014-03-26	1	-4/+16
\| \| \| \| \| \|	by Björn Steinbrink! llvm-svn: 204876
*	[Constant Hoisting] Make the constant candidate map local to the ↵	Juergen Ributzka	2014-03-25	1	-11/+14
\| \| \| \| \| \|	collectConstantCandidates method. llvm-svn: 204758