bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Move discriminator assignment to where it is used. (NFC)	Dehao Chen	2016-10-25	1	-1/+1
\| \| \| \|	llvm-svn: 285084
*	Merge two if conditions into one. NFCI.	Davide Italiano	2016-10-24	1	-3/+2
\| \| \| \|	llvm-svn: 285008
*	add-discriminators: Fix handling of lexical scopes.	Adrian Prantl	2016-10-24	1	-9/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a bug in the handling of lexical scopes, when more than one scope is defined on the same line or functions are inlined into call sites that are on the same line as the function definition. This situation can easily happen in macro expansions. The problem is solved by introducing a SmallDenseMap<DIScope , DILexicalBlockFile , 1> that keeps track of all the different lexical scopes that share a line/file location. Fixes PR30681. llvm-svn: 284998
*	Check the number of Args in LibCallsShrinkWrap.	Rong Xu	2016-10-24	1	-0/+2
\| \| \| \| \| \|	Some library fucntions can have no argument. llvm-svn: 284989
*	Now that VS2013 is gone, make a memoryssa structure an anonymous union again	Daniel Berlin	2016-10-22	1	-4/+4
\| \| \| \|	llvm-svn: 284910
*	[CtorUtils] Modernize. No functional changes intended.	Davide Italiano	2016-10-22	1	-5/+5
\| \| \| \|	llvm-svn: 284904
*	[StripGCRelocates] New pass to remove gc.relocates added by RS4GC	Anna Thomas	2016-10-21	3	-0/+82
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Utility pass to remove gc.relocates created by rewrite statepoints for GC. With respect to safepoint verification, the IR generated would be incorrect, and cannot run as such. This would be a single transformation on the final optimized IR. The benefit of the pass is for easy analysis when the IRs are 'polluted' by too many gc.relocates. Added tests. test run: All RS4GC tests with -verify option. Local downstream tests on large IR files. This also works when the pointer being gc.relocated is another gc.relocate. Reviewers: sanjoy, reames Subscribers: beanz, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D25096 llvm-svn: 284855
*	[LoopUnroll] Keep the loop test only on the first iteration of max-or-zero loops	John Brawn	2016-10-21	1	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we have a loop with a known upper bound on the number of iterations, and furthermore know that either the number of iterations will be either exactly that upper bound or zero, then we can fully unroll up to that upper bound keeping only the first loop test to check for the zero iteration case. Most of the work here is in plumbing this 'max-or-zero' information from the part of scalar evolution where it's detected through to loop unrolling. I've also gone for the safe default of 'false' everywhere but howManyLessThans which could probably be improved. Differential Revision: https://reviews.llvm.org/D25682 llvm-svn: 284818
*	[MSSA] Avoid unnecessary use walks when calling getClobberingMemoryAccess	Daniel Berlin	2016-10-20	1	-6/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This allows us to mark when uses have been optimized. This lets us avoid rewalking (IE when people call getClobberingAccess on everything), and also enables us to later relax the requirement of use optimization during updates with less cost. Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25172 llvm-svn: 284771
*	Do a sweep over move ctors and remove those that are identical to the default.	Benjamin Kramer	2016-10-20	1	-20/+0
\| \| \| \| \| \| \| \| \| \|	All of these existed because MSVC 2013 was unable to synthesize default move ctors. We recently dropped support for it so all that error-prone boilerplate can go. No functionality change intended. llvm-svn: 284721
*	[asan] Replace std::to_string with llvm::to_string	Vitaly Buka	2016-10-19	1	-1/+2
\| \| \| \|	llvm-svn: 284557
*	[asan] Simplify calculation of stack frame layout extraction calculation of ↵	Vitaly Buka	2016-10-18	1	-14/+20
\| \| \| \| \| \| \| \| \| \| \| \|	stack description into separate function. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25754 llvm-svn: 284547
*	[asan] Append line number to variable name if line is available and in the ↵	Vitaly Buka	2016-10-18	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \|	same file as the function. PR30498 Reviewers: eugenis Differential Revision: https://reviews.llvm.org/D25715 llvm-svn: 284546
*	Conditionally eliminate library calls where the result value is not used	Rong Xu	2016-10-18	3	-0/+566
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This pass shrink-wraps a condition to some library calls where the call result is not used. For example: sqrt(val); is transformed to if (val < 0) sqrt(val); Even if the result of library call is not being used, the compiler cannot safely delete the call because the function can set errno on error conditions. Note in many functions, the error condition solely depends on the incoming parameter. In this optimization, we can generate the condition can lead to the errno to shrink-wrap the call. Since the chances of hitting the error condition is low, the runtime call is effectively eliminated. These partially dead calls are usually results of C++ abstraction penalty exposed by inlining. This optimization hits 108 times in 19 C/C++ programs in SPEC2006. Reviewers: hfinkel, mehdi_amini, davidxl Subscribers: modocache, mgorny, mehdi_amini, xur, llvm-commits, beanz Differential Revision: https://reviews.llvm.org/D24414 llvm-svn: 284542
*	Ignore debug info when making optimization decisions in SimplifyCFG.	Dehao Chen	2016-10-17	1	-11/+18
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Debug info should not affect code generation. This patch properly handles debug info to make sure the generated code are the same with or without debug info. Reviewers: davidxl, mzolotukhin, jmolloy Subscribers: aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D25286 llvm-svn: 284415
*	[SimplifyCFG] Don't lower complex ConstantExprs to lookup tables	Oliver Stannard	2016-10-17	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	Not all ConstantExprs can be represented by a global variable, for example most pointer arithmetic other than addition of a constant, so we can't convert these values from switch statements to lookup tables. Differential Revision: https://reviews.llvm.org/D25550 llvm-svn: 284379
*	[SimplifyCFG] Use the error checking provided by getPrevNode.	Benjamin Kramer	2016-10-15	1	-7/+11
\| \| \| \| \| \| \| \| \|	BasicBlock::size is O(insts), making this loop O(blocks*insts), which can be really slow on generated code. getPrevNode already checks if we're at the beginning of the block and returns nullptr if so, just use that instead. No functionality change intended. llvm-svn: 284303
*	Memory-SSA: strengthen defClobbersUseOrDef interface	Sebastian Pop	2016-10-13	1	-19/+15
\| \| \| \| \| \| \|	As Danny pointed out, defClobbersUseOrDef should use MemoryLocOrCall to make sure fences are properly handled. llvm-svn: 284099
*	commit back "GVN-hoist: fix store past load dependence analysis (PR30216, ↵	Sebastian Pop	2016-10-13	1	-49/+61
\| \| \| \| \| \| \| \| \| \|	PR30499)" This is with an extra change to avoid calling MemoryLocation::get() on a call instruction. Differential Revision: https://reviews.llvm.org/D25542 llvm-svn: 284098
*	Revert "GVN-hoist: fix store past load dependence analysis (PR30216, PR30499)"	Reid Kleckner	2016-10-13	1	-56/+49
\| \| \| \| \| \| \| \| \| \| \|	This CL didn't actually address the test case in PR30499, and clang still crashes. Also revert dependent change "Memory-SSA cleanup of clobbers interface, NFC" Reverts r283965 and r283967. llvm-svn: 284093
*	Reapply "[LoopUnroll] Use the upper bound of the loop trip count to fullly ↵	Haicheng Wu	2016-10-12	1	-9/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	unroll a loop" Reappy r284044 after revert in r284051. Krzysztof fixed the error in r284049. The original summary: This patch tries to fully unroll loops having break statement like this for (int i = 0; i < 8; i++) { if (a[i] == value) { found = true; break; } } GCC can fully unroll such loops, but currently LLVM cannot because LLVM only supports loops having exact constant trip counts. The upper bound of the trip count can be obtained from calling ScalarEvolution::getMaxBackedgeTakenCount(). Part of the patch is the refactoring work in SCEV to prevent duplicating code. The feature of using the upper bound is enabled under the same circumstance when runtime unrolling is enabled since both are used to unroll loops without knowing the exact constant trip count. llvm-svn: 284053
*	Revert "[LoopUnroll] Use the upper bound of the loop trip count to fullly ↵	Haicheng Wu	2016-10-12	1	-18/+9
\| \| \| \| \| \| \| \|	unroll a loop" This reverts commit r284044. llvm-svn: 284051
*	[LoopUnroll] Use the upper bound of the loop trip count to fullly unroll a loop	Haicheng Wu	2016-10-12	1	-9/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch tries to fully unroll loops having break statement like this for (int i = 0; i < 8; i++) { if (a[i] == value) { found = true; break; } } GCC can fully unroll such loops, but currently LLVM cannot because LLVM only supports loops having exact constant trip counts. The upper bound of the trip count can be obtained from calling ScalarEvolution::getMaxBackedgeTakenCount(). Part of the patch is the refactoring work in SCEV to prevent duplicating code. The feature of using the upper bound is enabled under the same circumstance when runtime unrolling is enabled since both are used to unroll loops without knowing the exact constant trip count. Differential Revision: https://reviews.llvm.org/D24790 llvm-svn: 284044
*	[SimplifyCFG] Don't create PHI nodes for constant bundle operands	Sanjoy Das	2016-10-12	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Constant bundle operands may need to retain their constant-ness for correctness. I'll admit that this is slightly odd, but it looks like SimplifyCFG already does this for things like @llvm.frameaddress and @llvm.stackmap, so I suppose adding one more case is not a big deal. It is possible to add a mechanism to denote bundle operands that need to remain constants, but that's probably too complicated for the time being. Reviewers: jmolloy Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D25502 llvm-svn: 284028
*	Memory-SSA cleanup of clobbers interface, NFC	Sebastian Pop	2016-10-12	1	-4/+10
\| \| \| \| \| \| \| \| \|	This implements the cleanup that Danny asked to commit separately from the previous fix to GVN-hoist in https://reviews.llvm.org/D25476#inline-219818 Tested with ninja check on x86_64-linux. llvm-svn: 283967
*	GVN-hoist: fix store past load dependence analysis (PR30216, PR30499)	Sebastian Pop	2016-10-12	1	-53/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a refreshed version of a patch that was reverted: it fixes the problems reported in both PR30216 and PR30499, and contains all the test-cases from both bugs. To hoist stores past loads, we used to search for potential conflicting loads on the hoisting path by following a MemorySSA def-def link from the store to be hoisted to the previous defining memory access, and from there we followed the def-use chains to all the uses that occur on the hoisting path. The problem is that the def-def link may point to a store that does not alias with the store to be hoisted, and so the loads that are walked may not alias with the store to be hoisted, and even as in the testcase of PR30216, the loads that may alias with the store to be hoisted are not visited. The current patch visits all loads on the path from the store to be hoisted to the hoisting position and uses the alias analysis to ask whether the store may alias the load. I was not able to use the MemorySSA functionality to ask for whether load and store are clobbered: I'm not sure which function to call, so I used a call to AA->isNoAlias(). Store past store is still working as before using a MemorySSA query: I added an extra test to pr30216.ll to make sure store past store does not regress. Tested on x86_64-linux with check and a test-suite run. Differential Revision: https://reviews.llvm.org/D25476 llvm-svn: 283965
*	[LCSSA] Implement linear algorithm for the isRecursivelyLCSSAForm	Igor Laevsky	2016-10-11	2	-6/+7
\| \| \| \| \| \| \| \|	For each block check that it doesn't have any uses outside of it's innermost loop. Differential Revision: https://reviews.llvm.org/D25364 llvm-svn: 283877
*	Invoke add-discriminator at -g0 -fsample-profile	Dehao Chen	2016-10-07	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: -fsample-profile needs discriminator, which will not be added if built with -g0. This patch makes sure the discriminator is added for sample-profile at -g0. A followup patch will be send out to update clang tests. Reviewers: davidxl, dblaikie, echristo, dnovillo Subscribers: mehdi_amini, probinson, llvm-commits Differential Revision: https://reviews.llvm.org/D25132 llvm-svn: 283565
*	[ARM] Don't convert switches to lookup tables of pointers with ROPI/RWPI	Oliver Stannard	2016-10-07	1	-17/+27
\| \| \| \| \| \| \| \| \| \| \| \|	With the ROPI and RWPI relocation models we can't always have pointers to global data or functions in constant data, so don't try to convert switches into lookup tables if any value in the lookup table would require a relocation. We can still safely emit lookup tables of other values, such as simple constants. Differential Revision: https://reviews.llvm.org/D24462 llvm-svn: 283530
*	[SimplifyCFG] Correctly test for unconditional branches in GetCaseResults	David Majnemer	2016-10-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	GetCaseResults assumed that a terminator with one successor was an unconditional branch. This is not necessarily the case, it could be a cleanupret. Strengthen the check by querying whether or not the terminator is exceptional. llvm-svn: 283517
*	Revert "Add -strip-nonlinetable-debuginfo capability"	Michael Ilseman	2016-10-06	3	-44/+0
\| \| \| \| \| \| \| \|	This reverts commit r283473. Reverted until review is completed. llvm-svn: 283478
*	Add -strip-nonlinetable-debuginfo capability	Michael Ilseman	2016-10-06	3	-0/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a new function to DebugInfo.cpp that takes an llvm::Module as input and removes all debug info metadata that is not directly needed for line tables, thus effectively stripping all type and variable information from the module. The primary motivation for this feature was the bitcode work flow (cf. http://lists.llvm.org/pipermail/llvm-dev/2016-June/100643.html for more background). This is not wired up yet, but will be in subsequent patches. For testing, the new functionality is exposed to opt with a -strip-nonlinetable-debuginfo option. The secondary use-case (and one that works right now!) is as a reduction pass in bugpoint. I added two new bugpoint options (-disable-strip-debuginfo and -disable-strip-debug-types) to control the new features. By default it will first attempt to remove all debug information, then only the type info, and then proceed to hack at any remaining MDNodes. llvm-svn: 283473
*	Use StringRef in Pass/PassManager APIs (NFC)	Mehdi Amini	2016-10-01	1	-1/+1
\| \| \| \|	llvm-svn: 283004
*	[LoopUnroll] Port to the new streaming interface for opt remarks.	Adam Nemet	2016-09-30	1	-10/+13
\| \| \| \|	llvm-svn: 282834
*	[LoopSimplify] When simplifying phis in loop-simplify, do it only if it ↵	Michael Zolotukhin	2016-09-27	1	-2/+4
\| \| \| \| \| \|	preserves LCSSA form. llvm-svn: 282541
*	[DebugInfo] Add comments to phi dbg.value tracking code, NFC	Reid Kleckner	2016-09-27	1	-6/+8
\| \| \| \| \| \| \| \| \| \| \| \|	LLVM developers might be surprised to learn that there are blocks without valid insertion points (catchswitch), so it seems worth calling that out explicitly. Also add a FIXME about what we should really be doing if we ever need to make optimized Windows EH code debuggable. While I'm here, make auto usage more consistent with LLVM standards and avoid an unecessary call to insertBefore. llvm-svn: 282521
*	Remove pruning of phi nodes in MemorySSA - it makes updating harder	Daniel Berlin	2016-09-26	1	-40/+5
\| \| \| \| \| \| \| \| \| \|	Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24923 llvm-svn: 282419
*	GlobalStatus: Don't walk use-lists of ConstantData	Duncan P. N. Exon Smith	2016-09-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Return early from llvm::isSafeToDestroyConstant() whenever the value `isa<ConstantData>()`. These constants are shared across the LLVMContext. We never really want to delete them here, and walking their use-lists can be very expensive. (This is motivated by an eventual goal of removing use-lists entirely from ConstantData.) llvm-svn: 282320
*	Reapplying r281895 (and follow-up r281964) after fixing pr30468.	Keith Walker	2016-09-22	2	-5/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The additional fix is: When adding debug information to a lowered phi node in mem2reg check that we have a valid insertion point after the phi for adding the debug information. This change addresses the issue in pr30468 where a lowered phi was added before a catchswitch and no debug information should be added after the phi in this case. Differential Revision: https://reviews.llvm.org/D24797 llvm-svn: 282155
*	Revert r281895 "Add @llvm.dbg.value entries for the phi node created by ↵	Hans Wennborg	2016-09-21	2	-49/+5
\| \| \| \| \| \| \| \| \| \|	-mem2reg" (And follow-up r281964.) It caused PR30468. llvm-svn: 282077
*	Make llvm::ConvertDebugDeclareToDebugValue() be a void function (NFC)	Keith Walker	2016-09-20	1	-8/+5
\| \| \| \| \| \| \| \| \| \|	The routines llvm::ConvertDebugDeclareToDebugValue() always returned a true value which was never checked at the call site; change the function return type to void. This NFC cleanup was approved in the review https://reviews.llvm.org/D23715 llvm-svn: 281964
*	[LCSSA] Cache LoopExits to avoid wasted work	Philip Reames	2016-09-19	1	-3/+9
\| \| \| \| \| \| \| \| \| \|	When looking at the scribus_1.3 example from https://llvm.org/bugs/show_bug.cgi?id=10584, I noticed that we were spending a large amount of time computing loop exits in LCSSA. This code appears to be written with the assumption that LoopExits are stored in the Loop and thus cheap to query. This is not true, so we should cache the result across the potentially long running loop which tends to visit a small handful of Loops. On the particular example from 10584, this change drops the time spent in LCSSA computation by about 80%. Differential Revision: https://reviews.llvm.org/D24509 llvm-svn: 281949
*	Add @llvm.dbg.value entries for the phi node created by -mem2reg	Keith Walker	2016-09-19	2	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When phi nodes are created in the -mem2reg phase, the @llvm.dbg.declare entries are converted to @llvm.dbg.value entries at the place where the store instructions existed. However no entry is created to describe the resulting value of the phi node. The effect of this is especially noticeable in for loops which have a constant for the intial value; the loop control variable's location would be described as the intial constant value in the loop body once the -mem2reg optimization phase was run. This change adds the creation of the @llvm.dbg.value entries to describe variables whose location is the result of a phi node created in -mem2reg. Also when the phi node is finally lowered to a machine instruction it is important that the lowered "load" instruction is placed before the associated DEBUG_VALUE entry describing the value loaded. Differential Revision: https://reviews.llvm.org/D23715 llvm-svn: 281895
*	[SimplifyCFG] Update (AND) IR flags when CSE'ing instructions	James Molloy	2016-09-19	1	-2/+4
\| \| \| \| \| \| \| \|	We were updating metadata but not IR flags. Because we pick an arbitrary instruction to be the CSE candidate, it comes down to luck (50% or less chance) if this results in broken codegen or not, which is why PR30373 which is actually not the fault of the commit it was bisected down to. Fixes PR30373. llvm-svn: 281889
*	Rename NameAnonFunctions to NameAnonGlobals to match what it is doing (NFC)	Mehdi Amini	2016-09-16	3	-25/+25
\| \| \| \|	llvm-svn: 281745
*	Fix NameAnonFunctions pass: for ThinLTO we need to rename global variables ↵	Mehdi Amini	2016-09-16	1	-5/+10
\| \| \| \| \| \| \| \| \| \| \|	as well A follow-up patch will rename this pass and the source file accordingly, but I figured the non-NFC change will be easier to spot in isolation. Differential Revision: https://reviews.llvm.org/D24641 llvm-svn: 281744
*	Fix misleading comment for getOrEnforceKnownAlignment	Matt Arsenault	2016-09-13	1	-4/+0
\| \| \| \| \| \| \|	It does not return 0 to indicate failure, and returns the known alignment. llvm-svn: 281350
*	Enable simplify libcalls for ARM PCS	Sam Parker	2016-09-13	1	-3/+35
\| \| \| \| \| \| \| \| \| \|	Teach SimplifyLibcalls that in can treat functions annotated with apcs, aapcs or aapcs_vfp like normal C functions if they only take and return integer or pointer values, and the target is not iOS. Differential Revision: https://reviews.llvm.org/D24453 llvm-svn: 281322
*	[SimplifyCFG] Be even more conservative in SinkThenElseCodeToEnd	James Molloy	2016-09-11	1	-15/+19
\| \| \| \| \| \| \| \|	This should actually fix PR30244. This cranks up the workaround for PR30188 so that we never sink loads or stores of allocas. The idea is that these should be removed by SROA/Mem2Reg, and any movement of them may well confuse SROA or just cause unwanted code churn. It's not ideal that the midend should be crippled like this, but that unwanted churn can really cause significant regressions in important workloads (tsan). llvm-svn: 281162
*	[SimplifyCFG] Harden up the profitability heuristic for block splitting ↵	James Molloy	2016-09-11	1	-5/+20
\| \| \| \| \| \| \| \| \| \| \| \|	during sinking Exposed by PR30244, we will split a block currently if we think we can sink at least one instruction. However this isn't right - the reason we split predecessors is so that we can sink instructions that otherwise couldn't be sunk because it isn't safe to do so - stores, for example. So, change the heuristic to only split if it thinks it can sink at least one non-speculatable instruction. Should fix PR30244. llvm-svn: 281160