bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[LTO][ThinLTO] Use the linker resolutions to mark global values as dso_local.	Sean Fertile	2017-11-04	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \|	Now that we have a way to mark GlobalValues as local we can use the symbol resolutions that the linker plugin provides as part of lto/thinlto link step to refine the compilers view on what symbols will end up being local. Originally commited as r317374, but reverted in r317395 to update some missed tests. Differential Revision: https://reviews.llvm.org/D35702 llvm-svn: 317408
*	Revert "[LTO][ThinLTO] Use the linker resolutions to mark global values ..."	Sean Fertile	2017-11-04	1	-17/+0
\| \| \| \| \| \| \| \| \|	Changes more tests then expected on one of the build bots. reverting to investigate. This reverts https://llvm.org/svn/llvm-project/llvm/trunk@317374 llvm-svn: 317395
*	[CallSiteSplitting] clang-format my last commit. NFCI.	Davide Italiano	2017-11-04	1	-3/+2
\| \| \| \| \| \|	Thanks to Rui for pointing out. llvm-svn: 317393
*	[CallSiteSplitting] Silence GCC's -Wparentheses. NFCI.	Davide Italiano	2017-11-03	1	-2/+2
\| \| \| \|	llvm-svn: 317385
*	Invoke salvageDebugInfo from CodeGenPrepare's SinkCast()	Adrian Prantl	2017-11-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	This preserves the debug info for the cast operation in the original location. rdar://problem/33460652 Reapplied r317340 with the test moved into an ARM-specific directory. llvm-svn: 317375
*	[LTO][ThinLTO] Use the linker resolutions to mark global values as dso_local.	Sean Fertile	2017-11-03	1	-0/+17
\| \| \| \| \| \| \| \| \| \|	Now that we have a way to mark GlobalValues as local we can use the symbol resolutions that the linker plugin provides as part of lto/thinlto link step to refine the compilers view on what symbols will end up being local. Differential Revision: https://reviews.llvm.org/D35702 llvm-svn: 317374
*	[SimplifyCFG] When merging conditional stores, don't count the store we're ↵	Craig Topper	2017-11-03	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \|	merging against the PHINodeFoldingThreshold Merging conditional stores tries to check to see if the code is if convertible after the store is moved. But the store hasn't been moved yet so its being counted against the threshold. The patch adds 1 to the threshold comparison to make sure we don't count the store. I've adjusted a test to use a lower threshold to ensure we still do that conversion with the lower threshold. Differential Revision: https://reviews.llvm.org/D39570 llvm-svn: 317368
*	Recommit r317351 : Add CallSiteSplitting pass	Jun Bum Lim	2017-11-03	4	-0/+501
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This recommit r317351 after fixing a buildbot failure. Original commit message: Summary: This change add a pass which tries to split a call-site to pass more constrained arguments if its argument is predicated in the control flow so that we can expose better context to the later passes (e.g, inliner, jump threading, or IPA-CP based function cloning, etc.). As of now we support two cases : 1) If a call site is dominated by an OR condition and if any of its arguments are predicated on this OR condition, try to split the condition with more constrained arguments. For example, in the code below, we try to split the call site since we can predicate the argument (ptr) based on the OR condition. Split from : if (!ptr \|\| c) callee(ptr); to : if (!ptr) callee(null ptr) // set the known constant value else if (c) callee(nonnull ptr) // set non-null attribute in the argument 2) We can also split a call-site based on constant incoming values of a PHI For example, from : BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2, label %BB1 BB1: br label %BB2 BB2: %p = phi i32 [ 0, %BB0 ], [ 1, %BB1 ] call void @bar(i32 %p) to BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2-split0, label %BB1 BB1: br label %BB2-split1 BB2-split0: call void @bar(i32 0) br label %BB2 BB2-split1: call void @bar(i32 1) br label %BB2 BB2: %p = phi i32 [ 0, %BB2-split0 ], [ 1, %BB2-split1 ] llvm-svn: 317362
*	Add llvm::for_each as a range-based extensions to <algorithm> and make use ↵	Aaron Ballman	2017-11-03	1	-9/+9
\| \| \| \| \| \|	of it in some cases where it is a more clear alternative to std::for_each. llvm-svn: 317356
*	Revert "Add CallSiteSplitting pass"	Jun Bum Lim	2017-11-03	4	-500/+0
\| \| \| \| \| \| \| \|	Revert due to Buildbot failure. This reverts commit r317351. llvm-svn: 317353
*	Add CallSiteSplitting pass	Jun Bum Lim	2017-11-03	4	-0/+500
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change add a pass which tries to split a call-site to pass more constrained arguments if its argument is predicated in the control flow so that we can expose better context to the later passes (e.g, inliner, jump threading, or IPA-CP based function cloning, etc.). As of now we support two cases : 1) If a call site is dominated by an OR condition and if any of its arguments are predicated on this OR condition, try to split the condition with more constrained arguments. For example, in the code below, we try to split the call site since we can predicate the argument (ptr) based on the OR condition. Split from : if (!ptr \|\| c) callee(ptr); to : if (!ptr) callee(null ptr) // set the known constant value else if (c) callee(nonnull ptr) // set non-null attribute in the argument 2) We can also split a call-site based on constant incoming values of a PHI For example, from : BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2, label %BB1 BB1: br label %BB2 BB2: %p = phi i32 [ 0, %BB0 ], [ 1, %BB1 ] call void @bar(i32 %p) to BB0: %c = icmp eq i32 %i1, %i2 br i1 %c, label %BB2-split0, label %BB1 BB1: br label %BB2-split1 BB2-split0: call void @bar(i32 0) br label %BB2 BB2-split1: call void @bar(i32 1) br label %BB2 BB2: %p = phi i32 [ 0, %BB2-split0 ], [ 1, %BB2-split1 ] Reviewers: davidxl, huntergr, chandlerc, mcrosier, eraman, davide Reviewed By: davidxl Subscribers: sdesmalen, ashutosh.nema, fhahn, mssimpso, aemerson, mgorny, mehdi_amini, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D39137 llvm-svn: 317351
*	The patch fixes PR35131	Evgeny Stupachenko	2017-11-03	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fix a misprint which led to false CTLZ recognition. Reviewers: craig.topper Differential Revision: https://reviews.llvm.org/D39585 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 317348
*	Revert "Invoke salvageDebugInfo from CodeGenPrepare's SinkCast()"	Adrian Prantl	2017-11-03	1	-1/+1
\| \| \| \| \| \|	This reverts commit 317342 while investigating bot breakage. llvm-svn: 317345
*	Invoke salvageDebugInfo from CodeGenPrepare's SinkCast()	Adrian Prantl	2017-11-03	1	-1/+1
\| \| \| \| \| \| \| \|	This preserves the debug info for the cast operation in the original location. rdar://problem/33460652 llvm-svn: 317340
*	[LICM] sink through non-trivially replicable PHI	Jun Bum Lim	2017-11-03	1	-56/+140
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The current LICM allows sinking an instruction only when it is exposed to exit blocks through a trivially replacable PHI of which all incoming values are the same instruction. This change enhance LICM to sink a sinkable instruction through non-trivially replacable PHIs by spliting predecessors of loop exits. Reviewers: hfinkel, majnemer, davidxl, bmakam, mcrosier, danielcdh, efriedma, jtony Reviewed By: efriedma Subscribers: nemanjai, dberlin, llvm-commits Differential Revision: https://reviews.llvm.org/D37163 llvm-svn: 317335
*	[LoopPredication] NFC: Refactored code to separate out functions being reused	Anna Thomas	2017-11-03	1	-62/+92
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Refactored the code to separate out common functions that are being reused. This is to reduce the changes for changes coming up wrt loop predication with reverse loops. This refactoring is what we have in our downstream code. llvm-svn: 317324
*	[ADCE] Use MapVector for BlockInfo to make iteration order deterministic	Mikael Holmen	2017-11-03	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Also added a reserve() method to MapVector since we want to use that from ADCE. DenseMap does not provide deterministic iteration order so with that we will handle the members of BlockInfo in random order, eventually leading to random order of the blocks in the predecessor lists. Without this change, I get the same predecessor order in about 90% of the time when I compile a certain reproducer and in 10% I get a different one. No idea how to make a proper test case for this. Reviewers: kuhar, david2050 Reviewed By: kuhar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39593 llvm-svn: 317323
*	[PartialInliner] Skip call sites where inlining fails.	Florian Hahn	2017-11-03	1	-7/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: InlineFunction can fail, for example when trying to inline vararg fuctions. In those cases, we do not want to bump partial inlining counters or set AnyInlined to true, because this could leave an unused function hanging around. Reviewers: davidxl, davide, gyiu Reviewed By: davide Subscribers: llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D39581 llvm-svn: 317314
*	[LSR] Clarify a comment. NFC.	Vedant Kumar	2017-11-03	1	-1/+1
\| \| \| \|	llvm-svn: 317295
*	IndVarSimplify: preserve debug information attached to widened PHI nodes.	Adrian Prantl	2017-11-02	1	-0/+10
\| \| \| \| \| \| \| \| \| \|	This fixes PR35015. https://bugs.llvm.org/show_bug.cgi?id=35015 Differential Revision: https://reviews.llvm.org/D39345 llvm-svn: 317282
*	Irreducible loop metadata for more accurate block frequency under PGO.	Hiroshi Yamauchi	2017-11-02	1	-2/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently the block frequency analysis is an approximation for irreducible loops. The new irreducible loop metadata is used to annotate the irreducible loop headers with their header weights based on the PGO profile (currently this is approximated to be evenly weighted) and to help improve the accuracy of the block frequency analysis for irreducible loops. This patch is a basic support for this. Reviewers: davidxl Reviewed By: davidxl Subscribers: mehdi_amini, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D39028 llvm-svn: 317278
*	[LoopPredication] Enable predication when latchCheckIV is wider than rangeCheck	Anna Thomas	2017-11-02	1	-10/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch allows us to predicate range checks that have a type narrower than the latch check type. We leverage SCEV analysis to identify a truncate for the latchLimit and latchStart. There is also safety checks in place which requires the start and limit to be known at compile time. We require this to make sure that the SCEV truncate expr for the IV corresponding to the latch does not cause us to lose information about the IV range. Added tests show the loop predication over range checks that are of various types and are narrower than the latch type. This enhancement has been in our downstream tree for a while. Reviewers: apilipenko, sanjoy, mkazantsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39500 llvm-svn: 317269
*	Strip off invariant.start because memory locations arent invariant	Anna Thomas	2017-11-02	1	-9/+33
\| \| \| \| \| \| \| \| \| \| \| \|	The original change was reverted in rL317217 because of the failure in the RS4GC testcase. I couldn't reproduce the failure on my local machine (macbook) but could reproduce it on a linux box. The failure was around removing the uses of invariant.start. The fix here is to just RAUW undef (which was the first implementation in D39388). This is perfectly valid IR as discussed in the review. llvm-svn: 317225
*	Revert "[RS4GC] Strip off invariant.start because memory locations arent ↵	Anna Thomas	2017-11-02	1	-39/+9
\| \| \| \| \| \| \| \|	invariant" This reverts commit r317215, investigating the test failure. llvm-svn: 317217
*	[RS4GC] Strip off invariant.start because memory locations arent invariant	Anna Thomas	2017-11-02	1	-9/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Invariant.start on memory locations has the property that the memory location is unchanging. However, this is not true in the face of rewriting statepoints for GC. Teach RS4GC about removing invariant.start so that optimizations after RS4GC does not incorrect sink a load from the memory location past a statepoint. Added test showcasing the issue. Reviewers: reames, apilipenko, dneilson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39388 llvm-svn: 317215
*	Revert "[ExpandMemCmp] Split ExpandMemCmp from CodeGen into its own pass."	Clement Courbet	2017-11-02	3	-830/+0
\| \| \| \| \| \| \| \| \|	undefined reference to `llvm::TargetPassConfig::ID' on clang-ppc64le-linux-multistage This reverts commit eea333c33fa73ad225ef28607795984829f65688. llvm-svn: 317213
*	[ExpandMemCmp] Split ExpandMemCmp from CodeGen into its own pass.	Clement Courbet	2017-11-02	3	-0/+830
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is mostly a noop (most of the test diffs are renamed blocks). There are a few temporary register renames (eax<->ecx) and a few blocks are shuffled around. See the discussion in PR33325 for more details. Reviewers: spatel Subscribers: mgorny Differential Revision: https://reviews.llvm.org/D39456 llvm-svn: 317211
*	[SimplifyCFG] Discard speculated dbg intrinsics	Bjorn Pettersson	2017-11-02	1	-1/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: SpeculativelyExecuteBB can flatten the CFG by doing speculative execution followed by a select instruction. When the speculatively executed BB contained dbg intrinsics the result could be a little bit weird, since those dbg intrinsics were inserted before the select in the flattened CFG. So when single stepping in the debugger, printing the value of the variable referenced in the dbg intrinsic, it could happen that it looked like the variable had values that never actually were assigned to the variable. This patch simply discards all dbg intrinsics that were found in the speculatively executed BB. Reviewers: aprantl, chandlerc, craig.topper Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39494 llvm-svn: 317198
*	loop-unroll: teach remapInstruction to update dbg.value intrinsics.	Adrian Prantl	2017-11-01	1	-1/+15
\| \| \| \| \| \| \| \|	Fixes PR35112. https://bugs.llvm.org/show_bug.cgi?id=35112 llvm-svn: 317138
*	loop-rotate: avoid duplicating dbg.value intrinsics in the entry block.	Adrian Prantl	2017-11-01	1	-0/+24
\| \| \| \| \| \| \| \|	This fixes the second half of PR35113. This reapplies r317106 without modifications. llvm-svn: 317121
*	loop-rotate: eliminate duplicate debug intrinsics after splicing.	Adrian Prantl	2017-11-01	1	-1/+26
\| \| \| \| \| \| \| \| \|	Fixes part of PR35113. This reapplies r317105 with an additional check for isa<Instruction> as found by the bots. llvm-svn: 317120
*	Include GUIDs from the same module when computing GUIDs that needs to be ↵	Dehao Chen	2017-11-01	1	-15/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	imported. Summary: In the compile phase of SamplePGO+ThinLTO, ICP is not invoked. Instead, indirect call targets will be included as function metadata for ThinIndex to buidl the call graph. This should not only include functions defined in other modules, but also functions defined in the same module, otherwise ThinIndex may find the callee dead and eliminate it, while ICP in backend will revive the symbol, which leads to undefined symbol. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: sanjoy, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D39480 llvm-svn: 317118
*	Revert 317016 and 317048	Philip Reames	2017-11-01	1	-44/+50
\| \| \| \| \| \|	The former appears to have introduced a miscompile in a stage2 clang build. Revert so I can investigate offline. llvm-svn: 317116
*	Revert r317105 to investigate bot breakage.	Adrian Prantl	2017-11-01	1	-23/+1
\| \| \| \|	llvm-svn: 317110
*	Revert r317106 to facilitate reverting r317105.	Adrian Prantl	2017-11-01	1	-24/+0
\| \| \| \|	llvm-svn: 317109
*	LTO: Apply global DCE to ThinLTO modules at LTO opt level 0.	Peter Collingbourne	2017-11-01	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is necessary because DCE is applied to full LTO modules. Without this change, a reference from a dead ThinLTO global to a dead full LTO global will result in an undefined reference at link time. This problem is only observable when --gc-sections is disabled, or when targeting COFF, as the COFF port of lld requires all symbols to have a definition even if all references are dead (this is consistent with link.exe). This change also adds an EliminateAvailableExternally pass at -O0. This is necessary to handle the situation on Windows where a non-prevailing copy of a linkonce_odr function has an SEH filter function; any such filters must be DCE'd because they will contain a call to the llvm.localrecover intrinsic, passing as an argument the address of the function that the filter belongs to, and llvm.localrecover requires this function to be defined locally. Fixes PR35142. Differential Revision: https://reviews.llvm.org/D39484 llvm-svn: 317108
*	loop-rotate: avoid duplicating dbg.value intrinsics in the entry block.	Adrian Prantl	2017-11-01	1	-0/+24
\| \| \| \| \| \|	This fixes the second half of PR35113. llvm-svn: 317106
*	loop-rotate: eliminate duplicate debug intrinsics after splicing.	Adrian Prantl	2017-11-01	1	-1/+23
\| \| \| \| \| \|	Fixes part of PR35113. llvm-svn: 317105
*	Revert rL311205 "[IRCE] Fix buggy behavior in Clamp"	Max Kazantsev	2017-11-01	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch reverts rL311205 that was initially a wrong fix. The real problem was in intersection of signed and unsigned ranges (see rL316552), and the patch being reverted masked the problem instead of fixing it. By now, the test against which rL311205 was made works OK even without this code. This revert patch also contains a test case that demonstrates incorrect behavior caused by rL311205: it is caused by incorrect choise of signed max instead of unsigned. llvm-svn: 317088
*	[CodeExtractor] Fix iterator invalidation in findOrCreateBlockForHoisting.	Florian Hahn	2017-11-01	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: By replacing branches to CommonExitBlock, we remove the node from CommonExitBlock's predecessors, invalidating the iterator. The problem is exposed when the common exit block has multiple predecessors and needs to sink lifetime info. The modification in the test case trigger the issue. Reviewers: davidxl, davide, wmi Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39112 llvm-svn: 317084
*	[SimplifyIndVar] Inline makIVComparisonInvariant to eleminate code ↵	Philip Reames	2017-10-31	1	-51/+29
\| \| \| \| \| \| \| \|	duplication [NFC] This formulation might be slightly slower since I eagerly compute the cheap replacements. If anyone sees this having a compile time impact, let me know and I'll use lazy population instead. llvm-svn: 317048
*	loop-rotate: simplify code by using llvm::findDbgValues(). (NFC)	Adrian Prantl	2017-10-31	1	-31/+23
\| \| \| \|	llvm-svn: 317037
*	[coro] Make Spill a proper struct instead of deriving from pair.	Benjamin Kramer	2017-10-31	1	-12/+10
\| \| \| \| \| \|	No functionality change. llvm-svn: 317027
*	[SimplifyCFG] Use a more generic name for the selects created by ↵	Craig Topper	2017-10-31	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	SpeculativelyExecuteBB to prevent long names from being created Currently the selects are created with the names of their inputs concatenated together. It's possible to get cases that chain these selects together resulting in long names due to multiple levels of concatenation. Our internal branch of llvm managed to generate names over 100000 characters in length on a particular test due to an extreme compounding of the names. This patch changes the name to a generic name that is not dependent on its inputs. Differential Revision: https://reviews.llvm.org/D39440 llvm-svn: 317024
*	[IndVarSimplify] Extract wrapper around SE-.isLoopInvariantPredicate [NFC]	Philip Reames	2017-10-31	1	-17/+33
\| \| \| \| \| \|	This an intermediate state, the next patch will re-inline the markLoopInvariantPredicate function to reduce code duplication. llvm-svn: 317016
*	[IndVarSimplify] Simplify code using a dictionary	Philip Reames	2017-10-31	1	-16/+8
\| \| \| \| \| \|	Possibly very slightly slower, but this code is not performance critical and the readability benefit alone is huge. llvm-svn: 317012
*	[asan] Upgrade private linkage globals to internal linkage on COFF	Reid Kleckner	2017-10-31	1	-2/+7
\| \| \| \| \| \| \|	COFF comdats require symbol table entries, which means the comdat leader cannot have private linkage. llvm-svn: 317009
*	[LoopVectorize] Replace manual VPlan memory management with unique_ptr.	Benjamin Kramer	2017-10-31	1	-26/+10
\| \| \| \| \| \|	No functionality change intended. llvm-svn: 317003
*	[InstCombine] Simplify selects that test cmpxchg instructions	Matthew Simpson	2017-10-31	1	-0/+76
\| \| \| \| \| \| \| \| \| \|	If a select instruction tests the returned flag of a cmpxchg instruction and selects between the returned value of the cmpxchg instruction and its compare operand, the result of the select will always be equal to its false value. Differential Revision: https://reviews.llvm.org/D39383 llvm-svn: 316994
*	[LoopUnroll] Clean up remarks for unroll remainder	David Green	2017-10-31	2	-31/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The optimisation remarks for loop unrolling with an unrolled remainder looks something like: test.c:7:18: remark: completely unrolled loop with 3 iterations [-Rpass=loop-unroll] C[i] += A[i*N+j]; ^ test.c:6:9: remark: unrolled loop by a factor of 4 with run-time trip count [-Rpass=loop-unroll] for(int j = 0; j < N; j++) ^ This removes the first of the two messages. Differential revision: https://reviews.llvm.org/D38725 llvm-svn: 316986