bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[PM] Port StripDeadPrototypes to the new pass manager	Justin Bogner	2015-10-30	2	-22/+32
\| \| \| \| \| \| \|	This is a really straightforward port. Also adds a test for the pass, since it only seemed to be tested tangentially before. llvm-svn: 251726
*	[PM] Port ADCE to the new pass manager	Justin Bogner	2015-10-30	2	-27/+34
\| \| \| \|	llvm-svn: 251725
*	Whitespace. NFC	Justin Bogner	2015-10-30	2	-5/+5
\| \| \| \|	llvm-svn: 251724
*	[FunctionAttrs] Separate another chunk of the logic for functionattrs	Chandler Carruth	2015-10-30	1	-10/+16
\| \| \| \| \| \| \| \| \| \| \|	from its pass harness by providing a lambda to query for AA results. This allows the legacy pass to easily provide a lambda that uses the special helpers to construct function AA results from a legacy CGSCC pass. With the new pass manager (the next patch) the lambda just directly wraps the intuitive query API. llvm-svn: 251715
*	Recommit r251680 (also need to update clang test)	Dehao Chen	2015-10-30	1	-11/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Update the discriminator assignment algorithm * If a scope has already been assigned a discriminator, do not reassign a nested discriminator for it. * If the file and line both match, even if the column does not match, we should assign a new discriminator for the stmt. original code: ; #1 int foo(int i) { ; #2 if (i == 3 \|\| i == 5) return 100; else return 99; ; #3 } ; i == 3: discriminator 0 ; i == 5: discriminator 2 ; return 100: discriminator 1 ; return 99: discriminator 3 llvm-svn: 251689
*	Revert r251680:	Dehao Chen	2015-10-30	1	-12/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Update the discriminator assignment algorithm * If a scope has already been assigned a discriminator, do not reassign a nested discriminator for it. * If the file and line both match, even if the column does not match, we should assign a new discriminator for the stmt. original code: ; #1 int foo(int i) { ; #2 if (i == 3 \|\| i == 5) return 100; else return 99; ; #3 } ; i == 3: discriminator 0 ; i == 5: discriminator 2 ; return 100: discriminator 1 ; return 99: discriminator 3 llvm-svn: 251685
*	Update the discriminator assignment algorithm	Dehao Chen	2015-10-30	1	-11/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* If a scope has already been assigned a discriminator, do not reassign a nested discriminator for it. * If the file and line both match, even if the column does not match, we should assign a new discriminator for the stmt. original code: ; #1 int foo(int i) { ; #2 if (i == 3 \|\| i == 5) return 100; else return 99; ; #3 } ; i == 3: discriminator 0 ; i == 5: discriminator 2 ; return 100: discriminator 1 ; return 99: discriminator 3 llvm-svn: 251680
*	clang-format lib/Transforms/Utils/AddDiscriminators.cpp	Dehao Chen	2015-10-29	1	-12/+11
\| \| \| \|	llvm-svn: 251656
*	[FunctionAttrs] Provide a single SCC node set to all of the	Chandler Carruth	2015-10-29	1	-91/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	transformations in FunctionAttrs rather than building a new one each time. This isn't trivial because there are different heuristics from different passes for exactly what set they want. The primary difference is whether an overridable function completely disables the synthesis of attributes. I've modeled this by directly testing for overridable, and using the common set that excludes external and opt-none functions. This does cause some changes by disabling more optimizations in the face of opt-none. Specifically, we were still optimizing calls to opt-none functions based on their attributes, just not the bodies. It seems better to be conservative on both fronts given the intended semanticas here (best effort to not assume or disturb anything). I've not tried to test this change as it seems complex, brittle, and not important to the implicit contract of opt-none. Instead, it seems more like a choice that should be dictated by the simplified implementation and the change to be acceptable differences within the space of opt-none. A big benefit here is that these transformations no longer rely on the legacy pass manager's SCC types, they just work on generic sets of function pointers. This will make it easy to re-use their logic in the new pass manager. I've also made the transforms static functions instead of members where trivial while I was touching the signatures. llvm-svn: 251640
*	[sanitizer] [msan] Unify aarch64 mapping	Adhemerval Zanella	2015-10-29	1	-21/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch unify the 39-bit and 42-bit mapping for aarch64 to use only one instrumentation algorithm. This removes compiler flag SANITIZER_AARCH64_VMA requirement for MSAN on aarch64. The mapping to use now is for 39 and 42-bits: 0x00000000000ULL-0x01000000000ULL MappingDesc::INVALID 0x01000000000ULL-0x02000000000ULL MappingDesc::SHADOW 0x02000000000ULL-0x03000000000ULL MappingDesc::ORIGIN 0x03000000000ULL-0x04000000000ULL MappingDesc::SHADOW 0x04000000000ULL-0x05000000000ULL MappingDesc::ORIGIN 0x05000000000ULL-0x06000000000ULL MappingDesc::APP 0x06000000000ULL-0x07000000000ULL MappingDesc::INVALID 0x07000000000ULL-0x08000000000ULL MappingDesc::APP And only for 42-bits: 0x08000000000ULL-0x09000000000ULL MappingDesc::INVALID 0x09000000000ULL-0x0A000000000ULL MappingDesc::SHADOW 0x0A000000000ULL-0x0B000000000ULL MappingDesc::ORIGIN 0x0B000000000ULL-0x0F000000000ULL MappingDesc::INVALID 0x0F000000000ULL-0x10000000000ULL MappingDesc::APP 0x10000000000ULL-0x11000000000ULL MappingDesc::INVALID 0x11000000000ULL-0x12000000000ULL MappingDesc::APP 0x12000000000ULL-0x17000000000ULL MappingDesc::INVALID 0x17000000000ULL-0x18000000000ULL MappingDesc::SHADOW 0x18000000000ULL-0x19000000000ULL MappingDesc::ORIGIN 0x19000000000ULL-0x20000000000ULL MappingDesc::INVALID 0x20000000000ULL-0x21000000000ULL MappingDesc::APP 0x21000000000ULL-0x26000000000ULL MappingDesc::INVALID 0x26000000000ULL-0x27000000000ULL MappingDesc::SHADOW 0x27000000000ULL-0x28000000000ULL MappingDesc::ORIGIN 0x28000000000ULL-0x29000000000ULL MappingDesc::SHADOW 0x29000000000ULL-0x2A000000000ULL MappingDesc::ORIGIN 0x2A000000000ULL-0x2B000000000ULL MappingDesc::APP 0x2B000000000ULL-0x2C000000000ULL MappingDesc::INVALID 0x2C000000000ULL-0x2D000000000ULL MappingDesc::SHADOW 0x2D000000000ULL-0x2E000000000ULL MappingDesc::ORIGIN 0x2E000000000ULL-0x2F000000000ULL MappingDesc::APP 0x2F000000000ULL-0x39000000000ULL MappingDesc::INVALID 0x39000000000ULL-0x3A000000000ULL MappingDesc::SHADOW 0x3A000000000ULL-0x3B000000000ULL MappingDesc::ORIGIN 0x3B000000000ULL-0x3C000000000ULL MappingDesc::APP 0x3C000000000ULL-0x3D000000000ULL MappingDesc::INVALID 0x3D000000000ULL-0x3E000000000ULL MappingDesc::SHADOW 0x3E000000000ULL-0x3F000000000ULL MappingDesc::ORIGIN 0x3F000000000ULL-0x40000000000ULL MappingDesc::APP And although complex it provides a better memory utilization that previous one. llvm-svn: 251624
*	Fix use-after-free. Thanks ASAN for giving me a detailed report :-).	Daniel Jasper	2015-10-29	1	-2/+2
\| \| \| \|	llvm-svn: 251623
*	Revert the revision 251592 as it fails a test on some platforms.	Cong Hou	2015-10-29	1	-93/+28
\| \| \| \|	llvm-svn: 251617
*	[PGO] Do not emit runtime hook user function for Linux	Xinliang David Li	2015-10-29	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	Clang driver now injects -u<hook_var> flag in the linker command line, in which case user function is not needed any more. Differential Revision: http://reviews.llvm.org/D14033 llvm-svn: 251612
*	[LVI/CVP] Teach LVI about range metadata	Philip Reames	2015-10-29	1	-26/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Somewhat shockingly for an analysis pass which is computing constant ranges, LVI did not understand the ranges provided by range metadata. As part of this change, I included a change to CVP primarily because doing so made it much easier to write small self contained test cases. CVP was previously only handling the non-local operand case, but given that LVI can sometimes figure out information about instructions standalone, I don't see any reason to restrict this. There could possibly be a compile time impact from this, but I suspect it should be minimal. If anyone has an example which substaintially regresses, please let me know. I could restrict the block local handling to ICmps feeding Terminator instructions if needed. Note that this patch continues a somewhat bad practice in LVI. In many cases, we know facts about values, and separate context sensitive facts about values. LVI makes no effort to distinguish and will frequently cache the same value fact repeatedly for different contexts. I would like to change this, but that's a large enough change that I want it to go in separately with clear documentation of what's changing. Other examples of this include the non-null handling, and arguments. As a meta comment: the entire motivation of this change was being able to write smaller (aka reasonable sized) test cases for a future patch teaching LVI about select instructions. Differential Revision: http://reviews.llvm.org/D13543 llvm-svn: 251606
*	[SimplifyCFG] Constant fold a branch implied by it's incoming edge	Philip Reames	2015-10-29	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \|	The most common use case is when eliminating redundant range checks in an example like the following: c = a[i+1] + a[i]; Note that all the smarts of the transform (the implication engine) is already in ValueTracking and is tested directly through InstructionSimplify. Differential Revision: http://reviews.llvm.org/D13040 llvm-svn: 251596
*	[SimplifyLibCalls] Factor out common unsafe-math checks.	Davide Italiano	2015-10-29	1	-29/+23
\| \| \| \|	llvm-svn: 251595
*	Add a flag vectorizer-maximize-bandwidth in loop vectorizer to enable using ↵	Cong Hou	2015-10-29	1	-28/+93
\| \| \| \| \| \| \| \|	larger vectorization factor. To be able to maximize the bandwidth during vectorization, this patch provides a new flag vectorizer-maximize-bandwidth. When it is turned on, the vectorizer will determine the vectorization factor (VF) using the smallest instead of widest type in the loop. To avoid increasing register pressure too much, estimates of the register usage for different VFs are calculated so that we only choose a VF when its register usage doesn't exceed the number of available registers. llvm-svn: 251592
*	SamplePGO - Add flag to check sampling coverage.	Diego Novillo	2015-10-28	1	-3/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the flag -mllvm -sample-profile-check-coverage=N to the SampleProfile pass. N is the percent of input sample records that the user expects to apply. If the pass does not use N% (or more) of the sample records in the input, it emits a warning. This is useful to detect some forms of stale profiles. If the code has drifted enough from the original profile, there will be records that do not match the IR anymore. This will not detect cases where a sample profile record for line L is referring to some other instructions that also used to be at line L. llvm-svn: 251568
*	[JumpThreading] Use dominating conditions to prove implications	Sanjoy Das	2015-10-28	1	-2/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If P branches to Q conditional on C and Q branches to R conditional on C' and C => C' then the branch conditional on C' can be folded to an unconditional branch. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13972 llvm-svn: 251557
*	SamplePGO - Clear per-function data after applying a profile.	Diego Novillo	2015-10-28	1	-4/+21
\| \| \| \| \| \| \| \| \| \|	The pass was keeping around a lot of per-function data (visited blocks, edges, dominance, etc) that is just taking up memory for no reason. In fact, from function to function it could potentially confuse the propagator since some maps are indexed by line offsets which can be common between functions. llvm-svn: 251531
*	Typo.	Chad Rosier	2015-10-28	1	-1/+1
\| \| \| \|	llvm-svn: 251521
*	Reapply: [LIR] Add support for creating memsets from loops with a negative ↵	Chad Rosier	2015-10-28	1	-24/+32
\| \| \| \| \| \| \| \|	stride. The simple fix is to prevent forming memcpy from loops with a negative stride. llvm-svn: 251518
*	[GlobalOpt] Add newlines to DEBUG messages	James Molloy	2015-10-28	1	-4/+4
\| \| \| \| \| \| \| \|	I think these were affected by a change way back when to stop printing newlines in Value::dump() by default. This change simply allows the debug output to be readable. NFC. llvm-svn: 251517
*	Revert "[LIR] Add support for creating memsets from loops with a negative ↵	Chad Rosier	2015-10-28	1	-28/+24
\| \| \| \| \| \| \| \|	stride." This reverts commit r251512. This is causing LNT/chomp to fail. llvm-svn: 251513
*	[LIR] Add support for creating memsets from loops with a negative stride.	Chad Rosier	2015-10-28	1	-24/+28
\| \| \| \| \| \|	http://reviews.llvm.org/D14125 llvm-svn: 251512
*	Revert r251492 "[IndVarSimplify] Rewrite loop exit values with their	Chen Li	2015-10-28	1	-71/+0
\| \| \| \| \| \|	initial values from loop preheader", because it broke some bots. llvm-svn: 251498
*	[IndVarSimplify] Rewrite loop exit values with their initial values from ↵	Chen Li	2015-10-28	1	-0/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	loop preheader Summary: This patch adds support to check if a loop has loop invariant conditions which lead to loop exits. If so, we know that if the exit path is taken, it is at the first loop iteration. If there is an induction variable used in that exit path whose value has not been updated, it will keep its initial value passing from loop preheader. We can therefore rewrite the exit value with its initial value. This will help remove phis created by LCSSA and enable other optimizations like loop unswitch. Reviewers: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13974 llvm-svn: 251492
*	[SimplifyCFG] Don't DCE catchret because the successor is unreachable	David Majnemer	2015-10-27	1	-2/+1
\| \| \| \| \| \| \|	CatchReturnInst has side-effects: it runs a destructor. This destructor could conceivably run forever/call exit/etc. and should not be removed. llvm-svn: 251461
*	Whitespace.	NAKAMURA Takumi	2015-10-27	1	-1/+1
\| \| \| \|	llvm-svn: 251437
*	Revert r251291, "Loop Vectorizer - skipping "bitcast" before GEP"	NAKAMURA Takumi	2015-10-27	1	-16/+3
\| \| \| \| \| \| \|	It causes miscompilation of llvm/lib/ExecutionEngine/Interpreter/Execution.cpp. See also PR25324. llvm-svn: 251436
*	Tidy a comment. NFC.	Diego Novillo	2015-10-27	1	-1/+1
\| \| \| \|	llvm-svn: 251434
*	[SLP] Be more aggressive about reduction width selection.	Charlie Turner	2015-10-27	1	-12/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change could be way off-piste, I'm looking for any feedback on whether it's an acceptable approach. It never seems to be a problem to gobble up as many reduction values as can be found, and then to attempt to reduce the resulting tree. Some of the workloads I'm looking at have been aggressively unrolled by hand, and by selecting reduction widths that are not constrained by a vector register size, it becomes possible to profitably vectorize. My test case shows such an unrolling which SLP was not vectorizing (on neither ARM nor X86) before this patch, but with it does vectorize. I measure no significant compile time impact of this change when combined with D13949 and D14063. There are also no significant performance regressions on ARM/AArch64 in SPEC or LNT. The more principled approach I thought of was to generate several candidate tree's and use the cost model to pick the cheapest one. That seemed like quite a big design change (the algorithms seem very much one-shot), and would likely be a costly thing for compile time. This seemed to do the job at very little cost, but I'm worried I've misunderstood something! Reviewers: nadav, jmolloy Subscribers: mssimpso, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D14116 llvm-svn: 251428
*	[SLP] Try a bit harder to find reduction PHIs	Charlie Turner	2015-10-27	1	-5/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently, when the SLP vectorizer considers whether a phi is part of a reduction, it dismisses phi's whose incoming blocks are not the same as the block containing the phi. For the patterns I'm looking at, extending this rule to allow phis whose incoming block is a containing loop latch allows me to vectorize certain workloads. There is no significant compile-time impact, and combined with D13949, no performance improvement measured in ARM/AArch64 in any of SPEC2000, SPEC2006 or LNT. Reviewers: jmolloy, mcrosier, nadav Subscribers: mssimpso, nadav, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D14063 llvm-svn: 251425
*	[SLP] Treat SelectInsts as reduction values.	Charlie Turner	2015-10-27	1	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Certain workloads, in particular sum-of-absdiff loops, can be vectorized using SLP if it can treat select instructions as reduction values. The test case is a bit awkward. The AArch64 cost model needs some tuning to not be so pessimistic about selects. I've had to tweak the SLP threshold here. Reviewers: jmolloy, mzolotukhin, spatel, nadav Subscribers: nadav, mssimpso, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D13949 llvm-svn: 251424
*	Fix SamplePGO segfault when debug info is missing.	Diego Novillo	2015-10-27	1	-2/+4
\| \| \| \| \| \| \| \| \| \|	When emitting a remark for a conditional branch annotation, the remark uses the line location information of the conditional branch in the message. In some cases, that information is unavailable and the optimization would segfaul. I'm still not sure whether this is a bug or WAI, but the optimizer should not die because of this. llvm-svn: 251420
*	[SimplifyLibCalls] Use range-based loop. No functional change.	Davide Italiano	2015-10-27	1	-4/+2
\| \| \| \|	llvm-svn: 251383
*	[function-attrs] Refactor code to handle shorter code with early exits.	Chandler Carruth	2015-10-27	1	-31/+37
\| \| \| \| \| \| \| \| \| \| \|	No functionality changed here, but the indentation is substantially reduced and IMO the code is much easier to read. I've also added some helpful comments. This is just a clean-up I wrote while studying the code, and that has been in my backlog for a while. llvm-svn: 251381
*	Remove unused local variable. NFC.	Diego Novillo	2015-10-26	1	-2/+0
\| \| \| \|	llvm-svn: 251344
*	[RS4GC] Strip noalias attribute after statepoint rewrite	Igor Laevsky	2015-10-26	1	-1/+4
\| \| \| \| \| \| \| \| \|	We should remove noalias along with dereference and dereference_or_null attributes because statepoint could potentially touch the entire heap including noalias objects. Differential Revision: http://reviews.llvm.org/D14032 llvm-svn: 251333
*	SamplePGO - Add optimization reports.	Diego Novillo	2015-10-26	1	-6/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a couple of optimization remarks to the SamplePGO transformation. When it decides to inline a hot function (to mimic the inline stack and repeat useful inline decisions in the original build). It will also report branch destinations. For instance, given the code fragment: 6 if (i < 1000) 7 sum -= i; 8 else 9 sum += -i * rand(); If the 'else' branch is taken most of the time, building this code with -Rpass=sample-profile will produce: a.cc:9:14: remark: most popular destination for conditional branches at small.cc:6:9 [-Rpass=sample-profile] sum += -i * rand(); ^ llvm-svn: 251330
*	Move the canonical header to the top of its matching cpp file as per coding ↵	David Blaikie	2015-10-26	1	-1/+2
\| \| \| \| \| \| \| \| \|	convention This ensures that the header will be verified to be standalone (and avoid mistakes like the one fixed in r251178) llvm-svn: 251326
*	[safestack] Fast access to the unsafe stack pointer on AArch64/Android.	Evgeniy Stepanov	2015-10-26	1	-47/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Android libc provides a fixed TLS slot for the unsafe stack pointer, and this change implements direct access to that slot on AArch64 via __builtin_thread_pointer() + offset. This change also moves more code into TargetLowering and its target-specific subclasses to get rid of target-specific codegen in SafeStackPass. This change does not touch the ARM backend because ARM lowers builting_thread_pointer as aeabi_read_tp, which is not available on Android. The previous iteration of this change was reverted in r250461. This version leaves the generic, compiler-rt based implementation in SafeStack.cpp instead of moving it to TargetLoweringBase in order to allow testing without a TargetMachine. llvm-svn: 251324
*	Refactor: Simplify boolean conditional return statements in ↵	Alexey Samsonov	2015-10-26	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \|	lib/Transforms/Instrumentation Summary: Use clang-tidy to simplify boolean conditional return statements. Differential Revision: http://reviews.llvm.org/D9996 Patch by Richard (legalize@xmission.com)! llvm-svn: 251318
*	Loop Vectorizer - skipping "bitcast" before GEP	Elena Demikhovsky	2015-10-26	1	-3/+16
\| \| \| \| \| \| \| \| \| \|	Vectorization of memory instruction (Load/Store) is possible when the pointer is coming from GEP. The GEP analysis allows to estimate the profit. In some cases we have a "bitcast" between GEP and memory instruction. I added code that skips the "bitcast". http://reviews.llvm.org/D13886 llvm-svn: 251291
*	[InstCombine] Teach instcombine not to create extra PHI nodes when folding GEPs	Silviu Baranga	2015-10-26	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: InstCombine tries to transform GEP(PHI(GEP1, GEP2, ..)) into GEP(GEP(PHI(...)) when possible. However, this may leave the old PHI node around. Even if we do end up folding the GEPs, having an extra PHI node might not be beneficial. This change makes the transformation more conservative. We now only do this if the PHI has only one use, and can therefore be removed after the transformation. Reviewers: jmolloy, majnemer Subscribers: mcrosier, mssimpso, llvm-commits Differential Revision: http://reviews.llvm.org/D13887 llvm-svn: 251281
*	Convert assert(false) into llvm_unreachable where it makes sense.	Benjamin Kramer	2015-10-25	1	-1/+1
\| \| \| \|	llvm-svn: 251266
*	[LCSSA] Unbreak build, don't reuse L; NFC	Sanjoy Das	2015-10-25	1	-2/+2
\| \| \| \| \| \|	The build broke in r251248. llvm-svn: 251251
*	[LCSSA] Use range for loops; NFC	Sanjoy Das	2015-10-25	1	-28/+21
\| \| \| \|	llvm-svn: 251248
*	Refactor: Simplify boolean conditional return statements in ↵	Michael Zolotukhin	2015-10-24	2	-11/+5
\| \| \| \| \| \| \| \| \| \| \| \|	lib/Transforms/Vectorize (NFC). Summary: Use clang-tidy to simplify boolean conditional return statements Differential Revision: http://reviews.llvm.org/D10003 Patch by Richard<legalize@xmission.com> llvm-svn: 251206
*	ScalarReplAggregates.cpp: Try to appease clash of anonymous::SROA in modules ↵	NAKAMURA Takumi	2015-10-24	1	-0/+1
\| \| \| \| \| \|	build. llvm-svn: 251181