bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	LoopVectorize: External uses must use the last value in a reduction cycle	Arnold Schwaighofer	2013-10-07	1	-0/+6
\| \| \| \| \| \| \| \| \|	Otherwise, we don't perform operations that would have been performed on the scalar version. Fixes PR17498. llvm-svn: 192133
*	Revert r191834 until we measure the effect of this benchmarks and maybe find ↵	Alexey Samsonov	2013-10-07	1	-3/+56
\| \| \| \| \| \|	a better way to fix it llvm-svn: 192121
*	UpdatePHINodes in BasicBlockUtils should not crash on duplicate predecessors	Hal Finkel	2013-10-04	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \|	UpdatePHINodes has an optimization to reuse an existing PHI node, where it first deletes all of its entries and then replaces them. Unfortunately, in the case where we had duplicate predecessors (which are allowed so long as the associated PHI entries have the same value), the loop removing the existing PHI entries from the to-be-reused PHI would assert (if that PHI was not the one which had the duplicates). llvm-svn: 192001
*	SLPVectorizer: Sort inputs to commutative binary operations	Arnold Schwaighofer	2013-10-04	1	-4/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Sort the operands of the other entries in the current vectorization root according to the first entry's operands opcodes. %conv0 = uitofp ... %load0 = load float ... = fmul %conv0, %load0 = fmul %load0, %conv1 = fmul %load0, %conv2 Make sure that we recursively vectorize <%conv0, %conv1, %conv2> and <%load0, %load0, %load0>. This makes it more likely to obtain vectorizable trees. We have to be careful when we sort that we don't destroy 'good' existing ordering implied by source order. radar://15080067 llvm-svn: 191977
*	Pull fptrunc's upwards through selects when one of the select's selectands ↵	Owen Anderson	2013-10-03	1	-0/+13
\| \| \| \| \| \|	was a constant. This has a number of benefits, including producing small immediates (easier to materialize, smaller constant pools) as well as being more likely to allow the fptrunc to fuse with a preceding instruction (truncating selects are unusual). llvm-svn: 191929
*	Optimize linkonce_odr unnamed_addr functions during LTO.	Rafael Espindola	2013-10-03	3	-11/+39
\| \| \| \| \| \| \| \| \| \| \|	Generalize the API so we can distinguish symbols that are needed just for a DSO symbol table from those that are used from some native .o. The symbols that are only wanted for the dso symbol table can be dropped if llvm can prove every other dso has a copy (linkonce_odr) and the address is not important (unnamed_addr). llvm-svn: 191922
*	Make gep i8* X, -(ptrtoint Y) transform work with address spaces	Matt Arsenault	2013-10-03	1	-8/+10
\| \| \| \|	llvm-svn: 191920
*	Don't use runtime bounds check between address spaces.	Matt Arsenault	2013-10-02	1	-11/+49
\| \| \| \| \| \| \| \| \|	Don't vectorize with a runtime check if it requires a comparison between pointers with different address spaces. The values can't be assumed to be directly comparable. Previously it would create an illegal bitcast. llvm-svn: 191862
*	Apply slp vectorization on fully-vectorizable tree of height 2	Yi Jiang	2013-10-02	1	-4/+21
\| \| \| \|	llvm-svn: 191852
*	Fix debug printing spacing.	Matt Arsenault	2013-10-02	1	-32/+33
\| \| \| \| \| \|	Fix missing newlines, missing and extra spaces in printed messages. llvm-svn: 191851
*	Fix comment grammar and capitalization.	Matt Arsenault	2013-10-02	1	-3/+3
\| \| \| \|	llvm-svn: 191850
*	SLPVectorizer: Make store chain finding more aggressive with ↵	Benjamin Kramer	2013-10-02	1	-9/+4
\| \| \| \| \| \| \| \| \|	GetUnderlyingObject. This recursively strips all GEPs like the existing code. It also handles bitcasts and other operations that do not change the pointer value. llvm-svn: 191847
*	StructurizeCFG: Add dependency on LowerSwitch pass	Tom Stellard	2013-10-02	1	-1/+3
\| \| \| \| \| \| \| \|	Switch instructions were crashing the StructurizeCFG pass, and it's probably easier anyway if we don't need to handle them in this pass. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 191841
*	Remove the very substantial, largely unmaintained legacy PGO	Chandler Carruth	2013-10-02	8	-1794/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	infrastructure. This was essentially work toward PGO based on a design that had several flaws, partially dating from a time when LLVM had a different architecture, and with an effort to modernize it abandoned without being completed. Since then, it has bitrotted for several years further. The result is nearly unusable, and isn't helping any of the modern PGO efforts. Instead, it is getting in the way, adding confusion about PGO in LLVM and distracting everyone with maintenance on essentially dead code. Removing it paves the way for modern efforts around PGO. Among other effects, this removes the last of the runtime libraries from LLVM. Those are being developed in the separate 'compiler-rt' project now, with somewhat different licensing specifically more approriate for runtimes. llvm-svn: 191835
*	Remove "localize global" optimization	Alexey Samsonov	2013-10-02	1	-56/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As discussed in http://llvm-reviews.chandlerc.com/D1754, this optimization isn't really valid for C, and fires too rarely anyway. Reviewers: rafael, nicholas Reviewed By: nicholas CC: rnk, llvm-commits, nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D1769 llvm-svn: 191834
*	Don't merge tiny functions.	Matt Arsenault	2013-10-01	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's silly to merge functions like these: define void @foo(i32 %x) { ret void } define void @bar(i32 %x) { ret void } to get define void @bar(i32) { tail call void @foo(i32 %0) ret void } llvm-svn: 191786
*	Remove several unused variables.	Rafael Espindola	2013-10-01	5	-6/+1
\| \| \| \| \| \|	Patch by Alp Toker. llvm-svn: 191757
*	Fix code duplication	Matt Arsenault	2013-10-01	1	-21/+10
\| \| \| \|	llvm-svn: 191716
*	Use right address space size in InstCombineCompares	Matt Arsenault	2013-09-30	1	-3/+6
\| \| \| \| \| \| \|	The test's output doesn't change, but this ensures this is actually hit with a different address space. llvm-svn: 191701
*	Constant fold ptrtoint + compare with address spaces	Matt Arsenault	2013-09-30	1	-1/+1
\| \| \| \|	llvm-svn: 191699
*	BoundsChecking: Fix refacto.	Benjamin Kramer	2013-09-30	1	-0/+1
\| \| \| \|	llvm-svn: 191676
*	Convert manual insert point restores to the new RAII object.	Benjamin Kramer	2013-09-30	3	-13/+3
\| \| \| \|	llvm-svn: 191675
*	InstCombine: Replace manual fast math flag copying with the new IRBuilder ↵	Benjamin Kramer	2013-09-30	1	-22/+20
\| \| \| \| \| \| \| \| \|	RAII helper. Defines away the issue where cast<Instruction> would fail because constant folding happened. Also slightly cleaner. llvm-svn: 191674
*	IRBuilder: Add RAII objects to reset insertion points or fast math flags.	Benjamin Kramer	2013-09-30	1	-21/+1
\| \| \| \| \| \| \| \|	Inspired by the object from the SLPVectorizer. This found a minor bug in the debug loc restoration in the vectorizer where the location of a following instruction was attached instead of the location from the original instruction. llvm-svn: 191673
*	Fix a bug in InstCombine where it attempted to cast a Value* to an Instruction*	Joey Gouly	2013-09-30	1	-2/+2
\| \| \| \| \| \| \| \| \|	when it was actually a Constant*. There are quite a few other casts to Instruction that might have the same problem, but this is the only one I have a test case for. llvm-svn: 191668
*	Even more spelling fixes for "instruction".	Robert Wilhelm	2013-09-28	3	-4/+4
\| \| \| \|	llvm-svn: 191611
*	Fix spelling intruction -> instruction.	Robert Wilhelm	2013-09-28	2	-2/+2
\| \| \| \|	llvm-svn: 191610
*	Use right pointer type in DebugIR	Matt Arsenault	2013-09-27	1	-1/+1
\| \| \| \|	llvm-svn: 191576
*	Use type helper functions	Matt Arsenault	2013-09-27	2	-3/+2
\| \| \| \|	llvm-svn: 191574
*	Fix SLPVectorizer using wrong address space for load/store	Matt Arsenault	2013-09-27	1	-4/+7
\| \| \| \|	llvm-svn: 191564
*	InstCombine: Only foldSelectICmpAndOr for integer types	Justin Bogner	2013-09-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Currently foldSelectICmpAndOr asserts if the "or" involves a vector containing several of the same power of two. We can easily avoid this by only performing the fold on integer types, like foldSelectICmpAnd does. Fixes <rdar://problem/15012516> llvm-svn: 191552
*	Transforms: Use getFirstNonPHI to set the insertion point for PHIs	Justin Bogner	2013-09-27	2	-2/+2
\| \| \| \| \| \| \| \| \| \|	We were previously using getFirstInsertionPt to insert PHI instructions when vectorizing, but getFirstInsertionPt also skips past landingpads, causing this to generate invalid IR. We can avoid this issue by using getFirstNonPHI instead. llvm-svn: 191526
*	First check in. Modified a comment.	Puyan Lotfi	2013-09-27	1	-1/+1
\| \| \| \|	llvm-svn: 191491
*	SLPVectorize: Put horizontal reductions feeding a store under separate flag	Arnold Schwaighofer	2013-09-25	1	-13/+18
\| \| \| \| \| \| \|	Put them under a separate flag for experimentation. They are more likely to interfere with loop vectorization which happens later in the pass pipeline. llvm-svn: 191371
*	[msan] Fix -Wreturn-type warnings in non-self-hosted build.	Evgeniy Stepanov	2013-09-25	1	-0/+2
\| \| \| \|	llvm-svn: 191361
*	set the cost of tiny trees to INT_MAX in SLP vectorizer to disable ↵	Yi Jiang	2013-09-24	1	-1/+1
\| \| \| \| \| \|	vectorization on them llvm-svn: 191314
*	Push analysis passes to InstSimplify when they're around anyways.	Benjamin Kramer	2013-09-24	3	-3/+4
\| \| \| \|	llvm-svn: 191309
*	[msan] Handling of atomic load/store, atomic rmw, cmpxchg.	Evgeniy Stepanov	2013-09-24	1	-2/+96
\| \| \| \|	llvm-svn: 191287
*	Revert "LoopVectorizer: Only allow vectorization of intrinsics."	Arnold Schwaighofer	2013-09-23	1	-29/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Revert 191122 - with extra checks we are allowed to vectorize math library function calls. Standard library indentifiers are reserved names so functions with external linkage must not overrided them. However, functions with internal linkage can. Therefore, we can vectorize calls to math library functions with a check for external linkage and matching signature. This matches what we do during SelectionDAG building. llvm-svn: 191206
*	Provide basic type safety for array_pod_sort comparators.	Benjamin Kramer	2013-09-22	2	-11/+6
\| \| \| \| \| \| \| \|	This makes using array_pod_sort significantly safer. The implementation relies on function pointer casting but that should be safe as we're dealing with void* here. llvm-svn: 191175
*	Drop spurious handle in comment.	Benjamin Kramer	2013-09-22	1	-1/+1
\| \| \| \|	llvm-svn: 191172
*	SROA: Handle casts involving vectors of pointers and integer scalars.	Benjamin Kramer	2013-09-21	1	-11/+47
\| \| \| \| \| \| \| \| \| \|	SROA wants to convert any types of equivalent widths but it's not possible to convert vectors of pointers to an integer scalar with a single cast. As a workaround we add a bitcast to the corresponding int ptr type first. This type of cast used to be an edge case but has become common with SLP vectorization. Fixes PR17271. llvm-svn: 191143
*	SLPVectorizer: Fix multiline comment warning	Arnold Schwaighofer	2013-09-21	1	-2/+2
\| \| \| \|	llvm-svn: 191135
*	Reapply "SLPVectorizer: Handle more horizontal reductions (disabled)""	Arnold Schwaighofer	2013-09-21	1	-8/+364
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reapply r191108 with a fix for a memory corruption error I introduced. Of course, we can't reference the scalars that we replace by vectorizing and then call their eraseFromParent method. I only 'needed' the scalars to get the DebugLoc. Just store the DebugLoc before actually vectorizing instead. As a nice side effect, this also simplifies the interface between BoUpSLP and the HorizontalReduction class to returning a value pointer (the vectorized tree root). radar://14607682 llvm-svn: 191123
*	LoopVectorizer: Only allow vectorization of intrinsics. We can't know for ↵	Nadav Rotem	2013-09-21	1	-2/+11
\| \| \| \| \| \| \| \|	sure that the functions 'abs' or 'round' are the functions from libm. rdar://15012650 llvm-svn: 191122
*	Revert "SLPVectorizer: Handle more horizontal reductions (disabled)"	Arnold Schwaighofer	2013-09-21	1	-368/+8
\| \| \| \| \| \| \| \| \|	This reverts commit r191108. The horizontal.ll test case fails under libgmalloc. Thanks Shuxin for pointing this out to me. llvm-svn: 191121
*	Resurrect r191017 " GVN proceeds in the presence of dead code" plus a fix to ↵	Shuxin Yang	2013-09-20	1	-6/+168
\| \| \| \| \| \| \| \| \|	PR17307 & 17308. The problem of r191017 is that when GVN fabricate a val-number for a dead instruction (in order to make following expr-PRE happy), it forget to fabricate a leader-table entry for it as well. llvm-svn: 191118
*	InstCombine: Remove unused argument. No functionality change.	Benjamin Kramer	2013-09-20	2	-12/+6
\| \| \| \|	llvm-svn: 191112
*	SLPVectorizer: Handle more horizontal reductions (disabled)	Arnold Schwaighofer	2013-09-20	1	-8/+368
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Match reductions starting at binary operation feeding into a phi. The code handles trees like r += v1 + v2 + v3 ... and r += v1 r += v2 ... and r *= v1 + v2 + ... We currently only handle associative operations (add, fadd fast). The code can now also handle reductions feeding into stores. a[i] = v1 + v2 + v3 + ... The code is currently disabled behind the flag "-slp-vectorize-hor". The cost model for most architectures is not there yet. I found one opportunity of a horizontal reduction feeding a phi in TSVC (LoopRerolling-flt) and there are several opportunities where reductions feed into stores. radar://14607682 llvm-svn: 191108
*	Revert r191017, it results in segmentation faults in Qt.	Joerg Sonnenberger	2013-09-20	1	-164/+6
\| \| \| \|	llvm-svn: 191104