bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	When checking that the necessary bits are zero in	Dale Johannesen	2010-11-10	1	-2/+2
\| \| \| \| \| \| \|	order to reduce ((x<<30)>>24) to x<<6, check the correct bits. PR 8547. llvm-svn: 118665
*	When folding away a (shl (shr)) pair, we need to check that the bits that ↵	Owen Anderson	2010-11-01	1	-1/+1
\| \| \| \| \| \| \| \|	will BECOME the low bits are zero, not that the current low bits are zero. Fixes <rdar://problem/8606771>. llvm-svn: 117953
*	Clean up indentation and other whitespace.	Bob Wilson	2010-10-29	1	-11/+9
\| \| \| \|	llvm-svn: 117728
*	Remove trailing whitespace.	Bob Wilson	2010-10-29	1	-70/+69
\| \| \| \|	llvm-svn: 117727
*	Fix 80-column violation.	Bob Wilson	2010-10-29	1	-1/+2
\| \| \| \|	llvm-svn: 117722
*	Change instcombine's getShuffleMask to represent undef with negative values.	Bob Wilson	2010-10-29	1	-40/+36
\| \| \| \| \| \| \| \|	This code had previously used 2*N, where N is the mask length, to represent undef. That is not safe because the shufflevector operands may have more than N elements -- they don't have to match the result type. llvm-svn: 117721
*	Make instcombine a little more aggressive in combining vector shuffles.	Bob Wilson	2010-10-29	1	-15/+22
\| \| \| \| \| \| \| \|	Allow splats even if they don't match either of the original shuffles, possibly due to undef entries in the shuffles masks. Radar 8597790. Also fix some 80-column violations. llvm-svn: 117719
*	Teach InstCombine not to use Add and Neg on FP. PR 8490.	Dale Johannesen	2010-10-27	1	-1/+8
\| \| \| \|	llvm-svn: 117510
*	Fix a case where instcombine was stripping metadata (and alignment)	Dan Gohman	2010-10-25	1	-1/+3
\| \| \| \| \| \|	from stores when folding in bitcasts. llvm-svn: 117265
*	SmallVectorize.	Benjamin Kramer	2010-10-23	1	-3/+1
\| \| \| \|	llvm-svn: 117213
*	Teach instcombine to set the alignment arguments for NEON load/store intrinsics.	Bob Wilson	2010-10-22	1	-0/+26
\| \| \| \|	llvm-svn: 117154
*	Get rid of static constructors for pass registration. Instead, every pass ↵	Owen Anderson	2010-10-19	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820
*	Now with fewer extraneous semicolons!	Owen Anderson	2010-10-07	1	-1/+1
\| \| \| \|	llvm-svn: 115996
*	Add initialization routines to InstCombine.	Owen Anderson	2010-10-07	1	-0/+9
\| \| \| \|	llvm-svn: 115965
*	fix PR8267 - Instcombine shouldn't optimizer away volatile memcpy's.	Chris Lattner	2010-10-01	1	-1/+6
\| \| \| \|	llvm-svn: 115296
*	Removed a bunch of unnecessary target_link_libraries.	Oscar Fuentes	2010-09-28	1	-2/+0
\| \| \| \|	llvm-svn: 114999
*	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally."	Michael J. Spencer	2010-09-13	1	-7/+2
\| \| \| \| \| \| \| \| \| \|	This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819
*	Re-apply r113679, which was reverted in r113720, which added a paid of new ↵	Owen Anderson	2010-09-13	1	-5/+31
\| \| \| \| \| \| \| \| \|	instcombine transforms to expose greater opportunities for store narrowing in codegen. This patch fixes a potential infinite loop in instcombine caused by one of the introduced transforms being overly aggressive. llvm-svn: 113763
*	Revert 113679, it was causing an infinite loop in a testcase that I've sent	Eric Christopher	2010-09-12	1	-30/+5
\| \| \| \| \| \|	on to Owen. llvm-svn: 113720
*	Invert and-of-or into or-of-and when doing so would allow us to clear bits ↵	Owen Anderson	2010-09-11	1	-5/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	of the and's mask. This can result in increased opportunities for store narrowing in code generation. Update a number of tests for this change. This fixes <rdar://problem/8285027>. Additionally, because this inverts the order of ors and ands, some patterns for optimizing or-of-and-of-or no longer fire in instances where they did originally. Add a simple transform which recaptures most of these opportunities: if we have an or-of-constant-or and have failed to fold away the inner or, commute the order of the two ors, to give the non-constant or a chance for simplification instead. llvm-svn: 113679
*	CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally.	Michael J. Spencer	2010-09-10	1	-2/+7
\| \| \| \|	llvm-svn: 113632
*	This transform is also performed by InstructionSimplify, remove the duplicate.	Benjamin Kramer	2010-09-10	1	-3/+0
\| \| \| \|	llvm-svn: 113608
*	Generalize instcombine's support for combining multiple bit checks into a ↵	Owen Anderson	2010-09-08	1	-32/+278
\| \| \| \| \| \|	single test. Patch by Dirk Steinke! llvm-svn: 113423
*	Fix a serious performance regression introduced by r108687 on linux:	Chris Lattner	2010-09-07	1	-1/+6
\| \| \| \| \| \| \| \|	turning (fptrunc (sqrt (fpext x))) -> (sqrtf x) is great, but we have to delete the original sqrt as well. Not doing so causes us to do two sqrt's when building with -fmath-errno (the default on linux). llvm-svn: 113260
*	Remove r111665, which implemented store-narrowing in InstCombine. Chris ↵	Owen Anderson	2010-08-31	1	-47/+0
\| \| \| \| \| \| \| \|	discovered a miscompilation in it, and it's not easily fixable at the optimizer level. I'll investigate reimplementing it in DAGCombine. llvm-svn: 112575
*	for completeness, allow undef also.	Chris Lattner	2010-08-28	1	-0/+3
\| \| \| \|	llvm-svn: 112351
*	handle the constant case of vector insertion. For something	Chris Lattner	2010-08-28	1	-3/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	like this: struct S { float A, B, C, D; }; struct S g; struct S bar() { struct S A = g; ++A.B; A.A = 42; return A; } we now generate: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss 12(%rax), %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 unpcklps %xmm0, %xmm1 addss LCPI1_0(%rip), %xmm2 pshufd $16, %xmm2, %xmm2 movss LCPI1_1(%rip), %xmm0 pshufd $16, %xmm0, %xmm0 unpcklps %xmm2, %xmm0 ret instead of: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss 12(%rax), %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 unpcklps %xmm0, %xmm1 addss LCPI1_0(%rip), %xmm2 movd %xmm2, %eax shlq $32, %rax addq $1109917696, %rax ## imm = 0x42280000 movd %rax, %xmm0 ret llvm-svn: 112345
*	optimize bitcasts from large integers to vector into vector	Chris Lattner	2010-08-28	2	-11/+129
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	element insertion from the pieces that feed into the vector. This handles a pattern that occurs frequently due to code generated for the x86-64 abi. We now compile something like this: struct S { float A, B, C, D; }; struct S g; struct S bar() { struct S A = g; ++A.A; ++A.C; return A; } into all nice vector operations: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss LCPI1_0(%rip), %xmm1 movss (%rax), %xmm0 addss %xmm1, %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 12(%rax), %xmm3 pshufd $16, %xmm2, %xmm2 unpcklps %xmm2, %xmm0 addss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 pshufd $16, %xmm3, %xmm2 unpcklps %xmm2, %xmm1 ret instead of icky integer operations: _bar: ## @bar movq _g@GOTPCREL(%rip), %rax movss LCPI1_0(%rip), %xmm1 movss (%rax), %xmm0 addss %xmm1, %xmm0 movd %xmm0, %ecx movl 4(%rax), %edx movl 12(%rax), %esi shlq $32, %rdx addq %rcx, %rdx movd %rdx, %xmm0 addss 8(%rax), %xmm1 movd %xmm1, %eax shlq $32, %rsi addq %rax, %rsi movd %rsi, %xmm1 ret This resolves rdar://8360454 llvm-svn: 112343
*	Enhance the shift propagator to handle the case when you have:	Chris Lattner	2010-08-27	1	-22/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A = shl x, 42 ... B = lshr ..., 38 which can be transformed into: A = shl x, 4 ... iff we can prove that the would-be-shifted-in bits are already zero. This eliminates two shifts in the testcase and allows eliminate of the whole i128 chain in the real example. llvm-svn: 112314
*	Implement a pretty general logical shift propagation	Chris Lattner	2010-08-27	2	-2/+227
\| \| \| \| \| \| \| \| \| \| \| \|	framework, which is good at ripping through bitfield operations. This generalize a bunch of the existing xforms that instcombine does, such as (x << c) >> c -> and to handle intermediate logical nodes. This is useful for ripping up the "promote to large integer" code produced by SRoA. llvm-svn: 112304
*	remove some special shift cases that have been subsumed into the	Chris Lattner	2010-08-27	1	-34/+13
\| \| \| \| \| \|	more general simplify demanded bits logic. llvm-svn: 112291
*	teach the truncation optimization that an entire chain of	Chris Lattner	2010-08-27	1	-0/+5
\| \| \| \| \| \| \|	computation can be truncated if it is fed by a sext/zext that doesn't have to be exactly equal to the truncation result type. llvm-svn: 112285
*	Add an instcombine to clean up a common pattern produced	Chris Lattner	2010-08-27	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	by the SRoA "promote to large integer" code, eliminating some type conversions like this: %94 = zext i16 %93 to i32 ; <i32> [#uses=2] %96 = lshr i32 %94, 8 ; <i32> [#uses=1] %101 = trunc i32 %96 to i8 ; <i8> [#uses=1] This also unblocks other xforms from happening, now clang is able to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry pshufd $1, %xmm0, %xmm2 addss %xmm0, %xmm2 movdqa %xmm1, %xmm3 addss %xmm2, %xmm3 pshufd $1, %xmm1, %xmm0 addss %xmm3, %xmm0 ret on x86-64, instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret This seems pretty close to optimal to me, at least without using horizontal adds. This also triggers in lots of other code, including SPEC. llvm-svn: 112278
*	optimize "integer extraction out of the middle of a vector" as produced	Chris Lattner	2010-08-26	1	-13/+35
\| \| \| \| \| \| \|	by SRoA. This is part of rdar://7892780, but needs another xform to expose this. llvm-svn: 112232
*	optimize bitcast(trunc(bitcast(x))) where the result is a float and 'x'	Chris Lattner	2010-08-26	1	-0/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	is a vector to be a vector element extraction. This allows clang to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax movd %eax, %xmm0 shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movd %xmm1, %rax movd %eax, %xmm1 addss %xmm2, %xmm1 shrq $32, %rax movd %eax, %xmm0 addss %xmm1, %xmm0 ret ... eliminating half of the horribleness. llvm-svn: 112227
*	Re-apply r111568 with a fix for the clang self-host.	Owen Anderson	2010-08-20	1	-0/+47
\| \| \| \|	llvm-svn: 111665
*	Revert r111568 to unbreak clang self-host.	Owen Anderson	2010-08-19	1	-45/+0
\| \| \| \|	llvm-svn: 111571
*	When a set of bitmask operations, typically from a bitfield initialization, ↵	Owen Anderson	2010-08-19	1	-0/+45
\| \| \| \| \| \| \| \|	only modifies the low bytes of a value, we can narrow the store to only over-write the affected bytes. llvm-svn: 111568
*	Temporarily revert r110987 as it's causing some miscompares in	Eric Christopher	2010-08-17	1	-123/+64
\| \| \| \| \| \|	vector heavy code. I'll re-enable when we've tracked down the problem. llvm-svn: 111318
*	Reapply this transformation now that it is passing the external test which ↵	Nate Begeman	2010-08-13	1	-64/+123
\| \| \| \| \| \|	it previously failed. llvm-svn: 110987
*	Temporarily revert 110737 and 110734, they were causing failures	Eric Christopher	2010-08-12	1	-141/+64
\| \| \| \| \| \|	in an external testsuite. llvm-svn: 110905
*	Add the minimal amount of smarts necessary to instcombine of shufflevectors ↵	Nate Begeman	2010-08-10	1	-64/+141
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	to recognize patterns generated by clang for transpose of a matrix in generic vectors. This is made of two parts: 1) Propagating vector extracts of hi/lo half into their users 2) Recognizing an insertion of even elements followed by the odd elements as an unpack. Testcase to come, but this shrinks the # of shuffle instructions generated on x86 from ~40 to the minimal 8. llvm-svn: 110734
*	PR7853: fix a silly mistake introduced in r101899, and add a test to make sure	Eli Friedman	2010-08-09	1	-1/+1
\| \| \| \| \| \|	it doesn't regress again. llvm-svn: 110597
*	Reapply r110396, with fixes to appease the Linux buildbot gods.	Owen Anderson	2010-08-06	1	-1/+1
\| \| \| \|	llvm-svn: 110460
*	Revert r110396 to fix buildbots.	Owen Anderson	2010-08-06	1	-1/+1
\| \| \| \|	llvm-svn: 110410
*	Don't use PassInfo* as a type identifier for passes. Instead, use the ↵	Owen Anderson	2010-08-05	1	-1/+1
\| \| \| \| \| \| \| \|	address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396
*	Make instcombine set explicit alignments on load or store	Dan Gohman	2010-08-03	1	-6/+14
\| \| \| \| \| \| \|	instructions with alignment 0, so that subsequent passes don't need to bother checking the TargetData ABI size manually. llvm-svn: 110128
*	Use unary + instead of a separate local variable for working	Dan Gohman	2010-08-03	1	-2/+1
\| \| \| \| \| \|	around std::min vs static const friction. llvm-svn: 110112
*	Re-apply the infamous r108614, with a fix pointed out by Dirk Steinke.	Owen Anderson	2010-08-02	1	-5/+38
\| \| \| \|	llvm-svn: 110036
*	Speculatively revert r108614, "Another attempt at getting the clang self-host to	Daniel Dunbar	2010-07-31	1	-32/+0
\| \| \| \| \| \| \|	like my instcombine patch.", in an attempt to fix Clang i386 bootstrap. - Also PR7719. llvm-svn: 109953