bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	remove some special shift cases that have been subsumed into the	Chris Lattner	2010-08-27	1	-34/+13
\| \| \| \| \| \|	more general simplify demanded bits logic. llvm-svn: 112291
*	Fix typos in comments.	Owen Anderson	2010-08-27	1	-2/+2
\| \| \| \|	llvm-svn: 112286
*	teach the truncation optimization that an entire chain of	Chris Lattner	2010-08-27	1	-0/+5
\| \| \| \| \| \| \|	computation can be truncated if it is fed by a sext/zext that doesn't have to be exactly equal to the truncation result type. llvm-svn: 112285
*	Add an instcombine to clean up a common pattern produced	Chris Lattner	2010-08-27	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	by the SRoA "promote to large integer" code, eliminating some type conversions like this: %94 = zext i16 %93 to i32 ; <i32> [#uses=2] %96 = lshr i32 %94, 8 ; <i32> [#uses=1] %101 = trunc i32 %96 to i8 ; <i8> [#uses=1] This also unblocks other xforms from happening, now clang is able to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry pshufd $1, %xmm0, %xmm2 addss %xmm0, %xmm2 movdqa %xmm1, %xmm3 addss %xmm2, %xmm3 pshufd $1, %xmm1, %xmm0 addss %xmm3, %xmm0 ret on x86-64, instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret This seems pretty close to optimal to me, at least without using horizontal adds. This also triggers in lots of other code, including SPEC. llvm-svn: 112278
*	Use LVI to eliminate conditional branches where we've tested a related ↵	Owen Anderson	2010-08-27	1	-0/+39
\| \| \| \| \| \| \| \|	condition previously. Update tests for this change. This fixes PR5652. llvm-svn: 112270
*	optimize "integer extraction out of the middle of a vector" as produced	Chris Lattner	2010-08-26	1	-13/+35
\| \| \| \| \| \| \|	by SRoA. This is part of rdar://7892780, but needs another xform to expose this. llvm-svn: 112232
*	optimize bitcast(trunc(bitcast(x))) where the result is a float and 'x'	Chris Lattner	2010-08-26	1	-0/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	is a vector to be a vector element extraction. This allows clang to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax movd %eax, %xmm0 shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movd %xmm1, %rax movd %eax, %xmm1 addss %xmm2, %xmm1 shrq $32, %rax movd %eax, %xmm0 addss %xmm1, %xmm0 ret ... eliminating half of the horribleness. llvm-svn: 112227
*	Make JumpThreading smart enough to properly thread StrSwitch when it's ↵	Owen Anderson	2010-08-26	1	-17/+77
\| \| \| \| \| \|	compiled with clang++. llvm-svn: 112198
*	Reapply r112091 and r111922, support for metadata linking, with a	Dan Gohman	2010-08-26	6	-53/+88
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	fix: add a flag to MapValue and friends which indicates whether any module-level mappings are being made. In the common case of inlining, no module-level mappings are needed, so MapValue doesn't need to examine non-function-local metadata, which can be very expensive in the case of a large module with really deep metadata (e.g. a large C++ program compiled with -g). This flag is a little awkward; perhaps eventually it can be moved into the ClonedCodeInfo class. llvm-svn: 112190
*	Revert r111922, "MapValue support for MDNodes. This is similar to r109117,	Daniel Dunbar	2010-08-26	1	-27/+8
\| \| \| \| \| \| \|	except ...", it is causing massive performance regressions when building Clang with itself (-O3 -g). llvm-svn: 112158
*	Revert r112091, "Remap metadata attached to instructions when remapping	Daniel Dunbar	2010-08-26	2	-12/+16
\| \| \| \| \| \|	individual ...", which depends on r111922, which I am reverting. llvm-svn: 112157
*	zap dead code.	Chris Lattner	2010-08-26	1	-13/+1
\| \| \| \|	llvm-svn: 112130
*	Rewrite ExtractGV, removing a bunch of stuff that didn't fully work,	Dan Gohman	2010-08-26	1	-125/+32
\| \| \| \| \| \|	and was over-complicated, and replacing it with a simple implementation. llvm-svn: 112120
*	remove some llvmcontext arguments that are now dead post-refactoring.	Chris Lattner	2010-08-25	1	-2/+2
\| \| \| \|	llvm-svn: 112104
*	Remap metadata attached to instructions when remapping individual	Dan Gohman	2010-08-25	2	-16/+12
\| \| \| \| \| \|	instructions, not when remapping modules. llvm-svn: 112091
*	DIGlobalVariable can be used to encode debug info for globals that are ↵	Devang Patel	2010-08-25	1	-2/+2
\| \| \| \| \| \|	directly folded into a constant by FE. llvm-svn: 112072
*	Use MapValue in the Linker instead of having a private function	Dan Gohman	2010-08-24	4	-32/+3
\| \| \| \| \| \| \| \|	which does the same thing. This eliminates redundant code and handles MDNodes better. MDNode linking still doesn't fully work yet though. llvm-svn: 111941
*	Turn LVI on, previously detected failures should be fixed now.	Owen Anderson	2010-08-24	1	-1/+1
\| \| \| \|	llvm-svn: 111923
*	MapValue support for MDNodes. This is similar to r109117, except	Dan Gohman	2010-08-24	1	-8/+27
\| \| \| \| \| \| \| \|	that it avoids a lot of unnecessary cloning by avoiding remapping MDNode cycles when none of the nodes in the cycle actually need to be remapped. Also it uses the new temporary MDNode mechanism. llvm-svn: 111922
*	Turn LVI back off, I have a testcase now.	Owen Anderson	2010-08-23	1	-1/+1
\| \| \| \|	llvm-svn: 111834
*	Re-enable LazyValueInfo. Monitoring for failures.	Owen Anderson	2010-08-23	1	-1/+1
\| \| \| \|	llvm-svn: 111816
*	Now that PassInfo and Pass::ID have been separated, move the rest of the ↵	Owen Anderson	2010-08-23	9	-16/+19
\| \| \| \| \| \|	passes over to the new registration API. llvm-svn: 111815
*	Re-apply r111568 with a fix for the clang self-host.	Owen Anderson	2010-08-20	1	-0/+47
\| \| \| \|	llvm-svn: 111665
*	Revert r111568 to unbreak clang self-host.	Owen Anderson	2010-08-19	1	-45/+0
\| \| \| \|	llvm-svn: 111571
*	When a set of bitmask operations, typically from a bitfield initialization, ↵	Owen Anderson	2010-08-19	1	-0/+45
\| \| \| \| \| \| \| \|	only modifies the low bytes of a value, we can narrow the store to only over-write the affected bytes. llvm-svn: 111568
*	Disable LVI while I evaluate a failure.	Owen Anderson	2010-08-19	1	-1/+1
\| \| \| \|	llvm-svn: 111551
*	Tentatively enabled LVI by default. I'll be monitoring for any failures.	Owen Anderson	2010-08-19	1	-1/+1
\| \| \| \|	llvm-svn: 111543
*	Process the step before the start, because it's usually the simpler	Dan Gohman	2010-08-19	1	-3/+3
\| \| \| \| \| \|	of the two. llvm-svn: 111495
*	Inform LazyValueInfo whenever a block is deleted, to avoid dangling pointer ↵	Owen Anderson	2010-08-18	1	-0/+7
\| \| \| \| \| \|	issues. llvm-svn: 111382
*	Fix PR7755: knowing something about an inval for a pred	Chris Lattner	2010-08-18	1	-8/+4
\| \| \| \| \| \| \| \| \|	from the LHS should disable reconsidering that pred on the RHS. However, knowing something about the pred on the RHS shouldn't disable subsequent additions on the RHS from happening. llvm-svn: 111349
*	fit in 80 cols	Chris Lattner	2010-08-18	1	-2/+3
\| \| \| \|	llvm-svn: 111348
*	remove some dead code.	Chris Lattner	2010-08-18	3	-15/+4
\| \| \| \|	llvm-svn: 111344
*	remove dead prototype.	Chris Lattner	2010-08-18	1	-2/+1
\| \| \| \|	llvm-svn: 111342
*	Temporarily revert r110987 as it's causing some miscompares in	Eric Christopher	2010-08-17	1	-123/+64
\| \| \| \| \| \|	vector heavy code. I'll re-enable when we've tracked down the problem. llvm-svn: 111318
*	When rotating loops, put the original header at the bottom of the	Dan Gohman	2010-08-17	1	-0/+20
\| \| \| \| \| \| \|	loop, making the resulting loop significantly less ugly. Also, zap its trivial PHI nodes, since it's easy. llvm-svn: 111255
*	Use the getUniquePredecessor() utility function, instead of doing	Dan Gohman	2010-08-17	1	-15/+5
\| \| \| \| \| \|	what it does manually. llvm-svn: 111248
*	Add an option to disable codegen prepare critical edge splitting. In theory, ↵	Evan Cheng	2010-08-17	1	-6/+14
\| \| \| \| \| \|	PHI elimination is already doing all (most?) of the splitting needed. But machine-licm and machine-sink seem to miss some important optimizations when splitting is disabled. llvm-svn: 111224
*	Instead of having CollectSubexpr's categorize operands as interesting or	Dan Gohman	2010-08-16	1	-19/+14
\| \| \| \| \| \| \| \|	uninteresting, just put all the operands on one list and make GenerateReassociations make the decision about what's interesting. This is simpler, and it avoids an extra ScalarEvolution::getAddExpr call. llvm-svn: 111133
*	Put add operands in ScalarEvolution-canonical order, when convenient.	Dan Gohman	2010-08-16	1	-2/+2
\| \| \| \| \| \| \|	This isn't necessary, because ScalarEvolution sorts them anyway, but it's tidier this way. llvm-svn: 111132
*	Avoid #include <ScalarEvolution.h> in LoopSimplify.cpp, which doesn't	Dan Gohman	2010-08-16	1	-2/+1
\| \| \| \| \| \|	actually use ScalarEvolution. llvm-svn: 111124
*	Instead, teach SimplifyCFG to trim non-address-taken blocks from	Dan Gohman	2010-08-16	1	-2/+3
\| \| \| \| \| \|	indirectbr destination lists. llvm-svn: 111122
*	LoopSimplify shouldn't split loop backedges that use indirectbr. PR7867.	Dan Gohman	2010-08-14	1	-0/+5
\| \| \| \|	llvm-svn: 111061
*	Teach SimplifyCFG how to simplify indirectbr instructions.	Dan Gohman	2010-08-14	3	-16/+45
\| \| \| \| \| \| \| \| \| \| \|	- Eliminate redundant successors. - Convert an indirectbr with one successor into a direct branch. Also, generalize SimplifyCFG to be able to be run on a function entry block. It knows quite a few simplifications which are applicable to the entry block, and it only needs a few checks to avoid trouble with the entry block. llvm-svn: 111060
*	Fix LSR's ExtractImmediate and ExtractSymbol to avoid calling	Dan Gohman	2010-08-13	1	-4/+8
\| \| \| \| \| \| \|	ScalarEvolution::getAddExpr, which can be pretty expensive, when nothing has changed, which is pretty common. llvm-svn: 111042
*	Reapply this transformation now that it is passing the external test which ↵	Nate Begeman	2010-08-13	1	-64/+123
\| \| \| \| \| \|	it previously failed. llvm-svn: 110987
*	fix PR7876: If ipsccp decides that a function's address is taken	Chris Lattner	2010-08-12	1	-4/+15
\| \| \| \| \| \|	before it rewrites the code, we need to use that in the post-rewrite pass. llvm-svn: 110962
*	Temporarily revert 110737 and 110734, they were causing failures	Eric Christopher	2010-08-12	1	-141/+64
\| \| \| \| \| \|	in an external testsuite. llvm-svn: 110905
*	Add the minimal amount of smarts necessary to instcombine of shufflevectors ↵	Nate Begeman	2010-08-10	1	-64/+141
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	to recognize patterns generated by clang for transpose of a matrix in generic vectors. This is made of two parts: 1) Propagating vector extracts of hi/lo half into their users 2) Recognizing an insertion of even elements followed by the odd elements as an unpack. Testcase to come, but this shrinks the # of shuffle instructions generated on x86 from ~40 to the minimal 8. llvm-svn: 110734
*	Fix a use after free error caught by the valgrind builders.	Nick Lewycky	2010-08-09	1	-2/+4
\| \| \| \|	llvm-svn: 110601
*	PR7853: fix a silly mistake introduced in r101899, and add a test to make sure	Eli Friedman	2010-08-09	1	-1/+1
\| \| \| \| \| \|	it doesn't regress again. llvm-svn: 110597