bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Add support for external calls that we know how to constant fold. This ↵	Chris Lattner	2005-09-27	1	-11/+20
\| \| \| \| \| \| \| \|	implements ctor-list-opt.ll:CTOR8 llvm-svn: 23465
*	Fix a bug where we would evaluate stores into linkonce objects which could be	Chris Lattner	2005-09-27	1	-1/+6
\| \| \| \| \| \|	potentially replaced at link-time. llvm-svn: 23463
*	Implement support for static constructors with calls in them. This is useful	Chris Lattner	2005-09-27	1	-23/+54
\| \| \| \| \| \| \| \|	because gccas runs globalopt before inlining. This implements ctor-list-opt.ll:CTOR7 llvm-svn: 23462
*	Refactor this code a bit, no functionality changes.	Chris Lattner	2005-09-27	1	-22/+40
\| \| \| \|	llvm-svn: 23460
*	Remove some dead code. ctor evaluation subsumes empty ctor elim	Chris Lattner	2005-09-26	1	-12/+0
\| \| \| \|	llvm-svn: 23453
*	Add support for alloca, implementing ctor-list-opt.ll:CTOR6	Chris Lattner	2005-09-26	1	-17/+48
\| \| \| \|	llvm-svn: 23452
*	Add a debug printout, fix a crash on kc++	Chris Lattner	2005-09-26	1	-1/+6
\| \| \| \|	llvm-svn: 23450
*	Implement loads/stores through GEP's of globals. This implements	Chris Lattner	2005-09-26	1	-6/+98
\| \| \| \| \| \|	ctor-list-opt.ll:CTOR5. llvm-svn: 23449
*	Replace TraverseGEPInitializer with ConstantFoldLoadThroughGEPConstantExpr	Chris Lattner	2005-09-26	1	-17/+5
\| \| \| \|	llvm-svn: 23447
*	Eliminate GetGEPGlobalInitializer in favor of the more powerful	Chris Lattner	2005-09-26	1	-27/+1
\| \| \| \| \| \|	ConstantFoldLoadThroughGEPConstantExpr function in the utils lib. llvm-svn: 23446
*	Factor the GetGEPGlobalInitializer out of this pass and into Transforms/Utils	Chris Lattner	2005-09-26	1	-44/+2
\| \| \| \| \| \|	as ConstantFoldLoadThroughGEPConstantExpr. llvm-svn: 23445
*	Move the ConstantFoldLoadThroughGEPConstantExpr function out of the InstCombine	Chris Lattner	2005-09-26	1	-1/+45
\| \| \| \| \| \|	pass. llvm-svn: 23444
*	add a comment	Chris Lattner	2005-09-26	1	-0/+3
\| \| \| \|	llvm-svn: 23442
*	Add support for getelementptr, load, and correctly reject volatile stores.	Chris Lattner	2005-09-26	1	-0/+29
\| \| \| \|	llvm-svn: 23441
*	Add support for br/brcond/switch and phi	Chris Lattner	2005-09-26	1	-3/+47
\| \| \| \|	llvm-svn: 23439
*	Add a simple interpreter to this code, allowing us to statically evaluate	Chris Lattner	2005-09-26	1	-4/+110
\| \| \| \| \| \|	global ctors that are simple enough. This implements ctor-list-opt.ll:CTOR2. llvm-svn: 23437
*	factor some code into a InstallGlobalCtors method, add comments. No ↵	Chris Lattner	2005-09-26	1	-35/+52
\| \| \| \| \| \|	functionality change. llvm-svn: 23435
*	Make the global opt optimizer work on modules with a null terminator, by	Chris Lattner	2005-09-26	1	-8/+13
\| \| \| \| \| \|	accepting the null even with a non-65535 init prio llvm-svn: 23434
*	Factor this code out into a few methods.	Chris Lattner	2005-09-26	1	-33/+190
\| \| \| \| \| \| \| \| \| \| \| \| \|	Implement the start of global ctor optimization. It is currently smart enough to remove the global ctor for cases like this: struct foo { foo() {} } x; ... saving a bit of startup time for the program. llvm-svn: 23433
*	Fix some logic I broke that caused a regression on	Chris Lattner	2005-09-25	1	-3/+5
\| \| \| \| \| \|	SimplifyLibCalls/2005-05-20-sprintf-crash.ll llvm-svn: 23430
*	Move MaskedValueIsZero up.	Chris Lattner	2005-09-24	1	-77/+146
\| \| \| \| \| \|	Match a bunch of idioms for sign extensions, implementing InstCombine/signext.ll llvm-svn: 23428
*	Simplify this code a bit by relying on recursive simplification. Support	Chris Lattner	2005-09-24	1	-51/+43
\| \| \| \| \| \| \| \|	sprintf("%s", P)'s that have uses. s/hasNUses(0)/use_empty()/ llvm-svn: 23425
*	remove some debugging code	Chris Lattner	2005-09-23	1	-1/+0
\| \| \| \|	llvm-svn: 23411
*	Fold two consequtive branches that share a common destination between them.	Chris Lattner	2005-09-23	1	-33/+119
\| \| \| \| \| \| \|	This implements SimplifyCFG/branch-fold.ll, and is useful on ?:/min/max heavy code llvm-svn: 23410
*	simplify some logic further	Chris Lattner	2005-09-23	1	-6/+1
\| \| \| \|	llvm-svn: 23408
*	pull a bunch of logic out of SimplifyCFG into a helper fn	Chris Lattner	2005-09-23	1	-112/+112
\| \| \| \|	llvm-svn: 23407
*	Start threading across blocks with code in them, so long as the code does	Chris Lattner	2005-09-20	1	-15/+64
\| \| \| \| \| \| \| \| \|	not define a value that is used outside of it's block. This catches many more simplifications, e.g. 854 in 176.gcc, 137 in vpr, etc. This implements branch-phi-thread.ll:test3.ll llvm-svn: 23397
*	Implement merging of blocks with the same condition if the block has multiple	Chris Lattner	2005-09-20	1	-21/+59
\| \| \| \| \| \|	predecessors. This implements branch-phi-thread.ll::test1 llvm-svn: 23395
*	Reject a case we don't handle yet	Chris Lattner	2005-09-19	1	-1/+3
\| \| \| \|	llvm-svn: 23393
*	remove debugging code :-/	Chris Lattner	2005-09-19	1	-2/+0
\| \| \| \|	llvm-svn: 23392
*	Implement SimplifyCFG/branch-phi-thread.ll, the most trivial case of threading	Chris Lattner	2005-09-19	1	-0/+73
\| \| \| \| \| \| \|	control across branches with determined outcomes. More generality to follow. This triggers a couple thousand times in specint. llvm-svn: 23391
*	Refactor this code a bit and make it more general. This now compiles:	Chris Lattner	2005-09-18	1	-24/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus2 (unsigned int x) { b.j += x; } To: _plus2: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) slwi r3, r3, 6 add r3, r4, r3 rlwimi r3, r4, 0, 26, 14 stw r3, 0(r2) blr instead of: _plus2: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) rlwinm r5, r4, 26, 21, 31 add r3, r5, r3 rlwimi r4, r3, 6, 15, 25 stw r4, 0(r2) blr by eliminating an 'and'. I'm pretty sure this is as small as we can go :) llvm-svn: 23386
*	Compile	Chris Lattner	2005-09-18	1	-31/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus2 (unsigned int x) { b.j += x; } to: plus2: mov %EAX, DWORD PTR [b] mov %ECX, %EAX and %ECX, 131008 mov %EDX, DWORD PTR [%ESP + 4] shl %EDX, 6 add %EDX, %ECX and %EDX, 131008 and %EAX, -131009 or %EDX, %EAX mov DWORD PTR [b], %EDX ret instead of: plus2: mov %EAX, DWORD PTR [b] mov %ECX, %EAX shr %ECX, 6 and %ECX, 2047 add %ECX, DWORD PTR [%ESP + 4] shl %ECX, 6 and %ECX, 131008 and %EAX, -131009 or %ECX, %EAX mov DWORD PTR [b], %ECX ret llvm-svn: 23385
*	Generalize this transform, using MaskedValueIsZero, allowing us to compile:	Chris Lattner	2005-09-18	1	-14/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus3 (unsigned int x) { b.k += x; } To: plus3: mov %EAX, DWORD PTR [%ESP + 4] shl %EAX, 17 add DWORD PTR [b], %EAX ret instead of: plus3: mov %EAX, DWORD PTR [%ESP + 4] shl %EAX, 17 mov %ECX, DWORD PTR [b] add %EAX, %ECX and %EAX, -131072 and %ECX, 131071 or %ECX, %EAX mov DWORD PTR [b], %ECX ret llvm-svn: 23384
*	fix typeo	Chris Lattner	2005-09-18	1	-1/+1
\| \| \| \|	llvm-svn: 23383
*	Remove unintentionally committed code	Chris Lattner	2005-09-18	1	-3/+0
\| \| \| \|	llvm-svn: 23382
*	implement shift.ll:test25. This compiles:	Chris Lattner	2005-09-18	1	-3/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus3 (unsigned int x) { b.k += x; } to: _plus3: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r3, 0(r2) rlwinm r4, r3, 0, 0, 14 add r4, r4, r3 rlwimi r4, r3, 0, 15, 31 stw r4, 0(r2) blr instead of: _plus3: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) srwi r5, r4, 17 add r3, r5, r3 slwi r3, r3, 17 rlwimi r3, r4, 0, 15, 31 stw r3, 0(r2) blr llvm-svn: 23381
*	Implement add.ll:test29. Codegening:	Chris Lattner	2005-09-18	1	-0/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus1 (unsigned int x) { b.i += x; } as: _plus1: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) add r3, r4, r3 rlwimi r3, r4, 0, 0, 25 stw r3, 0(r2) blr instead of: _plus1: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) rlwinm r5, r4, 0, 26, 31 add r3, r5, r3 rlwimi r3, r4, 0, 0, 25 stw r3, 0(r2) blr llvm-svn: 23379
*	remove debug output	Chris Lattner	2005-09-18	1	-1/+0
\| \| \| \|	llvm-svn: 23377
*	Implement or.ll:test21. This teaches instcombine to be able to turn this:	Chris Lattner	2005-09-18	1	-3/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct { unsigned int bit0:1; unsigned int ubyte:31; } sdata; void foo() { sdata.ubyte++; } into this: foo: add DWORD PTR [sdata], 2 ret instead of this: foo: mov %EAX, DWORD PTR [sdata] mov %ECX, %EAX add %ECX, 2 and %ECX, -2 and %EAX, 1 or %EAX, %ECX mov DWORD PTR [sdata], %EAX ret llvm-svn: 23376
*	Fix the regression last night compiling povray	Chris Lattner	2005-09-14	1	-2/+3
\| \| \| \|	llvm-svn: 23348
*	Add a simple xform to simplify array accesses with casts in the way.	Chris Lattner	2005-09-13	1	-2/+62
\| \| \| \| \| \| \|	This is useful for 178.galgel where resolution of dope vectors (by the optimizer) causes the scales to become apparent. llvm-svn: 23328
*	Fix an issue where LSR would miss rewriting a use of an IV expression by a ↵	Chris Lattner	2005-09-13	1	-4/+8
\| \| \| \| \| \| \| \| \|	PHI node that is not the original PHI. This fixes up a dot-product loop in galgel, speeding it up from 18.47s to 16.13s. llvm-svn: 23327
*	Add a helper function, allowing us to simplify some code a bit, changing	Chris Lattner	2005-09-13	1	-39/+47
\| \| \| \| \| \|	indentation, no functionality change llvm-svn: 23325
*	Implement a simple xform to turn code like this:	Chris Lattner	2005-09-12	1	-0/+66
\| \| \| \| \| \| \| \| \|	if () { store A -> P; } else { store B -> P; } into a PHI node with one store, in the most trival case. This implements load.ll:test10. llvm-svn: 23324
*	Another load-peephole optimization: do gcse when two loads are next to	Chris Lattner	2005-09-12	1	-2/+5
\| \| \| \| \| \|	each other. This implements InstCombine/load.ll:test9 llvm-svn: 23322
*	Implement a trivial form of store->load forwarding where the store and the	Chris Lattner	2005-09-12	1	-0/+9
\| \| \| \| \| \| \| \|	load are exactly consequtive. This is picked up by other passes, but this triggers thousands of times in fortran programs that use static locals (and is thus a compile-time speedup). llvm-svn: 23320
*	Fix a regression from last night, which caused this pass to create invalid	Chris Lattner	2005-09-12	1	-8/+6
\| \| \| \| \| \| \| \| \| \| \| \|	code for IV uses outside of loops that are not dominated by the latch block. We should only convert these uses to use the post-inc value if they ARE dominated by the latch block. Also use a new LoopInfo method to simplify some code. This fixes Transforms/LoopStrengthReduce/2005-09-12-UsesOutOutsideOfLoop.ll llvm-svn: 23318
*	_test:	Chris Lattner	2005-09-12	1	-5/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	li r2, 0 LBB_test_1: ; no_exit.2 li r5, 0 stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmpwi cr0, r2, 701 blt cr0, LBB_test_1 ; no_exit.2 LBB_test_2: ; loopexit.2.loopexit addi r2, r2, 1 stw r2, 0(r4) blr [zion ~/llvm]$ cat > ~/xx Uses of IV's outside of the loop should use hte post-incremented version of the IV, not the preincremented version. This helps many loops (e.g. in sixtrack) which used to generate code like this (this is the code from the dont-hoist-simple-loop-constants.ll testcase): _test: li r2, 0 ** IV starts at 0 LBB_test_1: ; no_exit.2 or r5, r2, r2 Copy for loop exit li r2, 0 stw r2, 0(r3) addi r3, r3, 4 addi r2, r5, 1 addi r6, r5, 2 IV+2 cmpwi cr0, r6, 701 blt cr0, LBB_test_1 ; no_exit.2 LBB_test_2: ; loopexit.2.loopexit addi r2, r5, 2 IV+2 stw r2, 0(r4) blr And now generated code like this: _test: li r2, 1 * IV starts at 1 LBB_test_1: ; no_exit.2 li r5, 0 stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmpwi cr0, r2, 701 * IV.postinc + 0 blt cr0, LBB_test_1 LBB_test_2: ; loopexit.2.loopexit stw r2, 0(r4) * IV.postinc + 0 blr llvm-svn: 23313
*	implement Transforms/LoopStrengthReduce/dont-hoist-simple-loop-constants.ll.	Chris Lattner	2005-09-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We used to emit this code for it: _test: li r2, 1 ;; Value tying up a register for the whole loop li r5, 0 LBB_test_1: ; no_exit.2 or r6, r5, r5 li r5, 0 stw r5, 0(r3) addi r5, r6, 1 addi r3, r3, 4 add r7, r2, r5 ;; should be addi r7, r5, 1 cmpwi cr0, r7, 701 blt cr0, LBB_test_1 ; no_exit.2 LBB_test_2: ; loopexit.2.loopexit addi r2, r6, 2 stw r2, 0(r4) blr now we emit this: _test: li r2, 0 LBB_test_1: ; no_exit.2 or r5, r2, r2 li r2, 0 stw r2, 0(r3) addi r3, r3, 4 addi r2, r5, 1 addi r6, r5, 2 ;; whoa, fold those adds! cmpwi cr0, r6, 701 blt cr0, LBB_test_1 ; no_exit.2 LBB_test_2: ; loopexit.2.loopexit addi r2, r5, 2 stw r2, 0(r4) blr more improvement coming. llvm-svn: 23306