bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Simplify this code a bit by relying on recursive simplification. Support	Chris Lattner	2005-09-24	1	-51/+43
\| \| \| \| \| \| \| \|	sprintf("%s", P)'s that have uses. s/hasNUses(0)/use_empty()/ llvm-svn: 23425
*	Add support for a marker byte that indicates that we shouldn't add the user	Chris Lattner	2005-09-24	1	-7/+12
\| \| \| \| \| \|	prefix to a symbol name llvm-svn: 23421
*	Teach the dag isel generator how to construct arbitrary immediates. The	Chris Lattner	2005-09-24	1	-1/+6
\| \| \| \| \| \|	generated isel now tries li then lis, then lis+ori. llvm-svn: 23418
*	remove some debugging code	Chris Lattner	2005-09-23	1	-1/+0
\| \| \| \|	llvm-svn: 23411
*	Fold two consequtive branches that share a common destination between them.	Chris Lattner	2005-09-23	1	-33/+119
\| \| \| \| \| \| \|	This implements SimplifyCFG/branch-fold.ll, and is useful on ?:/min/max heavy code llvm-svn: 23410
*	simplify some logic further	Chris Lattner	2005-09-23	1	-6/+1
\| \| \| \|	llvm-svn: 23408
*	pull a bunch of logic out of SimplifyCFG into a helper fn	Chris Lattner	2005-09-23	1	-112/+112
\| \| \| \|	llvm-svn: 23407
*	speed up Archive::isBytecodeArchive in the case when the archive doesn't have	Chris Lattner	2005-09-23	1	-18/+24
\| \| \| \| \| \| \|	an llvm-ranlib symtab. This speeds up gccld -native on an almost empty .o file from 1.63s to 0.18s. llvm-svn: 23406
*	Turn (X^C1) == C2 into X == C1^C2 iff X&~C1 = 0 (and move a function)	Chris Lattner	2005-09-23	1	-72/+86
\| \| \| \| \| \| \| \| \|	This happens all the time on PPC for bool values, e.g. eliminating a xori in inverted-bool-compares.ll. This should be added to the dag combiner as well. llvm-svn: 23403
*	Expose the LiveInterval interfaces as public headers.	Chris Lattner	2005-09-21	6	-400/+4
\| \| \| \|	llvm-svn: 23400
*	Start threading across blocks with code in them, so long as the code does	Chris Lattner	2005-09-20	1	-15/+64
\| \| \| \| \| \| \| \| \|	not define a value that is used outside of it's block. This catches many more simplifications, e.g. 854 in 176.gcc, 137 in vpr, etc. This implements branch-phi-thread.ll:test3.ll llvm-svn: 23397
*	Implement merging of blocks with the same condition if the block has multiple	Chris Lattner	2005-09-20	1	-21/+59
\| \| \| \| \| \|	predecessors. This implements branch-phi-thread.ll::test1 llvm-svn: 23395
*	Reject a case we don't handle yet	Chris Lattner	2005-09-19	1	-1/+3
\| \| \| \|	llvm-svn: 23393
*	remove debugging code :-/	Chris Lattner	2005-09-19	1	-2/+0
\| \| \| \|	llvm-svn: 23392
*	Implement SimplifyCFG/branch-phi-thread.ll, the most trivial case of threading	Chris Lattner	2005-09-19	1	-0/+73
\| \| \| \| \| \| \|	control across branches with determined outcomes. More generality to follow. This triggers a couple thousand times in specint. llvm-svn: 23391
*	Stub out the rest of the DAG Combiner. Just need to fill in the	Nate Begeman	2005-09-19	1	-8/+104
\| \| \| \| \| \| \|	select_cc bits and then wrap it in a convenience function for use with regular select. llvm-svn: 23389
*	Teach the local spiller to turn stack slot loads into register-register copies	Chris Lattner	2005-09-19	1	-26/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	when possible, avoiding the load (and avoiding the copy if the value is already in the right register). This patch came about when I noticed code like the following being generated: store R17 -> [SS1] ...blah... R4 = load [SS1] This was causing an LSU reject on the G5. This problem was due to the register allocator folding spill code into a reg-reg copy (producing the load), which prevented the spiller from being able to rewrite the load into a copy, despite the fact that the value was already available in a register. In the case above, we now rip out the R4 load and replace it with a R4 = R17 copy. This speeds up several programs on X86 (which spills a lot :) ), e.g. smg2k from 22.39->20.60s, povray from 12.93->12.66s, 168.wupwise from 68.54->53.83s (!), 197.parser from 7.33->6.62s (!), etc. This may have a larger impact in some cases on the G5 (by avoiding LSU rejects), though it probably won't trigger as often (less spilling in general). Targets that implement folding of loads/stores into copies should implement the isLoadFromStackSlot hook to get this. llvm-svn: 23388
*	Implement the isLoadFromStackSlot interface	Chris Lattner	2005-09-19	2	-0/+28
\| \| \| \|	llvm-svn: 23387
*	Refactor this code a bit and make it more general. This now compiles:	Chris Lattner	2005-09-18	1	-24/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus2 (unsigned int x) { b.j += x; } To: _plus2: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) slwi r3, r3, 6 add r3, r4, r3 rlwimi r3, r4, 0, 26, 14 stw r3, 0(r2) blr instead of: _plus2: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) rlwinm r5, r4, 26, 21, 31 add r3, r5, r3 rlwimi r4, r3, 6, 15, 25 stw r4, 0(r2) blr by eliminating an 'and'. I'm pretty sure this is as small as we can go :) llvm-svn: 23386
*	Compile	Chris Lattner	2005-09-18	1	-31/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus2 (unsigned int x) { b.j += x; } to: plus2: mov %EAX, DWORD PTR [b] mov %ECX, %EAX and %ECX, 131008 mov %EDX, DWORD PTR [%ESP + 4] shl %EDX, 6 add %EDX, %ECX and %EDX, 131008 and %EAX, -131009 or %EDX, %EAX mov DWORD PTR [b], %EDX ret instead of: plus2: mov %EAX, DWORD PTR [b] mov %ECX, %EAX shr %ECX, 6 and %ECX, 2047 add %ECX, DWORD PTR [%ESP + 4] shl %ECX, 6 and %ECX, 131008 and %EAX, -131009 or %ECX, %EAX mov DWORD PTR [b], %ECX ret llvm-svn: 23385
*	Generalize this transform, using MaskedValueIsZero, allowing us to compile:	Chris Lattner	2005-09-18	1	-14/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus3 (unsigned int x) { b.k += x; } To: plus3: mov %EAX, DWORD PTR [%ESP + 4] shl %EAX, 17 add DWORD PTR [b], %EAX ret instead of: plus3: mov %EAX, DWORD PTR [%ESP + 4] shl %EAX, 17 mov %ECX, DWORD PTR [b] add %EAX, %ECX and %EAX, -131072 and %ECX, 131071 or %ECX, %EAX mov DWORD PTR [b], %ECX ret llvm-svn: 23384
*	fix typeo	Chris Lattner	2005-09-18	1	-1/+1
\| \| \| \|	llvm-svn: 23383
*	Remove unintentionally committed code	Chris Lattner	2005-09-18	1	-3/+0
\| \| \| \|	llvm-svn: 23382
*	implement shift.ll:test25. This compiles:	Chris Lattner	2005-09-18	1	-3/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus3 (unsigned int x) { b.k += x; } to: _plus3: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r3, 0(r2) rlwinm r4, r3, 0, 0, 14 add r4, r4, r3 rlwimi r4, r3, 0, 15, 31 stw r4, 0(r2) blr instead of: _plus3: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) srwi r5, r4, 17 add r3, r5, r3 slwi r3, r3, 17 rlwimi r3, r4, 0, 15, 31 stw r3, 0(r2) blr llvm-svn: 23381
*	Implement add.ll:test29. Codegening:	Chris Lattner	2005-09-18	1	-0/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct S { unsigned int i : 6, j : 11, k : 15; } b; void plus1 (unsigned int x) { b.i += x; } as: _plus1: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) add r3, r4, r3 rlwimi r3, r4, 0, 0, 25 stw r3, 0(r2) blr instead of: _plus1: lis r2, ha16(L_b$non_lazy_ptr) lwz r2, lo16(L_b$non_lazy_ptr)(r2) lwz r4, 0(r2) rlwinm r5, r4, 0, 26, 31 add r3, r5, r3 rlwimi r3, r4, 0, 0, 25 stw r3, 0(r2) blr llvm-svn: 23379
*	remove debug output	Chris Lattner	2005-09-18	1	-1/+0
\| \| \| \|	llvm-svn: 23377
*	Implement or.ll:test21. This teaches instcombine to be able to turn this:	Chris Lattner	2005-09-18	1	-3/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct { unsigned int bit0:1; unsigned int ubyte:31; } sdata; void foo() { sdata.ubyte++; } into this: foo: add DWORD PTR [sdata], 2 ret instead of this: foo: mov %EAX, DWORD PTR [sdata] mov %ECX, %EAX add %ECX, 2 and %ECX, -2 and %EAX, 1 or %EAX, %ECX mov DWORD PTR [sdata], %EAX ret llvm-svn: 23376
*	Implement hook for ppc	Chris Lattner	2005-09-17	2	-0/+18
\| \| \| \|	llvm-svn: 23374
*	More DAG combining. Still need the branch instructions, and select_cc	Nate Begeman	2005-09-16	1	-5/+425
\| \| \| \|	llvm-svn: 23371
*	disable this for now	Chris Lattner	2005-09-15	1	-0/+2
\| \| \| \|	llvm-svn: 23366
*	Give all operands names	Chris Lattner	2005-09-14	1	-1/+1
\| \| \| \|	llvm-svn: 23357
*	give all operands names	Chris Lattner	2005-09-14	2	-12/+14
\| \| \| \|	llvm-svn: 23356
*	Fix some issues exposed by more testing. XORIS had the wrong operands	Chris Lattner	2005-09-14	1	-5/+5
\| \| \| \| \| \| \|	specified. The various *imm operands defined by PPC are really all i32, even though the actual immediate is restricted to a smaller value in it. llvm-svn: 23352
*	Fix some bugs noticed by new checking code	Chris Lattner	2005-09-14	1	-8/+14
\| \| \| \|	llvm-svn: 23350
*	Fix the regression last night compiling povray	Chris Lattner	2005-09-14	1	-2/+3
\| \| \| \|	llvm-svn: 23348
*	fix a major regression from my patch this afternoon	Chris Lattner	2005-09-14	1	-0/+1
\| \| \| \|	llvm-svn: 23347
*	we don't need this proto any longer	Chris Lattner	2005-09-13	1	-1/+0
\| \| \| \|	llvm-svn: 23342
*	move the #include for the generated code into the isel class body so we	Chris Lattner	2005-09-13	1	-1/+3
\| \| \| \| \| \|	can use/define class methods llvm-svn: 23339
*	Change the arg lowering code to use copyfromreg from vregs associated	Chris Lattner	2005-09-13	1	-12/+17
\| \| \| \| \| \| \| \|	with incoming arguments instead of the pregs themselves. This fixes the scheduler from causing problems by moving a copyfromreg for an argument to after a select_cc node (now it can, and bad things won't happen). llvm-svn: 23334
*	This has been moved to the target-indep code	Chris Lattner	2005-09-13	1	-22/+0
\| \| \| \|	llvm-svn: 23333
*	This code is no longer needed, it is moved to the target-indep code	Chris Lattner	2005-09-13	2	-49/+0
\| \| \| \|	llvm-svn: 23332
*	If a function has liveins, and if the target requested that they be plopped	Chris Lattner	2005-09-13	1	-0/+15
\| \| \| \| \| \|	into particular vregs, emit copies into the entry MBB. llvm-svn: 23331
*	Majik numbers are bad	Chris Lattner	2005-09-13	1	-2/+2
\| \| \| \|	llvm-svn: 23330
*	Remove some dead vectors	Chris Lattner	2005-09-13	1	-4/+0
\| \| \| \|	llvm-svn: 23329
*	Add a simple xform to simplify array accesses with casts in the way.	Chris Lattner	2005-09-13	1	-2/+62
\| \| \| \| \| \| \|	This is useful for 178.galgel where resolution of dope vectors (by the optimizer) causes the scales to become apparent. llvm-svn: 23328
*	Fix an issue where LSR would miss rewriting a use of an IV expression by a ↵	Chris Lattner	2005-09-13	1	-4/+8
\| \| \| \| \| \| \| \| \|	PHI node that is not the original PHI. This fixes up a dot-product loop in galgel, speeding it up from 18.47s to 16.13s. llvm-svn: 23327
*	Add a helper function, allowing us to simplify some code a bit, changing	Chris Lattner	2005-09-13	1	-39/+47
\| \| \| \| \| \|	indentation, no functionality change llvm-svn: 23325
*	Implement a simple xform to turn code like this:	Chris Lattner	2005-09-12	1	-0/+66
\| \| \| \| \| \| \| \| \|	if () { store A -> P; } else { store B -> P; } into a PHI node with one store, in the most trival case. This implements load.ll:test10. llvm-svn: 23324
*	Another load-peephole optimization: do gcse when two loads are next to	Chris Lattner	2005-09-12	1	-2/+5
\| \| \| \| \| \|	each other. This implements InstCombine/load.ll:test9 llvm-svn: 23322
*	Implement a trivial form of store->load forwarding where the store and the	Chris Lattner	2005-09-12	1	-0/+9
\| \| \| \| \| \| \| \|	load are exactly consequtive. This is picked up by other passes, but this triggers thousands of times in fortran programs that use static locals (and is thus a compile-time speedup). llvm-svn: 23320