bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Implement ComputeMaskedBits/SimplifyDemandedBits for ISD::TRUNCATE	Chris Lattner	2006-05-05	1	-0/+18
\| \| \| \|	llvm-svn: 28135
*	Print a grouping around inline asm blocks so that we can tell when we are	Chris Lattner	2006-05-05	1	-1/+2
\| \| \| \| \| \|	using them. llvm-svn: 28134
*	Print some grouping around inline asm blocks so we know where they are.	Chris Lattner	2006-05-05	1	-1/+2
\| \| \| \|	llvm-svn: 28133
*	Indent multiline asm strings more nicely	Chris Lattner	2006-05-05	1	-5/+9
\| \| \| \|	llvm-svn: 28132
*	Teach the code generator to use cvtss2sd as extload f32 -> f64	Chris Lattner	2006-05-05	2	-5/+1
\| \| \| \|	llvm-svn: 28131
*	Fold (fpext (load x)) -> (extload x)	Chris Lattner	2006-05-05	1	-0/+14
\| \| \| \|	llvm-svn: 28130
*	More aggressively sink GEP offsets into loops. For example, before we	Chris Lattner	2006-05-05	1	-56/+115
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	generated: movl 8(%esp), %eax movl %eax, %edx addl $4316, %edx cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, (%edx) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %edx movl %edx, 4460(%eax) ret ... Now we generate: movl 8(%esp), %eax cmpb $1, %cl ja LBB1_2 #cond_false LBB1_1: #cond_true movl L_QuantizationTables720$non_lazy_ptr, %ecx movl %ecx, 4316(%eax) movl L_QNOtoQuantTableShift720$non_lazy_ptr, %ecx movl %ecx, 4460(%eax) ret ... which uses one fewer register. llvm-svn: 28129
*	Fix an infinite loop compiling oggenc last night.	Chris Lattner	2006-05-05	1	-6/+9
\| \| \| \|	llvm-svn: 28128
*	Need extload patterns after Chris' DAG combiner changes	Evan Cheng	2006-05-05	1	-1/+11
\| \| \| \|	llvm-svn: 28127
*	Implement InstCombine/cast.ll:test29	Chris Lattner	2006-05-05	1	-0/+40
\| \| \| \|	llvm-svn: 28126
*	Fold some common code.	Chris Lattner	2006-05-05	1	-14/+2
\| \| \| \|	llvm-svn: 28124
*	Implement:	Chris Lattner	2006-05-05	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \|	// fold (and (sext x), (sext y)) -> (sext (and x, y)) // fold (or (sext x), (sext y)) -> (sext (or x, y)) // fold (xor (sext x), (sext y)) -> (sext (xor x, y)) // fold (and (aext x), (aext y)) -> (aext (and x, y)) // fold (or (aext x), (aext y)) -> (aext (or x, y)) // fold (xor (aext x), (aext y)) -> (aext (xor x, y)) llvm-svn: 28123
*	Pull and through and/or/xor. This compiles some bitfield code to:	Chris Lattner	2006-05-05	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	mov EAX, DWORD PTR [ESP + 4] mov ECX, DWORD PTR [EAX] mov EDX, ECX add EDX, EDX or EDX, ECX and EDX, -2147483648 and ECX, 2147483647 or EDX, ECX mov DWORD PTR [EAX], EDX ret instead of: sub ESP, 4 mov DWORD PTR [ESP], ESI mov EAX, DWORD PTR [ESP + 8] mov ECX, DWORD PTR [EAX] mov EDX, ECX add EDX, EDX mov ESI, ECX and ESI, -2147483648 and EDX, -2147483648 or EDX, ESI and ECX, 2147483647 or EDX, ECX mov DWORD PTR [EAX], EDX mov ESI, DWORD PTR [ESP] add ESP, 4 ret llvm-svn: 28122
*	Implement a variety of simplifications for ANY_EXTEND.	Chris Lattner	2006-05-05	1	-0/+51
\| \| \| \|	llvm-svn: 28121
*	Factor some code, add these transformations:	Chris Lattner	2006-05-05	1	-55/+66
\| \| \| \| \| \| \| \|	// fold (and (trunc x), (trunc y)) -> (trunc (and x, y)) // fold (or (trunc x), (trunc y)) -> (trunc (or x, y)) // fold (xor (trunc x), (trunc y)) -> (trunc (xor x, y)) llvm-svn: 28120
*	Better implementation of truncate. ISel matches it to a pseudo instruction	Evan Cheng	2006-05-05	6	-240/+162
\| \| \| \| \| \| \| \|	that gets emitted as movl (for r32 to i16, i8) or a movw (for r16 to i8). And if the destination gets allocated a subregister of the source operand, then the instruction will not be emitted at all. llvm-svn: 28119
*	New note, Nate, please check to see if I'm full of it :)	Chris Lattner	2006-05-05	1	-0/+33
\| \| \| \|	llvm-svn: 28118
*	Fix VC++ compilation error.	Jeff Cohen	2006-05-05	1	-1/+1
\| \| \| \|	llvm-svn: 28117
*	Sink noop copies into the basic block that uses them. This reduces the number	Chris Lattner	2006-05-05	1	-4/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	of cross-block live ranges, and allows the bb-at-a-time selector to always coallesce these away, at isel time. This reduces the load on the coallescer and register allocator. For example on a codec on X86, we went from: 1643 asm-printer - Number of machine instrs printed 419 liveintervals - Number of loads/stores folded into instructions 1144 liveintervals - Number of identity moves eliminated after coalescing 1022 liveintervals - Number of interval joins performed 282 liveintervals - Number of intervals after coalescing 1304 liveintervals - Number of original intervals 86 regalloc - Number of times we had to backtrack 1.90232 regalloc - Ratio of intervals processed over total intervals 40 spiller - Number of values reused 182 spiller - Number of loads added 121 spiller - Number of stores added 132 spiller - Number of register spills 6 twoaddressinstruction - Number of instructions commuted to coalesce 360 twoaddressinstruction - Number of two-address instructions to: 1636 asm-printer - Number of machine instrs printed 403 liveintervals - Number of loads/stores folded into instructions 1155 liveintervals - Number of identity moves eliminated after coalescing 1033 liveintervals - Number of interval joins performed 279 liveintervals - Number of intervals after coalescing 1312 liveintervals - Number of original intervals 76 regalloc - Number of times we had to backtrack 1.88998 regalloc - Ratio of intervals processed over total intervals 1 spiller - Number of copies elided 41 spiller - Number of values reused 191 spiller - Number of loads added 114 spiller - Number of stores added 128 spiller - Number of register spills 4 twoaddressinstruction - Number of instructions commuted to coalesce 356 twoaddressinstruction - Number of two-address instructions On this testcase, this change provides a modest reduction in spill code, regalloc iterations, and total instructions emitted. It increases the number of register coallesces. llvm-svn: 28115
*	Adjust to use proper TargetData copy ctor	Chris Lattner	2006-05-04	1	-1/+1
\| \| \| \|	llvm-svn: 28112
*	Final pass of minor cleanups for MachineInstr	Chris Lattner	2006-05-04	1	-4/+0
\| \| \| \|	llvm-svn: 28110
*	Initial support for register pressure aware scheduling. The register reduction	Evan Cheng	2006-05-04	1	-50/+238
\| \| \| \| \| \| \| \| \| \|	scheduler can go into a "vertical mode" (i.e. traversing up the two-address chain, etc.) when the register pressure is low. This does seem to reduce the number of spills in the cases I've looked at. But with x86, it's no guarantee the performance of the code improves. It can be turned on with -sched-vertically option. llvm-svn: 28108
*	Remove redundancy and a level of indirection when creating machine operands	Chris Lattner	2006-05-04	1	-21/+5
\| \| \| \|	llvm-svn: 28107
*	Remove and simplify some more machineinstr/machineoperand stuff.	Chris Lattner	2006-05-04	5	-18/+18
\| \| \| \|	llvm-svn: 28105
*	Rename MO_VirtualRegister -> MO_Register. Clean up immediate handling.	Chris Lattner	2006-05-04	9	-18/+18
\| \| \| \|	llvm-svn: 28104
*	Move some methods out of MachineInstr into MachineOperand	Chris Lattner	2006-05-04	14	-60/+39
\| \| \| \|	llvm-svn: 28102
*	Fix Transforms/InstCombine/2006-05-04-DemandedBitCrash.ll	Chris Lattner	2006-05-04	1	-0/+4
\| \| \| \|	llvm-svn: 28101
*	There shalt be only one "immediate" operand type!	Chris Lattner	2006-05-04	16	-61/+47
\| \| \| \|	llvm-svn: 28099
*	Change "value" in MachineOperand to be a GlobalValue, as that is the only	Chris Lattner	2006-05-04	1	-14/+3
\| \| \| \| \| \|	thing that can be in it. Remove a dead method. llvm-svn: 28098
*	Revert Nate's CR patch from last night, which caused many regressions (e.g. ↵	Chris Lattner	2006-05-04	2	-26/+9
\| \| \| \| \| \| \| \| \|	fhourstones). Loading and storing off R0 isn't what we wanted. Also, taking some CR's out of CRRC seems to cause failures as well. Further investigation is required. llvm-svn: 28097
*	Make external globals public; other minor cleanup.	Jeff Cohen	2006-05-04	1	-15/+17
\| \| \| \|	llvm-svn: 28096
*	Make Intel syntax the default when LLVM is built with VC++.	Jeff Cohen	2006-05-04	1	-1/+6
\| \| \| \|	llvm-svn: 28095
*	Remove a bunch more dead V9 specific stuff	Chris Lattner	2006-05-04	4	-40/+10
\| \| \| \|	llvm-svn: 28094
*	Remove a bunch more SparcV9 specific stuff	Chris Lattner	2006-05-04	12	-54/+16
\| \| \| \|	llvm-svn: 28093
*	Remove some more V9-specific stuff.	Chris Lattner	2006-05-04	3	-39/+3
\| \| \| \|	llvm-svn: 28092
*	Remove some more unused stuff from MachineInstr that was leftover from V9.	Chris Lattner	2006-05-04	7	-69/+0
\| \| \| \|	llvm-svn: 28091
*	Simplify handling of relocations	Chris Lattner	2006-05-04	1	-24/+38
\| \| \| \|	llvm-svn: 28090
*	Use movsd to shuffle in the lowest two elements of a v4f32 / v4i32 vector when	Evan Cheng	2006-05-03	1	-0/+8
\| \| \| \| \| \|	movlps cannot be used (e.g. when load from m64 has multiple uses). llvm-svn: 28089
*	Change from using MachineRelocation ctors to using static methods	Chris Lattner	2006-05-03	3	-8/+8
\| \| \| \| \| \|	in MachineRelocation to create Relocations. llvm-svn: 28088
*	minor cleanups, no functionality change	Chris Lattner	2006-05-03	1	-7/+7
\| \| \| \|	llvm-svn: 28087
*	inline a simple method	Chris Lattner	2006-05-03	1	-10/+7
\| \| \| \|	llvm-svn: 28083
*	Suck block address tracking out of targets into the JIT Emitter. This	Chris Lattner	2006-05-03	5	-62/+57
\| \| \| \| \| \| \|	simplifies the MachineCodeEmitter interface just a little bit and makes BasicBlocks work like constant pools and jump tables. llvm-svn: 28082
*	Fix a bug in Owen's checkin that broke the CBE on all non sparc v9 platforms.	Chris Lattner	2006-05-03	1	-1/+1
\| \| \| \|	llvm-svn: 28081
*	Teach the x86 jit how to handle jump tables not directly used by a jump	Nate Begeman	2006-05-03	1	-0/+3
\| \| \| \| \| \|	instruction. llvm-svn: 28080
*	Finish up the initial jump table implementation by allowing jump tables to	Nate Begeman	2006-05-03	1	-26/+34
\| \| \| \| \| \| \|	not be 100% dense. Increase the minimum threshold for the number of cases in a switch statement from 4 to 6 in order to create a jump table. llvm-svn: 28079
*	Bottom up register pressure reduction work: clean up some hacks and enhanced	Evan Cheng	2006-05-03	1	-75/+72
\| \| \| \| \| \| \|	the heuristic to further reduce spills for several test cases. (Note, it may not necessarily translate to runtime win!) llvm-svn: 28076
*	Refactor TargetMachine, pushing handling of TargetData into the ↵	Owen Anderson	2006-05-03	30	-117/+121
\| \| \| \| \| \| \| \|	target-specific subclasses. This has one caller-visible change: getTargetData() now returns a pointer instead of a reference. This fixes PR 759. llvm-svn: 28074
*	Align function bodies correctly.	Chris Lattner	2006-05-03	1	-4/+2
\| \| \| \|	llvm-svn: 28073
*	Simplify some code. Don't add memory blocks to the Blocks list twice.	Chris Lattner	2006-05-03	1	-16/+8
\| \| \| \|	llvm-svn: 28071
*	Add assertions that verify that the actual arguments to a call or invoke match	Chris Lattner	2006-05-03	1	-4/+22
\| \| \| \| \| \|	the prototype of the called function. llvm-svn: 28070