bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	If an indvar with a variable stride is used by the exit condition, go ahead	Chris Lattner	2006-11-17	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and handle it like constant stride vars. This fixes some bad codegen in variable stride cases. For example, it compiles this: void foo(int k, int i) { for (k=i+i; k <= 8192; k+=i) flags2[k] = 0; } to: LBB1_1: #bb.preheader movl %eax, %ecx addl %ecx, %ecx movl L_flags2$non_lazy_ptr, %edx LBB1_2: #bb movb $0, (%edx,%ecx) addl %eax, %ecx cmpl $8192, %ecx jle LBB1_2 #bb LBB1_5: #return ret or (if the array is local and we are in dynamic-nonpic or static mode): LBB3_2: #bb movb $0, _flags2(%ecx) addl %eax, %ecx cmpl $8192, %ecx jle LBB3_2 #bb and: lis r2, ha16(L_flags2$non_lazy_ptr) lwz r2, lo16(L_flags2$non_lazy_ptr)(r2) slwi r3, r4, 1 LBB1_2: ;bb li r5, 0 add r6, r4, r3 stbx r5, r2, r3 cmpwi cr0, r6, 8192 bgt cr0, LBB1_5 ;return instead of: leal (%eax,%eax,2), %ecx movl %eax, %edx addl %edx, %edx addl L_flags2$non_lazy_ptr, %edx xorl %esi, %esi LBB1_2: #bb movb $0, (%edx,%esi) movl %eax, %edi addl %esi, %edi addl %ecx, %esi cmpl $8192, %esi jg LBB1_5 #return and: lis r2, ha16(L_flags2$non_lazy_ptr) lwz r2, lo16(L_flags2$non_lazy_ptr)(r2) mulli r3, r4, 3 slwi r5, r4, 1 li r6, 0 add r2, r2, r5 LBB1_2: ;bb li r5, 0 add r7, r3, r6 stbx r5, r2, r6 add r6, r4, r6 cmpwi cr0, r7, 8192 ble cr0, LBB1_2 ;bb This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and implements LoopStrengthReduce/var_stride_used_by_compare.ll llvm-svn: 31809
*	Fix a gcc 4.2 warning.	Chris Lattner	2006-11-15	1	-0/+2
\| \| \| \|	llvm-svn: 31751
*	implement InstCombine/shift-simplify.ll by transforming:	Chris Lattner	2006-11-14	1	-3/+46
\| \| \| \| \| \| \| \|	(X >> Z) op (Y >> Z) -> (X op Y) >> Z for all shifts and all ops={and/or/xor}. llvm-svn: 31729
*	implement InstCombine/and-compare.ll:test1. This compiles:	Chris Lattner	2006-11-14	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	typedef struct { unsigned prefix : 4; unsigned code : 4; unsigned unsigned_p : 4; } tree_common; int foo(tree_common a, tree_common b) { return a->code == b->code; } into: _foo: movl 4(%esp), %eax movl 8(%esp), %ecx movl (%eax), %eax xorl (%ecx), %eax # TRUNCATE movb %al, %al shrb $4, %al testb %al, %al sete %al movzbl %al, %eax ret instead of: _foo: movl 8(%esp), %eax movb (%eax), %al shrb $4, %al movl 4(%esp), %ecx movb (%ecx), %cl shrb $4, %cl cmpb %al, %cl sete %al movzbl %al, %eax ret saving one cycle by eliminating a shift. llvm-svn: 31727
*	Fix InstCombine/2006-11-10-ashr-miscompile.ll a miscompilation introduced	Chris Lattner	2006-11-10	1	-3/+3
\| \| \| \| \| \|	by the shr -> [al]shr patch. This was reduced from 176.gcc. llvm-svn: 31653
*	second patch to fix PR992/993.	Chris Lattner	2006-11-09	1	-4/+17
\| \| \| \|	llvm-svn: 31610
*	Minimal patch to fix PR992/PR993	Chris Lattner	2006-11-09	1	-2/+1
\| \| \| \|	llvm-svn: 31608
*	Teach ShrinkDemandedConstant how to handle X+C. This implements:	Chris Lattner	2006-11-09	1	-1/+100
\| \| \| \| \| \|	add.ll:test33, add.ll:test34, shift-sra.ll:test2 llvm-svn: 31586
*	reenable factoring of GEP expressions, being more precise about the	Chris Lattner	2006-11-08	1	-5/+10
\| \| \| \| \| \|	case that it bad to do. llvm-svn: 31563
*	make this code more efficient by not creating a phi node we are just going to	Chris Lattner	2006-11-08	1	-36/+33
\| \| \| \| \| \|	delete in the first place. This also makes it simpler. llvm-svn: 31562
*	Remove redundant <cmath>.	Jim Laskey	2006-11-08	1	-1/+0
\| \| \| \|	llvm-svn: 31561
*	disable this factoring optzn for GEPs for now, this severely pessimizes some	Chris Lattner	2006-11-08	1	-1/+1
\| \| \| \| \| \|	loops. llvm-svn: 31560
*	For PR950:	Reid Spencer	2006-11-08	5	-215/+194
\| \| \| \| \| \| \| \|	This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542
*	scalarrepl should not split the two elements of the vsiidx array:	Chris Lattner	2006-11-07	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	int func(vFloat v0, vFloat v1) { int ii; vSInt32 vsiidx[2]; vsiidx[0] = _mm_cvttps_epi32(v0); vsiidx[1] = _mm_cvttps_epi32(v1); ii = ((int *) vsiidx)[4]; return ii; } This fixes Transforms/ScalarRepl/2006-11-07-InvalidArrayPromote.ll llvm-svn: 31524
*	Unbreak VC++ build.	Jeff Cohen	2006-11-05	2	-6/+6
\| \| \| \|	llvm-svn: 31464
*	Remove commented line from earlier debugging.	Nick Lewycky	2006-11-05	1	-2/+0
\| \| \| \|	llvm-svn: 31460
*	The wrong parameter was being tested to deturmine i32 vs i64	Andrew Lenharth	2006-11-03	1	-1/+1
\| \| \| \|	llvm-svn: 31431
*	remove dead code	Chris Lattner	2006-11-03	1	-13/+0
\| \| \| \|	llvm-svn: 31398
*	For PR786:	Reid Spencer	2006-11-02	22	-47/+22
\| \| \| \| \| \| \| \| \| \|	Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380
*	For PR950:	Reid Spencer	2006-11-02	3	-119/+134
\| \| \| \| \| \|	Replace the REM instruction with UREM, SREM and FREM. llvm-svn: 31369
*	There can be more than one PHINode at the start of the block.	Devang Patel	2006-11-01	1	-5/+4
\| \| \| \|	llvm-svn: 31362
*	Handle PHINode with only one incoming value.	Devang Patel	2006-11-01	1	-5/+9
\| \| \| \| \| \|	This fixes http://llvm.org/bugs/show_bug.cgi?id=979 llvm-svn: 31358
*	Fix GlobalOpt/2006-11-01-ShrinkGlobalPhiCrash.ll and McGill/chomp	Chris Lattner	2006-11-01	1	-8/+14
\| \| \| \|	llvm-svn: 31352
*	Factor gep instructions through phi nodes.	Chris Lattner	2006-11-01	1	-10/+39
\| \| \| \|	llvm-svn: 31346
*	Turn a phi of many loads into a phi of the address and a single load of the	Chris Lattner	2006-11-01	1	-41/+30
\| \| \| \| \| \| \|	result. This can significantly shrink code and exposes identities more aggressively. llvm-svn: 31344
*	Fix a bug in the previous patch	Chris Lattner	2006-11-01	1	-3/+6
\| \| \| \|	llvm-svn: 31342
*	Fold things like "phi [add (a,b), add(c,d)]" into two phi's and one add.	Chris Lattner	2006-11-01	1	-3/+57
\| \| \| \| \| \|	This triggers thousands of times on multisource. llvm-svn: 31341
*	generalize the fix for PR977 to also fix	Chris Lattner	2006-10-31	1	-28/+26
\| \| \| \| \| \|	Transforms/LCSSA/2006-10-31-UnreachableBlock-2.ll llvm-svn: 31317
*	Fix PR977 and Transforms/LCSSA/2006-10-31-UnreachableBlock.ll	Chris Lattner	2006-10-31	1	-1/+8
\| \| \| \|	llvm-svn: 31315
*	Fix SimplifyCFG/2006-10-29-InvokeCrash.ll, a crash compiling QT.	Chris Lattner	2006-10-29	1	-1/+1
\| \| \| \|	llvm-svn: 31284
*	add option to isCriticalEdge	Chris Lattner	2006-10-28	1	-3/+12
\| \| \| \|	llvm-svn: 31258
*	break edges more intelligently	Chris Lattner	2006-10-28	1	-2/+3
\| \| \| \|	llvm-svn: 31257
*	Expose a smarter way to break critical edges.	Chris Lattner	2006-10-28	1	-5/+24
\| \| \| \|	llvm-svn: 31256
*	SplitCriticalEdge checks to see if an edge is critical, don't check twice	Chris Lattner	2006-10-28	1	-2/+1
\| \| \| \|	llvm-svn: 31255
*	prepare for a change I'm about to make	Chris Lattner	2006-10-28	1	-0/+6
\| \| \| \|	llvm-svn: 31248
*	Simplify code a bit by changing instances of:	Reid Spencer	2006-10-26	1	-47/+27
\| \| \| \| \| \| \| \|	InsertNewInstBefore(new CastInst(Val, ValTy, Val->GetName()), I) into: InsertCastBefore(Val, ValTy, I) llvm-svn: 31204
*	For PR950:	Reid Spencer	2006-10-26	4	-135/+256
\| \| \| \| \| \| \| \|	Make necessary changes to support DIV -> [SUF]Div. This changes llvm to have three division instructions: signed, unsigned, floating point. The bytecode and assembler are bacwards compatible, however. llvm-svn: 31195
*	Fix 2006-10-25-AddSetCC. A relational operator (like setlt) can never	Nick Lewycky	2006-10-26	1	-27/+46
\| \| \| \| \| \|	produce an EQ property. llvm-svn: 31193
*	Resurrect r1.25.	Nick Lewycky	2006-10-25	1	-117/+154
\| \| \| \| \| \|	Fix and comment the "or", "and" and "xor" transformations. llvm-svn: 31189
*	hide symbols properly	Chris Lattner	2006-10-25	1	-1/+1
\| \| \| \|	llvm-svn: 31184
*	Fix Transforms/ScalarRepl/2006-10-23-PointerUnionCrash.ll	Chris Lattner	2006-10-24	1	-5/+10
\| \| \| \|	llvm-svn: 31151
*	Revert back to r1.21, which was the last revision of predsimplify that	Chris Lattner	2006-10-24	1	-134/+109
\| \| \| \| \| \|	passes llvm-gcc bootstrap. llvm-svn: 31146
*	Handle fallout from the recent branch-on-undef changes. This fixes	Chris Lattner	2006-10-23	1	-1/+24
\| \| \| \| \| \|	Prolangs-C/agrep and SCCP/2006-10-23-IPSCCP-Crash.ll llvm-svn: 31132
*	Remove the Backwards operation. Resolving now works at the time when a	Nick Lewycky	2006-10-23	1	-90/+102
\| \| \| \| \| \| \|	property is added by running through the list of uses of the value and adding resolved properties to the property set. llvm-svn: 31126
*	Fix similar missing optimization opportunity in XOR.	Nick Lewycky	2006-10-22	1	-13/+22
\| \| \| \|	llvm-svn: 31123
*	Whoops! Add missing NULL check.	Nick Lewycky	2006-10-22	1	-0/+1
\| \| \| \|	llvm-svn: 31121
*	Handle "if ((x\|y) != 0)" for ints like we do for bools. Fixes missed	Nick Lewycky	2006-10-22	1	-10/+13
\| \| \| \| \| \|	optimization opportunity pointed out by Chris Lattner. llvm-svn: 31118
*	AllocaInst can't return a null pointer. Fixes missed optimization	Nick Lewycky	2006-10-22	1	-0/+6
\| \| \| \| \| \|	opportunity pointed out by Andrew Lewycky. llvm-svn: 31115
*	Add a workaround for PR962, disabling the more aggressive form of this	Chris Lattner	2006-10-22	1	-0/+8
\| \| \| \| \| \|	transformation. This speeds up a C++ app 2.25x. llvm-svn: 31113
*	3 Changes:	Chris Lattner	2006-10-22	1	-24/+35
\| \| \| \| \| \| \| \| \| \|	1. Better document what is going on here. 2. Only hack on one branch per iteration, making the results less conservative. 3. Handle the problematic case by marking edges executable instead of by playing with value lattice states. This is far less pessimistic, and fixes SCCP/ipsccp-gvar.ll. llvm-svn: 31106