bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	trunc to integer, not to FP.	Chris Lattner	2006-12-11	1	-2/+11
\| \| \| \|	llvm-svn: 32426
*	implement promotion of unions containing two packed types of the same width.	Chris Lattner	2006-12-11	1	-15/+30
\| \| \| \| \| \|	This implements Transforms/ScalarRepl/union-packed.ll llvm-svn: 32422
*	* Eliminate calls to CastInst::createInferredCast.	Chris Lattner	2006-12-10	1	-40/+93
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add support for promoting unions with fp values in them. This produces our new int<->fp bitcast instructions, implementing Transforms/ScalarRepl/union-fp-int.ll As an example, this allows us to compile this: union intfloat { int i; float f; }; float invsqrt(const float arg_x) { union intfloat x = { .f = arg_x }; const float xhalf = arg_x * 0.5f; x.i = 0x5f3759df - (x.i >> 1); return x.f * (1.5f - xhalf * x.f * x.f); } into: _invsqrt: movss 4(%esp), %xmm0 movd %xmm0, %eax sarl %eax movl $1597463007, %ecx subl %eax, %ecx movd %ecx, %xmm1 mulss LCPI1_0, %xmm0 mulss %xmm1, %xmm0 movss LCPI1_1, %xmm2 mulss %xmm1, %xmm0 subss %xmm0, %xmm2 movl 8(%esp), %eax mulss %xmm2, %xmm1 movss %xmm1, (%eax) ret instead of: _invsqrt: subl $4, %esp movss 8(%esp), %xmm0 movss %xmm0, (%esp) movl (%esp), %eax movl $1597463007, %ecx sarl %eax subl %eax, %ecx movl %ecx, (%esp) mulss LCPI1_0, %xmm0 movss (%esp), %xmm1 mulss %xmm1, %xmm0 mulss %xmm1, %xmm0 movss LCPI1_1, %xmm2 subss %xmm0, %xmm2 mulss %xmm2, %xmm1 movl 12(%esp), %eax movss %xmm1, (%eax) addl $4, %esp ret llvm-svn: 32418
*	Incorporate any changes in the successor blocks into the result of	Reid Spencer	2006-12-08	1	-1/+1
\| \| \| \| \| \|	MarkAliveBlocks. llvm-svn: 32375
*	What should be the last unnecessary <iostream>s in the library.	Bill Wendling	2006-12-07	1	-6/+5
\| \| \| \|	llvm-svn: 32333
*	Removing even more <iostream> includes.	Bill Wendling	2006-12-07	4	-90/+79
\| \| \| \|	llvm-svn: 32320
*	Changed llvm_ostream et all to OStream. llvm_cerr, llvm_cout, llvm_null, are	Bill Wendling	2006-12-07	13	-49/+46
\| \| \| \| \| \|	now cerr, cout, and NullStream resp. llvm-svn: 32298
*	Update ConstantIntegral Max/Min tests for new interface.	Reid Spencer	2006-12-06	1	-4/+4
\| \| \| \|	llvm-svn: 32288
*	add missing #include	Chris Lattner	2006-12-06	1	-0/+1
\| \| \| \|	llvm-svn: 32280
*	Detemplatize the Statistic class. The only type it is instantiated with	Chris Lattner	2006-12-06	49	-125/+125
\| \| \| \| \| \|	is 'unsigned'. llvm-svn: 32279
*	Remove the 'printname' argument to WriteAsOperand. It is always true, and	Chris Lattner	2006-12-06	1	-1/+1
\| \| \| \| \| \|	passing false would make the asmprinter fail anyway. llvm-svn: 32264
*	counter should be unsigned.	Chris Lattner	2006-12-06	1	-1/+1
\| \| \| \|	llvm-svn: 32252
*	add an instcombine xform. This speeds up 462.libquantum from 9.78s to	Chris Lattner	2006-12-05	1	-0/+17
\| \| \| \| \| \|	7.48s. This regression is due to unforseen consequences of the cast patch. llvm-svn: 32209
*	SCCP does not handle Packed Type properly. Disable Packed Type handling	Devang Patel	2006-12-04	1	-1/+17
\| \| \| \| \| \|	for now. llvm-svn: 32208
*	Update call to CastInst::getCastOpcode for its new signature.	Reid Spencer	2006-12-04	1	-1/+2
\| \| \| \|	llvm-svn: 32166
*	Unbreak VC++ build.	Jeff Cohen	2006-12-02	1	-7/+7
\| \| \| \|	llvm-svn: 32113
*	disable transformations that are invalid for fp vectors. This fixes	Chris Lattner	2006-12-02	1	-4/+4
\| \| \| \| \| \|	Transforms/InstCombine/2006-12-01-BadFPVectorXform.ll llvm-svn: 32112
*	Remove 4 FIXMEs to hack around cast-to-bool problems which no longer exist.	Reid Spencer	2006-11-30	1	-46/+3
\| \| \| \|	llvm-svn: 32051
*	make it clear that this is always a zext	Chris Lattner	2006-11-30	1	-1/+1
\| \| \| \|	llvm-svn: 32044
*	One more bugfix, 3 cases of making casts explicit.	Chris Lattner	2006-11-30	1	-5/+8
\| \| \| \|	llvm-svn: 32043
*	Fix a bug in globalopt due to the recent cast patch.	Chris Lattner	2006-11-30	1	-1/+2
\| \| \| \|	llvm-svn: 32042
*	implement cast.ll:test35. With this, we recognize:	Chris Lattner	2006-11-29	1	-0/+16
\| \| \| \| \| \| \| \| \| \|	unsigned short swp(unsigned short a) { return ((a & 0xff00) >> 8 \| (a & 0x00ff) << 8); } as an idiom for bswap. llvm-svn: 32011
*	Teach instcombine to turn trunc(srl x, c) -> srl (trunc(x), c) when safe.	Chris Lattner	2006-11-29	1	-1/+33
\| \| \| \| \| \| \|	This implements InstCombine/cast.ll:test34. It fires hundreds of times on 176.gcc. llvm-svn: 32009
*	Implement Regression/Transforms/InstCombine/bswap-fold.ll,	Chris Lattner	2006-11-29	1	-1/+24
\| \| \| \| \| \|	folding seteq (bswap(x)), c -> seteq(x,bswap(c)) llvm-svn: 32006
*	Join a split line.	Reid Spencer	2006-11-29	1	-2/+1
\| \| \| \|	llvm-svn: 31996
*	Undo the last patch until 253.perlbmk passes with these changes.	Reid Spencer	2006-11-28	1	-3/+46
\| \| \| \|	llvm-svn: 31977
*	Remove 4 FIXME's from the CAST patch now that the back end is correctly	Reid Spencer	2006-11-28	1	-46/+3
\| \| \| \| \| \|	producing code for "trunc to bool". This passes all tests on Linux. llvm-svn: 31963
*	Fix PR1014 and InstCombine/2006-11-27-XorBug.ll.	Chris Lattner	2006-11-27	1	-10/+8
\| \| \| \|	llvm-svn: 31941
*	For PR950:	Reid Spencer	2006-11-27	21	-786/+908
\| \| \| \| \| \| \| \| \| \|	The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931
*	Remove #include <iostream> and use llvm_* streams instead.	Bill Wendling	2006-11-26	3	-40/+37
\| \| \| \|	llvm-svn: 31925
*	Replace #include <iostream> with llvm_* streams.	Bill Wendling	2006-11-26	6	-69/+62
\| \| \| \|	llvm-svn: 31924
*	Removed #include <iostream> and replaced with llvm_* streams.	Bill Wendling	2006-11-26	11	-115/+100
\| \| \| \|	llvm-svn: 31923
*	Removed #include <iostream> and used the llvm_cerr/DOUT streams instead.	Bill Wendling	2006-11-26	7	-44/+34
\| \| \| \|	llvm-svn: 31922
*	Update to new predicate simplifier VRP design. Fixes PR966 and PR967.	Nick Lewycky	2006-11-22	1	-574/+1105
\| \| \| \| \| \| \| \|	Remove predicate simplifier from default gcc3 pipeline. New design is too slow to enable by default. Add new testcases for problems encountered in development. llvm-svn: 31895
*	This xform is handled by FoldOpIntoPhi in visitCastInst in a more elegant way.	Chris Lattner	2006-11-21	1	-30/+1
\| \| \| \|	llvm-svn: 31889
*	Do not convert massive blocks on phi nodes into select statements. Instead	Chris Lattner	2006-11-18	1	-0/+27
\| \| \| \| \| \| \|	only do these transformations if there are a small number of phi's. This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro. llvm-svn: 31853
*	If an indvar with a variable stride is used by the exit condition, go ahead	Chris Lattner	2006-11-17	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and handle it like constant stride vars. This fixes some bad codegen in variable stride cases. For example, it compiles this: void foo(int k, int i) { for (k=i+i; k <= 8192; k+=i) flags2[k] = 0; } to: LBB1_1: #bb.preheader movl %eax, %ecx addl %ecx, %ecx movl L_flags2$non_lazy_ptr, %edx LBB1_2: #bb movb $0, (%edx,%ecx) addl %eax, %ecx cmpl $8192, %ecx jle LBB1_2 #bb LBB1_5: #return ret or (if the array is local and we are in dynamic-nonpic or static mode): LBB3_2: #bb movb $0, _flags2(%ecx) addl %eax, %ecx cmpl $8192, %ecx jle LBB3_2 #bb and: lis r2, ha16(L_flags2$non_lazy_ptr) lwz r2, lo16(L_flags2$non_lazy_ptr)(r2) slwi r3, r4, 1 LBB1_2: ;bb li r5, 0 add r6, r4, r3 stbx r5, r2, r3 cmpwi cr0, r6, 8192 bgt cr0, LBB1_5 ;return instead of: leal (%eax,%eax,2), %ecx movl %eax, %edx addl %edx, %edx addl L_flags2$non_lazy_ptr, %edx xorl %esi, %esi LBB1_2: #bb movb $0, (%edx,%esi) movl %eax, %edi addl %esi, %edi addl %ecx, %esi cmpl $8192, %esi jg LBB1_5 #return and: lis r2, ha16(L_flags2$non_lazy_ptr) lwz r2, lo16(L_flags2$non_lazy_ptr)(r2) mulli r3, r4, 3 slwi r5, r4, 1 li r6, 0 add r2, r2, r5 LBB1_2: ;bb li r5, 0 add r7, r3, r6 stbx r5, r2, r6 add r6, r4, r6 cmpwi cr0, r7, 8192 ble cr0, LBB1_2 ;bb This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and implements LoopStrengthReduce/var_stride_used_by_compare.ll llvm-svn: 31809
*	Fix a gcc 4.2 warning.	Chris Lattner	2006-11-15	1	-0/+2
\| \| \| \|	llvm-svn: 31751
*	implement InstCombine/shift-simplify.ll by transforming:	Chris Lattner	2006-11-14	1	-3/+46
\| \| \| \| \| \| \| \|	(X >> Z) op (Y >> Z) -> (X op Y) >> Z for all shifts and all ops={and/or/xor}. llvm-svn: 31729
*	implement InstCombine/and-compare.ll:test1. This compiles:	Chris Lattner	2006-11-14	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	typedef struct { unsigned prefix : 4; unsigned code : 4; unsigned unsigned_p : 4; } tree_common; int foo(tree_common a, tree_common b) { return a->code == b->code; } into: _foo: movl 4(%esp), %eax movl 8(%esp), %ecx movl (%eax), %eax xorl (%ecx), %eax # TRUNCATE movb %al, %al shrb $4, %al testb %al, %al sete %al movzbl %al, %eax ret instead of: _foo: movl 8(%esp), %eax movb (%eax), %al shrb $4, %al movl 4(%esp), %ecx movb (%ecx), %cl shrb $4, %cl cmpb %al, %cl sete %al movzbl %al, %eax ret saving one cycle by eliminating a shift. llvm-svn: 31727
*	Fix InstCombine/2006-11-10-ashr-miscompile.ll a miscompilation introduced	Chris Lattner	2006-11-10	1	-3/+3
\| \| \| \| \| \|	by the shr -> [al]shr patch. This was reduced from 176.gcc. llvm-svn: 31653
*	second patch to fix PR992/993.	Chris Lattner	2006-11-09	1	-4/+17
\| \| \| \|	llvm-svn: 31610
*	Minimal patch to fix PR992/PR993	Chris Lattner	2006-11-09	1	-2/+1
\| \| \| \|	llvm-svn: 31608
*	Teach ShrinkDemandedConstant how to handle X+C. This implements:	Chris Lattner	2006-11-09	1	-1/+100
\| \| \| \| \| \|	add.ll:test33, add.ll:test34, shift-sra.ll:test2 llvm-svn: 31586
*	reenable factoring of GEP expressions, being more precise about the	Chris Lattner	2006-11-08	1	-5/+10
\| \| \| \| \| \|	case that it bad to do. llvm-svn: 31563
*	make this code more efficient by not creating a phi node we are just going to	Chris Lattner	2006-11-08	1	-36/+33
\| \| \| \| \| \|	delete in the first place. This also makes it simpler. llvm-svn: 31562
*	Remove redundant <cmath>.	Jim Laskey	2006-11-08	1	-1/+0
\| \| \| \|	llvm-svn: 31561
*	disable this factoring optzn for GEPs for now, this severely pessimizes some	Chris Lattner	2006-11-08	1	-1/+1
\| \| \| \| \| \|	loops. llvm-svn: 31560
*	For PR950:	Reid Spencer	2006-11-08	5	-215/+194
\| \| \| \| \| \| \| \|	This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542
*	scalarrepl should not split the two elements of the vsiidx array:	Chris Lattner	2006-11-07	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	int func(vFloat v0, vFloat v1) { int ii; vSInt32 vsiidx[2]; vsiidx[0] = _mm_cvttps_epi32(v0); vsiidx[1] = _mm_cvttps_epi32(v1); ii = ((int *) vsiidx)[4]; return ii; } This fixes Transforms/ScalarRepl/2006-11-07-InvalidArrayPromote.ll llvm-svn: 31524