bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Do not memoize ADJCALLSTACKDOWN nodes, provide a method to hack on them.	Chris Lattner	2005-05-11	1	-3/+24
\| \| \| \|	llvm-svn: 21871
*	Print bit count nodes correctly	Chris Lattner	2005-05-11	1	-0/+6
\| \| \| \|	llvm-svn: 21855
*	Silence some VC++ warnings	Jeff Cohen	2005-05-10	1	-2/+2
\| \| \| \|	llvm-svn: 21838
*	Add support for READPORT, WRITEPORT, READIO, WRITEIO	Chris Lattner	2005-05-09	1	-1/+10
\| \| \| \|	llvm-svn: 21824
*	Fold shifts into subsequent SHL's. These shifts often arise due to addrses	Chris Lattner	2005-05-09	1	-0/+28
\| \| \| \| \| \|	arithmetic lowering. llvm-svn: 21818
*	Don't use the load/store instruction as the source pointer, use the pointer	Chris Lattner	2005-05-09	1	-0/+2
\| \| \| \| \| \|	being stored/loaded through! llvm-svn: 21806
*	memoize all nodes, even null Value* nodes. Do not add two token chain outputs	Chris Lattner	2005-05-09	1	-7/+5
\| \| \| \|	llvm-svn: 21805
*	Print SrcValue nodes correctly	Chris Lattner	2005-05-09	1	-0/+6
\| \| \| \|	llvm-svn: 21803
*	Implement count leading zeros (ctlz), count trailing zeros (cttz), and count	Andrew Lenharth	2005-05-03	1	-3/+6
\| \| \| \| \| \| \| \| \|	population (ctpop). Generic lowering is implemented, however only promotion is implemented for SelectionDAG at the moment. More coming soon. llvm-svn: 21676
*	Add FSQRT, FSIN, FCOS nodes, patch contributed by Morten Ofstad	Chris Lattner	2005-04-28	1	-1/+4
\| \| \| \|	llvm-svn: 21605
*	Implement Value* tracking for loads and stores in the selection DAG. This ↵	Andrew Lenharth	2005-04-27	1	-9/+33
\| \| \| \| \| \| \| \|	enables one to use alias analysis in the backends. (TRUNK)Stores and (EXT\|ZEXT\|SEXT)Loads have an extra SDOperand which is a SrcValueSDNode which contains the Value. Note that if the operation is introduced by the backend, it will still have the operand, but the value will be null. llvm-svn: 21599
*	Fold (X > -1) \| (Y > -1) --> (X&Y > -1)	Chris Lattner	2005-04-26	1	-1/+3
\| \| \| \|	llvm-svn: 21552
*	implement some more logical compares with constants, so that:	Chris Lattner	2005-04-25	1	-7/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	int foo1(int x, int y) { int t1 = x >= 0; int t2 = y >= 0; return t1 & t2; } int foo2(int x, int y) { int t1 = x == -1; int t2 = y == -1; return t1 & t2; } produces: _foo1: or r2, r4, r3 srwi r2, r2, 31 xori r3, r2, 1 blr _foo2: and r2, r4, r3 addic r2, r2, 1 li r2, 0 addze r3, r2 blr instead of: _foo1: srwi r2, r4, 31 xori r2, r2, 1 srwi r3, r3, 31 xori r3, r3, 1 and r3, r2, r3 blr _foo2: addic r2, r4, 1 li r2, 0 addze r2, r2 addic r3, r3, 1 li r3, 0 addze r3, r3 and r3, r2, r3 blr llvm-svn: 21547
*	Codegen x < 0 \| y < 0 as (x\|y) < 0. This allows us to compile this to:	Chris Lattner	2005-04-25	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	_foo: or r2, r4, r3 srwi r3, r2, 31 blr instead of: _foo: srwi r2, r4, 31 srwi r3, r3, 31 or r3, r2, r3 blr llvm-svn: 21544
*	Convert tabs to spaces	Misha Brukman	2005-04-22	1	-6/+4
\| \| \| \|	llvm-svn: 21439
*	Remove trailing whitespace	Misha Brukman	2005-04-21	1	-31/+31
\| \| \| \|	llvm-svn: 21420
*	Improve and elimination. On PPC, for:	Chris Lattner	2005-04-21	1	-6/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	bool %test(int %X) { %Y = and int %X, 8 %Z = setne int %Y, 0 ret bool %Z } we now generate this: rlwinm r2, r3, 0, 28, 28 srwi r3, r2, 3 instead of this: rlwinm r2, r3, 0, 28, 28 srwi r2, r2, 3 rlwinm r3, r2, 0, 31, 31 I'll leave it to Nate to get it down to one instruction. :) --------------------------------------------------------------------- llvm-svn: 21391
*	Fold (x & 8) != 0 and (x & 8) == 8 into (x & 8) >> 3.	Chris Lattner	2005-04-21	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This turns this PPC code: rlwinm r2, r3, 0, 28, 28 cmpwi cr7, r2, 8 mfcr r2 rlwinm r3, r2, 31, 31, 31 into this: rlwinm r2, r3, 0, 28, 28 srwi r2, r2, 3 rlwinm r3, r2, 0, 31, 31 Next up, nuking the extra and. llvm-svn: 21390
*	Fold setcc of MVT::i1 operands into logical operations	Chris Lattner	2005-04-18	1	-0/+39
\| \| \| \|	llvm-svn: 21319
*	Another minor simplification: handle setcc (zero_extend x), c -> setcc(x, c')	Chris Lattner	2005-04-18	1	-0/+45
\| \| \| \|	llvm-svn: 21318
*	Another simple xform	Chris Lattner	2005-04-18	1	-0/+8
\| \| \| \|	llvm-svn: 21317
*	Fold:	Chris Lattner	2005-04-18	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	// (X != 0) \| (Y != 0) -> (X\|Y != 0) // (X == 0) & (Y == 0) -> (X\|Y == 0) Compiling this: int %bar(int %a, int %b) { entry: %tmp.1 = setne int %a, 0 %tmp.2 = setne int %b, 0 %tmp.3 = or bool %tmp.1, %tmp.2 %retval = cast bool %tmp.3 to int ret int %retval } to this: _bar: or r2, r3, r4 addic r3, r2, -1 subfe r3, r3, r2 blr instead of: _bar: addic r2, r3, -1 subfe r2, r2, r3 addic r3, r4, -1 subfe r3, r3, r4 or r3, r2, r3 blr llvm-svn: 21316
*	Make the AND elimination operation recursive and significantly more powerful,	Chris Lattner	2005-04-18	1	-26/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	eliminating an and for Nate's testcase: int %bar(int %a, int %b) { entry: %tmp.1 = setne int %a, 0 %tmp.2 = setne int %b, 0 %tmp.3 = or bool %tmp.1, %tmp.2 %retval = cast bool %tmp.3 to int ret int %retval } generating: _bar: addic r2, r3, -1 subfe r2, r2, r3 addic r3, r4, -1 subfe r3, r3, r4 or r3, r2, r3 blr instead of: _bar: addic r2, r3, -1 subfe r2, r2, r3 addic r3, r4, -1 subfe r3, r3, r4 or r2, r2, r3 rlwinm r3, r2, 0, 31, 31 blr llvm-svn: 21315
*	Add a couple missing transforms in getSetCC that were triggering assertions	Nate Begeman	2005-04-14	1	-1/+8
\| \| \| \| \| \|	in the PPC Pattern ISel llvm-svn: 21297
*	Disbale the broken fold of shift + sz[ext] for now	Nate Begeman	2005-04-13	1	-7/+30
\| \| \| \| \| \| \| \|	Move the transform for select (a < 0) ? b : 0 into the dag from ppc isel Enable the dag to fold and (setcc, 1) -> setcc for targets where setcc always produces zero or one. llvm-svn: 21291
*	fix an infinite loop	Chris Lattner	2005-04-13	1	-1/+1
\| \| \| \|	llvm-svn: 21289
*	fix some serious miscompiles on ia64, alpha, and ppc	Chris Lattner	2005-04-13	1	-1/+1
\| \| \| \|	llvm-svn: 21288
*	avoid work when possible, perhaps fix the problem nate and andrew are seeing	Chris Lattner	2005-04-13	1	-0/+1
\| \| \| \| \| \|	with != 0 comparisons vanishing. llvm-svn: 21287
*	add back the optimization that Nate added for shl X, (zext_inreg y)	Chris Lattner	2005-04-13	1	-2/+23
\| \| \| \|	llvm-svn: 21273
*	Oops, remove these too.	Chris Lattner	2005-04-13	1	-6/+2
\| \| \| \|	llvm-svn: 21272
*	Remove all foldings of ZERO_EXTEND_INREG, moving them to work for AND nodes	Chris Lattner	2005-04-13	1	-41/+46
\| \| \| \| \| \|	instead. OVerall, this increases the amount of folding we can do. llvm-svn: 21265
*	Fold shift x, [sz]ext(y) -> shift x, y	Nate Begeman	2005-04-12	1	-0/+16
\| \| \| \|	llvm-svn: 21262
*	Fold shift by size larger than type size to undef	Nate Begeman	2005-04-12	1	-11/+4
\| \| \| \| \| \|	Make llvm undef values generate ISD::UNDEF nodes llvm-svn: 21261
*	Remove some redundant checks, add a couple of new ones. This allows us to	Chris Lattner	2005-04-12	1	-7/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	compile this: int foo (unsigned long a, unsigned long long g) { return a >= g; } To: foo: movl 8(%esp), %eax cmpl %eax, 4(%esp) setae %al cmpl $0, 12(%esp) sete %cl andb %al, %cl movzbl %cl, %eax ret instead of: foo: movl 8(%esp), %eax cmpl %eax, 4(%esp) setae %al movzbw %al, %cx movl 12(%esp), %edx cmpl $0, %edx sete %al movzbw %al, %ax cmpl $0, %edx cmove %cx, %ax movzbl %al, %eax ret llvm-svn: 21244
*	canonicalize x <u 1 -> x == 0. On this testcase:	Chris Lattner	2005-04-12	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	unsigned long long g; unsigned long foo (unsigned long a) { return (a >= g) ? 1 : 0; } It changes the ppc code from: _foo: .LBB_foo_0: ; entry mflr r11 stw r11, 8(r1) bl "L00000$pb" "L00000$pb": mflr r2 addis r2, r2, ha16(L_g$non_lazy_ptr-"L00000$pb") lwz r2, lo16(L_g$non_lazy_ptr-"L00000$pb")(r2) lwz r4, 0(r2) lwz r2, 4(r2) cmplw cr0, r3, r2 li r2, 1 li r3, 0 bge .LBB_foo_2 ; entry .LBB_foo_1: ; entry or r2, r3, r3 .LBB_foo_2: ; entry cmplwi cr0, r4, 1 li r3, 1 li r5, 0 blt .LBB_foo_4 ; entry .LBB_foo_3: ; entry or r3, r5, r5 .LBB_foo_4: ; entry cmpwi cr0, r4, 0 beq .LBB_foo_6 ; entry .LBB_foo_5: ; entry or r2, r3, r3 .LBB_foo_6: ; entry rlwinm r3, r2, 0, 31, 31 lwz r11, 8(r1) mtlr r11 blr to: _foo: .LBB_foo_0: ; entry mflr r11 stw r11, 8(r1) bl "L00000$pb" "L00000$pb": mflr r2 addis r2, r2, ha16(L_g$non_lazy_ptr-"L00000$pb") lwz r2, lo16(L_g$non_lazy_ptr-"L00000$pb")(r2) lwz r4, 0(r2) lwz r2, 4(r2) cmplw cr0, r3, r2 li r2, 1 li r3, 0 bge .LBB_foo_2 ; entry .LBB_foo_1: ; entry or r2, r3, r3 .LBB_foo_2: ; entry cntlzw r3, r4 srwi r3, r3, 5 cmpwi cr0, r4, 0 beq .LBB_foo_4 ; entry .LBB_foo_3: ; entry or r2, r3, r3 .LBB_foo_4: ; entry rlwinm r3, r2, 0, 31, 31 lwz r11, 8(r1) mtlr r11 blr llvm-svn: 21241
*	Don't bother sign/zext_inreg'ing the result of an and operation if we know	Chris Lattner	2005-04-10	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the result does change as a result of the extend. This improves codegen for Alpha on this testcase: int %a(ushort* %i) { %tmp.1 = load ushort* %i %tmp.2 = cast ushort %tmp.1 to int %tmp.4 = and int %tmp.2, 1 ret int %tmp.4 } Generating: a: ldgp $29, 0($27) ldwu $0,0($16) and $0,1,$0 ret $31,($26),1 instead of: a: ldgp $29, 0($27) ldwu $0,0($16) and $0,1,$0 addl $0,0,$0 ret $31,($26),1 btw, alpha really should switch to livein/outs for args :) llvm-svn: 21213
*	Fold zext_inreg(zextload), likewise for sext's	Chris Lattner	2005-04-10	1	-0/+6
\| \| \| \|	llvm-svn: 21204
*	add a simple xform	Chris Lattner	2005-04-10	1	-0/+6
\| \| \| \|	llvm-svn: 21203
*	Fix a thinko. If the operand is promoted, pass the promoted value into	Chris Lattner	2005-04-10	1	-0/+4
\| \| \| \| \| \| \| \| \|	the new zero extend, not the original operand. This fixes cast bool -> long on ppc. Add an unrelated fixme llvm-svn: 21196
*	add a little peephole optimization. This allows us to codegen:	Chris Lattner	2005-04-09	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	int a(short i) { return i & 1; } as _a: andi. r3, r3, 1 blr instead of: _a: rlwinm r2, r3, 0, 16, 31 andi. r3, r2, 1 blr on ppc. It should also help the other risc targets. llvm-svn: 21189
*	recognize some patterns as fabs operations, so that fabs at the source level	Chris Lattner	2005-04-09	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \|	is deconstructed then reconstructed here. This catches 19 fabs's in 177.mesa 9 in 168.wupwise, 5 in 171.swim, 3 in 172.mgrid, and 14 in 173.applu out of specfp2000. This allows the X86 code generator to make MUCH better code than before for each of these and saves one instr on ppc. This depends on the previous CFE patch to expose these correctly. llvm-svn: 21171
*	print and fold BRCONDTWOWAY correctly	Chris Lattner	2005-04-09	1	-11/+25
\| \| \| \|	llvm-svn: 21165
*	canonicalize a bunch of operations involving fneg	Chris Lattner	2005-04-09	1	-0/+21
\| \| \| \|	llvm-svn: 21160
*	If a target zero or sign extends the result of its setcc, allow folding of	Chris Lattner	2005-04-07	1	-1/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	this into sign/zero extension instructions later. On PPC, for example, this testcase: %G = external global sbyte implementation void %test(int %X, int %Y) { %C = setlt int %X, %Y %D = cast bool %C to sbyte store sbyte %D, sbyte* %G ret void } Now codegens to: cmpw cr0, r3, r4 li r3, 1 li r4, 0 blt .LBB_test_2 ; .LBB_test_1: ; or r3, r4, r4 .LBB_test_2: ; addis r2, r2, ha16(L_G$non_lazy_ptr-"L00000$pb") lwz r2, lo16(L_G$non_lazy_ptr-"L00000$pb")(r2) stb r3, 0(r2) instead of: cmpw cr0, r3, r4 li r3, 1 li r4, 0 blt .LBB_test_2 ; .LBB_test_1: ; or r3, r4, r4 .LBB_test_2: ; *** rlwinm r3, r3, 0, 31, 31 addis r2, r2, ha16(L_G$non_lazy_ptr-"L00000$pb") lwz r2, lo16(L_G$non_lazy_ptr-"L00000$pb")(r2) stb r3, 0(r2) llvm-svn: 21148
*	Remove somethign I had for testing	Chris Lattner	2005-04-07	1	-1/+1
\| \| \| \|	llvm-svn: 21144
*	This patch does two things. First, it canonicalizes 'X >= C' -> 'X > C-1'	Chris Lattner	2005-04-07	1	-7/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(likewise for <= >=u >=u). Second, it implements a special case hack to turn 'X gtu SINTMAX' -> 'X lt 0' On powerpc, for example, this changes this: lis r2, 32767 ori r2, r2, 65535 cmplw cr0, r3, r2 bgt .LBB_test_2 into: cmpwi cr0, r3, 0 blt .LBB_test_2 llvm-svn: 21142
*	Fix a really scary bug that Nate found where we weren't deleting the right	Chris Lattner	2005-04-07	1	-1/+1
\| \| \| \| \| \|	elements auto of the autoCSE maps. llvm-svn: 21128
*	Add MULHU and MULHS nodes for the high part of an (un)signed 32x32=64b	Nate Begeman	2005-04-05	1	-0/+2
\| \| \| \| \| \|	multiply. llvm-svn: 21102
*	print fneg/fabs	Chris Lattner	2005-04-02	1	-0/+5
\| \| \| \|	llvm-svn: 21008
*	fix some bugs in the implementation of SHL_PARTS and friends.	Chris Lattner	2005-04-02	1	-3/+9
\| \| \| \|	llvm-svn: 21004