bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix a crash in scalarrepl for memcpy/memmove where the source and destination	Bob Wilson	2010-01-19	1	-6/+10
\| \| \| \| \| \| \| \| \|	are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848
*	Convert some of the dynamic opcode lookups into static ones.	Owen Anderson	2010-01-17	1	-59/+40
\| \| \| \|	llvm-svn: 93693
*	1) Use the new SimplifyInstructionsInBlock routine instead of the copy	Chris Lattner	2010-01-12	1	-17/+14
\| \| \| \| \| \| \| \| \| \| \| \|	in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253
*	tidy up	Chris Lattner	2010-01-12	1	-5/+1
\| \| \| \|	llvm-svn: 93222
*	Teach jump threading to duplicate small blocks when the branch	Chris Lattner	2010-01-12	1	-9/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221
*	some cleanup, and make it obvious that ProcessJumpOnPHI only works	Chris Lattner	2010-01-11	1	-24/+14
\| \| \| \| \| \|	on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208
*	only factor from expressions whose uses are empty and whose	Chris Lattner	2010-01-09	1	-0/+5
\| \| \| \| \| \|	base is the right expression type. This fixes PR5981. llvm-svn: 93045
*	Suppress an unused variable warning when assertions are off;	Duncan Sands	2010-01-08	1	-2/+3
\| \| \| \| \| \|	remove some trailing whitespace while there. llvm-svn: 93008
*	Use a do-while loop instead of while + boolean.	Benjamin Kramer	2010-01-07	1	-6/+4
\| \| \| \|	llvm-svn: 92912
*	Move the object size intrinsic optimization to inst-combine and make	Eric Christopher	2010-01-06	1	-24/+0
\| \| \| \| \| \|	it work for any integer size return type. llvm-svn: 92853
*	Formatting.	Mikhail Glushenkov	2010-01-06	1	-2/+2
\| \| \| \|	llvm-svn: 92831
*	Move remaining stuff to the isInteger predicate.	Benjamin Kramer	2010-01-05	1	-12/+9
\| \| \| \|	llvm-svn: 92771
*	Convert a ton of simple integer type equality tests to the new predicate.	Benjamin Kramer	2010-01-05	2	-6/+6
\| \| \| \|	llvm-svn: 92760
*	Set Changed properly after calling DeleteDeadPHIs.	Dan Gohman	2010-01-05	2	-2/+2
\| \| \| \|	llvm-svn: 92735
*	Use do+while instead of while for loops which obviously have a	Dan Gohman	2010-01-05	8	-19/+14
\| \| \| \| \| \|	non-zero trip count. Use SmallVector's pop_back_val(). llvm-svn: 92734
*	fix an infinite loop in reassociate building emacs.	Chris Lattner	2010-01-05	1	-0/+4
\| \| \| \|	llvm-svn: 92679
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-4/+4
\| \| \| \|	llvm-svn: 92624
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-6/+6
\| \| \| \|	llvm-svn: 92623
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-1/+1
\| \| \| \|	llvm-svn: 92622
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-3/+3
\| \| \| \|	llvm-svn: 92620
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-3/+3
\| \| \| \|	llvm-svn: 92619
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-15/+15
\| \| \| \|	llvm-svn: 92617
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-2/+2
\| \| \| \|	llvm-svn: 92615
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-25/+25
\| \| \| \|	llvm-svn: 92614
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-20/+20
\| \| \| \|	llvm-svn: 92613
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-21/+21
\| \| \| \|	llvm-svn: 92612
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-6/+6
\| \| \| \|	llvm-svn: 92611
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-8/+8
\| \| \| \|	llvm-svn: 92610
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-4/+4
\| \| \| \|	llvm-svn: 92609
*	Change errs() to dbgs().	David Greene	2010-01-05	1	-8/+8
\| \| \| \|	llvm-svn: 92608
*	Remove dead debug info intrinsics.	Devang Patel	2010-01-05	1	-4/+0
\| \| \| \| \| \| \| \| \| \|	Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start AutoUpgrade simply ignores these intrinsics now. llvm-svn: 92557
*	80-col violations, trailing whitespace.	Mikhail Glushenkov	2010-01-04	1	-16/+20
\| \| \| \|	llvm-svn: 92470
*	move instcombine to its own library, it's past time.	Chris Lattner	2010-01-04	2	-14088/+0
\| \| \| \|	llvm-svn: 92459
*	implement an instcombine xform needed by clang's codegen	Chris Lattner	2010-01-04	1	-1/+19
\| \| \| \| \| \| \| \|	on the example in PR4216. This doesn't trigger in the testsuite, so I'd really appreciate someone scrutinizing the logic for correctness. llvm-svn: 92458
*	pull my debug hooks out, I'm done with this xform for now.	Chris Lattner	2010-01-03	1	-20/+2
\| \| \| \|	llvm-svn: 92446
*	Small cleanups, refactor some duplicated code into a single method. No	Nick Lewycky	2010-01-03	1	-44/+35
\| \| \| \| \| \|	functionality change. llvm-svn: 92445
*	generalize the previous transformation to handle indexing into	Chris Lattner	2010-01-03	1	-19/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	arrays of structs and other arrays, so long as all the subsequent indexes are constants. This triggers frequently for stuff like: @divisions = internal constant [29 x [2 x i32]] [[2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 2], [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2]], align 32 ; <[29 x [2 x i32]]> [#uses=50] %623 = getelementptr inbounds [29 x [2 x i32]] @divisions, i64 0, i64 %619, i64 0 ; <i32*> [#uses=1] %684 = icmp eq i32 %683, 999 also for the "my_defs" table in 'gs', etc. llvm-svn: 92444
*	Cleanup.	Nick Lewycky	2010-01-03	1	-3/+2
\| \| \| \|	llvm-svn: 92436
*	teach instcombine to optimize idioms like A[i]&42 == 0. This	Chris Lattner	2010-01-02	1	-7/+31
\| \| \| \| \| \| \|	occurs in 403.gcc in mode_mask_array, in safe-ctype.c (which is copied in multiple apps) in _sch_istable, etc. llvm-svn: 92427
*	Teach the table lookup optimization to generate range compares	Chris Lattner	2010-01-02	1	-22/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	when a consequtive sequence of elements all satisfies the predicate. Like the double compare case, this generates better code than the magic constant case and generalizes to more than 32/64 element array lookups. Here are some examples where it triggers. From 403.gcc, most accesses to the rtx_class array are handled, e.g.: @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=547] %142 = icmp eq i8 %141, 105 @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=543] %165 = icmp eq i8 %164, 60 Also, most of the 59-element arrays (mode_class/rid_to_yy, etc) optimized before are actually range compares. This lets 32-bit machines optimize them. 400.perlbmk has stuff like this: 400.perlbmk: PL_regkind, even for 32-bit: @PL_regkind = constant [62 x i8] c"\00\00\02\02\02\06\06\06\06\09\09\0B\0B\0D\0E\0E\0E\11\12\12\14\14\16\16\18\18\1A\1A\1C\1C\1E\1F !!!$$&'((((,-.///88886789:;8$", align 32 ; <[62 x i8]> [#uses=4] %811 = icmp ne i8 %810, 33 @PL_utf8skip = constant [256 x i8] c"\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\04\04\04\04\04\04\04\04\05\05\05\05\06\06\07\0D", align 32 ; <[256 x i8]> [#uses=94] %12 = icmp ult i8 %10, 2 etc. llvm-svn: 92426
*	theoretically the negate we find could be in a different function, check	Chris Lattner	2010-01-02	1	-0/+4
\| \| \| \| \| \|	for this case. llvm-svn: 92425
*	use enums for the over/underdefined markers for clarity. Switch	Chris Lattner	2010-01-02	1	-21/+23
\| \| \| \| \| \|	to using -2/-3 instead of -1/-2 for a future xform. llvm-svn: 92423
*	remove the random sampling framework, which is not maintained anymore.	Chris Lattner	2010-01-02	1	-0/+6
\| \| \| \| \| \|	If there is interest, it can be resurrected from SVN. PR4912. llvm-svn: 92422
*	Fix logic error in previous commit. The != case needs to become an or, not an	Nick Lewycky	2010-01-02	1	-3/+7
\| \| \| \| \| \|	and. llvm-svn: 92419
*	Optimize pointer comparison into the typesafe form, now that the backends will	Nick Lewycky	2010-01-02	1	-2/+20
\| \| \| \| \| \| \|	handle them efficiently. This is the opposite direction of the transformation we used to have here. llvm-svn: 92418
*	Generalize the previous xform to handle cases where exactly	Chris Lattner	2010-01-02	1	-26/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	two elements match or don't match with two comparisons. For example, the testcase compiles into: define i1 @test5(i32 %X) { %1 = icmp eq i32 %X, 2 ; <i1> [#uses=1] %2 = icmp eq i32 %X, 7 ; <i1> [#uses=1] %R = or i1 %1, %2 ; <i1> [#uses=1] ret i1 %R } This generalizes the previous xforms when the array is larger than 64 elements (and this case matches) and generates better code for cases where it overlaps with the magic bitshift case. This generalizes more cases than you might expect. For example, 400.perlbmk has: @PL_utf8skip = constant [256 x i8] c"\01\01\01\... %15 = icmp ult i8 %7, 7 403.gcc has: @rid_to_yy = internal constant [114 x i16] [i16 259, i16 260, ... %18 = icmp eq i16 %16, 295 and xalancbmk has a bunch of examples, such as _ZN11xercesc_2_5L15gCombiningCharsE and _ZN11xercesc_2_5L10gBaseCharsE. llvm-svn: 92417
*	fix a miscompilation I introduced of cdecl with a late change.	Chris Lattner	2010-01-02	1	-1/+1
\| \| \| \|	llvm-svn: 92416
*	enhance the compare/load/index optimization to work on any load	Chris Lattner	2010-01-02	1	-11/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	from a global with 32/64 elements or less (depending on whether i64 is native on the target), generating a bitshift idiom to determine the result. For example, on test4 we produce: define i1 @test4(i32 %X) { %1 = lshr i32 933, %X ; <i32> [#uses=1] %2 = and i32 %1, 1 ; <i32> [#uses=1] %R = icmp ne i32 %2, 0 ; <i1> [#uses=1] ret i1 %R } This triggers in a number of interesting cases, for example, here's an fp case: @A.3255 = internal constant [4 x double] [double 4.100000e+00, double -3.900000e+00, double -1.000000e+00, double 1.000000e+00], align 32 ; <[4 x double]> [#uses=7] ... %7 = fcmp olt double %3, 0.000000e+00 In this case we make the slen2_tab global dead, which is nice: @slen2_tab = internal constant [16 x i32] [i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 2, i32 3], align 32 ; <[16 x i32]> [#uses=1] ... %204 = icmp eq i32 %46, 0 Perl has a bunch of these, also on the 'Perl_regkind' array: @Perl_yygindex = internal constant [51 x i16] [i16 0, i16 0, i16 0, i16 0, i16 374, i16 351, i16 0, i16 -12, i16 0, i16 946, i16 413, i16 -83, i16 0, i16 0, i16 0, i16 -311, i16 -13, i16 4007, i16 2893, i16 0, i16 0, i16 0, i16 0, i16 0, i16 372, i16 -8, i16 0, i16 0, i16 246, i16 -131, i16 43, i16 86, i16 208, i16 -45, i16 -169, i16 987, i16 0, i16 0, i16 0, i16 0, i16 308, i16 0, i16 -271, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0], align 32 ; <[51 x i16]> [#uses=1] ... %1364 = icmp eq i16 %1361, 0 186.crafty really likes this on 64-bit machines, because it triggers on a bunch of globals like this: @white_outpost = internal constant [64 x i8] c"\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\02\02\00\00\00\00\00\04\05\05\04\00\00\00\00\03\06\06\03\00\00\00\00\00\01\01\00\00\00\00\00\00\00\00\00\00\00", align 32 ; <[64 x i8]> [#uses=2] However the big winner is 403.gcc, which triggers hundreds of times, eliminating all the accesses to the 57-element arrays 'mode_class', mode_unit_size, mode_bitsize, regclass_map, etc. go 64-bit machines :) llvm-svn: 92415
*	enhance the previous optimization to work with fcmp in addition	Chris Lattner	2010-01-02	1	-4/+21
\| \| \| \| \| \|	to icmp. llvm-svn: 92412
*	Teach instcombine to fold compares of loads from constant	Chris Lattner	2010-01-02	1	-1/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	arrays with variable indices into a comparison of the index with a constant. The most common occurrence of this that I see by far is stuff like: if ("foobar"[i] == '\0') ... which we compile into: if (i == 6), saving a load and materialization of the global address. This also exposes loop trip count information to later passes in many cases. This triggers hundreds of times in xalancbmk, which is where I first noticed it, but it also triggers in many other apps. Here are a few interesting ones from various apps: @must_be_connected_without = internal constant [8 x i8] [i8 getelementptr inbounds ([3 x i8]* @.str64320, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str27283, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str71327, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str72328, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str18274, i64 0, i64 0), i8* getelementptr inbounds ([6 x i8]* @.str11267, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str32288, i64 0, i64 0), i8* null], align 32 ; <[8 x i8]> [#uses=2] %scevgep.i = getelementptr [8 x i8] @must_be_connected_without, i64 0, i64 %indvar.i ; <i8*> [#uses=1] %17 = load ... %18 = icmp eq i8 %17, null ; <i1> [#uses=1] -> icmp eq i64 %indvar.i, 7 @yytable1095 = internal constant [84 x i8] c"\12\01(\05\06\07\08\09\0A\0B\0C\0D\0E1\0F\10\11266\1D: \10\11,-,0\03'\10\11B6\04\17&\18\1945\05\06\07\08\09\0A\0B\0C\0D\0E\1E\0F\10\11\1A\1B\1C$3+>#%;<IJ=ADFEGH9KL\00\00\00C", align 32 ; <[84 x i8]> [#uses=2] %57 = getelementptr inbounds [84 x i8]* @yytable1095, i64 0, i64 %56 ; <i8> [#uses=1] %mode.0.in = getelementptr inbounds [9 x i32] @mb_mode_table, i64 0, i64 %.pn ; <i32> [#uses=1] load ... %64 = icmp eq i8 %58, 4 ; <i1> [#uses=1] -> icmp eq i64 %.pn, 35 ; <i1> [#uses=0] @gsm_DLB = internal constant [4 x i16] [i16 6554, i16 16384, i16 26214, i16 32767] %scevgep.i = getelementptr [4 x i16] @gsm_DLB, i64 0, i64 %indvar.i ; <i16*> [#uses=1] %425 = load %scevgep.i %426 = icmp eq i16 %425, -32768 ; <i1> [#uses=0] -> false llvm-svn: 92411