bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	It is possible that subprgoram definition is only encoding return value ↵	Devang Patel	2009-02-27	1	-2/+6
\| \| \| \| \| \|	directly, instsad of an DIArray of all argument types. llvm-svn: 65643
*	Refactor TLS code and add some tests. The tests and expected results are:	Rafael Espindola	2009-02-27	3	-14/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632
*	Introduce a new technique for merging BasicBlock with Instruction sentinel ↵	Gabor Greif	2009-02-27	1	-34/+0
\| \| \| \| \| \| \| \| \|	by superposition. This looks dangerous, but isn't because the sentinel is accessed in special way only, namely the Next and Prev fields of it, and these are guaranteed to exist. llvm-svn: 65626
*	Silence compiler warning about use of uninitialized variables (in reality these	Nick Lewycky	2009-02-27	1	-1/+1
\| \| \| \| \| \|	are always set by reference on the path that uses them.) No functional change. llvm-svn: 65621
*	Fix compiler warning about uninitialized variables. No functional change.	Nick Lewycky	2009-02-27	1	-1/+1
\| \| \| \|	llvm-svn: 65620
*	Alignment values for i64 and f64 on ppc64 were wrong,	Dale Johannesen	2009-02-27	1	-1/+3
\| \| \| \| \| \| \| \|	possibly for the reason suggested by the comment. No wonder it didn't work very well. This unblocks bootstrap with assertions on ppc. llvm-svn: 65601
*	MachineLICM CSE should match destination register classes; avoid hoisting ↵	Evan Cheng	2009-02-27	1	-3/+13
\| \| \| \| \| \|	implicit_def's. llvm-svn: 65592
*	Ignore dbg info intrinsics when folding conditional branch to	Zhou Sheng	2009-02-26	1	-1/+5
\| \| \| \| \| \|	conditional branch predecessors. llvm-svn: 65509
*	Enable stack slot coloring DCE. Evan's spiller fixes were needed before ↵	Owen Anderson	2009-02-26	1	-7/+2
\| \| \| \| \| \|	this could happen. llvm-svn: 65501
*	ADDS{D\|S}rr_Int and MULS{D\|S}rr_Int are not commutable. The users of these ↵	Evan Cheng	2009-02-26	1	-8/+4
\| \| \| \| \| \|	intrinsics expect the high bits will not be modified. llvm-svn: 65499
*	The last commit was overly conservative. It's ok to reuse value that's ↵	Evan Cheng	2009-02-26	1	-7/+0
\| \| \| \| \| \|	already marked livein. llvm-svn: 65498
*	If an available register falls through to a succ block, unset the last kill. ↵	Evan Cheng	2009-02-26	1	-37/+76
\| \| \| \| \| \|	Sorry, it's impossible to reduce a sensible test case. It basically requires the moon and stars to align in order to cause a failure. llvm-svn: 65497
*	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296.	Evan Cheng	2009-02-25	11	-324/+329
\| \| \| \|	llvm-svn: 65482
*	Fix big-endian codegen bug. We're splitting up	Dale Johannesen	2009-02-25	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	overly long ints, e.g. i96, into pieces at PHIs and the nodes that feed into them; however big-endian reverses the order of the pieces (for some reason), and wasn't doing it the same way on both sides, so the pieces didn't match and runtime failures ensued. Fixes 188.ammp and sqlite3 on ppc32. llvm-svn: 65481
*	Print variable's display name in dwarf DIE.	Devang Patel	2009-02-25	1	-1/+1
\| \| \| \|	llvm-svn: 65468
*	Fix PR3667	Chris Lattner	2009-02-25	1	-1/+1
\| \| \| \|	llvm-svn: 65464
*	Don't block basic block with only SwitchInst to fold into predecessors.	Zhou Sheng	2009-02-25	1	-1/+5
\| \| \| \|	llvm-svn: 65456
*	Clean up dwarf writer, part 1. This eliminated the horrible recursive ↵	Evan Cheng	2009-02-25	4	-335/+414
\| \| \| \| \| \| \| \|	getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior. This is a very minor compile time win. llvm-svn: 65438
*	Add a totally synthetic situation I came up with while looking at a bug in	Nick Lewycky	2009-02-25	1	-0/+17
\| \| \| \| \| \|	related code. llvm-svn: 65437
*	Expand tabs to spaces (overlooked in previous commit)	Scott Michel	2009-02-25	1	-12/+12
\| \| \| \|	llvm-svn: 65427
*	Remove all "cached" data from BuildVectorSDNode, preferring to retrieve	Scott Michel	2009-02-25	2	-19/+14
\| \| \| \| \| \| \| \| \|	results via reference parameters. This patch also appears to fix Evan's reported problem supplied as a reduced bugpoint test case. llvm-svn: 65426
*	Added support to have TableGen provide information if an intrinsic (core	Mon P Wang	2009-02-24	1	-0/+10
\| \| \| \| \| \|	or target) can be overloaded or not. llvm-svn: 65404
*	If compile unit's language is not set then don't crash while dump'ing ↵	Devang Patel	2009-02-24	1	-1/+2
\| \| \| \| \| \|	compile unit. llvm-svn: 65402
*	Extension of GEP in constant folder was broken (apparently this code	Daniel Dunbar	2009-02-24	1	-1/+1
\| \| \| \| \| \| \|	has never been run!). - Sorry, don't know how to make an LLVM test case for this. llvm-svn: 65383
*	Rename ScalarEvolution's getIterationCount to getBackedgeTakenCount,	Dan Gohman	2009-02-24	5	-122/+144
\| \| \| \| \| \| \| \| \|	to more accurately describe what it does. Expand its doxygen comment to describe what the backedge-taken count is and how it differs from the actual iteration count of the loop. Adjust names and comments in associated code accordingly. llvm-svn: 65382
*	Overhaul my earlier submission due to feedback. It's a large patch, but most of	Bill Wendling	2009-02-24	40	-129/+166
\| \| \| \| \| \| \| \| \| \| \| \|	them are generic changes. - Use the "fast" flag that's already being passed into the asm printers instead of shoving it into the DwarfWriter. - Instead of calling "MI->getParent()->getParent()" for every MI, set the machine function when calling "runOnMachineFunction" in the asm printers. llvm-svn: 65379
*	Add a debugging option for SSC DCE.	Owen Anderson	2009-02-24	1	-0/+5
\| \| \| \|	llvm-svn: 65375
*	- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit	Bill Wendling	2009-02-24	4	-14/+15
\| \| \| \| \| \| \| \| \| \|	a DBG_LABEL or not. We want to fall back to the original way of emitting debug info when we're in -O0/-fast mode. - Add plumbing in to pass the "Fast" flag to places that need it. - XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I need to investigate still. llvm-svn: 65367
*	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple	Dan Gohman	2009-02-24	3	-3/+8
\| \| \| \| \| \| \|	ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364
*	Generalize the ChangeCompareStride code, in preparation for	Dan Gohman	2009-02-24	1	-94/+96
\| \| \| \| \| \|	handling non-constant strides. No functionality change. llvm-svn: 65363
*	Preserve the DominanceFrontier analysis in the LoopDeletion pass.	Dan Gohman	2009-02-24	1	-2/+7
\| \| \| \|	llvm-svn: 65359
*	gdb uses DW_AT_prototyped to identify K&R style in C based languages.	Devang Patel	2009-02-24	1	-0/+5
\| \| \| \| \| \|	This fixes objc.dg/dwarf-prototypes.m scan-assembler DW_AT_prototyped from llvmgcc42 test suite. llvm-svn: 65357
*	While folding unconditional return move DbgRegionEndInst into the ↵	Devang Patel	2009-02-24	2	-23/+10
\| \| \| \| \| \| \| \| \| \| \|	predecessor, instead of removing it. This fixes following tests from llvmgcc42 testsuite. gcc.c-torture/execute/20000605-3.c gcc.c-torture/execute/20020619-1.c gcc.c-torture/execute/20030920-1.c gcc.c-torture/execute/loop-ivopts-1.c llvm-svn: 65353
*	If there is not any debug info available for any global variables and any ↵	Devang Patel	2009-02-24	1	-13/+26
\| \| \| \| \| \|	subprograms then there is not any debug info to emit. llvm-svn: 65352
*	Back out the change in 64918 that used sign-extensions when promoting	Dan Gohman	2009-02-23	1	-34/+12
\| \| \| \| \| \| \| \| \|	trip counts that use signed comparisons. It's not obviously the best approach for preserving trip count information, and at any rate there isn't anything in the tree right now that makes use of that, so for now always using zero-extensions is preferable. llvm-svn: 65347
*	Fast-isel can't do TLS yet, so it should fall back to SDISel	Dan Gohman	2009-02-23	1	-0/+6
\| \| \| \| \| \|	if it sees TLS addresses. llvm-svn: 65341
*	LoopDeletion needs to inform ScalarEvolution when a loop is deleted,	Dan Gohman	2009-02-23	1	-1/+4
\| \| \| \| \| \| \| \|	so that ScalarEvolution doesn't hang onto a dangling Loop*, which could be a problem if another Loop happens to get allocated at the same address. llvm-svn: 65323
*	IndVarSimplify preserves ScalarEvolution. In the	Dan Gohman	2009-02-23	1	-0/+1
\| \| \| \| \| \| \|	-std-compile-opts sequence, this avoids the need for ScalarEvolution to be rerun before LoopDeletion. llvm-svn: 65318
*	Should reset DBI_Prev if DBI_Next == 0.	Zhou Sheng	2009-02-23	1	-0/+2
\| \| \| \|	llvm-svn: 65314
*	Only v1i16 (i.e. _m64) is returned via RAX / RDX.	Evan Cheng	2009-02-23	3	-19/+50
\| \| \| \|	llvm-svn: 65313
*	Generate better code for v8i16 shuffles on SSE2	Nate Begeman	2009-02-23	3	-249/+360
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Generate better code for v16i8 shuffles on SSE2 (avoids stack) Generate pshufb for v8i16 and v16i8 shuffles on SSSE3 where it is fewer uops. Document the shuffle matching logic and add some FIXMEs for later further cleanups. New tests that test the above. Examples: New: _shuf2: pextrw $7, %xmm0, %eax punpcklqdq %xmm1, %xmm0 pshuflw $128, %xmm0, %xmm0 pinsrw $2, %eax, %xmm0 Old: _shuf2: pextrw $2, %xmm0, %eax pextrw $7, %xmm0, %ecx pinsrw $2, %ecx, %xmm0 pinsrw $3, %eax, %xmm0 movd %xmm1, %eax pinsrw $4, %eax, %xmm0 ret ========= New: _shuf4: punpcklqdq %xmm1, %xmm0 pshufb LCPI1_0, %xmm0 Old: _shuf4: pextrw $3, %xmm0, %eax movsd %xmm1, %xmm0 pextrw $3, %xmm1, %ecx pinsrw $4, %ecx, %xmm0 pinsrw $5, %eax, %xmm0 ======== New: _shuf1: pushl %ebx pushl %edi pushl %esi pextrw $1, %xmm0, %eax rolw $8, %ax movd %xmm0, %ecx rolw $8, %cx pextrw $5, %xmm0, %edx pextrw $4, %xmm0, %esi pextrw $3, %xmm0, %edi pextrw $2, %xmm0, %ebx movaps %xmm0, %xmm1 pinsrw $0, %ecx, %xmm1 pinsrw $1, %eax, %xmm1 rolw $8, %bx pinsrw $2, %ebx, %xmm1 rolw $8, %di pinsrw $3, %edi, %xmm1 rolw $8, %si pinsrw $4, %esi, %xmm1 rolw $8, %dx pinsrw $5, %edx, %xmm1 pextrw $7, %xmm0, %eax rolw $8, %ax movaps %xmm1, %xmm0 pinsrw $7, %eax, %xmm0 popl %esi popl %edi popl %ebx ret Old: _shuf1: subl $252, %esp movaps %xmm0, (%esp) movaps %xmm0, 16(%esp) movaps %xmm0, 32(%esp) movaps %xmm0, 48(%esp) movaps %xmm0, 64(%esp) movaps %xmm0, 80(%esp) movaps %xmm0, 96(%esp) movaps %xmm0, 224(%esp) movaps %xmm0, 208(%esp) movaps %xmm0, 192(%esp) movaps %xmm0, 176(%esp) movaps %xmm0, 160(%esp) movaps %xmm0, 144(%esp) movaps %xmm0, 128(%esp) movaps %xmm0, 112(%esp) movzbl 14(%esp), %eax movd %eax, %xmm1 movzbl 22(%esp), %eax movd %eax, %xmm2 punpcklbw %xmm1, %xmm2 movzbl 42(%esp), %eax movd %eax, %xmm1 movzbl 50(%esp), %eax movd %eax, %xmm3 punpcklbw %xmm1, %xmm3 punpcklbw %xmm2, %xmm3 movzbl 77(%esp), %eax movd %eax, %xmm1 movzbl 84(%esp), %eax movd %eax, %xmm2 punpcklbw %xmm1, %xmm2 movzbl 104(%esp), %eax movd %eax, %xmm1 punpcklbw %xmm1, %xmm0 punpcklbw %xmm2, %xmm0 movaps %xmm0, %xmm1 punpcklbw %xmm3, %xmm1 movzbl 127(%esp), %eax movd %eax, %xmm0 movzbl 135(%esp), %eax movd %eax, %xmm2 punpcklbw %xmm0, %xmm2 movzbl 155(%esp), %eax movd %eax, %xmm0 movzbl 163(%esp), %eax movd %eax, %xmm3 punpcklbw %xmm0, %xmm3 punpcklbw %xmm2, %xmm3 movzbl 188(%esp), %eax movd %eax, %xmm0 movzbl 197(%esp), %eax movd %eax, %xmm2 punpcklbw %xmm0, %xmm2 movzbl 217(%esp), %eax movd %eax, %xmm4 movzbl 225(%esp), %eax movd %eax, %xmm0 punpcklbw %xmm4, %xmm0 punpcklbw %xmm2, %xmm0 punpcklbw %xmm3, %xmm0 punpcklbw %xmm1, %xmm0 addl $252, %esp ret llvm-svn: 65311
*	Changed option name from inline-threshold to basic-inline-threshold because	Mon P Wang	2009-02-23	1	-1/+1
\| \| \| \| \| \|	inline-threshold option is used by the inliner. llvm-svn: 65309
*	fix some typos that Duncan noticed	Chris Lattner	2009-02-23	1	-3/+3
\| \| \| \|	llvm-svn: 65306
*	Propagate debug loc info through prologue/epilogue.	Bill Wendling	2009-02-23	7	-28/+39
\| \| \| \|	llvm-svn: 65298
*	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR	Scott Michel	2009-02-22	11	-316/+320
\| \| \| \| \| \| \| \| \|	instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296
*	Revert the part of 64623 that attempted to align the source in a	Dan Gohman	2009-02-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	memcpy to match the alignment of the destination. It isn't necessary for making loads and stores handled like the SSE loadu/storeu intrinsics, and it was causing a performance regression in MultiSource/Applications/JM/lencod. The problem appears to have been a memcpy that copies from some highly aligned array into an alloca; the alloca was then being assigned a large alignment, which required codegen to perform dynamic stack-pointer re-alignment, which forced the enclosing function to have a frame pointer, which led to increased spilling. llvm-svn: 65289
*	Properly parenthesize this expression, fixing a real bug in the new	Dan Gohman	2009-02-22	1	-1/+1
\| \| \| \| \| \|	-full-lsr code, as well as a GCC warning. llvm-svn: 65288
*	If a use operand is marked isKill, don't forget to add kill to its live ↵	Evan Cheng	2009-02-22	1	-4/+6
\| \| \| \| \| \|	interval as well. llvm-svn: 65279
*	Add a note.	Evan Cheng	2009-02-22	1	-0/+28
\| \| \| \|	llvm-svn: 65275
*	Be bug compatible with gcc by returning MMX values in RAX.	Evan Cheng	2009-02-22	2	-7/+13
\| \| \| \|	llvm-svn: 65274