bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Non-functional change: make custom lowering for truncate stylistically	Scott Michel	2008-12-02	1	-5/+12
\| \| \| \| \| \|	consistent with the way it's generally done in other places. llvm-svn: 60439
*	CellSPU:	Scott Michel	2008-12-02	3	-45/+179
\| \| \| \| \| \| \| \| \| \|	- Incorporate Tilmann Scheller's ISD::TRUNCATE custom lowering patch - Update SPU calling convention info, even if it's not used yet (but can be at some point or another) - Ensure that any-extended f32 loads are custom lowered, especially when they're promoted for use in printf. llvm-svn: 60438
*	Fix a typo in a comment.	Dan Gohman	2008-12-02	1	-1/+1
\| \| \| \|	llvm-svn: 60434
*	Add support for folding spills into preceding defs when doing pre-alloc ↵	Owen Anderson	2008-12-02	1	-27/+101
\| \| \| \| \| \|	splitting. llvm-svn: 60433
*	One more transformation.	Dale Johannesen	2008-12-02	1	-0/+8
\| \| \| \|	llvm-svn: 60432
*	Make the code do what the comment says it does.	Dale Johannesen	2008-12-02	1	-4/+5
\| \| \| \|	llvm-svn: 60431
*	Comment typeo fix, thanks Duncan!	Chris Lattner	2008-12-02	1	-1/+1
\| \| \| \|	llvm-svn: 60429
*	make it possible to custom lower TRUNCATE (needed for the CellSPU target)	Tilmann Scheller	2008-12-02	1	-0/+5
\| \| \| \|	llvm-svn: 60409
*	Implement PRE of loads in the GVN pass with a pretty cheap and	Chris Lattner	2008-12-02	1	-54/+193
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	straight-forward implementation. This does not require any extra alias analysis queries beyond what we already do for non-local loads. Some programs really really like load PRE. For example, SPASS triggers this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc. The biggest limitation to the implementation is that it does not split critical edges. This is a huge killer on many programs and should be addressed after the initial patch is enabled by default. The implementation of this should incidentally speed up rejection of non-local loads because it avoids creating the repl densemap in cases when it won't be used for fully redundant loads. This is currently disabled by default. Before I turn this on, I need to fix a couple of miscompilations in the testsuite, look at compile time performance numbers, and look at perf impact. This is pretty close to ready though. llvm-svn: 60408
*	Add a new SCEV representing signed division.	Nick Lewycky	2008-12-02	2	-10/+76
\| \| \| \|	llvm-svn: 60407
*	Removed some unnecessary code in widening.	Mon P Wang	2008-12-02	1	-115/+9
\| \| \| \|	llvm-svn: 60406
*	add a little helper function that does PHI translation.	Chris Lattner	2008-12-02	1	-0/+13
\| \| \| \|	llvm-svn: 60405
*	add a note	Chris Lattner	2008-12-02	1	-0/+21
\| \| \| \|	llvm-svn: 60404
*	Remove some errors that crept in. No functionality change.	Bill Wendling	2008-12-02	1	-3/+4
\| \| \| \|	llvm-svn: 60403
*	Merge two if-statements into one.	Bill Wendling	2008-12-02	1	-7/+3
\| \| \| \|	llvm-svn: 60402
*	More styalistic changes. No functionality change.	Bill Wendling	2008-12-02	1	-15/+12
\| \| \| \|	llvm-svn: 60401
*	- Remove the buggy -X/C -> X/-C transform. This isn't valid when X isn't a	Bill Wendling	2008-12-02	2	-12/+10
\| \| \| \| \| \| \| \| \|	constant. If X is a constant, then this is folded elsewhere. - Added a note to Target/README.txt to indicate that we'd like to implement this when we're able. llvm-svn: 60399
*	Improve comment.	Bill Wendling	2008-12-02	1	-4/+3
\| \| \| \|	llvm-svn: 60398
*	- Reduce nesting.	Bill Wendling	2008-12-02	1	-24/+18
\| \| \| \| \| \| \| \|	- No need to do a swap on a canonicalized pattern. No functionality change. llvm-svn: 60397
*	some random comment improvements.	Chris Lattner	2008-12-02	1	-11/+22
\| \| \| \|	llvm-svn: 60395
*	Fix an issue that Chris noticed, where local PRE was not properly instantiating	Owen Anderson	2008-12-02	1	-2/+7
\| \| \| \| \| \| \|	a new value numbering set after splitting a critical edge. This increases the number of instances of PRE on 403.gcc from ~60 to ~570. llvm-svn: 60393
*	Fix PR3124: overly strict assert.	Evan Cheng	2008-12-02	1	-2/+4
\| \| \| \|	llvm-svn: 60392
*	Add a few more transformations.	Dale Johannesen	2008-12-02	1	-0/+24
\| \| \| \|	llvm-svn: 60391
*	Second stab at target-dependent lowering of everyone's favorite nodes: [SU]ADDO	Bill Wendling	2008-12-02	2	-27/+36
\| \| \| \| \| \| \| \| \| \| \|	- LowerXADDO lowers [SU]ADDO into an ADD with an implicit EFLAGS define. The EFLAGS are fed into a SETCC node which has the conditional COND_O or COND_C, depending on the type of ADDO requested. - LowerBRCOND now recognizes if it's coming from a SETCC node with COND_O or COND_C set. llvm-svn: 60388
*	Reapply r60382. This time, don't mark "ADC" nodes with "implicit EFLAGS".	Bill Wendling	2008-12-02	3	-29/+110
\| \| \| \|	llvm-svn: 60385
*	Temporarily revert r60382. It caused CodeGen/X86/i2k.ll and others to fail.	Bill Wendling	2008-12-01	3	-135/+40
\| \| \| \|	llvm-svn: 60383
*	- Have "ADD" instructions return an implicit EFLAGS.	Bill Wendling	2008-12-01	3	-40/+135
\| \| \| \| \| \|	- Add support for seto, setno, setc, and setnc instructions. llvm-svn: 60382
*	Expand getVTList, getNodeValueTypes, and SelectNodeTo to handle more value ↵	Bill Wendling	2008-12-01	1	-0/+33
\| \| \| \| \| \|	types. llvm-svn: 60381
*	Consider only references to an IV within the loop when	Dale Johannesen	2008-12-01	1	-6/+26
\| \| \| \| \| \| \| \| \|	figuring out the base of the IV. This produces better code in the example. (Addresses use (IV) instead of (BASE,IV) - a significant improvement on low-register machines like x86). llvm-svn: 60374
*	Don't rebuild RHSNeg. Just use the one that's already there.	Bill Wendling	2008-12-01	1	-2/+1
\| \| \| \|	llvm-svn: 60370
*	Document what this check is doing. Also, no need to cast to ConstantInt.	Bill Wendling	2008-12-01	1	-4/+4
\| \| \| \|	llvm-svn: 60369
*	Use a simple comparison. Overflow on integer negation can only occur when the	Bill Wendling	2008-12-01	1	-13/+2
\| \| \| \| \| \|	integer is "minint". llvm-svn: 60366
*	CellSPU:	Scott Michel	2008-12-01	3	-92/+80
\| \| \| \| \| \| \| \| \|	- Fix v2[if]64 vector insertion code before IBM files a bug report. - Ensure that zero (0) offsets relative to $sp don't trip an assert (add $sp, 0 gets legalized to $sp alone, tripping an assert) - Shuffle masks passed to SPUISD::SHUFB are now v16i8 or v4i32 llvm-svn: 60358
*	There are no longer any places that require a	Duncan Sands	2008-12-01	10	-39/+47
\| \| \| \| \| \| \| \|	MERGE_VALUES node with only one operand, so get rid of special code that only existed to handle that possibility. llvm-svn: 60349
*	Change the interface to the type legalization method	Duncan Sands	2008-12-01	19	-371/+369
\| \| \| \| \| \| \| \| \| \| \|	ReplaceNodeResults: rather than returning a node which must have the same number of results as the original node (which means mucking around with MERGE_VALUES, and which is also easy to get wrong since SelectionDAG folding may mean you don't get the node you expect), return the results in a vector. llvm-svn: 60348
*	Generalize the FoldOrWithConstant method to fold for any two constants which	Bill Wendling	2008-12-01	1	-23/+22
\| \| \| \| \| \|	don't have overlapping bits. llvm-svn: 60344
*	Reduce copy-and-paste code by splitting out the code into its own function.	Bill Wendling	2008-12-01	1	-58/+50
\| \| \| \|	llvm-svn: 60343
*	Use m_Specific() instead of double matching.	Bill Wendling	2008-12-01	1	-18/+12
\| \| \| \|	llvm-svn: 60341
*	Move pattern check outside of the if-then statement. This prevents us from ↵	Bill Wendling	2008-12-01	1	-10/+12
\| \| \| \| \| \|	fiddling with constants unless we have to. llvm-svn: 60340
*	Rename some variables, only increment BI once at the start of the loop ↵	Chris Lattner	2008-12-01	1	-38/+30
\| \| \| \| \| \|	instead of throughout it. llvm-svn: 60339
*	pull the predMap densemap out of the inner loop of performPRE, so	Chris Lattner	2008-12-01	1	-2/+4
\| \| \| \| \| \| \|	that it isn't reallocated all the time. This is a tiny speedup for GVN: 3.90->3.88s llvm-svn: 60338
*	switch a couple more calls to use array_pod_sort.	Chris Lattner	2008-12-01	2	-3/+5
\| \| \| \|	llvm-svn: 60337
*	Introduce a new array_pod_sort function and switch LSR to use it	Chris Lattner	2008-12-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	instead of std::sort. This shrinks the release-asserts LSR.o file by 1100 bytes of code on my system. We should start using array_pod_sort where possible. llvm-svn: 60335
*	Eliminate use of setvector for the DeadInsts set, just use a smallvector.	Chris Lattner	2008-12-01	1	-17/+31
\| \| \| \| \| \|	This is a lot cheaper and conceptually simpler. llvm-svn: 60332
*	DeleteTriviallyDeadInstructions is always passed the	Chris Lattner	2008-12-01	1	-10/+9
\| \| \| \| \| \|	DeadInsts ivar, just use it directly. llvm-svn: 60330
*	simplify DeleteTriviallyDeadInstructions again, unlike my previous	Chris Lattner	2008-12-01	1	-20/+13
\| \| \| \| \| \| \| \|	buggy rewrite, this notifies ScalarEvolution of a pending instruction about to be removed and then erases it, instead of erasing it then notifying. llvm-svn: 60329
*	simplify these patterns using m_Specific. No need to grep for	Chris Lattner	2008-12-01	1	-16/+6
\| \| \| \| \| \|	xor in testcase (or is a substring). llvm-svn: 60328
*	Teach jump threading to clean up after itself, DCE and constfolding the	Chris Lattner	2008-12-01	1	-1/+24
\| \| \| \| \| \| \| \| \|	new instructions it simplifies. Because we're threading jumps on edges with constants coming in from PHI's, we inherently are exposing a lot more constants to the new block. Folding them and deleting dead conditions allows the cost model in jump threading to be more accurate as it iterates. llvm-svn: 60327
*	The PreVerifier pass preserves everything. In practice, this	Chris Lattner	2008-12-01	1	-0/+4
\| \| \| \| \| \| \|	prevents the passmgr from adding yet-another domtree invocation for Verifier if there is already one live. llvm-svn: 60326
*	Change instcombine to use FoldPHIArgGEPIntoPHI to fold two operand PHIs	Chris Lattner	2008-12-01	1	-17/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instead of using FoldPHIArgBinOpIntoPHI. In addition to being more obvious, this also fixes a problem where instcombine wouldn't merge two phis that had different variable indices. This prevented instcombine from factoring big chunks of code in 403.gcc. For example: insn_cuid.exit: - %tmp336 = load i32** @uid_cuid, align 4 - %tmp337 = getelementptr %struct.rtx_def* %insn_addr.0.ph.i, i32 0, i32 3 - %tmp338 = bitcast [1 x %struct.rtunion]* %tmp337 to i32* - %tmp339 = load i32* %tmp338, align 4 - %tmp340 = getelementptr i32* %tmp336, i32 %tmp339 br label %bb62 bb61: - %tmp341 = load i32** @uid_cuid, align 4 - %tmp342 = getelementptr %struct.rtx_def* %insn, i32 0, i32 3 - %tmp343 = bitcast [1 x %struct.rtunion]* %tmp342 to i32* - %tmp344 = load i32* %tmp343, align 4 - %tmp345 = getelementptr i32* %tmp341, i32 %tmp344 br label %bb62 bb62: - %iftmp.62.0.in = phi i32* [ %tmp345, %bb61 ], [ %tmp340, %insn_cuid.exit ] + %insn.pn2 = phi %struct.rtx_def* [ %insn, %bb61 ], [ %insn_addr.0.ph.i, %insn_cuid.exit ] + %tmp344.pn.in.in = getelementptr %struct.rtx_def* %insn.pn2, i32 0, i32 3 + %tmp344.pn.in = bitcast [1 x %struct.rtunion]* %tmp344.pn.in.in to i32* + %tmp341.pn = load i32** @uid_cuid + %tmp344.pn = load i32* %tmp344.pn.in + %iftmp.62.0.in = getelementptr i32* %tmp341.pn, i32 %tmp344.pn %iftmp.62.0 = load i32* %iftmp.62.0.in llvm-svn: 60325