bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Reapply commit 158073 with a fix (the testcase was already committed). The	Duncan Sands	2012-06-08	1	-123/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	problem was that by moving instructions around inside the function, the pass could accidentally move the iterator being used to advance over the function too. Fix this by only processing the instruction equal to the iterator, and leaving processing of instructions that might not be equal to the iterator to later (later = after traversing the basic block; it could also wait until after traversing the entire function, but this might make the sets quite big). Original commit message: Grab-bag of reassociate tweaks. Unify handling of dead instructions and instructions to reoptimize. Exploit this to more systematically eliminate dead instructions (this isn't very useful in practice but is convenient for analysing some testcase I am working on). No need for WeakVH any more: use an AssertingVH instead. llvm-svn: 158226
*	Remove the TODO statement in the PPC README re: CTR loops	Hal Finkel	2012-06-08	1	-1/+0
\| \| \| \| \| \| \|	As Chris points out, this can now be removed! TODO: check if the associated section on viterbi's inner loop can also be removed. llvm-svn: 158224
*	Enable PPC CTR loop formation by default.	Hal Finkel	2012-06-08	2	-11/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Thanks to Jakob's help, this now causes no new test suite failures! Over the entire test suite, this gives an average 1% speedup. The largest speedups are: SingleSource/Benchmarks/Misc/pi - 108% SingleSource/Benchmarks/CoyoteBench/lpbench - 54% MultiSource/Benchmarks/Prolangs-C/unix-smail/unix-smail - 50% SingleSource/Benchmarks/Shootout/ary3 - 32% SingleSource/Benchmarks/Shootout-C++/matrix - 30% The largest slowdowns are: MultiSource/Benchmarks/mediabench/gsm/toast/toast - -30% MultiSource/Benchmarks/Prolangs-C/bison/mybison - -25% MultiSource/Benchmarks/BitBench/uuencode/uuencode - -22% MultiSource/Applications/d/make_dparser - -14% SingleSource/Benchmarks/Shootout-C++/ary - -13% In light of these slowdowns, additional profiling work is obviously needed! llvm-svn: 158223
*	Mark the PPC CTRRC and CTRRC8 register classes as non-allocatable.	Hal Finkel	2012-06-08	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \|	Marking these classes as non-alocatable allows CTR loop generation to work correctly with the block placement passes, etc. These register classes are currently used only by some unused TCRETURN patterns. In future cleanup, these will be removed. Thanks again to Jakob for suggesting this fix to the CTR loop problem! llvm-svn: 158221
*	Enable optimization for integer ABS on X86 if Subtarget has CMOV.	Manman Ren	2012-06-08	1	-3/+5
\| \| \| \|	llvm-svn: 158220
*	Fix a crash in APInt::lshr when shiftAmt > BitWidth.	Chad Rosier	2012-06-08	1	-1/+1
\| \| \| \| \| \|	Patch by James Benton <jbenton@vmware.com>. llvm-svn: 158213
*	Fix Target->Codegen dependence.	Andrew Trick	2012-06-08	2	-195/+205
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Bulk move of TargetInstrInfo implementation into TargetInstrInfoImpl. This is dirty because the code isn't part of TargetInstrInfoImpl class, nor should it be, because the methods are not target hooks. However, it's the current mechanism for keeping libTarget useful outside the backend. You'll get a not-so-nice link error if you invoke a TargetInstrInfo method that depends on CodeGen. The TargetInstrInfoImpl class should probably be removed since it doesn't really solve this problem. To really fix this, we probably need separate interfaces for the CodeGen/nonCodeGen sides of TargetInstrInfo. llvm-svn: 158212
*	BoundsChecking: add support for ConstantPointerNull. fixes a bunch of ↵	Nuno Lopes	2012-06-08	1	-6/+7
\| \| \| \| \| \|	instrumentation failures in loops with reallocs llvm-svn: 158210
*	Disable the PPC CTR-Loops pass by default.	Hal Finkel	2012-06-08	2	-4/+17
\| \| \| \| \| \| \| \| \| \|	The pass itself works well, but the something in the Machine* infrastructure does not understand terminators which define registers. Without the ability to use the block-placement pass, etc. this causes performance regressions (and so is turned off by default). Turning off the analysis turns off the problems with the Machine* infrastructure. llvm-svn: 158206
*	Fix a bug in the new PPC CTR-Loops pass.	Hal Finkel	2012-06-08	1	-0/+1
\| \| \| \| \| \| \| \| \|	The code which tests for an induction operation cannot assume that any ADDI instruction will have a register operand because the operand could also be a frame index; for example: %vreg16<def> = ADDI8 <fi#0>, 0; G8RC:%vreg16 llvm-svn: 158205
*	Add the PPCCTRLoops pass: a PPC machine-code-level optimization pass to form ↵	Hal Finkel	2012-06-08	9	-18/+812
\| \| \| \| \| \| \| \| \| \|	CTR-based loop branching code. This pass is derived from the Hexagon HardwareLoops pass. The only significant enhancement over the Hexagon pass is that PPCCTRLoops will also attempt to delete the replaced add and compare operations if they are no longer otherwise used. Also, invalid preheader DebugLoc is not used. llvm-svn: 158204
*	Revert commit 158073 while waiting for a fix. The issue is that reassociate	Duncan Sands	2012-06-08	1	-111/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	can move instructions within the instruction list. If the instruction just happens to be the one the basic block iterator is pointing to, and it is moved to a different basic block, then we get into an infinite loop due to the iterator running off the end of the basic block (for some reason this doesn't fire any assertions). Original commit message: Grab-bag of reassociate tweaks. Unify handling of dead instructions and instructions to reoptimize. Exploit this to more systematically eliminate dead instructions (this isn't very useful in practice but is convenient for analysing some testcase I am working on). No need for WeakVH any more: use an AssertingVH instead. llvm-svn: 158199
*	X86: optimize generated code for integer ABS	Manman Ren	2012-06-07	1	-2/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch will generate the following for integer ABS: movl %edi, %eax negl %eax cmovll %edi, %eax INSTEAD OF movl %edi, %ecx sarl $31, %ecx leal (%rdi,%rcx), %eax xorl %ecx, %eax There exists a target-independent DAG combine for integer ABS, which converts integer ABS to sar+add+xor. For X86, we match this pattern back to neg+cmov. This is implemented in PerformXorCombine. rdar://10695237 llvm-svn: 158175
*	Do not optimize the used bits of the x86 vselect condition operand, when the ↵	Nadav Rotem	2012-06-07	1	-4/+6
\| \| \| \| \| \| \| \|	condition operand is a vector of 1-bit predicates. This may happen on MIC devices. llvm-svn: 158168
*	Fix a bug in FoldSelectOpOp. Bitcast ops may change the number of vector ↵	Nadav Rotem	2012-06-07	1	-0/+6
\| \| \| \| \| \|	elements, which may disagree with the select condition type. llvm-svn: 158166
*	Continue factoring computeOperandLatency. Use it for ARM hasHighOperandLatency.	Andrew Trick	2012-06-07	2	-24/+67
\| \| \| \|	llvm-svn: 158164
*	ARM getOperandLatency rewrite.	Andrew Trick	2012-06-07	1	-85/+112
\| \| \| \| \| \|	Match expectations of the new latency API. Cleanup and make the logic consistent. llvm-svn: 158163
*	ARM getOperandLatency should return -1 for unknown, consistent with API	Andrew Trick	2012-06-07	1	-1/+4
\| \| \| \|	llvm-svn: 158162
*	Fix ARM getInstrLatency logic to work with the current API.	Andrew Trick	2012-06-07	1	-13/+19
\| \| \| \|	llvm-svn: 158161
*	PR13046: we can't replace usage of SUB with CMP in the lowering phase.	Manman Ren	2012-06-07	1	-1/+2
\| \| \| \| \| \|	It will cause assertion failure later on. llvm-svn: 158160
*	Use a base register instead of an index register with the local dynamic model.	Rafael Espindola	2012-06-07	1	-0/+8
\| \| \| \| \| \|	Fixes pr13048. llvm-svn: 158158
*	Move terminator machine verification to check ↵	Pete Cooper	2012-06-07	1	-11/+11
\| \| \| \| \| \|	MachineBasicBlock::instr_iterator instead of MBB::iterator llvm-svn: 158154
*	X86: replace SUB with CMP if possible	Manman Ren	2012-06-07	1	-1/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch will optimize the following movq %rdi, %rax subq %rsi, %rax cmovsq %rsi, %rdi movq %rdi, %rax to cmpq %rsi, %rdi cmovsq %rsi, %rdi movq %rdi, %rax Perform this optimization if the actual result of SUB is not used. rdar: 11540023 llvm-svn: 158126
*	Revert r157755.	Manman Ren	2012-06-06	3	-42/+0
\| \| \| \| \| \| \| \|	The commit is intended to fix rdar://11540023. It is implemented as part of peephole optimization. We can actually implement this in the SelectionDAG lowering phase. llvm-svn: 158122
*	Properly verify liveness with bundled machine instructions.	Jakob Stoklund Olesen	2012-06-06	1	-13/+34
\| \| \| \| \| \| \| \|	Bundles should be treated as one atomic transaction when checking liveness. That is how the register allocator (and VLIW targets) treats bundles. llvm-svn: 158116
*	Add accessors for all private members of DisasmContext.	Benjamin Kramer	2012-06-06	1	-0/+8
\| \| \| \| \| \|	LLVM should be -Wunused-private-field clean now. llvm-svn: 158103
*	Move RegisterClassInfo.h.	Andrew Trick	2012-06-06	11	-143/+11
\| \| \| \| \| \|	Allow targets to access this API. It's required for RegisterPressure. llvm-svn: 158102
*	Move RegisterPressure.h.	Andrew Trick	2012-06-06	4	-258/+3
\| \| \| \| \| \|	Make it a general utility for use by Targets. llvm-svn: 158097
*	Round 2 of dead private variable removal.	Benjamin Kramer	2012-06-06	14	-43/+14
\| \| \| \| \| \| \| \|	LLVM is now -Wunused-private-field clean except for - lib/MC/MCDisassembler/Disassembler.h. Not sure why it keeps all those unaccessible fields. - gtest. llvm-svn: 158096
*	Remove unused private fields found by clang's new -Wunused-private-field.	Benjamin Kramer	2012-06-06	17	-29/+13
\| \| \| \| \| \| \| \|	There are some that I didn't remove this round because they looked like obvious stubs. There are dead variables in gtest too, they should be fixed upstream. llvm-svn: 158090
*	Add support for dynamic stack realignment in the presence of dynamic allocas on	Chad Rosier	2012-06-06	3	-14/+93
\| \| \| \| \| \| \|	X86. rdar://11496434 llvm-svn: 158087
*	Fix combine of uno && ord -> false so that the ordering of the fcmps doesn't	Chad Rosier	2012-06-06	1	-1/+3
\| \| \| \| \| \| \|	matter. rdar://11579835 llvm-svn: 158084
*	Remove dead debug option -disable-rematerialization.	Jakob Stoklund Olesen	2012-06-06	1	-4/+0
\| \| \| \| \| \| \|	Remat has been stable for years, and it isn't done by LiveIntervalAnalysis any longer. (See LiveRangeEdit). llvm-svn: 158079
*	Grab-bag of reassociate tweaks. Unify handling of dead instructions and	Duncan Sands	2012-06-06	1	-123/+111
\| \| \| \| \| \| \| \| \|	instructions to reoptimize. Exploit this to more systematically eliminate dead instructions (this isn't very useful in practice but is convenient for analysing some testcase I am working on). No need for WeakVH any more: use an AssertingVH instead. llvm-svn: 158073
*	Stop leaking RegScavengers from TailDuplication.	Benjamin Kramer	2012-06-06	1	-3/+4
\| \| \| \|	llvm-svn: 158069
*	Correct decoder for T1 conditional B encoding	Richard Barton	2012-06-06	1	-2/+2
\| \| \| \|	llvm-svn: 158055
*	Mark several instructions SSE2 instead of SSE3 as they should be.	Craig Topper	2012-06-06	2	-9/+11
\| \| \| \|	llvm-svn: 158049
*	Move LiveUnionArray into LiveIntervalUnion.h	Jakob Stoklund Olesen	2012-06-05	4	-47/+54
\| \| \| \| \| \|	It is useful outside RegAllocBase. llvm-svn: 158041
*	Don't print register names in LiveIntervalUnion::print().	Jakob Stoklund Olesen	2012-06-05	3	-5/+2
\| \| \| \| \| \| \| \|	Soon we'll be making LiveIntervalUnions for register units as well. This was the only place using the RepReg member, so just remove it. llvm-svn: 158038
*	Suppress -Wunused-variable in -Asserts build	Matt Beaumont-Gay	2012-06-05	1	-0/+1
\| \| \| \|	llvm-svn: 158037
*	Simplify LiveInterval::print().	Jakob Stoklund Olesen	2012-06-05	4	-48/+19
\| \| \| \| \| \| \| \| \| \|	Don't print out the register number and spill weight, making the TRI argument unnecessary. This allows callers to interpret the reg field. It can currently be a virtual register, a physical register, a spill slot, or a register unit. llvm-svn: 158031
*	Add experimental support for register unit liveness.	Jakob Stoklund Olesen	2012-06-05	1	-0/+130
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of computing a live interval per physreg, LiveIntervals can compute live intervals per register unit. This makes impossible the confusing situation where aliasing registers could have overlapping live intervals. It should also make fixed interferernce checking cheaper since registers have fewer register units than aliases. Live intervals for regunits are computed on demand, using MRI use-def chains and the new LiveRangeCalc class. Only regunits live in to ABI blocks are precomputed during LiveIntervals::runOnMachineFunction(). The regunit liveness computations don't depend on LiveVariables. llvm-svn: 158029
*	Implement LiveRangeCalc::extendToUses() and createDeadDefs().	Jakob Stoklund Olesen	2012-06-05	3	-2/+103
\| \| \| \| \| \| \|	These LiveRangeCalc methods are to be used when computing a live range from scratch. llvm-svn: 158027
*	MachineInstr::eraseFromParent fix for removing bundled instrs.	Andrew Trick	2012-06-05	1	-1/+2
\| \| \| \| \| \|	Patch by Ivan Llopard. llvm-svn: 158025
*	misched: API for minimum vs. expected latency.	Andrew Trick	2012-06-05	7	-118/+226
\| \| \| \| \| \| \|	Minimum latency determines per-cycle scheduling groups. Expected latency determines critical path and cost. llvm-svn: 158021
*	Add a new intrinsic: llvm.fmuladd. This intrinsic represents a multiply-add	Lang Hames	2012-06-05	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \|	expression (a * b + c) that can be implemented as a fused multiply-add (fma) if the target determines that this will be more efficient. This intrinsic will be used to implement FP_CONTRACT support and an aggressive FMA formation mode. If your target has a fast FMA instruction you should override the isFMAFasterThanMulAndAdd method in TargetLowering to return true. llvm-svn: 158014
*	Fix header file include order in NVPTX backend NV_CONTRIB	Yuan Lin	2012-06-05	1	-2/+2
\| \| \| \|	llvm-svn: 158013
*	LoopUnroll: always check for NULL LoopPassManager	Andrew Trick	2012-06-05	1	-3/+5
\| \| \| \|	llvm-svn: 158007
*	PPC32 uses R2 as the TLS register. Fix the copy and paste.	Roman Divacky	2012-06-05	1	-3/+3
\| \| \| \|	llvm-svn: 158004
*	X86 itinerary properties.	Andrew Trick	2012-06-05	2	-2/+29
\| \| \| \|	llvm-svn: 157981