bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Don't fold undef uses or copies with implicit uses	Matt Arsenault	2016-10-06	1	-4/+22
\| \| \| \|	llvm-svn: 283476
*	AMDGPU: Remove leftover implicit operands when folding immediates	Matt Arsenault	2016-10-06	1	-7/+26
\| \| \| \| \| \| \| \|	When constant folding an operation to a copy or an immediate mov, the implicit uses/defs of the old instruction were left behind, e.g. replacing v_or_b32 left the implicit exec use on the new copy. llvm-svn: 283471
*	Use StringRef in Pass/PassManager APIs (NFC)	Mehdi Amini	2016-10-01	1	-3/+1
\| \| \| \|	llvm-svn: 283004
*	AMDGPU: Support folding FrameIndex operands	Matt Arsenault	2016-09-14	1	-9/+26
\| \| \| \| \| \|	This avoids test regressions in a future commit. llvm-svn: 281491
*	AMDGPU: Improve splitting 64-bit bit ops by constants	Matt Arsenault	2016-09-14	1	-0/+126
\| \| \| \| \| \| \| \|	This addresses a TODO to handle operations besides and. This also starts eliminating no-op operations with a constant that can emerge later. llvm-svn: 281488
*	AMDGPU: Don't fold subregister extracts into tied operands	Matt Arsenault	2016-08-15	1	-3/+15
\| \| \| \|	llvm-svn: 278676
*	CodeGen: Use MachineInstr& in TargetInstrInfo, NFC	Duncan P. N. Exon Smith	2016-06-30	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is mostly a mechanical change to make TargetInstrInfo API take MachineInstr& (instead of MachineInstr* or MachineBasicBlock::iterator) when the argument is expected to be a valid MachineInstr. This is a general API improvement. Although it would be possible to do this one function at a time, that would demand a quadratic amount of churn since many of these functions call each other. Instead I've done everything as a block and just updated what was necessary. This is mostly mechanical fixes: adding and removing `` and `&` operators. The only non-mechanical change is to split ARMBaseInstrInfo::getOperandLatencyImpl out from ARMBaseInstrInfo::getOperandLatency. Previously, the latter took a `MachineInstr` which it updated to the instruction bundle leader; now, the latter calls the former either with the same `MachineInstr&` or the bundle leader. As a side effect, this removes a bunch of MachineInstr* to MachineBasicBlock::iterator implicit conversions, a necessary step toward fixing PR26753. Note: I updated WebAssembly, Lanai, and AVR (despite being off-by-default) since it turned out to be easy. I couldn't run tests for AVR since llc doesn't link with it turned on. llvm-svn: 274189
*	AMDGPU: Cleanup subtarget handling.	Matt Arsenault	2016-06-24	1	-4/+3
\| \| \| \| \| \| \| \| \|	Split AMDGPUSubtarget into amdgcn/r600 specific subclasses. This removes most of the static_casting of the basic codegen classes everywhere, and tries to restrict the features visible on the wrong target. llvm-svn: 273652
*	Add optimization bisect opt-in calls for AMDGPU passes	Andrew Kaylor	2016-04-25	1	-0/+3
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19450 llvm-svn: 267485
*	AMDGPU: Fix passes depending on dominator tree for no reason	Matt Arsenault	2016-02-11	1	-8/+2
\| \| \| \|	llvm-svn: 260494
*	AMDGPU/SI: Fix a bug in SIFoldOperands	Marek Olsak	2016-01-13	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: ret.ll will contain a test for this Reviewers: tstellarAMD, arsenm Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D16029 llvm-svn: 257590
*	AMDGPU/SI: Fold operands with sub-registers	Nicolai Haehnle	2016-01-07	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Multi-dword constant loads generated unnecessary moves from SGPRs into VGPRs, increasing the code size and VGPR pressure. These moves are now folded away. Note that this lack of operand folding was not a problem for VMEM loads, because COPY nodes from VReg_Nnn to VGPR32 are eliminated by the register coalescer. Some tests are updated, note that the fsub.ll test explicitly checks that the move is elided. With the IR generated by current Mesa, the changes are obviously relatively minor: 7063 shaders in 3531 tests Totals: SGPRS: 351872 -> 352560 (0.20 %) VGPRS: 199984 -> 200732 (0.37 %) Code Size: 9876968 -> 9881112 (0.04 %) bytes LDS: 91 -> 91 (0.00 %) blocks Scratch: 1779712 -> 1767424 (-0.69 %) bytes per wave Wait states: 295164 -> 295337 (0.06 %) Totals from affected shaders: SGPRS: 65784 -> 66472 (1.05 %) VGPRS: 38064 -> 38812 (1.97 %) Code Size: 1993828 -> 1997972 (0.21 %) bytes LDS: 42 -> 42 (0.00 %) blocks Scratch: 795648 -> 783360 (-1.54 %) bytes per wave Wait states: 54026 -> 54199 (0.32 %) Reviewers: tstellarAMD, arsenm, mareko Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15875 llvm-svn: 257074
*	AMDGPU: Fix verifier error in SIFoldOperands	Matt Arsenault	2015-10-21	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There may be other use operands that also need their kill flags cleared. This happens in a few tests when SIFoldOperands is moved after PeepholeOptimizer. PeepholeOptimizer rewrites cases that look like: %vreg0 = ... %vreg1 = COPY %vreg0 use %vreg1<kill> %vreg2 = COPY %vreg0 use %vreg2<kill> to use the earlier source to %vreg0 = ... use %vreg0 use %vreg0 Currently SIFoldOperands sees the copied registers, so there is only one use. So far I haven't managed to come up with a test that currently has multiple uses of a foldable VGPR -> VGPR copy. llvm-svn: 250960
*	Improved the interface of methods commuting operands, improved X86-FMA3 ↵	Andrew Kaylor	2015-09-28	1	-3/+12
\| \| \| \| \| \| \| \| \| \|	mem-folding&coalescing. Patch by Slava Klochkov (vyacheslav.n.klochkov@intel.com) Differential Revision: http://reviews.llvm.org/D11370 llvm-svn: 248735
*	AMDGPU: Fix recomputing dominator tree unnecessarily	Matt Arsenault	2015-09-25	1	-0/+1
\| \| \| \| \| \| \|	SIFixSGPRCopies does not modify the CFG, but this was being recomputed before running SIFoldOperands. llvm-svn: 248587
*	AMDGPU/SI: Fix creating v_mov_b32s without exec uses	Matt Arsenault	2015-09-10	1	-2/+14
\| \| \| \| \| \| \|	This will be caught by existing tests with a verifier check to be added in a future commit. llvm-svn: 247229
*	AMDGPU/SI: Fold operands through REG_SEQUENCE instructions	Tom Stellard	2015-09-09	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This helps mostly when we use add instructions for address calculations that contain immediates. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12256 llvm-svn: 247157
*	AMDGPU/SI: Fix some invaild assumptions when folding 64-bit immediates	Tom Stellard	2015-08-29	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We were assuming tha if the use operand had a sub-register that the immediate was 64-bits, but this was breaking the case of folding a 64-bit immediate into another 64-bit instruction. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12255 llvm-svn: 246354
*	AMDGPU/SI: Factor operand folding code into its own function	Tom Stellard	2015-08-28	1	-67/+79
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12254 llvm-svn: 246353
*	AMDGPU/SI: Select mad patterns to v_mac_f32	Tom Stellard	2015-07-13	1	-0/+31
\| \| \| \| \| \| \| \| \|	The two-address instruction pass will convert these back to v_mad_f32 if necessary. Differential Revision: http://reviews.llvm.org/D11060 llvm-svn: 242038
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+288
	llvm-svn: 239657