bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Fix passes depending on dominator tree for no reason	Matt Arsenault	2016-02-11	1	-8/+2
\| \| \| \|	llvm-svn: 260494
*	AMDGPU/SI: Fix a bug in SIFoldOperands	Marek Olsak	2016-01-13	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: ret.ll will contain a test for this Reviewers: tstellarAMD, arsenm Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D16029 llvm-svn: 257590
*	AMDGPU/SI: Fold operands with sub-registers	Nicolai Haehnle	2016-01-07	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Multi-dword constant loads generated unnecessary moves from SGPRs into VGPRs, increasing the code size and VGPR pressure. These moves are now folded away. Note that this lack of operand folding was not a problem for VMEM loads, because COPY nodes from VReg_Nnn to VGPR32 are eliminated by the register coalescer. Some tests are updated, note that the fsub.ll test explicitly checks that the move is elided. With the IR generated by current Mesa, the changes are obviously relatively minor: 7063 shaders in 3531 tests Totals: SGPRS: 351872 -> 352560 (0.20 %) VGPRS: 199984 -> 200732 (0.37 %) Code Size: 9876968 -> 9881112 (0.04 %) bytes LDS: 91 -> 91 (0.00 %) blocks Scratch: 1779712 -> 1767424 (-0.69 %) bytes per wave Wait states: 295164 -> 295337 (0.06 %) Totals from affected shaders: SGPRS: 65784 -> 66472 (1.05 %) VGPRS: 38064 -> 38812 (1.97 %) Code Size: 1993828 -> 1997972 (0.21 %) bytes LDS: 42 -> 42 (0.00 %) blocks Scratch: 795648 -> 783360 (-1.54 %) bytes per wave Wait states: 54026 -> 54199 (0.32 %) Reviewers: tstellarAMD, arsenm, mareko Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15875 llvm-svn: 257074
*	AMDGPU: Fix verifier error in SIFoldOperands	Matt Arsenault	2015-10-21	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There may be other use operands that also need their kill flags cleared. This happens in a few tests when SIFoldOperands is moved after PeepholeOptimizer. PeepholeOptimizer rewrites cases that look like: %vreg0 = ... %vreg1 = COPY %vreg0 use %vreg1<kill> %vreg2 = COPY %vreg0 use %vreg2<kill> to use the earlier source to %vreg0 = ... use %vreg0 use %vreg0 Currently SIFoldOperands sees the copied registers, so there is only one use. So far I haven't managed to come up with a test that currently has multiple uses of a foldable VGPR -> VGPR copy. llvm-svn: 250960
*	Improved the interface of methods commuting operands, improved X86-FMA3 ↵	Andrew Kaylor	2015-09-28	1	-3/+12
\| \| \| \| \| \| \| \| \| \|	mem-folding&coalescing. Patch by Slava Klochkov (vyacheslav.n.klochkov@intel.com) Differential Revision: http://reviews.llvm.org/D11370 llvm-svn: 248735
*	AMDGPU: Fix recomputing dominator tree unnecessarily	Matt Arsenault	2015-09-25	1	-0/+1
\| \| \| \| \| \| \|	SIFixSGPRCopies does not modify the CFG, but this was being recomputed before running SIFoldOperands. llvm-svn: 248587
*	AMDGPU/SI: Fix creating v_mov_b32s without exec uses	Matt Arsenault	2015-09-10	1	-2/+14
\| \| \| \| \| \| \|	This will be caught by existing tests with a verifier check to be added in a future commit. llvm-svn: 247229
*	AMDGPU/SI: Fold operands through REG_SEQUENCE instructions	Tom Stellard	2015-09-09	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This helps mostly when we use add instructions for address calculations that contain immediates. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12256 llvm-svn: 247157
*	AMDGPU/SI: Fix some invaild assumptions when folding 64-bit immediates	Tom Stellard	2015-08-29	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We were assuming tha if the use operand had a sub-register that the immediate was 64-bits, but this was breaking the case of folding a 64-bit immediate into another 64-bit instruction. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12255 llvm-svn: 246354
*	AMDGPU/SI: Factor operand folding code into its own function	Tom Stellard	2015-08-28	1	-67/+79
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12254 llvm-svn: 246353
*	AMDGPU/SI: Select mad patterns to v_mac_f32	Tom Stellard	2015-07-13	1	-0/+31
\| \| \| \| \| \| \| \| \|	The two-address instruction pass will convert these back to v_mad_f32 if necessary. Differential Revision: http://reviews.llvm.org/D11060 llvm-svn: 242038
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+288
	llvm-svn: 239657