bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Create emergency stack slots during frame lowering	Matt Arsenault	2015-11-06	1	-0/+1
\| \| \| \| \| \|	Test has a bogus verifier error which will be fixed by later commits. llvm-svn: 252327
*	AMDGPU: Remove unused scratch resource operands	Matt Arsenault	2015-11-06	1	-72/+129
\| \| \| \| \| \|	The SGPR spill pseudos don't actually use them. llvm-svn: 252324
*	AMDGPU: Fix hardcoded alignment of spill.	Matt Arsenault	2015-11-06	1	-2/+1
\| \| \| \| \| \| \|	Instead of forcing 4 alignment when spilled, set register class alignments. llvm-svn: 252322
*	AMDGPU: Also track whether SGPRs were spilled	Matt Arsenault	2015-11-05	1	-0/+2
\| \| \| \|	llvm-svn: 252145
*	AMDGPU: Fix assert when legalizing atomic operands	Matt Arsenault	2015-11-05	1	-15/+51
\| \| \| \| \| \| \| \| \| \|	The operand layout is slightly different for the atomic opcodes from the usual MUBUF loads and stores. This should only fix it on SI/CI. VI is still broken because it still emits the addr64 replacement. llvm-svn: 252140
*	AMDGPU: Make findUsedSGPR more readable	Matt Arsenault	2015-11-03	1	-7/+18
\| \| \| \| \| \|	Add more comments etc. llvm-svn: 251996
*	AMDGPU: Simplify VOP3 operand legalization.	Matt Arsenault	2015-10-21	1	-41/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was checking for a variety of situations that should never happen. This saves a tiny bit of compile time. We should not be selecting instructions with invalid operands in the first place. Most of the time for registers copys are inserted to the correct operand register class. For VOP3, since all operand types are supported and literal constants never are, we just need to verify the constant bus requirements (all immediates should be legal inline ones). The only possibly tricky case to maybe worry about is if when legalizing operands in moveToVALU with s_add_i32 and similar instructions. If the original s_add_i32 had a literal constant and we need to replace it with v_add_i32_e64 we would have an unsupported literal operand. However, I don't think we should worry about that because SIFoldOperands should handle folding literal constant operands into the SALU instructions based on the uses. At SIFoldOperands time, the legality and profitability of operand types is a bit different. llvm-svn: 250951
*	AMDGPU: Fix not checking implicit operands in verifyInstruction	Matt Arsenault	2015-10-21	1	-15/+29
\| \| \| \| \| \| \|	When verifying constant bus restrictions, this wasn't catching uses in implicit operands. llvm-svn: 250948
*	AMDGPU: Add MachineInstr overloads for instruction format tests	Matt Arsenault	2015-10-20	1	-30/+26
\| \| \| \|	llvm-svn: 250797
*	AMDGPU: Use explicit register size indirect pseudos	Matt Arsenault	2015-10-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This stops using an unknown reg class operand. Currently build_vector selection has a broken looking check where it tries to use a VGPR reg class and an SGPR one if it sees an SGPR use. With the source operand has an explicit VGPR class, illegal copies will be inserted that SIFixSGPRCopies will take care of normally later, which will allow removing the weird check of build_vector users. Without this, when removed v_movrels_b32 would still be emitted even though all of the values were only stored in SGPRs. llvm-svn: 249494
*	AMDGPU/SI: Add verifier check for exec reads	Matt Arsenault	2015-10-02	1	-0/+10
\| \| \| \| \| \| \|	Make sure we aren't accidentally not setting these in the instruction definitions. llvm-svn: 249170
*	AMDGPU/SI: Don't set DATA_FORMAT if ADD_TID_ENABLE is set	Marek Olsak	2015-09-29	1	-0/+13
\| \| \| \| \| \| \| \| \| \|	to prevent setting a huge stride, because DATA_FORMAT has a different meaning if ADD_TID_ENABLE is set. This is a candidate for stable llvm 3.7. Tested-and-Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 248858
*	AMDGPU: Factor switch into separate function	Matt Arsenault	2015-09-28	1	-21/+27
\| \| \| \|	llvm-svn: 248742
*	AMDGPU: Fix splitting x16 SMRD loads	Matt Arsenault	2015-09-28	1	-2/+2
\| \| \| \| \| \| \| \|	When used recursively, this would set the kill flag on the intermediate step from first splitting x16 to x8. llvm-svn: 248741
*	AMDGPU: Fix moving SMRD loads with literal offsets on CI	Matt Arsenault	2015-09-28	1	-3/+9
\| \| \| \|	llvm-svn: 248740
*	AMDGPU: Fix splitting SMRD with large offset	Matt Arsenault	2015-09-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	The splitting of > 4 dword SMRD instructions if using an offset in an SGPR instead of an immediate was not setting the destination register, resulting an an instruction missing an operand which would assert later. Test will be included in a following commit which fixes a related issue. llvm-svn: 248739
*	Improved the interface of methods commuting operands, improved X86-FMA3 ↵	Andrew Kaylor	2015-09-28	1	-16/+37
\| \| \| \| \| \| \| \| \| \|	mem-folding&coalescing. Patch by Slava Klochkov (vyacheslav.n.klochkov@intel.com) Differential Revision: http://reviews.llvm.org/D11370 llvm-svn: 248735
*	AMDGPU: Construct new buffer instruction when moving SMRD	Matt Arsenault	2015-09-25	1	-30/+37
\| \| \| \| \| \| \| \| \|	It's easier to understand creating a full instruction than the current situation where sometimes a new instruction is created and sometimes it is awkwardly mutated in place. llvm-svn: 248627
*	AMDGPU: Re-justify workaround and fix worked around problem	Matt Arsenault	2015-09-25	1	-18/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When buffer resource descriptors were built, the upper two components of the descriptor were first composed into a 64-bit register because legalizeOperands assumed all operands had the same register class. Fix that problem, but keep the workaround. I'm not sure anything actually is actually emitting such a REG_SEQUENCE now. If multiple resource descriptors are set up with different base pointers, this is copied with a single s_mov_b64. We probably should fix this better by recognizing a pair of s_mov_b32 later, but for now delete the dead code. llvm-svn: 248585
*	AMDGPU: Don't create REG_SEQUENCE with SGPR dest and VGPR sources	Matt Arsenault	2015-09-25	1	-6/+15
\| \| \| \| \| \|	This avoids needting to re-legalize the new REG_SEQUENCE. llvm-svn: 248584
*	AMDGPU: Return after instruction is processed.	Matt Arsenault	2015-09-24	1	-0/+4
\| \| \| \|	llvm-svn: 248476
*	AMDGPU: Remove another unnecessary check from commuteInstruction	Matt Arsenault	2015-09-24	1	-5/+0
\| \| \| \|	llvm-svn: 248475
*	AMDGPU: Reduce number of copies emitted	Matt Arsenault	2015-09-24	1	-5/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of always inserting a copy in case the super register is itself a subregister, only extract to the super reg class if this is actually the case. This shouldn't really change codegen, but makes looking at the output of SIFixSGPRCopies easier to read. llvm-svn: 248467
*	AMDGPU: Remove unnecessary check	Matt Arsenault	2015-09-22	1	-4/+0
\| \| \| \| \| \| \|	If the instruction doesn't have enough operands, it either shouldn't be marked as isCommutable or is malformed. llvm-svn: 248242
*	AMDGPU/SI: Fix more cases of losing exec operands	Matt Arsenault	2015-09-10	1	-3/+1
\| \| \| \|	llvm-svn: 247230
*	AMDGPU: Extract full 64-bit subregister and use subregs	Matt Arsenault	2015-09-09	1	-35/+29
\| \| \| \| \| \| \| \| \| \| \| \|	Instead of extracting both 32-bit components from the 128-bit register. This produces fewer copies and is easier for the copy peephole optimizer to understand and see the actual uses as extracts from a reg_sequence. This avoids needing to handle subregister composing in the PeepholeOptimizer's ValueTracker for this case. llvm-svn: 247162
*	AMDGPU: Fix adding redundant implicit operands	Matt Arsenault	2015-09-01	1	-11/+7
\| \| \| \| \| \| \|	These are already added during the MachineInstr construction, so this was adding the implicit registers twice. llvm-svn: 246525
*	AMDGPU: Set mem operands for spill instructions	Matt Arsenault	2015-08-29	1	-13/+30
\| \| \| \|	llvm-svn: 246357
*	AMDGPU: Fix dropping mem operands when moving to VALU	Matt Arsenault	2015-08-29	1	-11/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	Without a memory operand, mayLoad or mayStore instructions are treated as hasUnorderedMemRef, which results in much worse scheduling. We really should have a verifier check that any non-side effecting mayLoad or mayStore has a memory operand. There are a few instructions (interp and images) which I'm not sure what / where to add these. llvm-svn: 246356
*	AMDGPU: Delete dead code	Matt Arsenault	2015-08-26	1	-54/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There is no context where s_mov_b64 is emitted and could potentially be moved to the VALU. It is currently only emitted for materializing immediates, which can't be dependent on vector sources. The immediate splitting is already done when selecting constants. I'm not sure what contexts if any the register splitting would have been used before. Also clean up using s_mov_b64 in place of v_mov_b64_pseudo, although this isn't required and just skips the extra step of eliminating the copy from the SReg_64. llvm-svn: 246080
*	AMDGPU: Don't reprocess instructions when splitting i64 bcnt	Matt Arsenault	2015-08-26	1	-4/+5
\| \| \| \|	llvm-svn: 246079
*	AMDGPU: Fix not moving users of s_bfe_i64 to VALU	Matt Arsenault	2015-08-26	1	-0/+2
\| \| \| \| \| \| \|	This wouldn't propagate to users of the original BFE and would hit a verifier error. llvm-svn: 246078
*	AMDGPU: Don't create intermediate SALU instructions	Matt Arsenault	2015-08-26	1	-27/+40
\| \| \| \| \| \| \| \| \| \| \| \|	When splitting 64-bit operations, create the correct VALU instructions immediately. This was splitting things like s_or_b64 into the two s_or_b32s and then pushing the new instructions onto the worklist. There's no reason we need to do this intermediate step. llvm-svn: 246077
*	Fix some comment typos.	Benjamin Kramer	2015-08-08	1	-1/+1
\| \| \| \|	llvm-svn: 244402
*	AMDGPU/SI: Remove VCCReg	Matt Arsenault	2015-08-08	1	-1/+1
\| \| \| \|	llvm-svn: 244380
*	AMDGPU/SI: Remove EXECReg	Matt Arsenault	2015-08-05	1	-5/+0
\| \| \| \| \| \|	For the same reasons as the other physical registers. llvm-svn: 244062
*	AMDGPU/SI: Add implicit register operands in the correct order.	Alex Lorenz	2015-07-31	1	-19/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit fixes a bug in the class 'SIInstrInfo' where the implicit register machine operands were added to a machine instruction in an incorrect order - the implicit uses were added before the implicit defs. I found this bug while working on moving the implicit register operand verification code from the MIR parser to the machine verifier. This commit also makes the method 'addImplicitDefUseOperands' in the machine instruction class public so that it can be reused in the 'SIInstrInfo' class. Reviewers: Matt Arsenault Differential Revision: http://reviews.llvm.org/D11689 llvm-svn: 243799
*	AMDGPU/SI: Simplify moveSMRDToVALU()	Tom Stellard	2015-07-30	1	-11/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Replace the switch on instruction opcode with a switch on register size. This way we don't need to update the switch statement when we add new SMRD variants. Reviewers: arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11601 llvm-svn: 243652
*	AMDGPU/SI: Remove isTriviallyReMaterializable() function from SIInstrInfo	Tom Stellard	2015-07-30	1	-12/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This function is never called. isReallyTriviallyReMaterializable() is the function that should be implemented instead. Reviewers: arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11620 llvm-svn: 243651
*	AMDGPU/SI: Fix read2 merging into a super register.	Matt Arsenault	2015-07-14	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the read2 produced was supposed to be writing into a super register, it would use the wrong subregister indices. Fix this by inserting copies, so we only ever write to a vreg_64. Run the register coalescer again to clean this up, although this isn't ideal and often does result in an extra move. Also remove the assert that offset1 > offset0. There isn't a real reason to not allow this other than a minor convenience in the compiler, and it doesn't seem worth the effort of avoiding it. llvm-svn: 242174
*	AMDGPU/SI: Select mad patterns to v_mac_f32	Tom Stellard	2015-07-13	1	-5/+51
\| \| \| \| \| \| \| \| \|	The two-address instruction pass will convert these back to v_mad_f32 if necessary. Differential Revision: http://reviews.llvm.org/D11060 llvm-svn: 242038
*	AMDGPU/SI: Fix crash on physical registers in SIInstrInfo::isOperandLegal()	Tom Stellard	2015-07-09	1	-1/+4
\| \| \| \| \| \| \|	No test case for this. I ran into it while working on some improvements to SIShrinkInstructions.cpp. llvm-svn: 241816
*	AMDPGU/SI: Use correct resource descriptors for VI on HSA	Tom Stellard	2015-06-26	1	-1/+6
\| \| \| \| \| \| \| \| \| \|	Summary: We need to set MTYPE = 2 for VI shaders when targeting the HSA runtime. Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D10777 llvm-svn: 240841
*	AMDGPU: really don't commute REV opcodes if the target variant doesn't exist	Marek Olsak	2015-06-26	1	-8/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If pseudoToMCOpcode failed, we would return the original opcode, so operands would be swapped, but the instruction would remain the same. It resulted in LSHLREV a, b ---> LSHLREV b, a. This fixes Glamor text rendering and piglit/arb_sample_shading-builtin-gl-sample-mask on VI. This is a candidate for stable branches. v2: the test was simplified by Tom Stellard llvm-svn: 240824
*	Fix "the the" in comments.	Eric Christopher	2015-06-19	1	-1/+1
\| \| \| \|	llvm-svn: 240112
*	[TargetInstrInfo] Rename getLdStBaseRegImmOfs and implement for x86.	Sanjoy Das	2015-06-15	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: TargetInstrInfo::getLdStBaseRegImmOfs to TargetInstrInfo::getMemOpBaseRegImmOfs and implement for x86. The implementation only handles a few easy cases now and will be made more sophisticated in the future. This is NFCI: the only user of `getLdStBaseRegImmOfs` (now `getmemOpBaseRegImmOfs`) is `LoadClusterMotion` and `LoadClusterMotion` is disabled for x86. Reviewers: reames, ab, MatzeB, atrick Reviewed By: MatzeB, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10199 llvm-svn: 239741
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+2723
\| \| \| \|	llvm-svn: 239657
*	Revert "AMDGPU: Add core backend files for R600/SI codegen v6"	Tom Stellard	2012-07-16	1	-103/+0
\| \| \| \| \| \|	This reverts commit 4ea70107c5e51230e9e60f0bf58a0f74aa4885ea. llvm-svn: 160303
*	AMDGPU: Add core backend files for R600/SI codegen v6	Tom Stellard	2012-07-16	1	-0/+103
	llvm-svn: 160270