bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	AMDGPU: Fix trying to skip from a block with no successors	Matt Arsenault	2016-07-15	1	-2/+3
\| \| \| \| \| \|	Found while reducing bug 28550 llvm-svn: 275509
*	AMDGPU: Follow up to r275203	Matt Arsenault	2016-07-12	1	-24/+27
\| \| \| \| \| \|	I meant to squash this into it. llvm-svn: 275220
*	AMDGPU: Fix verifier error with kill intrinsic	Matt Arsenault	2016-07-12	1	-65/+122
\| \| \| \| \| \| \|	Don't create a terminator in the middle of the block. We should probably get rid of this intrinsic. llvm-svn: 275203
*	Revert "AMDGPU: Remove unused control flow intrinsic"	Matt Arsenault	2016-07-09	1	-0/+19
\| \| \| \|	llvm-svn: 274978
*	AMDGPU: Improve offset folding for register indexing	Matt Arsenault	2016-07-09	1	-22/+40
\| \| \| \|	llvm-svn: 274954
*	AMDGPU: Remove unused control flow intrinsic	Matt Arsenault	2016-07-08	1	-19/+0
\| \| \| \|	llvm-svn: 274939
*	AMDGPU: Minor adjustment to r274817	Matt Arsenault	2016-07-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	The commit message is inaccurate, modifiesRegister will check for partial defs of exec. We currently don't ever emit partial defs of exec, so it doesn't really matter. llvm-svn: 274886
*	AMDGPU: Move si_mask_branch register operand to be a use	Matt Arsenault	2016-07-08	1	-4/+6
\| \| \| \|	llvm-svn: 274818
*	AMDGPU: Cleanup. Use definesRegister instead of manual loop	Matt Arsenault	2016-07-08	1	-6/+2
\| \| \| \| \| \| \|	Also this will be more precise since it will check exec_lo/exec_hi writes. llvm-svn: 274817
*	AMDGPU: Fix return of non-void-returning shaders	Nicolai Haehnle	2016-07-06	1	-6/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Since "AMDGPU: Fix verifier errors in SILowerControlFlow", the logic that ensures that a non-void-returning shader falls off the end of the last basic block was effectively disabled, since SI_RETURN is now used. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96731 Reviewers: arsenm, tstellarAMD Subscribers: arsenm, kzhuravl, llvm-commits Differential Revision: http://reviews.llvm.org/D21975 llvm-svn: 274612
*	AMDGPU: Add m0 vgpr load loop block as successor	Matt Arsenault	2016-06-30	1	-0/+1
\| \| \| \| \| \| \|	This shows up as a verifier error when I move this earlier, not sure why it didn't before. llvm-svn: 274275
*	AMDGPU: Fix out of bounds indirect indexing errors	Matt Arsenault	2016-06-28	1	-8/+19
\| \| \| \| \| \| \|	This was producing acceses to registers beyond the super register's limits, resulting in verifier failures. llvm-svn: 273977
*	AMDGPU: Fix verifier errors with undef vector indices	Matt Arsenault	2016-06-27	1	-27/+37
\| \| \| \| \| \|	Also fix pointlessly adding exec to liveins. llvm-svn: 273916
*	AMDGPU: Cleanup subtarget handling.	Matt Arsenault	2016-06-24	1	-3/+4
\| \| \| \| \| \| \| \| \|	Split AMDGPUSubtarget into amdgcn/r600 specific subclasses. This removes most of the static_casting of the basic codegen classes everywhere, and tries to restrict the features visible on the wrong target. llvm-svn: 273652
*	AMDGPU: Fix liveness when expanding m0 loop	Matt Arsenault	2016-06-22	1	-17/+60
\| \| \| \|	llvm-svn: 273514
*	AMDGPU: Fix verifier errors in SILowerControlFlow	Matt Arsenault	2016-06-22	1	-66/+127
\| \| \| \| \| \| \| \| \| \| \| \| \|	The main sin this was committing was using terminator instructions in the middle of the block, and then not updating the block successors / predecessors. Split the blocks up to avoid this and introduce new pseudo instructions for branches taken with exec masking. Also use a pseudo instead of emitting s_endpgm and erasing it in the special case of a non-void return. llvm-svn: 273467
*	AMDGPU: Also look for s_cbranch_vccz	Matt Arsenault	2016-05-19	1	-1/+2
\| \| \| \|	llvm-svn: 270091
*	AMDGPU: Fix crash with unreachable terminators.	Matt Arsenault	2016-04-29	1	-12/+27
\| \| \| \| \| \| \| \| \| \|	If a block has no successors because it ends in unreachable, this was accessing an invalid iterator. Also stop counting instructions that don't emit any real instructions. llvm-svn: 268119
*	AMDGPU: Add a shader calling convention	Nicolai Haehnle	2016-04-06	1	-6/+4
\| \| \| \| \| \| \| \| \| \| \|	This makes it possible to distinguish between mesa shaders and other kernels even in the presence of compute shaders. Patch By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Differential Revision: http://reviews.llvm.org/D18559 llvm-svn: 265589
*	AMDGPU: Add SIWholeQuadMode pass	Nicolai Haehnle	2016-03-21	1	-12/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Whole quad mode is already enabled for pixel shaders that compute derivatives, but it must be suspended for instructions that cause a shader to have side effects (i.e. stores and atomics). This pass addresses the issue by storing the real (initial) live mask in a register, masking EXEC before instructions that require exact execution and (re-)enabling WQM where required. This pass is run before register coalescing so that we can use machine SSA for analysis. The changes in this patch expose a problem with the second machine scheduling pass: target independent instructions like COPY implicitly use EXEC when they operate on VGPRs, but this fact is not encoded in the MIR. This can lead to miscompilation because instructions are moved past changes to EXEC. This patch fixes the problem by adding use-implicit operands to target independent instructions. Some general codegen passes are relaxed to work with such implicit use operands. Reviewers: arsenm, tstellarAMD, mareko Subscribers: MatzeB, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18162 llvm-svn: 263982
*	AMDGPU/SI: Fix threshold calculation for branching when exec is zero	Tom Stellard	2016-03-21	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When control flow is implemented using the exec mask, the compiler will insert branch instructions to skip over the masked section when exec is zero if the section contains more than a certain number of instructions. The previous code would only count instructions in successor blocks, and this patch modifies the code to start counting instructions in all blocks between the start and end of the branch. Reviewers: nhaehnle, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18282 llvm-svn: 263969
*	AMDGPU: add missing braces around multi-line if block	Nicolai Haehnle	2016-03-18	1	-1/+2
\| \| \| \| \| \|	This fixes an issue with rL263658 pointed out by Tom Stellard. llvm-svn: 263823
*	AMDGPU: Prevent uniform loops from becoming infinite	Nicolai Haehnle	2016-03-16	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Uniform loops where the branch leaving the loop is predicated on VCCNZ must be skipped if EXEC = 0, otherwise they will be infinite. Reviewers: tstellarAMD, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18137 llvm-svn: 263658
*	AMDGPU/SI: Incomplete shader binaries need to finish execution at the end	Marek Olsak	2016-03-14	1	-0/+24
\| \| \| \| \| \| \| \| \| \|	Reviewers: tstellarAMD, arsenm Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D18058 llvm-svn: 263441
*	AMDGPU: Set flat_scratch from flat_scratch_init reg	Matt Arsenault	2016-02-12	1	-35/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was hardcoded to the static private size, but this would be missing the offset and additional size for someday when we have dynamic sizing. Also stops always initializing flat_scratch even when unused. In the future we should stop emitting this unless flat instructions are used to access private memory. For example this will initialize it almost always on VI because flat is used for global access. llvm-svn: 260658
*	AMDGPU: Initialize SILowerControlFlow	Matt Arsenault	2016-02-12	1	-28/+36
\| \| \| \|	llvm-svn: 260645
*	AMDGPU: Remove trailing whitespace	Matt Arsenault	2016-02-12	1	-4/+4
\| \| \| \|	llvm-svn: 260644
*	AMDGPU: Fix adding redundant m0 uses	Matt Arsenault	2015-10-21	1	-2/+0
\| \| \| \| \| \|	BuildMI already adds these since they are defined correctly now. llvm-svn: 250961
*	AMDGPU: Add MachineInstr overloads for instruction format tests	Matt Arsenault	2015-10-20	1	-2/+2
\| \| \| \|	llvm-svn: 250797
*	AMDGPU: Use explicit register size indirect pseudos	Matt Arsenault	2015-10-07	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This stops using an unknown reg class operand. Currently build_vector selection has a broken looking check where it tries to use a VGPR reg class and an SGPR one if it sees an SGPR use. With the source operand has an explicit VGPR class, illegal copies will be inserted that SIFixSGPRCopies will take care of normally later, which will allow removing the weird check of build_vector users. Without this, when removed v_movrels_b32 would still be emitted even though all of the values were only stored in SGPRs. llvm-svn: 249494
*	AMDGPU: Fix recomputing dominator tree unnecessarily	Matt Arsenault	2015-09-25	1	-0/+4
\| \| \| \| \| \| \|	SIFixSGPRCopies does not modify the CFG, but this was being recomputed before running SIFoldOperands. llvm-svn: 248587
*	AMDGPU/SI: Remove VCCReg	Matt Arsenault	2015-08-08	1	-4/+4
\| \| \| \|	llvm-svn: 244380
*	AMDGPU/SI: Remove EXECReg	Matt Arsenault	2015-08-05	1	-8/+4
\| \| \| \| \| \|	For the same reasons as the other physical registers. llvm-svn: 244062
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+605
	llvm-svn: 239657