bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Expand register indexing pseudos in custom inserter	Matt Arsenault	2016-07-19	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is to help moveSILowerControlFlow to before regalloc. There are a couple of tradeoffs with this. The complete CFG is visible to more passes, the loop body avoids an extra copy of m0, vcc isn't required, and immediate offsets can be shrunk into s_movk_i32. The disadvantage is the register allocator doesn't understand that the single lane's vector is dead within the loop body, so an extra register is used to outlive the loop block when expanding the VGPR -> m0 loop. This also now results in worse waitcnt insertion before the loop instead of after for pending operations at the point of the indexing, but that should be fixed by future improvements to cross block waitcnt insertion. v_movreld_b32's operands are now modeled more correctly since vdst is not a true output. This is kind of a hack to treat vdst as a use operand. Extra checking is required in the verifier since I can't seem to get tablegen to emit an implicit operand for a virtual register. llvm-svn: 275934
*	AMDGPU: Fix trailing whitespace	Matt Arsenault	2016-06-10	1	-1/+1
\| \| \| \|	llvm-svn: 272364
*	[AMDGPU] Disassembler: Support for sdwa instructions	Sam Kolton	2016-06-09	1	-1/+5
\| \| \| \| \| \| \| \| \| \|	Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D21129 llvm-svn: 272255
*	Fix build warning introduced in r270552 "[AMDGPU][llvm-mc] Disassembler: ↵	Artem Tamazov	2016-05-26	1	-1/+2
\| \| \| \| \| \|	support for TTMP/TBA/TMA registers." llvm-svn: 270859
*	[AMDGPU][llvm-mc] Disassembler: support for TTMP/TBA/TMA registers.	Artem Tamazov	2016-05-24	1	-42/+82
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20476 llvm-svn: 270552
*	Fixed/Recommitted r267733 "[AMDGPU][llvm-mc] Add support of TTMP quads. ↵	Artem Tamazov	2016-04-29	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \|	Rework M0 exclusion for SMRD." Previously reverted by r267752. r267733 review: Differential Revision: http://reviews.llvm.org/D19342 llvm-svn: 268066
*	Revert "[AMDGPU][llvm-mc] Add support of TTMP quads. Rework M0 exclusion for ↵	Chad Rosier	2016-04-27	1	-6/+0
\| \| \| \| \| \| \| \|	SMRD." This reverts commit r267733 due to a -Werror,-Wunused-function error. llvm-svn: 267752
*	[AMDGPU][llvm-mc] Add support of TTMP quads. Rework M0 exclusion for SMRD.	Artem Tamazov	2016-04-27	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \|	Added support of TTMP quads. Reworked M0 exclusion machinery for SMRD and similar instructions to enable usage of TTMP registers in those instructions as destinations. Tests added. Differential Revision: http://reviews.llvm.org/D19342 llvm-svn: 267733
*	[AMDGPU] Disassembler: support for DPP	Sam Kolton	2016-03-31	1	-7/+19
\| \| \| \| \|	Review: http://reviews.llvm.org/D18642 llvm-svn: 265015
*	[AMDGPU] Fix SMEM instructions encoding/operand namings	Valery Pykhtin	2016-03-10	1	-0/+2
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D17651 llvm-svn: 263108
*	test commit	Valery Pykhtin	2016-03-04	1	-1/+1
\| \| \| \|	llvm-svn: 262709
*	[AMDGPU] Remove unused disassembler code.	Nikolay Haustov	2016-03-01	1	-2/+0
\| \| \| \|	llvm-svn: 262346
*	[AMDGPU] Fix build warnings.	Nikolay Haustov	2016-03-01	1	-2/+2
\| \| \| \|	llvm-svn: 262338
*	[AMDGPU] Disassembler code refactored + error messages.	Nikolay Haustov	2016-03-01	1	-346/+266
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Idea behind this change is to make code shorter and as much common for all targets as possible. Let's even accept more code than is valid for a particular target, leaving it for the assembler to sort out. 64bit instructions decoding added. Error\warning messages on unrecognized instructions operands added, InstPrinter allowed to print invalid operands helping to find invalid/unsupported code. The change is massive and hard to compare with previous version, so it makes sense just to take a look on the new version. As a bonus, with a few TD changes following, it disassembles the majority of instructions. Currently it fully disassembles >300K binary source of some blas kernel. Previous TODOs were saved whenever possible. Patch by: Valery Pykhtin Differential Revision: http://reviews.llvm.org/D17720 llvm-svn: 262332
*	[AMDGPU] Disassembler: Support for all VOP1 instructions.	Nikolay Haustov	2016-02-25	1	-49/+206
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Support all instructions with VOP1 encoding with 32 or 64-bit operands for VI subtarget: VGPR_32 and VReg_64 operand register classes VS_32 and VS_64 operand register classes with inline and literal constants Tests for VOP1 instructions. Patch by: skolton Reviewers: arsenm, tstellarAMD Review: http://reviews.llvm.org/D17194 llvm-svn: 261878
*	[AMDGPU] Disassembler: Added basic disassembler for AMDGPU target	Tom Stellard	2016-02-18	1	-0/+302
	Changes: - Added disassembler project - Fixed all decoding conflicts in .td files - Added DecoderMethod=“NONE” option to Target.td that allows to disable decoder generation for an instruction. - Created decoding functions for VS_32 and VReg_32 register classes. - Added stubs for decoding all register classes. - Added several tests for disassembler Disassembler only supports: - VI subtarget - VOP1 instruction encoding - 32-bit register operands and inline constants [Valery] One of the point that requires to pay attention to is how decoder conflicts were resolved: - Groups of target instructions were separated by using different DecoderNamespace (SICI, VI, CI) using similar to AssemblerPredicate approach. - There were conflicts in IMAGE_<> instructions caused by two different reasons: 1. dmask wasn’t specified for the output (fixed) 2. There are image instructions that differ only by the number of the address components but have the same encoding by the HW spec. The actual number of address components is determined by the HW at runtime using image resource descriptor starting from the VGPR encoded in an IMAGE instruction. This means that we should choose only one instruction from conflicting group to be the rule for decoder. I didn’t find the way to disable decoder generation for an arbitrary instruction and therefore made a onelinear fix to tablegen generator that would suppress decoder generation when DecoderMethod is set to “NONE”. This is a change that should be reviewed and submitted first. Otherwise I would need to specify different DecoderNamespace for every instruction in the conflicting group. I haven’t checked yet if DecoderMethod=“NONE” is not used in other targets. 3. IMAGE_GATHER decoder generation is for now disabled and to be done later. [/Valery] Patch By: Sam Kolton Differential Revision: http://reviews.llvm.org/D16723 llvm-svn: 261185