bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Reduce the size of MCRelaxableFragment.	Akira Hatanaka	2015-11-14	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MCRelaxableFragment previously kept a copy of MCSubtargetInfo and MCInst to enable re-encoding the MCInst later during relaxation. A copy of MCSubtargetInfo (instead of a reference or pointer) was needed because the feature bits could be modified by the parser. This commit replaces the MCSubtargetInfo copy in MCRelaxableFragment with a constant reference to MCSubtargetInfo. The copies of MCSubtargetInfo are kept in MCContext, and the target parsers are now responsible for asking MCContext to provide a copy whenever the feature bits of MCSubtargetInfo have to be toggled. With this patch, I saw a 4% reduction in peak memory usage when I compiled verify-uselistorder.lto.bc using llc. rdar://problem/21736951 Differential Revision: http://reviews.llvm.org/D14346 llvm-svn: 253127
*	[MCTargetAsmParser] Move the member varialbes that reference	Akira Hatanaka	2015-11-14	1	-9/+7
\| \| \| \| \| \| \| \| \| \|	MCSubtargetInfo in the subclasses into MCTargetAsmParser and define a member function getSTI. This is done in preparation for making changes to shrink the size of MCRelaxableFragment. (see http://reviews.llvm.org/D14346). llvm-svn: 253124
*	AMDGPU: Add stony support	Tom Stellard	2015-11-13	1	-0/+4
\| \| \| \| \| \|	Patch by: Alex Deucher llvm-svn: 253053
*	Revert "Remove unnecessary call to getAllocatableRegClass"	Tom Stellard	2015-11-12	3	-6/+16
\| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r252565. This also includes the revert of the commit mentioned below in order to avoid breaking tests in AMDGPU: Revert "AMDGPU: Set isAllocatable = 0 on VS_32/VS_64" This reverts commit r252674. llvm-svn: 252956
*	AMDGPU: Print more fields in comments	Matt Arsenault	2015-11-11	1	-3/+14
\| \| \| \|	llvm-svn: 252677
*	AMDGPU: Remove dead code	Matt Arsenault	2015-11-11	1	-33/+2
\| \| \| \|	llvm-svn: 252675
*	AMDGPU: Set isAllocatable = 0 on VS_32/VS_64	Matt Arsenault	2015-11-11	3	-16/+6
\| \| \| \|	llvm-svn: 252674
*	AMDGPU/SI: Refactor VOP[12C] tablegen definitions	Tom Stellard	2015-11-06	2	-97/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Pass the VOPProfile object all the through to *_m multiclasses. This will allow us to do more simplifications in the future. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13437 llvm-svn: 252339
*	AMDGPU: Cleanup includes	Matt Arsenault	2015-11-06	2	-6/+4
\| \| \| \|	llvm-svn: 252328
*	AMDGPU: Create emergency stack slots during frame lowering	Matt Arsenault	2015-11-06	7	-14/+89
\| \| \| \| \| \|	Test has a bogus verifier error which will be fixed by later commits. llvm-svn: 252327
*	AMDGPU: Remove unused scratch resource operands	Matt Arsenault	2015-11-06	2	-75/+131
\| \| \| \| \| \|	The SGPR spill pseudos don't actually use them. llvm-svn: 252324
*	AMDGPU: Add pass to detect used kernel features	Matt Arsenault	2015-11-06	4	-0/+138
\| \| \| \| \| \| \| \| \| \| \|	Mark kernels that use certain features that require user SGPRs to support with kernel attributes. We need to know before instruction selection begins because it impacts the kernel calling convention lowering. For now this only detects the workitem intrinsics. llvm-svn: 252323
*	AMDGPU: Fix hardcoded alignment of spill.	Matt Arsenault	2015-11-06	2	-13/+12
\| \| \| \| \| \| \|	Instead of forcing 4 alignment when spilled, set register class alignments. llvm-svn: 252322
*	AMDGPU: Hack for VS_32 register pressure	Matt Arsenault	2015-11-06	2	-4/+17
\| \| \| \| \| \| \| \| \| \| \| \| \|	For some reason VS_32 ends up factoring into the pressure heuristics even though we should never see a virtual register with this class. When SGPRs are reserved for register spilling, this for some reason triggers reg-crit scheduling. Setting isAllocatable = 0 may help with this since that seems to remove it from the default implementation's generated table. llvm-svn: 252321
*	AMDGPU/SI: Emit HSA kernels with symbol type STT_AMDGPU_HSA_KERNEL	Tom Stellard	2015-11-06	6	-0/+60
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13804 llvm-svn: 252291
*	AMDGPU: Also track whether SGPRs were spilled	Matt Arsenault	2015-11-05	3	-2/+20
\| \| \| \|	llvm-svn: 252145
*	AMDGPU: Print number user SGPRs	Matt Arsenault	2015-11-05	1	-0/+6
\| \| \| \| \| \| \|	This doesn't quite match how SC prints it, which doesn't put it in a comment. llvm-svn: 252144
*	AMDGPU: Disallow s[102:103] on VI in assembler	Matt Arsenault	2015-11-05	1	-2/+28
\| \| \| \|	llvm-svn: 252142
*	AMDGPU: Fix assert when legalizing atomic operands	Matt Arsenault	2015-11-05	3	-15/+59
\| \| \| \| \| \| \| \| \| \|	The operand layout is slightly different for the atomic opcodes from the usual MUBUF loads and stores. This should only fix it on SI/CI. VI is still broken because it still emits the addr64 replacement. llvm-svn: 252140
*	AMDGPU: Make addr64 atomic operand order consistent	Matt Arsenault	2015-11-05	1	-2/+2
\| \| \| \| \| \| \|	vaddr comes before srsrc in every other MUBUF instruction, and is the order it is printed. llvm-svn: 252139
*	AMDGPU: Fix typo	Matt Arsenault	2015-11-05	1	-2/+2
\| \| \| \|	llvm-svn: 252116
*	AMDGPU: Make flat_scratch name consistent	Matt Arsenault	2015-11-03	1	-3/+3
\| \| \| \| \| \| \|	The printed name and the parsed assembler names weren't the same. I'm not sure which name SC prints these as, but I think it's this one. llvm-svn: 252010
*	AMDGPU: Fix asserts on invalid register ranges	Matt Arsenault	2015-11-03	1	-5/+13
\| \| \| \| \| \| \| \| \|	If the requested SGPR was not actually aligned, it was accepted and rounded down instead of rejected. Also fix an assert if the range is an invalid size. llvm-svn: 252009
*	AMDGPU: Fix off by one error in register parsing	Matt Arsenault	2015-11-03	1	-4/+5
\| \| \| \| \| \|	If trying to use one past the end, this would assert. llvm-svn: 252008
*	AMDGPU: s[102:103] is unavailable on VI	Matt Arsenault	2015-11-03	1	-1/+10
\| \| \| \|	llvm-svn: 252000
*	AMDGPU: Define correct number of SGPRs	Matt Arsenault	2015-11-03	2	-6/+10
\| \| \| \| \| \| \| \| \|	There are actually 104 so 2 were missing. More assembler tests with high register number tuples will be included in later patches. llvm-svn: 251999
*	AMDGPU: Make findUsedSGPR more readable	Matt Arsenault	2015-11-03	1	-7/+18
\| \| \| \| \| \|	Add more comments etc. llvm-svn: 251996
*	AMDGPU: Initialize SIFixSGPRCopies so -print-after works	Matt Arsenault	2015-11-03	3	-8/+15
\| \| \| \|	llvm-svn: 251995
*	AMDGPU: Alphabetize includes	Matt Arsenault	2015-11-03	1	-1/+1
\| \| \| \|	llvm-svn: 251994
*	ScheduleDAGInstrs: Remove IsPostRA flag; NFC	Matthias Braun	2015-11-03	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ScheduleDAGInstrs doesn't behave differently before or after register allocation. It was only used in a method of MachineSchedulerBase which behaved differently in MachineScheduler/PostMachineScheduler. Change this to let MachineScheduler/PostMachineScheduler just pass in a parameter to that function. The order of the LiveIntervals* and bool RemoveKillFlags paramters have been switched to make out-of-tree code fail instead of unintentionally passing a value intended for the IsPostRA flag to the (previously following and default initialized) RemoveKillFlags. Differential Revision: http://reviews.llvm.org/D14245 llvm-svn: 251883
*	AMDGPU: Stop assuming vreg for build_vector	Matt Arsenault	2015-11-02	2	-20/+40
\| \| \| \| \| \| \| \| \| \| \| \| \|	This was causing a variety of test failures when v2i64 is added as a legal type. SIFixSGPRCopies should correctly handle the case of vector inputs to a scalar reg_sequence, so this isn't necessary anymore. This was hiding some deficiencies in how reg_sequence is handled later, but this shouldn't be a problem anymore since the register class copy of a reg_sequence is now done before the reg_sequence. llvm-svn: 251860
*	AMDGPU: Error on graphics shaders with HSA	Matt Arsenault	2015-11-02	1	-0/+8
\| \| \| \| \| \| \| \|	I've found myself pointlessly debugging problems from running graphics tests with an HSA triple a few times, so stop this from happening again. llvm-svn: 251858
*	AMDGPU: Distribute SGPR->VGPR copies of REG_SEQUENCE	Matt Arsenault	2015-11-02	1	-23/+89
\| \| \| \| \| \| \|	Make the REG_SEQUENCE be a VGPR, and do the register class copy first. llvm-svn: 251855
*	AMDGPU/SI: handle undef for llvm.SI.packf16	Marek Olsak	2015-10-29	1	-0/+4
\| \| \| \|	llvm-svn: 251632
*	AMDGPU/SI: use S_OR for fneg (fabs f32)	Marek Olsak	2015-10-29	1	-2/+1
\| \| \| \|	llvm-svn: 251631
*	AMDGPU/SI: use S_AND for i1 trunc	Marek Olsak	2015-10-29	1	-2/+2
\| \| \| \|	llvm-svn: 251630
*	AMDGPU: Print modifiers when dumping AMDGPUOperand	Matt Arsenault	2015-10-24	1	-1/+1
\| \| \| \|	llvm-svn: 251160
*	AMDGPU: Fix parsing of 32-bit literals with sign bit set	Matt Arsenault	2015-10-23	2	-5/+8
\| \| \| \|	llvm-svn: 251132
*	AMDGPU: Fix adding redundant m0 uses	Matt Arsenault	2015-10-21	1	-2/+0
\| \| \| \| \| \|	BuildMI already adds these since they are defined correctly now. llvm-svn: 250961
*	AMDGPU: Fix verifier error in SIFoldOperands	Matt Arsenault	2015-10-21	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There may be other use operands that also need their kill flags cleared. This happens in a few tests when SIFoldOperands is moved after PeepholeOptimizer. PeepholeOptimizer rewrites cases that look like: %vreg0 = ... %vreg1 = COPY %vreg0 use %vreg1<kill> %vreg2 = COPY %vreg0 use %vreg2<kill> to use the earlier source to %vreg0 = ... use %vreg0 use %vreg0 Currently SIFoldOperands sees the copied registers, so there is only one use. So far I haven't managed to come up with a test that currently has multiple uses of a foldable VGPR -> VGPR copy. llvm-svn: 250960
*	AMDGPU: Split DiagnosticInfoUnsupported into its own file	Matt Arsenault	2015-10-21	4	-41/+76
\| \| \| \|	llvm-svn: 250959
*	AMDGPU: Simplify VOP3 operand legalization.	Matt Arsenault	2015-10-21	3	-42/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was checking for a variety of situations that should never happen. This saves a tiny bit of compile time. We should not be selecting instructions with invalid operands in the first place. Most of the time for registers copys are inserted to the correct operand register class. For VOP3, since all operand types are supported and literal constants never are, we just need to verify the constant bus requirements (all immediates should be legal inline ones). The only possibly tricky case to maybe worry about is if when legalizing operands in moveToVALU with s_add_i32 and similar instructions. If the original s_add_i32 had a literal constant and we need to replace it with v_add_i32_e64 we would have an unsupported literal operand. However, I don't think we should worry about that because SIFoldOperands should handle folding literal constant operands into the SALU instructions based on the uses. At SIFoldOperands time, the legality and profitability of operand types is a bit different. llvm-svn: 250951
*	AMDGPU: Fix not checking implicit operands in verifyInstruction	Matt Arsenault	2015-10-21	1	-15/+29
\| \| \| \| \| \| \|	When verifying constant bus restrictions, this wasn't catching uses in implicit operands. llvm-svn: 250948
*	AMDGPU: Add MachineInstr overloads for instruction format tests	Matt Arsenault	2015-10-20	7	-40/+111
\| \| \| \|	llvm-svn: 250797
*	AMDGPU: Stop reserving v[254:255]	Matt Arsenault	2015-10-20	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \|	This wasn't doing anything useful. They weren't explicitly used anywhere, and the RegScavenger ignores reserved registers. This for some reason caused a random scheduling change in the test. Getting the check lines to pass is too frustrating, and there's probably not too much value in checking the vector case's operands N times. llvm-svn: 250794
*	Make a bunch of static arrays const.	Craig Topper	2015-10-18	2	-2/+2
\| \| \| \|	llvm-svn: 250642
*	Don't pretend AMDGPU backend knows how to custom-lower UDIVREM for vector ↵	Artyom Skrobov	2015-10-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	types; it can't Reviewers: arsenm, jvesely, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13734 llvm-svn: 250384
*	AMDGPU: Remove implicit ilist iterator conversions, NFC	Duncan P. N. Exon Smith	2015-10-13	9	-18/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	One of the changes in lib/Target/AMDGPU/AMDGPUMCInstLower.cpp was a new one. Previously, bundle iterators and single-instruction iterators could be compared to each other (comparing on underlying pointers). I changed a comparison from using `MBB->end()` to using `MBB->instr_end()`, since both end iterators should point at the some place anyway. I don't think the implicit conversion between the two iterator types is a good idea since it's fairly easy to accidentally compare to the wrong thing (they aren't always end iterators). Otherwise I would have just added the conversion. Even with that, no there should be functionality change here. llvm-svn: 250218
*	AMDGPU: Refactor isVGPRToSGPRCopy	Matt Arsenault	2015-10-13	1	-19/+48
\| \| \| \| \| \| \|	It should now correctly handle physical registers and make it easier to identify the other direction. llvm-svn: 250132
*	DAGCombiner: Combine extract_vector_elt from build_vector	Matt Arsenault	2015-10-12	2	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This basic combine was surprisingly missing. AMDGPU legalizes many operations in terms of 32-bit vector components, so not doing this results in many extra copies and subregister extracts that need to be cleaned up later. InstCombine already does this for the hasOneUse case. The target hook is to fix a handful of tests which break (e.g. ARM/vmov.ll) which turn from a vector materialize repeated immediate instruction to a constant vector load with more scalar copies from it. llvm-svn: 250129