bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	R600/SI: Spill VGPRs to scratch space for compute shaders	Tom Stellard	2015-01-14	11	-96/+353
\| \| \| \|	llvm-svn: 225988
*	[cleanup] Re-sort all the #include lines in LLVM using	Chandler Carruth	2015-01-14	5	-10/+7
\| \| \| \| \| \| \| \| \| \| \|	utils/sort_includes.py. I clearly haven't done this in a while, so more changed than usual. This even uncovered a missing include from the InstrProf library that I've added. No functionality changed here, just mechanical cleanup of the include order. llvm-svn: 225974
*	R600/SI: Fix bad code with unaligned byte vector loads	Matt Arsenault	2015-01-14	2	-5/+18
\| \| \| \| \| \| \| \| \|	Don't do the v4i8 -> v4f32 combine if the load will need to be expanded due to alignment. This stops adding instructions to repack into a single register that the v_cvt_ubyteN_f32 instructions read. llvm-svn: 225926
*	Implement new way of expanding extloads.	Matt Arsenault	2015-01-14	2	-24/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that the source and destination types can be specified, allow doing an expansion that doesn't use an EXTLOAD of the result type. Try to do a legal extload to an intermediate type and extend that if possible. This generalizes the special case custom lowering of extloads R600 has been using to work around this problem. This also happens to fix a bug that would incorrectly use more aligned loads than should be used. llvm-svn: 225925
*	R600/SI: Define a schedule model	Tom Stellard	2015-01-14	4	-21/+155
\| \| \| \| \| \| \| \|	The machine scheduler is still disabled by default. The schedule model is not complete yet, and could be improved. llvm-svn: 225913
*	R600/SI: Add pattern for bitcasting fp immediates to integers	Tom Stellard	2015-01-13	8	-56/+39
\| \| \| \| \| \| \| \|	The backend now assumes that all immediates are integers. This allows us to simplify immediate handling code, becasue we no longer need to handle fp and integer immediates differently. llvm-svn: 225844
*	R600: Implement getRecipEstimate	Matt Arsenault	2015-01-13	2	-0/+26
\| \| \| \| \| \| \| \| \|	This requires a new hook to prevent expanding sqrt in terms of rsqrt and reciprocal. v_rcp_f32, v_rsq_f32, and v_sqrt_f32 are all the same rate, so this expansion would just double the number of instructions and cycles. llvm-svn: 225828
*	R600: Implement getRsqrtEstimate	Matt Arsenault	2015-01-13	2	-0/+23
\| \| \| \| \| \| \| \|	Only do for f32 since I'm unclear on both what this is expecting for the refinement steps in terms of accuracy, and what f64 instruction actually provides. llvm-svn: 225827
*	R600: Make cttz / ctlz cheap to speculate	Matt Arsenault	2015-01-13	2	-0/+15
\| \| \| \| \| \| \| \| \|	Speculating things is generally good. SI+ has instructions for these for 32-bit values. This is still probably better even with the expansion for 64-bit values, although it is odd that this callback doesn't have the size as a parameter. llvm-svn: 225822
*	R600/SI: Remove redundant setting expand on f64 vectors	Matt Arsenault	2015-01-12	1	-7/+0
\| \| \| \| \| \| \|	None of these are legal types already, so they default to Expand. llvm-svn: 225728
*	R600/SI: Use RegisterOperands to specify which operands can accept immediates	Tom Stellard	2015-01-12	10	-76/+68
\| \| \| \| \| \| \| \| \| \| \| \|	There are some operands which can take either immediates or registers and we were previously using different register class to distinguish between operands that could take immediates and those that could not. This patch switches to using RegisterOperands which should simplify the backend by reducing the number of register classes and also make it easier to implement the assembler. llvm-svn: 225662
*	Add r224985 back with two fixes.	Rafael Espindola	2015-01-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	One is that AArch64 has additional restrictions on when local relocations can be used. We have to take those into consideration when deciding to put a L symbol in the symbol table or not. The other is that ld64 requires the relocations to cstring to use linker visible symbols on AArch64. Thanks to Michael Zolotukhin for testing this! Remove doesSectionRequireSymbols. In an assembly expression like bar: .long L0 + 1 the intended semantics is that bar will contain a pointer one byte past L0. In sections that are merged by content (strings, 4 byte constants, etc), a single position in the section doesn't give the linker enough information. For example, it would not be able to tell a relocation must point to the end of a string, since that would look just like the start of the next. The solution used in ELF to use relocation with symbols if there is a non-zero addend. In MachO before this patch we would just keep all symbols in some sections. This would miss some cases (only cstrings on x86_64 were implemented) and was inefficient since most relocations have an addend of 0 and can be represented without the symbol. This patch implements the non-zero addend logic for MachO too. llvm-svn: 225644
*	R600/SI: Remove SIISelLowering::legalizeOperands()	Tom Stellard	2015-01-08	2	-176/+1
\| \| \| \| \| \| \| \| \|	Its functionality has been replaced by calling SIInstrInfo::legalizeOperands() from SIISelLowering::AdjstInstrPostInstrSelection() and running the SIFoldOperands and SIShrinkInstructions passes. llvm-svn: 225445
*	[SelectionDAG] Allow targets to specify legality of extloads' result	Ahmed Bougacha	2015-01-08	3	-39/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	type (in addition to the memory type). The LoadExt legalization handling used to only have one type, the memory type. This forced users to assume that as long as the extload for the memory type was declared legal, and the result type was legal, the whole extload was legal. However, this isn't always the case. For instance, on X86, with AVX, this is legal: v4i32 load, zext from v4i8 but this isn't: v4i64 load, zext from v4i8 Whereas v4i64 is (arguably) legal, even without AVX2. Note that the same thing was done a while ago for truncstores (r46140), but I assume no one needed it yet for extloads, so here we go. Calls to getLoadExtAction were changed to add the value type, found manually in the surrounding code. Calls to setLoadExtAction were mechanically changed, by wrapping the call in a loop, to match previous behavior. The loop iterates over the MVT subrange corresponding to the memory type (FP vectors, etc...). I also pulled neighboring setTruncStoreActions into some of the loops; those shouldn't make a difference, as the additional types are illegal. (e.g., i128->i1 truncstores on PPC.) No functional change intended. Differential Revision: http://reviews.llvm.org/D6532 llvm-svn: 225421
*	R600/SI: Commute instructions to enable more folding opportunities	Tom Stellard	2015-01-07	2	-19/+51
\| \| \| \|	llvm-svn: 225410
*	R600/SI: Only fold immediates that have one use	Tom Stellard	2015-01-07	1	-1/+8
\| \| \| \| \| \| \|	Folding the same immediate into multiple instruction will increase program size, which can hurt performance. llvm-svn: 225405
*	R600/SI: Remove VReg_32 register class	Tom Stellard	2015-01-07	13	-154/+152
\| \| \| \| \| \| \| \| \| \| \|	Use VGPR_32 register class instead. These two register classes were identical and having separate classes was causing SIInstrInfo::isLegalOperands() to be overly conservative in some cases. This change is necessary to prevent future paches from missing a folding opportunity in fneg-fabs.ll. llvm-svn: 225382
*	R600/SI: Add a V_MOV_B64 pseudo instruction	Tom Stellard	2015-01-07	3	-0/+38
\| \| \| \| \| \| \|	This is used to simplify the SIFoldOperands pass and make it easier to fold immediates. llvm-svn: 225373
*	R600/SI: Teach SIFoldOperands to split 64-bit constants when folding	Tom Stellard	2015-01-07	3	-25/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This allows folding of sequences like: s[0:1] = s_mov_b64 4 v_add_i32 v0, s0, v0 v_addc_u32 v1, s1, v1 into v_add_i32 v0, 4, v0 v_add_i32 v1, 0, v1 llvm-svn: 225369
*	R600/SI: Refactor SIFoldOperands to simplify immediate folding	Tom Stellard	2015-01-07	1	-25/+54
\| \| \| \| \| \|	This will make a future patch much less intrusive. llvm-svn: 225358
*	R600/SI: Add combine for isinfinite pattern	Matt Arsenault	2015-01-06	2	-0/+57
\| \| \| \|	llvm-svn: 225310
*	R600/SI: Pattern match isinf to v_cmp_class instructions	Matt Arsenault	2015-01-06	2	-0/+34
\| \| \| \|	llvm-svn: 225307
*	R600/SI: Add basic DAG combines for fp_class	Matt Arsenault	2015-01-06	2	-1/+50
\| \| \| \|	llvm-svn: 225306
*	R600/SI: Add class intrinsic	Matt Arsenault	2015-01-06	7	-5/+82
\| \| \| \|	llvm-svn: 225305
*	R600/SI: Insert s_waitcnt before s_barrier instructions.	Tom Stellard	2015-01-06	1	-1/+5
\| \| \| \| \| \| \|	This ensures that all memory operations are complete when all threads reach the barrier. llvm-svn: 225290
*	R600/SI: Fix dependency calculation for DS writes instructions in SIInsertWaits	Tom Stellard	2015-01-06	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \|	In DS write instructions, the address operand comes before the value operand(s) which is reversed from every other instruction type. The SIInsertWait assumed that the first use for each instruction was the value, so for DS write it was protecting the address operand with s_waitcnt instructions when it should have been protecting the value operand. llvm-svn: 225289
*	R600/SI: Add a stub GCNTargetMachine	Tom Stellard	2015-01-06	8	-1/+46
\| \| \| \| \| \| \| \| \| \| \| \|	This is equivalent to the AMDGPUTargetMachine now, but it is the starting point for separating R600 and GCN functionality into separate targets. It is recommened that users start using the gcn triple for GCN-based GPUs, because using the r600 triple for these GPUs will be deprecated in the future. llvm-svn: 225277
*	R600/SI: Remove MachineFunction dump from AsmPrinter	Tom Stellard	2015-01-06	1	-17/+12
\| \| \| \| \| \| \|	The dump was dependent on a feature string, which meant that it couldn't be disabled or enable on a per compile basis. llvm-svn: 225275
*	Revert r225048: It broke ObjC on AArch64.	Lang Hames	2015-01-06	1	-1/+1
\| \| \| \| \| \|	I've filed http://llvm.org/PR22100 to track this issue. llvm-svn: 225228
*	Minor cleanup to all the switches after MatchInstructionImpl in all the ↵	Craig Topper	2015-01-03	1	-16/+15
\| \| \| \| \| \| \| \|	AsmParsers. Make sure they all have llvm_unreachable on the default path out of the switch. Remove unnecessary "default: break". Remove a 'return' after unreachable. Fix some indentation. llvm-svn: 225114
*	Add r224985 back with a fix.	Rafael Espindola	2014-12-31	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The issues was that AArch64 has additional restrictions on when local relocations can be used. We have to take those into consideration when deciding to put a L symbol in the symbol table or not. Original message: Remove doesSectionRequireSymbols. In an assembly expression like bar: .long L0 + 1 the intended semantics is that bar will contain a pointer one byte past L0. In sections that are merged by content (strings, 4 byte constants, etc), a single position in the section doesn't give the linker enough information. For example, it would not be able to tell a relocation must point to the end of a string, since that would look just like the start of the next. The solution used in ELF to use relocation with symbols if there is a non-zero addend. In MachO before this patch we would just keep all symbols in some sections. This would miss some cases (only cstrings on x86_64 were implemented) and was inefficient since most relocations have an addend of 0 and can be represented without the symbol. This patch implements the non-zero addend logic for MachO too. llvm-svn: 225048
*	Revert "Remove doesSectionRequireSymbols."	Rafael Espindola	2014-12-31	1	-1/+1
\| \| \| \| \| \| \| \|	This reverts commit r224985. I am investigating why it made an Apple bot unhappy. llvm-svn: 225044
*	Remove doesSectionRequireSymbols.	Rafael Espindola	2014-12-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In an assembly expression like bar: .long L0 + 1 the intended semantics is that bar will contain a pointer one byte past L0. In sections that are merged by content (strings, 4 byte constants, etc), a single position in the section doesn't give the linker enough information. For example, it would not be able to tell a relocation must point to the end of a string, since that would look just like the start of the next. The solution used in ELF to use relocation with symbols if there is a non-zero addend. In MachO before this patch we would just keep all symbols in some sections. This would miss some cases (only cstrings on x86_64 were implemented) and was inefficient since most relocations have an addend of 0 and can be represented without the symbol. This patch implements the non-zero addend logic for MachO too. llvm-svn: 224985
*	Enable (sext x) == C --> x == (trunc C) combine	Matt Arsenault	2014-12-21	1	-21/+2
\| \| \| \| \| \| \| \| \|	Extend the existing code which handles this for zext. This makes this more useful for targets with ZeroOrNegativeOne BooleanContent and obsoletes a custom combine SI uses for i1 setcc (sext(i1), 0, setne) since the constant will now be shrunk to i1. llvm-svn: 224691
*	R600: Remove outdated comment	Matt Arsenault	2014-12-19	1	-4/+0
\| \| \| \|	llvm-svn: 224648
*	R600/SI: Only form min/max with 1 use.	Matt Arsenault	2014-12-19	1	-1/+1
\| \| \| \| \| \| \|	If the condition is used for something else, this increases the number of instructions. llvm-svn: 224646
*	R600/SI: isLegalOperand() shouldn't check constant bus for SALU instructions	Tom Stellard	2014-12-19	1	-1/+1
\| \| \| \| \| \| \|	The constant bus restrictions only apply to VALU instructions. This enables SIFoldOperands to fold immediates into SALU instructions. llvm-svn: 224623
*	R600/SI: Make sure non-inline constants aren't folded into mubuf soffset operand	Tom Stellard	2014-12-19	4	-17/+25
\| \| \| \| \| \| \| \|	mubuf instructions now define the soffset field using the SCSrc_32 register class which indicates that only SGPRs and inline constants are allowed. llvm-svn: 224622
*	R600/SI: Fix f64 inline immediates	Matt Arsenault	2014-12-17	8	-46/+126
\| \| \| \|	llvm-svn: 224458
*	R600: Fix min/max matching problems with unordered compares	Matt Arsenault	2014-12-12	4	-50/+60
\| \| \| \| \| \| \| \|	The returned operand needs to be permuted for the unordered compares. Also fix incorrectly producing fmin_legacy / fmax_legacy for f64, which don't exist. llvm-svn: 224094
*	R600/SI: fmin/fmax_legacy are not associative	Matt Arsenault	2014-12-12	1	-2/+2
\| \| \| \|	llvm-svn: 224093
*	R600/SI: Don't promote f32 select to i32	Matt Arsenault	2014-12-12	2	-2/+5
\| \| \| \| \| \| \| \|	This is nice for the instruction patterns, but it complicates min / max matching. The select doesn't have the correct type and would require looking through the bitcasts for the real float operands. llvm-svn: 224092
*	Add target hook for whether it is profitable to reduce load widths	Matt Arsenault	2014-12-12	2	-0/+26
\| \| \| \| \| \| \| \|	Add an option to disable optimization to shrink truncated larger type loads to smaller type loads. On SI this prevents using scalar load instructions in some cases, since there are no scalar extloads. llvm-svn: 224084
*	R600/SI: Handle physical registers in getOpRegClass	Matt Arsenault	2014-12-11	1	-2/+7
\| \| \| \|	llvm-svn: 224079
*	R600/SI: Don't verify constant bus usage of flag ops	Matt Arsenault	2014-12-11	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \|	This was checking if pseudo-operands like the source modifiers were using the constant bus, which happens to work because the values these all can be happen to be valid inline immediates. This fixes a later commit which starts checking the register class of the operands. llvm-svn: 224078
*	R600/SI: Use unordered equal instructions	Matt Arsenault	2014-12-11	2	-6/+2
\| \| \| \|	llvm-svn: 224067
*	R600/SI: Make more unordered comparisons legal	Matt Arsenault	2014-12-11	3	-18/+9
\| \| \| \| \| \| \|	This saves a second compare and an and / or by using the unordered comparison instructions. llvm-svn: 224066
*	R600/SI: Use unordered not equal instructions	Matt Arsenault	2014-12-11	4	-10/+19
\| \| \| \|	llvm-svn: 224065
*	[CodeGen] Add print and verify pass after each MachineFunctionPass by default	Matthias Braun	2014-12-11	1	-26/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously print+verify passes were added in a very unsystematic way, which is annoying when debugging as you miss intermediate steps and allows bugs to stay unnotice when no verification is performed. To make this change practical I added the possibility to explicitely disable verification. I used this option on all places where no verification was performed previously (because alot of places actually don't pass the MachineVerifier). In the long term these problems should be fixed properly and verification enabled after each pass. I'll enable some more verification in subsequent commits. This is the 2nd attempt at this after realizing that PassManager::add() may actually delete the pass. llvm-svn: 224059
*	This reverts commit r224043 and r224042.	Rafael Espindola	2014-12-11	1	-21/+26
\| \| \| \| \| \|	check-llvm was failing. llvm-svn: 224045