bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-709/+0
\| \| \| \|	llvm-svn: 239657
*	R600: Use SIGN_EXTEND_INREG for SEXT loads	Jan Vesely	2015-05-26	1	-70/+39
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 238229
*	[opaque pointer type] Add textual IR support for explicit type parameter to ↵	David Blaikie	2015-02-27	1	-46/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	load instruction Essentially the same as the GEP change in r230786. A similar migration script can be used to update test cases, though a few more test case improvements/changes were required this time around: (r229269-r229278) import fileinput import sys import re pat = re.compile(r"((?:=\|:\|^)\sload (?:atomic )?(?:volatile )?(.?))(\| addrspace$\d+$ )\($\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$)") for line in sys.stdin: sys.stdout.write(re.sub(pat, r"\1, \2\3*\4", line)) Reviewers: rafael, dexonsmith, grosser Differential Revision: http://reviews.llvm.org/D7649 llvm-svn: 230794
*	[opaque pointer type] Add textual IR support for explicit type parameter to ↵	David Blaikie	2015-02-27	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	getelementptr instruction One of several parallel first steps to remove the target type of pointers, replacing them with a single opaque pointer type. This adds an explicit type parameter to the gep instruction so that when the first parameter becomes an opaque pointer type, the type to gep through is still available to the instructions. * This doesn't modify gep operators, only instructions (operators will be handled separately) * Textual IR changes only. Bitcode (including upgrade) and changing the in-memory representation will be in separate changes. * geps of vectors are transformed as: getelementptr <4 x float> %x, ... ->getelementptr float, <4 x float> %x, ... Then, once the opaque pointer type is introduced, this will ultimately look like: getelementptr float, <4 x ptr> %x with the unambiguous interpretation that it is a vector of pointers to float. * address spaces remain on the pointer, not the type: getelementptr float addrspace(1)* %x ->getelementptr float, float addrspace(1)* %x Then, eventually: getelementptr float, ptr addrspace(1) %x Importantly, the massive amount of test case churn has been automated by same crappy python code. I had to manually update a few test cases that wouldn't fit the script's model (r228970,r229196,r229197,r229198). The python script just massages stdin and writes the result to stdout, I then wrapped that in a shell script to handle replacing files, then using the usual find+xargs to migrate all the files. update.py: import fileinput import sys import re ibrep = re.compile(r"(^.?[^%\w]getelementptr inbounds )(((?:<\d x )?)(.?)(\| addrspace$\d$) \(\|>)(?:$\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$))") normrep = re.compile( r"(^.?[^%\w]getelementptr )(((?:<\d* x )?)(.?)(\| addrspace$\d$) \(\|>)(?:$\| (?:%\|@\|null\|undef\|blockaddress\|getelementptr\|addrspacecast\|bitcast\|inttoptr\|\[\[[a-zA-Z]\|\{\{).$))") def conv(match, line): if not match: return line line = match.groups()[0] if len(match.groups()[5]) == 0: line += match.groups()[2] line += match.groups()[3] line += ", " line += match.groups()[1] line += "\n" return line for line in sys.stdin: if line.find("getelementptr ") == line.find("getelementptr inbounds"): if line.find("getelementptr inbounds") != line.find("getelementptr inbounds ("): line = conv(re.match(ibrep, line), line) elif line.find("getelementptr ") != line.find("getelementptr ("): line = conv(re.match(normrep, line), line) sys.stdout.write(line) apply.sh: for name in "$@" do python3 `dirname "$0"`/update.py < "$name" > "$name.tmp" && mv "$name.tmp" "$name" rm -f "$name.tmp" done The actual commands: From llvm/src: find test/ -name .ll \| xargs ./apply.sh From llvm/src/tools/clang: find test/ -name .mm -o -name .m -o -name .cpp -o -name .c \| xargs -I '{}' ../../apply.sh "{}" From llvm/src/tools/polly: find test/ -name *.ll \| xargs ./apply.sh After that, check-all (with llvm, clang, clang-tools-extra, lld, compiler-rt, and polly all checked out). The extra 'rm' in the apply.sh script is due to a few files in clang's test suite using interesting unicode stuff that my python script was throwing exceptions on. None of those files needed to be migrated, so it seemed sufficient to ignore those cases. Reviewers: rafael, dexonsmith, grosser Differential Revision: http://reviews.llvm.org/D7636 llvm-svn: 230786
*	R600/SI: Remove the -CHECK suffix from all FileCheck prefixes in LIT tests	Marek Olsak	2015-02-03	1	-299/+299
\| \| \| \|	llvm-svn: 228040
*	R600/SI: Enable all tests that pass on VI without changes	Marek Olsak	2015-01-27	1	-0/+1
\| \| \| \|	llvm-svn: 227214
*	R600/SI: Add a stub GCNTargetMachine	Tom Stellard	2015-01-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This is equivalent to the AMDGPUTargetMachine now, but it is the starting point for separating R600 and GCN functionality into separate targets. It is recommened that users start using the gcn triple for GCN-based GPUs, because using the r600 triple for these GPUs will be deprecated in the future. llvm-svn: 225277
*	R600: Error on initializer for LDS.	Matt Arsenault	2014-11-13	1	-1/+1
\| \| \| \| \| \|	Also give a proper error for other address spaces. llvm-svn: 221917
*	R600/SI: Change all instruction assembly names to lowercase.	Tom Stellard	2014-11-05	1	-128/+128
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This matches the format produced by the AMD proprietary driver. //==================================================================// // Shell script for converting .ll test cases: (Pass the .ll files you want to convert to this script as arguments). //==================================================================// ; This was necessary on my system so that A-Z in sed would match only ; upper case. I'm not sure why. export LC_ALL='C' TEST_FILES="$" MATCHES=`grep -v Patterns SIInstructions.td \| grep -o '"[A-Z0-9_]\+["e]' \| grep -o '[A-Z0-9_]\+' \| sort -r` for f in $TEST_FILES; do # Check that there are SI tests: grep -q -e 'verde' -e 'bonaire' -e 'SI' -e 'tahiti' $f if [ $? -eq 0 ]; then for match in $MATCHES; do sed -i -e "s/$[ :]$match$/\L\1/" $f done # Try to get check lines with partial instruction names sed -i 's/$;[ ]SI[A-Z\\-]: $$[A-Z_0-9]\+$/\1\L\2/' $f fi done sed -i -e 's/bb0_1/BB0_1/g' ../../../test/CodeGen/R600/infinite-loop.ll sed -i -e 's/SI-NOT: bfe/SI-NOT: {{[^@]}}bfe/g'../../../test/CodeGen/R600/llvm.AMDGPU.bfe.32.ll ../../../test/CodeGen/R600/sext-in-reg.ll sed -i -e 's/exp_IEEE/EXP_IEEE/g' ../../../test/CodeGen/R600/llvm.exp2.ll sed -i -e 's/numVgprs/NumVgprs/g' ../../../test/CodeGen/R600/register-count-comments.ll sed -i 's/$; CHECK[-NOT]*: $$[A-Z_0-9]\+$/\1\L\2/' ../../../test/CodeGen/R600/select64.ll ../../../test/CodeGen/R600/sgpr-copy.ll //==================================================================// // Shell script for converting .td files (run this last) //==================================================================// export LC_ALL='C' sed -i -e '/Patterns/!s/$"[A-Z0-9_]\+[ "e]$/\L\1/g' SIInstructions.td sed -i -e 's/"EXP/"exp/g' SIInstrInfo.td llvm-svn: 221350
*	R600/SI: Fix bug where immediates were being used in DS addr operands	Tom Stellard	2014-10-15	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The SelectDS1Addr1Offset complex pattern always tries to store constant lds pointers in the offset operand and store a zero value in the addr operand. Since the addr operand does not accept immediates, the zero value needs to first be copied to a register. This newly created zero value will not go through normal instruction selection, so we need to manually insert a V_MOV_B32_e32 in the complex pattern. This bug was hidden by the fact that if there was another zero value in the DAG that had not been selected yet, then the CSE done by the DAG would use the unselected node for the addr operand rather than the one that was just created. This would lead to the zero value being selected and the DAG automatically inserting a V_MOV_B32_e32 instruction. llvm-svn: 219848
*	R600/SI: Remove assertion in SIInstrInfo::areLoadsFromSameBasePtr()	Tom Stellard	2014-10-07	1	-0/+18
\| \| \| \| \| \| \|	Added a FIXME coment instead, we need to handle the case where the two DS instructions being compared have different numbers of operands. llvm-svn: 219236
*	R600: Call EmitFunctionHeader() in the AsmPrinter to populate the ELF symbol ↵	Tom Stellard	2014-10-01	1	-43/+43
\| \| \| \| \| \|	table llvm-svn: 218776
*	R600: Add dag combine for copy of an illegal type.	Matt Arsenault	2014-07-15	1	-4/+2
\| \| \| \| \| \| \| \| \|	This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. llvm-svn: 213031
*	R600: Promote i64 loads to v2i32	Tom Stellard	2014-07-02	1	-2/+1
\| \| \| \|	llvm-svn: 212216
*	R600/SI: Split global vector loads with more than 4 elements	Tom Stellard	2014-02-13	1	-85/+93
\| \| \| \|	llvm-svn: 201368
*	R600/SI: Initialize M0 and emit S_WQM_B64 whenever DS instructions are used	Tom Stellard	2014-02-10	1	-2/+17
\| \| \| \| \| \| \| \| \| \| \|	DS instructions that access local memory can only uses addresses that are less than or equal to the value of M0. When M0 is uninitialized, then we experience undefined behavior. This patch also changes the behavior to emit S_WQM_B64 on pixel shaders no matter what kind of DS instruction is used. llvm-svn: 201097
*	R600/SI: Add support for private address space load/store	Tom Stellard	2013-11-13	1	-2/+0
\| \| \| \| \| \| \|	Private address space is emulated using the register file with MOVRELS and MOVRELD instructions. llvm-svn: 194626
*	R600/SI: Prefer SALU instructions for bit shift operations	Tom Stellard	2013-11-13	1	-81/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	All shift operations will be selected as SALU instructions and then if necessary lowered to VALU instructions in the SIFixSGPRCopies pass. This allows us to do more operations on the SALU which will improve performance and is also required for implementing private memory using indirect addressing, since the private memory pointers must stay in the scalar registers. This patch includes some fixes from Matt Arsenault. llvm-svn: 194625
*	R600/SI: Change formatting of printed registers.	Matt Arsenault	2013-11-12	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Print the range of registers used with a single letter prefix. This better matches what the shader compiler produces and is overall less obnoxious than concatenating all of the subregister names together. Instead of SGPR0, it will print s0. Instead of SGPR0_SGPR1, it will print s[0:1] and so on. There doesn't appear to be a straightforward way to get the actual register info in the InstPrinter, so this parses the generated name to print with the new syntax. The required test changes are pretty nasty, and register matching regexes are now worse. Since there isn't a way to add to a variable in FileCheck, some of the tests now don't check the exact number of registers used, but I don't think that will be a real problem. llvm-svn: 194443
*	R600/SI: Use -verify-machineinstrs for most tests	Tom Stellard	2013-10-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	We can't enable the verifier for tests with SI_IF and SI_ELSE, because these instructions are always followed by a COPY which copies their result to the next basic block. This violates the machine verifier's rule that non-terminators can not folow terminators. Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 192366
*	R600/SI: Don't emit S_WQM_B64 instruction for compute shaders	Tom Stellard	2013-09-05	1	-0/+13
\| \| \| \|	llvm-svn: 190077
*	SelectionDAG: Remove unnecessary uses of TargetLowering::getPointerTy()	Tom Stellard	2013-08-26	1	-0/+140
\| \| \| \| \| \| \| \| \| \| \| \|	If we have a binary operation like ISD:ADD, we can set the result type equal to the result type of one of its operands rather than using TargetLowering::getPointerTy(). Also, any use of DAG.getIntPtrConstant(C) as an operand for a binary operation can be replaced with: DAG.getConstant(C, OtherOperand.getValueType()); llvm-svn: 189227
*	R600: Add support for vector local memory loads	Tom Stellard	2013-08-26	1	-0/+14
\| \| \| \|	llvm-svn: 189226
*	R600: Add support for i8 and i16 local memory loads	Tom Stellard	2013-08-26	1	-0/+78
\| \| \| \|	llvm-svn: 189225
*	R600: Add support for global vector loads with element types less than 32-bits	Tom Stellard	2013-08-16	1	-0/+176
\| \| \| \| \|	Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188521
*	R600: Change the RAT instruction assembly names so they match the docs	Tom Stellard	2013-08-16	1	-6/+6
\| \| \| \| \|	Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188515
*	R600: Add 64-bit float load/store support	Tom Stellard	2013-08-01	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Added R600_Reg64 class * Added T#Index#.XY registers definition * Added v2i32 register reads from parameter and global space * Added f32 and i32 elements extraction from v2f32 and v2i32 * Added v2i32 -> v2f32 conversions Tom Stellard: - Mark vec2 operations as expand. The addition of a vec2 register class made them all legal. Patch by: Dmitry Cherkassov Signed-off-by: Dmitry Cherkassov <dcherkassov@gmail.com> llvm-svn: 187582
*	R600: Treat CONSTANT_ADDRESS loads like GLOBAL_ADDRESS loads when necessary	Tom Stellard	2013-07-23	1	-25/+122
\| \| \| \| \| \| \| \| \| \|	These are really the same address space in hardware. The only difference is that CONSTANT_ADDRESS uses a special cache for faster access. When we are unable to use the constant kcache for some reason (e.g. smaller types or lack of indirect addressing) then the instruction selector must use GLOBAL_ADDRESS loads instead. llvm-svn: 187006
*	R600: Improve support for < 32-bit loads	Tom Stellard	2013-07-23	1	-0/+45
\| \| \| \| \|	Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186921
*	R600/SI: Add support for v2f32 loads	Tom Stellard	2013-07-18	1	-0/+14
\| \| \| \|	llvm-svn: 186615
*	R600/SI: Add support for 64-bit loads	Tom Stellard	2013-07-15	1	-0/+42
\| \| \| \| \| \|	https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 186339
*	R600: Add support for i32 loads from the constant address space on Cayman	Tom Stellard	2013-06-25	1	-0/+1
\| \| \| \| \|	Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 184821
*	R600/SI: Add support for global loads	Tom Stellard	2013-06-03	1	-3/+49
\| \| \| \|	llvm-svn: 183131
*	R600: Reorganize lit tests and document how they should be organized	Tom Stellard	2013-04-19	1	-0/+20
	llvm-svn: 179828