bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[AMDGPU] add LDS f32 intrinsics	Daniil Fukalov	2018-01-17	1	-0/+69
\| \| \| \| \| \| \| \| \| \| \| \|	added llvm.amdgcn.atomic.{add\|min\|max}.f32 intrinsics to allow generate ds_{add\|min\|max}[_rtn]_f32 instructions needed for OpenCL float atomics in LDS Reviewed by: arsenm Differential Revision: https://reviews.llvm.org/D37985 llvm-svn: 322656
*	[AMDGPU] Add HW_REG_SH_MEM_BASES symbolic name for s_getreg_b32	Stanislav Mekhanoshin	2018-01-15	1	-3/+3
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D41617 llvm-svn: 322500
*	[AMDGPU] stop image_store being moved illegally	Tim Renouf	2018-01-12	2	-20/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: A recent change 321556: AMDGPU: Remove mayLoad/hasSideEffects from MIMG stores can allow the machine instruction scheduler to move an image store past an image load using the same descriptor. V2: Fixed by marking image ops as mayAlias and isAliased. This may be overly conservative, and we may need to revisit. V3: Reverted test change done on 321556. Reviewers: arsenm, nhaehnle, dstuttard Subscribers: llvm-commits, t-tye, yaxunl, wdng, kzhuravl Differential Revision: https://reviews.llvm.org/D41969 llvm-svn: 322419
*	AMDGPU/SI: Add d16 support for buffer intrinsics.	Changpeng Fang	2018-01-12	4	-0/+185
\| \| \| \| \| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D38906 Reviewers: Matt and Brian. llvm-svn: 322402
*	Make internal/private GVs implicitly dso_local.	Rafael Espindola	2018-01-11	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While updating clang tests for having clang set dso_local I noticed that: - There are a lot of tests to update. - Many of the updates are redundant. They are redundant because a GV is "obviously dso_local". This patch starts formalizing that a bit by requiring that internal and private GVs be dso_local too. Since they all are, we don't have to print dso_local to the textual representation, making it a bit more compact and easier to read. llvm-svn: 322317
*	[MIR] Repurposing '$' sigil used by external symbols. Replacing with '&'.	Puyan Lotfi	2018-01-10	2	-2/+2
\| \| \| \| \| \| \| \| \| \|	Planning to add support for named vregs. This puts is in a conundrum since physregs are named as well. To rectify this we need to use a sigil other than '%' for physregs in MIR. We've settled on using '$' for physregs but first we must repurpose it from external symbols using it, which is what this commit is all about. We think '&' will have familiar semantics for C/C++ users. llvm-svn: 322146
*	[SelectionDAG] Fixed f16-from-vector promotion problem	Tim Renouf	2018-01-09	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In the case of an fp_extend of v1f16 to v1f32 where the v1f16 is the result of a bitcast from i16, avoid creating an illegal fp16_to_fp where the input is not a vector and the result is a v1f32. V2: The fix is now to avoid vector scalarization creating a v1->scalar bitcast. Reviewers: srhines, t.p.northover Subscribers: nhaehnle, llvm-commits, dstuttard, t-tye, yaxunl, wdng, kzhuravl, arsenm Differential Revision: https://reviews.llvm.org/D41126 llvm-svn: 322120
*	[AMDGPU] Fixed incorrect uniform branch condition	Tim Renouf	2018-01-09	8	-13/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I had a case where multiple nested uniform ifs resulted in code that did v_cmp comparisons, combining the results with s_and_b64, s_or_b64 and s_xor_b64 and using the resulting mask in s_cbranch_vccnz, without first ensuring that bits for inactive lanes were clear. There was already code for inserting an "s_and_b64 vcc, exec, vcc" to clear bits for inactive lanes in the case that the branch is instruction selected as s_cbranch_scc1 and is then changed to s_cbranch_vccnz in SIFixSGPRCopies. I have added the same code into SILowerControlFlow for the case that the branch is instruction selected as s_cbranch_vccnz. This de-optimizes the code in some cases where the s_and is not needed, because vcc is the result of a v_cmp, or multiple v_cmp instructions combined by s_and/s_or. We should add a pass to re-optimize those cases. Reviewers: arsenm, kzhuravl Subscribers: wdng, yaxunl, t-tye, llvm-commits, dstuttard, timcorringham, nhaehnle Differential Revision: https://reviews.llvm.org/D41292 llvm-svn: 322119
*	[CodeGen] Don't print register classes in -debug output	Francis Visoiu Mistrih	2018-01-09	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Since register classes and banks are already printed with the register definition, don't print it at the end of every instruction anymore. This follows MIR in this regard and is another step to the unification of the two formats. llvm-svn: 322086
*	StructurizeCFG: Fix broken backedge detection	Matt Arsenault	2018-01-03	2	-42/+88
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The work order was changed in r228186 from SCC order to RPO with an arbitrary sorting function. The sorting function attempted to move inner loop nodes earlier. This was was apparently relying on an assumption that every block in a given loop / the same loop depth would be seen before visiting another loop. In the broken testcase, a block outside of the loop was encountered before moving onto another block in the same loop. The testcase would then structurize such that one blocks unconditional successor could never be reached. Revert to plain RPO for the analysis phase. This fixes detecting edges as backedges that aren't really. The processing phase does use another visited set, and I'm unclear on whether the order there is as important. An arbitrary order doesn't work, and triggers some infinite loops. The reversed RPO list seems to work and is closer to the order that was used before, minus the arbitary custom sorting. A few of the changed tests now produce smaller code, and a few are slightly worse looking. llvm-svn: 321751
*	2nd attempt at "fixing" amdgpu tests after r321575	Philip Reames	2017-12-31	1	-26/+0
\| \| \| \| \| \|	The test needs to be changed; it was exercising UB and that likely wasn't the intent of the test author. I simply removed the checks because I have absolutely no idea what this test was trying to accomplish. With multiple check patterns, no explanation, and no familiarity on my part with the ISA a true fix is going to have to come from someone familiar with the target. llvm-svn: 321591
*	Test fix after r321575	Philip Reames	2017-12-30	1	-3/+2
\| \| \| \| \| \| \| \|	The test in question was checking for a particular intepretation of undefined behavior. Relax the test to check that we simply don't crash. Sorry for the breakage, I don't generally build AMDGPU locally and just saw the failure this morning. llvm-svn: 321589
*	AMDGPU: Remove mayLoad/hasSideEffects from MIMG stores	Matt Arsenault	2017-12-29	2	-5/+22
\| \| \| \| \| \| \|	Atomics still have hasSideEffects set on them because of the mess that is the memory properties. llvm-svn: 321556
*	[AMDGPU] Turn off MergeConsecutiveStores() before Instruction Selection for ↵	Mark Searles	2017-12-19	2	-1/+34
\| \| \| \| \| \| \| \|	AMDGPU. Commit dbbb6c5fc3642987430866dffdf710df4f616ac7 turned on MergeConsecutiveStores() before Instruction Selection for all targets. Enough AMDGPU compiles go into an infinite loop ( MergeConsecutiveStores() merges two stores; LegalizeStoreOps() un-merges; MergeConsecutiveStores() re-merges, etc. ) to warrant turning it off until the issues can be addressed. Differential Revision: https://reviews.llvm.org/D41377 llvm-svn: 321100
*	[Memcpy Loop Lowering] Remove the fixed int8 lowering.	Sean Fertile	2017-12-18	1	-16/+8
\| \| \| \| \| \| \| \|	Switch over to the lowering that uses target supplied operand types. Differential Revision: https://reviews.llvm.org/D41201 llvm-svn: 320989
*	Recommit CodeGen: Fix assertion in machine inst sheduler due to llvm.dbg.value	Yaxun Liu	2017-12-15	1	-0/+106
\| \| \| \| \| \|	The regression on ppc64 was not due to this commit. llvm-svn: 320788
*	Revert CodeGen: Fix assertion in machine inst sheduler due to llvm.dbg.value	Yaxun Liu	2017-12-14	1	-106/+0
\| \| \| \| \| \|	This commit might have caused regression on ppc64. Revert it to verify that. llvm-svn: 320712
*	CodeGen: Fix assertion in machine inst sheduler due to llvm.dbg.value	Yaxun Liu	2017-12-13	1	-0/+106
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Two issues were found about machine inst scheduler when compiling ProRender with -g for amdgcn target: GCNScheduleDAGMILive::schedule tries to update LiveIntervals for DBG_VALUE, which it should not since DBG_VALUE is not mapped in LiveIntervals. when DBG_VALUE is the last instruction of MBB, ScheduleDAGInstrs::buildSchedGraph and ScheduleDAGMILive::scheduleMI does not move RPTracker properly, which causes assertion. This patch fixes that. Differential Revision: https://reviews.llvm.org/D41132 llvm-svn: 320650
*	[MachineOperand][MIR] Add isRenamable to MachineOperand.	Geoff Berry	2017-12-12	5	-23/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add isRenamable() predicate to MachineOperand. This predicate can be used by machine passes after register allocation to determine whether it is safe to rename a given register operand. Register operands that aren't marked as renamable may be required to be assigned their current register to satisfy constraints that are not captured by the machine IR (e.g. ABI or ISA constraints). Reviewers: qcolombet, MatzeB, hfinkel Subscribers: nemanjai, mcrosier, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D39400 llvm-svn: 320503
*	LSR: Check more intrinsic pointer operands	Matt Arsenault	2017-12-11	1	-0/+13
\| \| \| \|	llvm-svn: 320424
*	AMDGPU/GCN: Bring processors in sync with AMDGPUUsage	Konstantin Zhuravlyov	2017-12-08	32	-47/+44
\| \| \| \| \| \| \| \| \| \| \| \|	- Add gfx704 - Change bonaire to gfx704 - Remove gfx804 - Remove gfx901 - Remove gfx903 Differential Revision: https://reviews.llvm.org/D40046 llvm-svn: 320194
*	AMDGPU: image_getlod and image_getresinfo do not read memory	Matt Arsenault	2017-12-08	3	-5/+60
\| \| \| \|	llvm-svn: 320187
*	AMDGPU: Report Arg's Value name in metadata if kernel_arg_name metadata is ↵	Konstantin Zhuravlyov	2017-12-08	4	-63/+126
\| \| \| \| \| \| \| \|	not available Differential Revision: https://reviews.llvm.org/D40924 llvm-svn: 320176
*	[AMDGPU] Revert "[AMDGPU] Add options for waitcnt pass debugging; add instr ↵	Mark Searles	2017-12-07	1	-41/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	count in debug output." Patch caused a buildbot failure; http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/15733/steps/build_Lld/logs/stdio : lib/Target/AMDGPU/SIInsertWaitcnts.cpp:396:11: error: private field 'InstCnt' is not used [-Werror,-Wunused-private-field] int32_t InstCnt = 0; ^ 1 error generated. " This reverts commit 71627f79010aafe74fdcba901bba28dd7caa0869. llvm-svn: 320086
*	[AMDGPU] Add options for waitcnt pass debugging; add instr count in debug ↵	Mark Searles	2017-12-07	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \|	output. -amdgpu-waitcnt-forcezero={1\|0} Force all waitcnt instrs to be emitted as s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) -amdgpu-waitcnt-forceexp=<n> Force emit a s_waitcnt expcnt(0) before the first <n> instrs -amdgpu-waitcnt-forcelgkm=<n> Force emit a s_waitcnt lgkmcnt(0) before the first <n> instrs -amdgpu-waitcnt-forcevm=<n> Force emit a s_waitcnt vmcnt(0) before the first <n> instrs Differential Revision: https://reviews.llvm.org/D40091 llvm-svn: 320084
*	[AMDGPU] Add GCNHazardRecognizer::checkInlineAsmHazards() and ↵	Mark Searles	2017-12-07	1	-0/+24
\| \| \| \| \| \| \| \|	GCNHazardRecognizer::checkVALUHazardsHelper(). checkInlineAsmHazards() checks INLINEASM for hazards that we particularly care about (so not exhaustive); this patch adds a check for INLINEASM that defs vregs that hold data-to-be stored by immediately preceding store of more than 8 bytes. If the instr were not within an INLINEASM, this scenario would be handled by checkVALUHazard(). Add checkVALUHazardsHelper(), which will be called by both checkVALUHazards() and checkInlineAsmHazards(). Differential Revision: https://reviews.llvm.org/D40098 llvm-svn: 320083
*	[CodeGen] Use MachineOperand::print in the MIRPrinter for MO_Register.	Francis Visoiu Mistrih	2017-12-07	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Work towards the unification of MIR and debug output by refactoring the interfaces. For MachineOperand::print, keep a simple version that can be easily called from `dump()`, and a more complex one which will be called from both the MIRPrinter and MachineInstr::print. Add extra checks inside MachineOperand for detached operands (operands with getParent() == nullptr). https://reviews.llvm.org/D40836 * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/kill: ([^ ]+) ([^ ]+)<def> ([^ ]+)/kill: \1 def \2 \3/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/kill: ([^ ]+) ([^ ]+) ([^ ]+)<def>/kill: \1 \2 def \3/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/kill: def ([^ ]+) ([^ ]+) ([^ ]+)<def>/kill: def \1 \2 def \3/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/<def>//g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<kill>/killed \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-use,kill>/implicit killed \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<dead>/dead \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<def[ ],[ ]dead>/dead \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-def[ ],[ ]dead>/implicit-def dead \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-def>/implicit-def \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-use>/implicit \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<internal>/internal \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name "*.s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<undef>/undef \1/g' llvm-svn: 320022
*	AMDGPU Tests: Change a case to be run with -O0	Zvi Rackover	2017-12-06	2	-49/+45
\| \| \| \| \| \| \|	D40231 requires to run case with -O0 to prevent InstructionSimplify from transforming an extractelement with undef index. llvm-svn: 319907
*	AMDGPU: Fix SDWA crash on inline asm	Matt Arsenault	2017-12-05	1	-0/+23
\| \| \| \| \| \| \| \|	This was only searching for explicit defs, and asserting for any implicit or variadic instruction defs, like inline asm. llvm-svn: 319826
*	AMDGPU: Fix infinite loop with dbg_value	Matt Arsenault	2017-12-05	1	-9/+24
\| \| \| \| \| \| \| \| \|	Surprisingly SIOptimizeExecMaskingPreRA can infinite loop in some case with DBG_VALUE. Most tests using dbg_value are run at -O0, so don't run this pass. This seems to only happen when the value argument is undef. llvm-svn: 319808
*	AMDGPU: Fix crash when scheduling DBG_VALUE	Matt Arsenault	2017-12-05	1	-0/+333
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This calls handleMove with a DBG_VALUE instruction, which isn't tracked by LiveIntervals. I'm not sure this is the correct place to fix this. The generic scheduler seems to have more deliberate region selection that skips dbg_value. The test is also really hard to reduce. I haven't been able to figure out what exactly causes this particular case to try moving the dbg_value. llvm-svn: 319732
*	AMDGPU/EG: Add a new FeatureFMA and use it to selectively enable FMA instruction	Jan Vesely	2017-12-04	1	-1/+8
\| \| \| \| \| \| \| \| \|	Only used by pre-GCN targets v2: fix predicate setting for FMA_Common Differential Revision: https://reviews.llvm.org/D40692 llvm-svn: 319712
*	AMDGPU: Disable fp64 support on pre GCN asics	Jan Vesely	2017-12-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	It's not implemented. Passing +fp64-fp16-denormal feature enables fp64 even on asics that don't support it v2: fix hasFP64 query Differential Revision: https://reviews.llvm.org/D39931 llvm-svn: 319709
*	AMDGPU: Fix creating invalid copy when adjusting dmask	Matt Arsenault	2017-12-04	1	-0/+51
\| \| \| \| \| \| \| \| \|	Move the entire optimization to one place. Before it was possible to adjust dmask without changing the register class of the output instruction, since they were done in separate places. Fix all lane sizes and move all of the optimization into the DAG folding. llvm-svn: 319705
*	[CodeGen] Unify MBB reference format in both MIR and debug output	Francis Visoiu Mistrih	2017-12-04	32	-125/+125
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As part of the unification of the debug format and the MIR format, print MBB references as '%bb.5'. The MIR printer prints the IR name of a MBB only for block definitions. * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)->getNumber/" << printMBBReference(\1)/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)\.getNumber/" << printMBBReference(\1)/g' * find . $ -name ".txt" -o -name ".s" -o -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E 's/BB#([0-9]+)/%bb.\1/g' * grep -nr 'BB#' and fix Differential Revision: https://reviews.llvm.org/D40422 llvm-svn: 319665
*	[AMDGPU] SDWA: add support for PRESERVE into SDWA peephole.	Sam Kolton	2017-12-04	5	-10/+63
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Reviewers: arsenm, vpykhtin, rampitec Subscribers: kzhuravl, wdng, nhaehnle, mgorny, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D37817 llvm-svn: 319662
*	CodeGen: Fix SelectionDAGISel::LowerArguments for sret addr space	Yaxun Liu	2017-12-03	1	-8/+8
\| \| \| \| \| \| \| \| \| \| \|	SelectionDAGISel::LowerArguments assumes sret addr space is 0, which is not true for amdgcn---amdgiz target. This patch fixes that. Differential Revision: https://reviews.llvm.org/D40255 llvm-svn: 319630
*	CodeGen: Fix pointer info in ↵	Yaxun Liu	2017-12-02	10	-35/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SplitVecOp_EXTRACT_VECTOR_ELT/SplitVecRes_INSERT_VECTOR_ELT Two issues found when doing codegen for splitting vector with non-zero alloca addr space: DAGTypeLegalizer::SplitVecRes_INSERT_VECTOR_ELT/SplitVecOp_EXTRACT_VECTOR_ELT uses dummy pointer info for creating SDStore. Since one pointer operand contains multiply and add, InferPointerInfo is unable to infer the correct pointer info, which ends up with a dummy pointer info for the target to lower store and results in isel failure. The fix is to introduce MachinePointerInfo::getUnknownStack to represent MachinePointerInfo which is known in alloca address space but without other information. TargetLowering::getVectorElementPointer uses value type of pointer in addr space 0 for multiplication of index and then add it to the pointer. However the pointer may be in an addr space which has different size than addr space 0. The fix is to use the pointer value type for index multiplication. Differential Revision: https://reviews.llvm.org/D39758 llvm-svn: 319622
*	[AMDGPU] SiFixSGPRCopies should not modify non-divergent PHI	Alexander Timofeev	2017-12-01	4	-9/+45
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D40556 llvm-svn: 319534
*	AMDGPU: Use carry-less adds in FI elimination	Matt Arsenault	2017-11-30	1	-19/+62
\| \| \| \|	llvm-svn: 319501
*	AMDGPU: Use gfx9 carry-less add/sub instructions	Matt Arsenault	2017-11-30	23	-258/+513
\| \| \| \|	llvm-svn: 319491
*	[CodeGen] Always use `printReg` to print registers in both MIR and debug	Francis Visoiu Mistrih	2017-11-30	4	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	output As part of the unification of the debug format and the MIR format, always use `printReg` to print all kinds of registers. Updated the tests using '_' instead of '%noreg' until we decide which one we want to be the default one. Differential Revision: https://reviews.llvm.org/D40421 llvm-svn: 319445
*	[CodeGen] Print "%vreg0" as "%0" in both MIR and debug output	Francis Visoiu Mistrih	2017-11-30	4	-13/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As part of the unification of the debug format and the MIR format, avoid printing "vreg" for virtual registers (which is one of the current MIR possibilities). Basically: * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E "s/%vreg([0-9]+)/%\1/g" * grep -nr '%vreg' . and fix if needed * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E "s/ vreg([0-9]+)/ %\1/g" * grep -nr 'vreg[0-9]\+' . and fix if needed Differential Revision: https://reviews.llvm.org/D40420 llvm-svn: 319427
*	AMDGPU: Allow negative MUBUF vaddr for gfx9	Matt Arsenault	2017-11-30	2	-243/+138
\| \| \| \| \| \| \| \|	GFX9 does not enable bounds checking for the resource descriptors used for private access, so it should be OK to use vaddr with a potentially negative value. llvm-svn: 319393
*	AMDGPU: Use stricter regexes for add instructions	Matt Arsenault	2017-11-29	6	-66/+66
\| \| \| \| \| \| \|	Match the entire _co as one optional piece rather than a set of characters to match multiple times. llvm-svn: 319275
*	AMDGPU: Select DS insts without m0 initialization	Matt Arsenault	2017-11-29	25	-524/+1623
\| \| \| \| \| \| \| \| \|	GFX9 stopped using m0 for most DS instructions. Select a different instruction without the use. I think this will be less error prone than trying to manually maintain m0 uses as needed. llvm-svn: 319270
*	AMDGPU: Enable IPRA	Matt Arsenault	2017-11-28	7	-14/+15
\| \| \| \|	llvm-svn: 319256
*	AMDGPU: Add num spilled s/vgprs to metadata	Konstantin Zhuravlyov	2017-11-28	1	-17/+125
\| \| \| \| \| \| \| \|	This was requested by tools. Differential Revision: https://reviews.llvm.org/D40321 llvm-svn: 319192
*	[CodeGen] Print register names in lowercase in both MIR and debug output	Francis Visoiu Mistrih	2017-11-28	3	-9/+9
\| \| \| \| \| \| \| \| \| \| \|	As part of the unification of the debug format and the MIR format, always print registers as lowercase. * Only debug printing is affected. It now follows MIR. Differential Revision: https://reviews.llvm.org/D40417 llvm-svn: 319187
*	DAG: Legalize truncstores to illegal int types	Matt Arsenault	2017-11-28	1	-0/+56
\| \| \| \| \| \| \|	Truncate to a legal int type, and produce a new truncstore from a narrower type. llvm-svn: 319185