bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[AMDGPU] Allow folding of sgpr to vgpr copy	Stanislav Mekhanoshin	2019-10-23	1	-2/+3
\| \| \| \| \| \| \| \|	Potentially sgpr to sgpr copy should also be possible. That is however trickier because we may end up with a wrong register class at use because of xm0/xexec permutations. Differential Revision: https://reviews.llvm.org/D69280
*	[Mips] Use appropriate private label prefix based on Mips ABI	Mirko Brkusanin	2019-10-23	2	-2/+4
\| \| \| \| \| \| \| \| \| \|	MipsMCAsmInfo was using '$' prefix for Mips32 and '.L' for Mips64 regardless of -target-abi option. By passing MCTargetOptions to MCAsmInfo we can find out Mips ABI and pick appropriate prefix. Tags: #llvm, #clang, #lldb Differential Revision: https://reviews.llvm.org/D66795
*	[AMDGPU] Allow tied operand subreg folding	Stanislav Mekhanoshin	2019-10-22	1	-12/+0
\| \| \| \| \| \|	Turns out it makes sense, contrarily to what comment said. Differential Revision: https://reviews.llvm.org/D69287
*	AMDGPU/GlobalISel: Legalize fast unsafe FDIV	Austin Kerbow	2019-10-21	2	-6/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, dstuttard, tpr, t-tye, hiraditya, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69231 llvm-svn: 375460
*	AMDGPU: Select basic interp directly from intrinsics	Matt Arsenault	2019-10-21	5	-57/+29
\| \| \| \|	llvm-svn: 375457
*	AMDGPU: Use CopyToReg for interp intrinsic lowering	Matt Arsenault	2019-10-21	1	-16/+17
\| \| \| \| \| \| \|	This doesn't use the default value, so doesn't benefit from the hack to help optimize it. llvm-svn: 375450
*	AMDGPU: Erase redundant redefs of m0 in SIFoldOperands	Matt Arsenault	2019-10-21	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \|	Only handle simple inter-block redefs of m0 to the same value. This avoids interference from redefs of m0 in SILoadStoreOptimzer. I was initially teaching that pass to ignore redefs of m0, but having them not exist beforehand is much simpler. This is in preparation for deleting the current special m0 handling in SIFixSGPRCopies to allow the register coalescer to handle the difficult cases. llvm-svn: 375449
*	AMDGPU: Stop adding m0 implicit def to SGPR spills	Matt Arsenault	2019-10-21	1	-13/+2
\| \| \| \| \| \| \| \|	r375293 removed the SGPR spilling with scalar stores path, so this is no longer necessary. This also always had the defect of adding the def even when this path wasn't in use. llvm-svn: 375448
*	AMDGPU: Slightly restructure m0 init code	Matt Arsenault	2019-10-21	1	-13/+15
\| \| \| \| \| \| \|	This will allow using another operation to produce the glue in a future change. llvm-svn: 375447
*	[AMDGPU] Select AGPR in PHI operand legalization	Stanislav Mekhanoshin	2019-10-21	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a PHI defines AGPR legalize its operands to AGPR. At the moment we can get an AGPR PHI with VGPR operands. I am not aware of any problems as it seems to be handled gracefully in RA, but this is not right anyway. It also slightly decreases VGPR pressure in some cases because we do not have to a copy via VGPR. Differential Revision: https://reviews.llvm.org/D69206 llvm-svn: 375446
*	Use Align for TFL::TransientStackAlignment	Guillaume Chatelet	2019-10-21	4	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, dschuff, jyknight, sdardis, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, fedor.sergeev, jrtc27, atanasyan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69216 llvm-svn: 375398
*	Fix buildbot error in SIRegisterInfo.cpp.	Zinovy Nis	2019-10-20	1	-3/+4
\| \| \| \|	llvm-svn: 375373
*	AMDGPU: Increase vcc liveness scan threshold	Matt Arsenault	2019-10-20	1	-2/+4
\| \| \| \| \| \| \|	Avoids a test regression in a future patch. Also add debug printing on this case, so I waste less time debugging folds in the future. llvm-svn: 375367
*	AMDGPU: Split flat offsets that don't fit in DAG	Matt Arsenault	2019-10-20	3	-3/+96
\| \| \| \| \| \| \| \| \| \|	We handle it this way for some other address spaces. Since r349196, SILoadStoreOptimizer has been trying to do this. This is after SIFoldOperands runs, which can change the addressing patterns. It's simpler to just split this earlier. llvm-svn: 375366
*	AMDGPU: Fix missing OPERAND_IMMEDIATE	Matt Arsenault	2019-10-20	1	-12/+13
\| \| \| \|	llvm-svn: 375365
*	AMDGPU: Don't re-get the subtarget	Matt Arsenault	2019-10-20	1	-21/+9
\| \| \| \| \| \|	It's already available in the class. llvm-svn: 375363
*	AMDGPU: Don't error on calls to null or undef	Matt Arsenault	2019-10-20	1	-0/+9
\| \| \| \| \| \|	Calls to constants should probably be generally handled. llvm-svn: 375356
*	Prune a LegacyDivergenceAnalysis and MachineLoopInfo include each	Reid Kleckner	2019-10-19	2	-1/+4
\| \| \| \| \| \|	Now X86ISelLowering doesn't depend on many IR analyses. llvm-svn: 375320
*	Prune two MachineInstr.h includes, fix up deps	Reid Kleckner	2019-10-19	2	-1/+3
\| \| \| \| \| \| \| \| \| \|	MachineInstr.h included AliasAnalysis.h, which includes a world of IR constructs mostly unneeded in CodeGen. Prune it. Same for DebugInfoMetadata.h. Noticed with -ftime-trace. llvm-svn: 375311
*	[AMDGPU] move PHI nodes to AGPR class	Stanislav Mekhanoshin	2019-10-18	1	-5/+16
\| \| \| \| \| \| \| \| \|	If all uses of a PHI are in AGPR register class we should avoid unneeded copies via VGPRs. Differential Revision: https://reviews.llvm.org/D69200 llvm-svn: 375297
*	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem.	Jay Foad	2019-10-18	2	-156/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The implementation was never completed and never used except in tests. Reviewers: arsenm, mareko Subscribers: qcolombet, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69163 llvm-svn: 375293
*	[GISel][CallLowering] Make isIncomingArgumentHandler a pure virtual method	Quentin Colombet	2019-10-18	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	The default implementation of isIncomingArgumentHandler could lead to generating incorrect code. Make it a pure virtual method, so that targets know they have to override it to produce correct code. NFC Differential Revision: https://reviews.llvm.org/D69187 llvm-svn: 375277
*	AMDGPU: Relax 32-bit SGPR register class	Matt Arsenault	2019-10-18	6	-34/+39
\| \| \| \| \| \| \| \| \| \| \|	Mostly use SReg_32 instead of SReg_32_XM0 for arbitrary values. This will allow the register coalescer to do a better job eliminating copies to m0. For GlobalISel, as a terrible hack, use SGPR_32 for things that should use SCC until booleans are solved. llvm-svn: 375267
*	AMDGPU: Fix SMEM WAR hazard for gfx10 readlane	Austin Kerbow	2019-10-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Hazard recognizer fails to see hazard with V_READLANE_B32_gfx10. Reviewers: rampitec Reviewed By: rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69172 llvm-svn: 375265
*	[AMDGPU][MC][GFX10] Added sdwa/dpp versions of v_cndmask_b32	Dmitry Preobrazhensky	2019-10-18	2	-52/+80
\| \| \| \| \| \| \| \| \| \|	See https://bugs.llvm.org/show_bug.cgi?id=43608 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D69096 llvm-svn: 375241
*	[AMDGPU][MC][GFX9] Corrected parsing of v_cndmask_b32_sdwa	Dmitry Preobrazhensky	2019-10-18	2	-10/+22
\| \| \| \| \| \| \| \| \| \|	See https://bugs.llvm.org/show_bug.cgi?id=43607 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D69095 llvm-svn: 375231
*	[AMDGPU] drop getIsFP td helper	Stanislav Mekhanoshin	2019-10-17	3	-23/+13
\| \| \| \| \| \| \| \| \|	We already have isFloatType helper, and they are out of sync. Drop one and merge the type list. Differential Revision: https://reviews.llvm.org/D69138 llvm-svn: 375175
*	[AMDGPU] Improve code size cost model	Daniil Fukalov	2019-10-17	3	-3/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Added estimation for zero size insertelement, extractelement and llvm.fabs operators. Updated inline/unroll parameters default values. Reviewers: rampitec, arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68881 llvm-svn: 375109
*	[Alignment][NFC] Use Align for TargetFrameLowering/Subtarget	Guillaume Chatelet	2019-10-17	6	-21/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jholewinski, arsenm, dschuff, jyknight, dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, jsji, Jim, lenary, s.egerton, pzheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68993 llvm-svn: 375084
*	GlobalISel: Implement lower for G_SADDO/G_SSUBO	Matt Arsenault	2019-10-16	2	-3/+4
\| \| \| \| \| \| \|	Port directly from SelectionDAG, minus the path using ISD::SADDSAT/ISD::SSUBSAT. llvm-svn: 375042
*	[AMDGPU] Do not combine dpp mov reading physregs	Stanislav Mekhanoshin	2019-10-16	1	-0/+6
\| \| \| \| \| \| \| \|	We cannot be sure physregs will stay unchanged. Differential Revision: https://reviews.llvm.org/D69065 llvm-svn: 375033
*	[AMDGPU] Do not combine dpp with physreg def	Stanislav Mekhanoshin	2019-10-16	1	-0/+4
\| \| \| \| \| \| \| \|	We will remove dpp mov along with the physreg def otherwise. Differential Revision: https://reviews.llvm.org/D69063 llvm-svn: 375030
*	[AMDGPU] Supress unused sdwa insts generation	Stanislav Mekhanoshin	2019-10-16	3	-25/+55
\| \| \| \| \| \| \| \| \|	Do not generate non-existing sdwa instructions. It reduces the number of generated instructions by 185. Differential Revision: https://reviews.llvm.org/D69010 llvm-svn: 375016
*	[AMDGPU] Fix-up cases where writelane has 2 SGPR operands	David Stuttard	2019-10-16	2	-0/+87
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Even though writelane doesn't have the same constraints as other valu instructions it still can't violate the >1 SGPR operand constraint Due to later register propagation (e.g. fixing up vgpr operands via readfirstlane) changing writelane to only have a single SGPR is tricky. This implementation puts a new check after SIFixSGPRCopies that prevents multiple SGPRs being used in any writelane instructions. The algorithm used is to check for trivial copy prop of suitable constants into one of the SGPR operands and perform that if possible. If this isn't possible put an explicit copy of Src1 SGPR into M0 and use that instead (this is allowable for writelane as the constraint is for SGPR read-port and not constant-bus access). Reviewers: rampitec, tpr, arsenm, nhaehnle Reviewed By: rampitec, arsenm, nhaehnle Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, mgorny, yaxunl, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D51932 Change-Id: Ic7553fa57440f208d4dbc4794fc24345d7e0e9ea llvm-svn: 375004
*	[AMDGPU] Extend the SI Load/Store optimizer	Piotr Sobczak	2019-10-16	1	-13/+174
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extend the SI Load/Store optimizer to merge MIMG load instructions. Handle different flavours of image_load and image_sample instructions. When the instructions of the same subclass differ only in dmask, merge them and update dmask accordingly. Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64911 llvm-svn: 374984
*	AMDGPU: Fix infinite searches in SIFixSGPRCopies	Austin Kerbow	2019-10-15	2	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Two conditions could lead to infinite loops when processing PHI nodes in SIFixSGPRCopies. The first condition involves a REG_SEQUENCE that uses registers defined by both a PHI and a COPY. The second condition arises when a physical register is copied to a virtual register which is then used in a PHI node. If the same virtual register is copied to the same physical register, the result is an endless loop. %0:sgpr_64 = COPY $sgpr0_sgpr1 %2 = PHI %0, %bb.0, %1, %bb.1 $sgpr0_sgpr1 = COPY %0 Reviewers: alex-t, rampitec, arsenm Reviewed By: rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68970 llvm-svn: 374944
*	[AMDGPU] Support mov dpp with 64 bit operands	Stanislav Mekhanoshin	2019-10-15	4	-0/+103
\| \| \| \| \| \| \| \| \| \|	We define mov/update dpp intrinsics as overloaded but do not support i64, which is a practically useful type. Fix the selection and lowering. Differential Revision: https://reviews.llvm.org/D68673 llvm-svn: 374910
*	[AMDGPU] Allow DPP combiner to work with REG_SEQUENCE	Stanislav Mekhanoshin	2019-10-15	1	-5/+54
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D68828 llvm-svn: 374908
*	[Alignment] Migrate Attribute::getWith(Stack)Alignment	Guillaume Chatelet	2019-10-15	9	-39/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet, jdoerfert Reviewed By: courbet Subscribers: arsenm, jvesely, nhaehnle, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D68792 llvm-svn: 374884
*	[Alignment][NFC] Remove dependency on GlobalObject::setAlignment(unsigned)	Guillaume Chatelet	2019-10-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, mehdi_amini, jvesely, nhaehnle, hiraditya, steven_wu, dexonsmith, dang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68944 llvm-svn: 374880
*	AMDGPU: Fix redundant setting of m0 for atomic load/store	Matt Arsenault	2019-10-14	1	-10/+7
\| \| \| \| \| \| \|	Atomic load/store would have their setting of m0 handled twice, which happened to be optimized out later. llvm-svn: 374801
*	[AMDGPU] Come back patch for the 'Assign register class for cross block ↵	Alexander Timofeev	2019-10-14	4	-125/+208
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	values according to the divergence.' Detailed description: After https://reviews.llvm.org/D59990 submit several issues were discovered. Changes in common code were preserved but AMDGPU specific part was reverted to keep the backend working correctly. Discovered issues were addressed in the following commits: https://reviews.llvm.org/D67662 https://reviews.llvm.org/D67101 https://reviews.llvm.org/D63953 https://reviews.llvm.org/D63731 This change brings back AMDGPU specific changes. Reviewed by: rampitec, arsenm Differential Revision: https://reviews.llvm.org/D68635 llvm-svn: 374767
*	[AMDGPU] link dpp pseudos and real instructions on gfx10	Stanislav Mekhanoshin	2019-10-11	3	-34/+28
\| \| \| \| \| \| \| \| \| \| \| \|	This defaults to zero fi operand, but we do not expose it anyway. Should we expose it later it needs to be added to the pseudo. This enables dpp combining on gfx10. Differential Revision: https://reviews.llvm.org/D68888 llvm-svn: 374604
*	[AMDGPU][MC][GFX9][GFX10] Corrected number of src operands for ↵	Dmitry Preobrazhensky	2019-10-11	1	-5/+18
\| \| \| \| \| \| \| \| \| \| \| \|	ds_[read/write]_addtid_b32 See https://bugs.llvm.org/show_bug.cgi?id=37941 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D68787 llvm-svn: 374561
*	[AMDGPU][MC][GFX6][GFX7][GFX10] Added instructions ↵	Dmitry Preobrazhensky	2019-10-11	1	-14/+29
\| \| \| \| \| \| \| \| \| \| \| \|	buffer_atomic_[fcmpswap/fmin/fmax]* See https://bugs.llvm.org/show_bug.cgi?id=28232 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D68788 llvm-svn: 374559
*	[AMDGPU][MC][GFX10] Enabled null for 64-bit dst operands	Dmitry Preobrazhensky	2019-10-11	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	See https://bugs.llvm.org/show_bug.cgi?id=43524 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D68785 llvm-svn: 374557
*	[AMDGPU][MC] Corrected parsing of optional operands	Dmitry Preobrazhensky	2019-10-11	1	-12/+6
\| \| \| \| \| \| \| \| \| \|	See https://bugs.llvm.org/show_bug.cgi?id=43486 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D68350 llvm-svn: 374553
*	AMDGPU: Move SelectFlatOffset back into AMDGPUISelDAGToDAG	Matt Arsenault	2019-10-11	3	-62/+43
\| \| \| \|	llvm-svn: 374495
*	[AMDGPU] Handle undef old operand in DPP combine	Stanislav Mekhanoshin	2019-10-10	1	-1/+3
\| \| \| \| \| \| \| \|	It was missing an undef flag. Differential Revision: https://reviews.llvm.org/D68813 llvm-svn: 374455
*	Revert "[AMDGPU] Run `unreachable-mbb-elimination` after isel to clean up PHIs."	Jay Foad	2019-10-10	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This has been superseded by "[AMDGPU]: PHI Elimination hooks added for custom COPY insertion." This reverts the code changes from commit 53f967f2bdb6aa7b08596880c3689d1ecad6f0ff but keeps the test case. Reviewers: hliao, arsenm, tpr, dstuttard Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68769 llvm-svn: 374347