bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	GlobalISel: Legalization for G_FMINNUM/G_FMAXNUM	Matt Arsenault	2019-07-10	2	-0/+1066
\| \| \| \|	llvm-svn: 365658
*	AMDGPU/GlobalISel: Add support for wide loads >= 256-bits	Tom Stellard	2019-07-10	3	-0/+548
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds support for the most commonly used wide load types: <8xi32>, <16xi32>, <4xi64>, and <8xi64> Reviewers: arsenm Reviewed By: arsenm Subscribers: hiraditya, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57399 llvm-svn: 365586
*	GlobalISel: Implement lower for G_FCOPYSIGN	Matt Arsenault	2019-07-09	2	-176/+671
\| \| \| \| \| \| \| \| \|	In SelectionDAG AMDGPU treated these as legal, but this was mostly because the bitcasts required for FP types were painful. Theoretically the bitpattern should eventually match to bfi, so don't bother trying to get the patterns to import. llvm-svn: 365583
*	AMDGPU/GlobalISel: Fix legality for G_BUILD_VECTOR	Matt Arsenault	2019-07-09	19	-192/+600
\| \| \| \|	llvm-svn: 365575
*	GlobalISel: Combine unmerge of merge with intermediate cast	Matt Arsenault	2019-07-09	1	-0/+484
\| \| \| \| \| \| \|	This eliminates some illegal intermediate vectors when operations are scalarized. llvm-svn: 365566
*	AMDGPU/GlobalISel: Prepare some tests for store selection	Matt Arsenault	2019-07-09	17	-124/+92
\| \| \| \| \| \| \| \| \| \| \|	Mostsly these would fail due to trying to use SI with a flat operation. Implementing global loads with MUBUF is more work than flat, so these won't be handled in the initial load selection. Others fail because store of s64 won't initially work, as the current set of patterns expect everything to be turned into v2i32. llvm-svn: 365493
*	AMDGPU/GlobalISel: Fix test	Matt Arsenault	2019-07-09	1	-4/+1
\| \| \| \|	llvm-svn: 365491
*	AMDGPU/GlobalISel: Legalize more concat_vectors	Matt Arsenault	2019-07-09	2	-14/+99
\| \| \| \|	llvm-svn: 365488
*	AMDGPU/GlobalISel: Improve regbankselect for icmp s16	Matt Arsenault	2019-07-09	2	-25/+366
\| \| \| \| \| \|	Account for 64-bit scalar eq/ne when available. llvm-svn: 365487
*	AMDGPU/GlobalISel: Make s16 G_ICMP legal	Matt Arsenault	2019-07-09	1	-151/+474
\| \| \| \|	llvm-svn: 365486
*	AMDGPU/GlobalISel: Select G_SUB	Matt Arsenault	2019-07-09	1	-0/+61
\| \| \| \|	llvm-svn: 365484
*	AMDGPU/GlobalISel: Select G_UNMERGE_VALUES	Matt Arsenault	2019-07-09	1	-0/+231
\| \| \| \|	llvm-svn: 365483
*	AMDGPU/GlobalISel: Select G_MERGE_VALUES	Matt Arsenault	2019-07-09	2	-0/+1303
\| \| \| \|	llvm-svn: 365482
*	GlobalISel: Fix widenScalar for pointer typed G_MERGE_VALUES	Matt Arsenault	2019-07-03	1	-0/+83
\| \| \| \|	llvm-svn: 365093
*	[AArch64][GlobalISel] Overhaul legalization & isel or shifts to select ↵	Amara Emerson	2019-07-03	1	-6/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	immediate forms. There are two main issues preventing us from generating immediate form shifts: 1) We have partial SelectionDAG imported support for G_ASHR and G_LSHR shift immediate forms, but they currently don't work because the amount type is expected to be an s64 constant, but we only legalize them to have homogenous types. To deal with this, first we introduce a custom legalizer to only custom legalize s32 shifts which have a constant operand into a s64. There is also an additional artifact combiner to fold zexts(g_constant) to a larger G_CONSTANT if it's legal, a counterpart to the anyext version committed in an earlier patch. 2) For G_SHL the importer can't cope with the pattern. For this I introduced an early selection phase in the arm64 selector to select these forms manually before the tablegen selector pessimizes it to a register-register variant. Differential Revision: https://reviews.llvm.org/D63910 llvm-svn: 364994
*	AMDGPU/GlobalISel: Try generated matcher with intrinsics	Matt Arsenault	2019-07-02	2	-0/+93
\| \| \| \|	llvm-svn: 364933
*	AMDGPU/GlobalISel: Select mul	Matt Arsenault	2019-07-02	1	-0/+78
\| \| \| \|	llvm-svn: 364932
*	GlobalISel: Define GINodeEquiv for G_UMULH/G_SMULH	Matt Arsenault	2019-07-02	2	-0/+170
\| \| \| \|	llvm-svn: 364931
*	AMDGPU/GlobalISel: Fix G_GEP with mixed SGPR/VGPR operands	Matt Arsenault	2019-07-02	2	-13/+13
\| \| \| \| \| \| \| \|	The register bank for the destination of the sample argument copy was wrong. We shouldn't be constraining each source to the result register bank. Allow constraining the original register to the right size. llvm-svn: 364928
*	AMDGPU/GlobalISel: Select G_FENCE	Matt Arsenault	2019-07-02	1	-0/+719
\| \| \| \| \| \| \|	Manually select to workaround tablegen emitter emitting checks for G_CONSTANT. llvm-svn: 364927
*	GlobalISel: Add G_FENCE	Matt Arsenault	2019-07-02	1	-0/+361
\| \| \| \| \| \| \|	The pattern importer is for some reason emitting checks for G_CONSTANT for the immediate operands. llvm-svn: 364926
*	GlobalISel: Try to widen merges with other merges	Matt Arsenault	2019-07-01	1	-18/+327
\| \| \| \| \| \| \| \|	If the requested source type an be used as a merge source type, create a merge of merges. This avoids creating large, illegal extensions and bit-ops directly to the result type. llvm-svn: 364841
*	AMDGPU/GlobalISel: Handle more input argument intrinsics	Matt Arsenault	2019-07-01	6	-10/+82
\| \| \| \|	llvm-svn: 364836
*	AMDGPU/GlobalISel: Lower kernarg segment ptr intrinsics	Matt Arsenault	2019-07-01	2	-19/+125
\| \| \| \|	llvm-svn: 364835
*	AMDGPU/GlobalISel: Legalize workgroup ID intrinsics	Matt Arsenault	2019-07-01	2	-0/+221
\| \| \| \|	llvm-svn: 364834
*	AMDGPU/GlobalISel: Legalize workitem ID intrinsics	Matt Arsenault	2019-07-01	3	-2/+95
\| \| \| \| \| \| \| \| \|	Tests don't cover the masked input path since non-kernel arguments aren't lowered yet. Test is copied directly from the existing test, with 2 additions. llvm-svn: 364833
*	AMDGPU/GlobalISel: Custom lower control flow intrinsics	Matt Arsenault	2019-07-01	2	-11/+188
\| \| \| \| \| \| \| \|	Replace the brcond for the 2 cases that act as branches. For now follow how the current system works, although I think we can eventually get rid of the pseudos. llvm-svn: 364832
*	AMDGPU/GlobalISel: Handle 16-bit SALU min/max	Matt Arsenault	2019-07-01	4	-60/+372
\| \| \| \| \| \| \| \| \|	This needs to be extended to s32, and expanded into cmp+select. This is relying on the fact that widenScalar happens to leave the instruction in place, but this isn't a guaranteed property of LegalizerHelper. llvm-svn: 364831
*	AMDGPU/GlobalISel: Lower SALU min/max to cmp+select	Matt Arsenault	2019-07-01	4	-80/+256
\| \| \| \| \| \| \|	Use a change observer to apply a register bank to the newly created intermediate result register. llvm-svn: 364830
*	AMDGPU/GlobalISel: Add tests for add legalization	Matt Arsenault	2019-07-01	1	-0/+87
\| \| \| \|	llvm-svn: 364828
*	AMDGPU/GlobalISel: Legalize s16 add/sub/mul	Matt Arsenault	2019-07-01	3	-58/+519
\| \| \| \| \| \| \|	If this is scalar, promote to s32. Use a new observer class to assign the register bank of newly created registers. llvm-svn: 364827
*	AMDGPU/GlobalISel: Fix allowing non-boolean conditions for G_SELECT	Matt Arsenault	2019-07-01	2	-1068/+2232
\| \| \| \| \| \| \| \| \|	The condition register bank must be scc or vcc so that a copy will be inserted, which will be lowered to a compare. Currently greedy unnecessarily forces using a VCC select. llvm-svn: 364825
*	AMDGPU/GlobalISel: RegBankSelect for sendmsg/sendmsghalt	Matt Arsenault	2019-07-01	2	-0/+64
\| \| \| \|	llvm-svn: 364819
*	AMDGPU/GlobalISel: Legalize s16 fcmp	Matt Arsenault	2019-07-01	1	-69/+252
\| \| \| \|	llvm-svn: 364817
*	AMDGPU/GlobalISel: RegBankSelect for DS ordered add/swap	Matt Arsenault	2019-07-01	2	-0/+142
\| \| \| \|	llvm-svn: 364811
*	AMDGPU/GlobalISel: RegBankSelect for amdgcn.writelane	Matt Arsenault	2019-07-01	1	-0/+98
\| \| \| \|	llvm-svn: 364808
*	AMDGPU/GlobalISel: Complete implementation of G_GEP	Matt Arsenault	2019-07-01	3	-30/+384
\| \| \| \| \| \| \| \|	Also works around tablegen defect in selecting add with unused carry, but if we have to manually select GEP, might as well handle add manually. llvm-svn: 364806
*	AMDGPU/GlobalISel: Select G_PHI	Matt Arsenault	2019-07-01	2	-0/+416
\| \| \| \|	llvm-svn: 364805
*	AMDGPU/GlobalISel: Try to select VOP3 form of add	Matt Arsenault	2019-07-01	1	-13/+26
\| \| \| \| \| \| \| \| \| \| \|	There are several things broken, but at least emit the right thing for gfx9. The import of the pattern with the unused carry out seems to not work. Needs a special class for clamp, because OperandWithDefaultOps doesn't really work. llvm-svn: 364804
*	AMDGPU/GlobalISel: RegBankSelect for readlane/readfirstlane	Matt Arsenault	2019-07-01	2	-0/+103
\| \| \| \|	llvm-svn: 364801
*	AMDGPU/GlobalISel: Implement select for 32-bit G_ADD	Tom Stellard	2019-07-01	1	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: hiraditya, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58804 llvm-svn: 364797
*	AMDGPU/GlobalISel: Select G_BRCOND for vcc	Matt Arsenault	2019-07-01	1	-11/+36
\| \| \| \|	llvm-svn: 364795
*	AMDGPU/GlobalISel: Select G_FRAME_INDEX	Matt Arsenault	2019-07-01	1	-0/+38
\| \| \| \|	llvm-svn: 364789
*	AMDGPU/GlobalISel: Make s16 select legal	Matt Arsenault	2019-07-01	4	-70/+244
\| \| \| \| \| \| \|	This is easy to handle and avoids legalization artifacts which are likely to obscure combines. llvm-svn: 364787
*	AMDGPU/GlobalISel: Select G_BRCOND for scc conditions	Matt Arsenault	2019-07-01	2	-0/+194
\| \| \| \|	llvm-svn: 364786
*	AMDGPU/GlobalISel: Tolerate copies with no type set	Matt Arsenault	2019-07-01	1	-0/+56
\| \| \| \| \| \| \|	isVCC has the same bug, but isn't used in a context where it can cause a problem. llvm-svn: 364784
*	AMDGPU/GlobalISel: Select src modifiers	Matt Arsenault	2019-07-01	2	-30/+191
\| \| \| \|	llvm-svn: 364782
*	AMDGPU/GlobalISel: Fix RegBankSelect for G_FCANONICALIZE	Matt Arsenault	2019-07-01	1	-0/+35
\| \| \| \|	llvm-svn: 364768
*	AMDGPU/GlobalISel: Fix RegBankSelect for G_BUILD_VECTOR	Matt Arsenault	2019-07-01	1	-0/+69
\| \| \| \|	llvm-svn: 364767
*	AMDGPU/GlobalISel: Fail on store to 32-bit address space	Matt Arsenault	2019-07-01	1	-3/+3
\| \| \| \|	llvm-svn: 364766