bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU/GlobalISel: Fix import of s_abs_i32 pattern	Matt Arsenault	2020-01-07	1	-1/+1
\|
*	AMDGPU/GlobalISel: Select llvm.amdgcn.wqm.vote	Matt Arsenault	2020-01-07	1	-2/+2
\|
*	AMDGPU: Only allow regs for s_movrel_{b32\|b64}	Matt Arsenault	2020-01-03	1	-2/+13
\| \| \| \| \|	This would incorrectly allowing folding immediates. These currently aren't selectable, but will be from GlobalISel soon.
*	[AMDGPU] deduplicate tablegen predicates	Stanislav Mekhanoshin	2019-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We are duplicating predicates if several parts of the combined predicate list contain the same condition. Added code to deduplicate the list. We have AssemblerPredicates and AssemblerPredicate in the PredicateControl, but we never use AssemblerPredicates with an actual list, so this one is dropped. This addresses the first part of the llvm bug 43886: https://bugs.llvm.org/show_bug.cgi?id=43886 Differential Revision: https://reviews.llvm.org/D69815
*	AMDGPU/GlobalISel: Allow selection of scalar min/max	Matt Arsenault	2019-09-21	1	-4/+4
\| \| \| \| \| \| \| \| \|	I believe all of the uniform/divergent pattern predicates are redundant and can be removed. The uniformity bit already influences the register class, and nothhing has broken when I've removed this and others. llvm-svn: 372450
*	Reapply r372285 "GlobalISel: Don't materialize immarg arguments to intrinsics"	Matt Arsenault	2019-09-19	1	-7/+7
\| \| \| \| \| \| \| \| \|	This reverts r372314, reapplying r372285 and the commits which depend on it (r372286-r372293, and r372296-r372297) This was missing one switch to getTargetConstant in an untested case. llvm-svn: 372338
*	Revert r372285 "GlobalISel: Don't materialize immarg arguments to intrinsics"	Hans Wennborg	2019-09-19	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This broke the Chromium build, causing it to fail with e.g. fatal error: error in backend: Cannot select: t362: v4i32 = X86ISD::VSHLI t392, Constant:i8<15> See llvm-commits thread of r372285 for details. This also reverts r372286, r372287, r372288, r372289, r372290, r372291, r372292, r372293, r372296, and r372297, which seemed to depend on the main commit. > Encode them directly as an imm argument to G_INTRINSIC. > > Since now intrinsics can now define what parameters are required to be > immediates, avoid using registers for them. Intrinsics could > potentially want a constant that isn't a legal register type. Also, > since G_CONSTANT is subject to CSE and legalization, transforms could > potentially obscure the value (and create extra work for the > selector). The register bank of a G_CONSTANT is also meaningful, so > this could throw off future folding and legalization logic for AMDGPU. > > This will be much more convenient to work with than needing to call > getConstantVRegVal and checking if it may have failed for every > constant intrinsic parameter. AMDGPU has quite a lot of intrinsics wth > immarg operands, many of which need inspection during lowering. Having > to find the value in a register is going to add a lot of boilerplate > and waste compile time. > > SelectionDAG has always provided TargetConstant for constants which > should not be legalized or materialized in a register. The distinction > between Constant and TargetConstant was somewhat fuzzy, and there was > no automatic way to force usage of TargetConstant for certain > intrinsic parameters. They were both ultimately ConstantSDNode, and it > was inconsistently used. It was quite easy to mis-select an > instruction requiring an immediate. For SelectionDAG, start emitting > TargetConstant for these arguments, and using timm to match them. > > Most of the work here is to cleanup target handling of constants. Some > targets process intrinsics through intermediate custom nodes, which > need to preserve TargetConstant usage to match the intrinsic > expectation. Pattern inputs now need to distinguish whether a constant > is merely compatible with an operand or whether it is mandatory. > > The GlobalISelEmitter needs to treat timm as a special case of a leaf > node, simlar to MachineBasicBlock operands. This should also enable > handling of patterns for some G_ instructions with immediates, like > G_FENCE or G_EXTRACT. > > This does include a workaround for a crash in GlobalISelEmitter when > ARM tries to uses "imm" in an output with a "timm" pattern source. llvm-svn: 372314
*	GlobalISel: Don't materialize immarg arguments to intrinsics	Matt Arsenault	2019-09-19	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Encode them directly as an imm argument to G_INTRINSIC. Since now intrinsics can now define what parameters are required to be immediates, avoid using registers for them. Intrinsics could potentially want a constant that isn't a legal register type. Also, since G_CONSTANT is subject to CSE and legalization, transforms could potentially obscure the value (and create extra work for the selector). The register bank of a G_CONSTANT is also meaningful, so this could throw off future folding and legalization logic for AMDGPU. This will be much more convenient to work with than needing to call getConstantVRegVal and checking if it may have failed for every constant intrinsic parameter. AMDGPU has quite a lot of intrinsics wth immarg operands, many of which need inspection during lowering. Having to find the value in a register is going to add a lot of boilerplate and waste compile time. SelectionDAG has always provided TargetConstant for constants which should not be legalized or materialized in a register. The distinction between Constant and TargetConstant was somewhat fuzzy, and there was no automatic way to force usage of TargetConstant for certain intrinsic parameters. They were both ultimately ConstantSDNode, and it was inconsistently used. It was quite easy to mis-select an instruction requiring an immediate. For SelectionDAG, start emitting TargetConstant for these arguments, and using timm to match them. Most of the work here is to cleanup target handling of constants. Some targets process intrinsics through intermediate custom nodes, which need to preserve TargetConstant usage to match the intrinsic expectation. Pattern inputs now need to distinguish whether a constant is merely compatible with an operand or whether it is mandatory. The GlobalISelEmitter needs to treat timm as a special case of a leaf node, simlar to MachineBasicBlock operands. This should also enable handling of patterns for some G_ instructions with immediates, like G_FENCE or G_EXTRACT. This does include a workaround for a crash in GlobalISelEmitter when ARM tries to uses "imm" in an output with a "timm" pattern source. llvm-svn: 372285
*	AMDGPU/GlobalISel: Select G_CTPOP	Matt Arsenault	2019-09-13	1	-1/+3
\| \| \| \|	llvm-svn: 371798
*	[AMDGPU] Mark s_barrier as having side effects but not accessing memory.	Jay Foad	2019-09-06	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes poor scheduling in a function containing a barrier and a few load instructions. Without this fix, ScheduleDAGInstrs::buildSchedGraph adds an artificial edge in the dependency graph from the barrier instruction to the exit node representing live-out latency, with a latency of about 500 cycles. Because of this it thinks the critical path through the graph also has a latency of about 500 cycles. And because of that it does not think that any of the load instructions are on the critical path, so it schedules them with no regard for their (80 cycle) latency, which gives poor results. Reviewers: arsenm, dstuttard, tpr, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67218 llvm-svn: 371192
*	Re-commit: [AMDGPU] Use S_DENORM_MODE for gfx10	Austin Kerbow	2019-08-06	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: During fdiv32 lowering use S_DENORM_MODE to select denorm mode in gfx10. Reviewers: arsenm, rampitec Reviewed By: arsenm, rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65620 llvm-svn: 367969
*	Revert "[AMDGPU] Use S_DENORM_MODE for gfx10"	Dmitri Gribenko	2019-08-05	1	-4/+1
\| \| \| \| \| \| \|	This reverts commit r367882. It broke the test MC/Disassembler/AMDGPU/gfx10_dasm_all.txt. llvm-svn: 367904
*	[AMDGPU] Use S_DENORM_MODE for gfx10	Austin Kerbow	2019-08-05	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: During fdiv32 lowering use S_DENORM_MODE to select denorm mode in gfx10. Reviewers: arsenm, rampitec Reviewed By: arsenm, rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65620 llvm-svn: 367882
*	AMDGPU: Use tablegen pattern for sendmsg intrinsics	Matt Arsenault	2019-08-01	1	-4/+3
\| \| \| \| \| \| \|	Since this now emits a direct copy to m0, SIFixSGPRCopies has to handle a physical register. llvm-svn: 367593
*	AMDGPU: Redefine setcc condition PatLeafs	Matt Arsenault	2019-07-19	1	-3/+3
\| \| \| \| \| \|	Avoid using custom code predicates. llvm-svn: 366609
*	AMDGPU/GlobalISel: Select G_ASHR	Matt Arsenault	2019-07-16	1	-2/+2
\| \| \| \|	llvm-svn: 366257
*	AMDGPU/GlobalISel: Select G_LSHR	Matt Arsenault	2019-07-16	1	-2/+2
\| \| \| \|	llvm-svn: 366256
*	AMDGPU/GlobalISel: Select G_SHL	Matt Arsenault	2019-07-16	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	I think this manages to not break the DAG handling with the divergent predicates because the stadalone divergent patterns end up with a higher priority than the pattern on the instruction definition. The 16-bit versions don't work yet. llvm-svn: 366254
*	AMDGPU: s_waitcnt field should be treated as unsigned	Matt Arsenault	2019-07-11	1	-1/+1
\| \| \| \| \| \| \|	Also make it an ImmLeaf, so it should work with global isel as well, which was part of the point of moving it in the first place. llvm-svn: 365842
*	[AMDGPU] Created a sub-register class for the return address operand in the ↵	Christudasan Devadasan	2019-07-09	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	return instruction. Function return instruction lowering, currently uses the fixed register pair s[30:31] for holding the return address. It can be any SGPR pair other than the CSRs. Created an SGPR pair sub-register class exclusive of the CSRs, and used this regclass while lowering the return instruction. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D63924 llvm-svn: 365512
*	AMDGPU: Move waitcnt intrinsic to instruction definition pattern	Matt Arsenault	2019-07-08	1	-12/+2
\| \| \| \|	llvm-svn: 365349
*	[AMDGPU] Fix for branch offset hardware workaround	Ryan Taylor	2019-06-26	1	-15/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes a hardware bug that makes a branch offset of 0x3f unsafe. This replaces the 32 bit branch with offset 0x3f to a 64 bit instruction that includes the same 32 bit branch and the encoding for a s_nop 0 to follow. The relaxer than modifies the offsets accordingly. Change-Id: I10b7aed99d651f8159401b01bb421f105fa6288e Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63494 llvm-svn: 364451
*	[AMDGPU] gfx10 wave32 patterns	Stanislav Mekhanoshin	2019-06-18	1	-3/+15
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D63511 llvm-svn: 363729
*	AMDGPU: Set isTrap on S_TRAP	Matt Arsenault	2019-06-14	1	-1/+4
\| \| \| \| \| \| \|	This seems to only be used for generating some kind of documentation, but might as well set it. llvm-svn: 363454
*	AMDGPU: Fix printing trailing whitespace after s_endpgm	Matt Arsenault	2019-06-14	1	-1/+1
\| \| \| \|	llvm-svn: 363384
*	[AMDGPU] gfx1010 base changes for wave32	Stanislav Mekhanoshin	2019-06-13	1	-0/+32
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D63293 llvm-svn: 363299
*	AMDGPU: Temporary drop s_mul_hi_i/u32 patterns	Konstantin Zhuravlyov	2019-05-28	1	-6/+2
\| \| \| \| \| \| \| \|	It introduces performance regressions in several applications. This has already been submitted downstream. llvm-svn: 361879
*	[AMDGPU][MC] Enabled labels with s_call_b64 and s_cbranch_i_fork	Dmitry Preobrazhensky	2019-05-17	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	See https://bugs.llvm.org/show_bug.cgi?id=41888 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D62016 llvm-svn: 361040
*	[AMDGPU] gfx1010 SOP instructions	Stanislav Mekhanoshin	2019-04-24	1	-131/+305
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D61080 llvm-svn: 359139
*	[AMDGPU] Sort out and rename multiple CI/VI predicates	Stanislav Mekhanoshin	2019-04-06	1	-3/+3
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D60346 llvm-svn: 357835
*	[AMDGPU] predicate and feature refactoring	Stanislav Mekhanoshin	2019-04-05	1	-17/+16
\| \| \| \| \| \| \| \| \|	We have done some predicate and feature refactoring lately but did not upstream it. This is to sync. Differential revision: https://reviews.llvm.org/D60292 llvm-svn: 357791
*	[AMDGPU] Enable code selection using `s_mul_hi_u32`/`s_mul_hi_i32`.	Michael Liao	2019-03-18	1	-2/+6
\| \| \| \| \| \| \| \| \| \|	Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59501 llvm-svn: 356405
*	[AMDGPU] Add support for immediate operand for S_ENDPGM	David Stuttard	2019-03-12	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add support for immediate operand in S_ENDPGM Change-Id: I0c56a076a10980f719fb2a8f16407e9c301013f6 Reviewers: alexshap Subscribers: qcolombet, arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, tpr, t-tye, eraman, arphaman, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59213 llvm-svn: 355902
*	[AMDGPU][MC][GFX8+] Added syntactic sugar for 'vgpr index' operand of ↵	Dmitry Preobrazhensky	2019-02-27	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	instructions s_set_gpr_idx_on and s_set_gpr_idx_mode See bug 39331: https://bugs.llvm.org/show_bug.cgi?id=39331 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D58288 llvm-svn: 354969
*	AMDGPU: Correct definitions for bitset instructions	Matt Arsenault	2019-02-25	1	-12/+18
\| \| \| \| \| \| \|	These really read and write the result register, so these need a tied input. llvm-svn: 354809
*	Revert "AMDGPU/NFC: Cleanup subtarget predicates"	Konstantin Zhuravlyov	2019-02-22	1	-12/+12
\| \| \| \| \| \| \|	It breaks one of our downstream merges, so revert it temporarily while investigating failures downstream llvm-svn: 354700
*	AMDGPU/NFC: Cleanup subtarget predicates	Konstantin Zhuravlyov	2019-02-21	1	-12/+12
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D58522 llvm-svn: 354620
*	AMDGPU: Remove GCN features and predicates	Matt Arsenault	2019-02-08	1	-4/+0
\| \| \| \| \| \| \|	These are no longer necessary since the R600 tablegen files are split out now. llvm-svn: 353548
*	Update the file headers across all of the LLVM projects in the monorepo	Chandler Carruth	2019-01-19	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636
*	[AMDGPU][MC] Disabled use of 2 different literals with SOP2/SOPC instructions	Dmitry Preobrazhensky	2019-01-18	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	See bug 39319: https://bugs.llvm.org/show_bug.cgi?id=39319 Reviewers: artem.tamazov, arsenm, rampitec Differential Revision: https://reviews.llvm.org/D56847 llvm-svn: 351549
*	[AMDGPU] Add and update scalar instructions	Graham Sellers	2018-11-29	1	-8/+37
\| \| \| \| \| \| \| \| \|	This patch adds support for S_ANDN2, S_ORN2 32-bit and 64-bit instructions and adds splits to move them to the vector unit (for which there is no equivalent instruction). It modifies the way that the more complex scalar instructions are lowered to vector instructions by first breaking them down to sequences of simpler scalar instructions which are then lowered through the existing code paths. The pattern for S_XNOR has also been updated to apply inversion to one input rather than the output of the XOR as the result is equivalent and may allow leaving the NOT instruction on the scalar unit. A new tests for NAND, NOR, ANDN2 and ORN2 have been added, and existing tests now hit the new instructions (and have been modified accordingly). Differential: https://reviews.llvm.org/D54714 llvm-svn: 347877
*	[AMDGPU] Divergence driven instruction selection. Shift operations.	Alexander Timofeev	2018-10-01	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	Summary: This change enables VOP3 shifts to be explicitly selected dependent on the divergence. Differential Revision: https://reviews.llvm.org/D52559 Reviewers: rampitec llvm-svn: 343455
*	[AMDGPU] Divergence driven instruction selection. Part 1.	Alexander Timofeev	2018-09-21	1	-18/+27
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change is the first part of the AMDGPU target description change. The aim of it is the effective splitting the vector and scalar flows at the selection stage. Selection uses predicate functions based on the framework implemented earlier - https://reviews.llvm.org/D35267 Differential revision: https://reviews.llvm.org/D52019 Reviewers: rampitec llvm-svn: 342719
*	[AMDGPU][MC][GFX9] Added instructions s_mul_hi_32, s_lshl_add_u32	Dmitry Preobrazhensky	2018-04-09	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \|	See bugs 36841: https://bugs.llvm.org/show_bug.cgi?id=36841 36842: https://bugs.llvm.org/show_bug.cgi?id=36842 Differential Revision: https://reviews.llvm.org/D45251 Reviewers: artem.tamazov, arsenm, timcorringham llvm-svn: 329562
*	[AMDGPU][MC][GFX9] Added s_call_b64	Dmitry Preobrazhensky	2018-04-06	1	-0/+12
\| \| \| \| \| \| \| \| \|	See bug 36843: https://bugs.llvm.org/show_bug.cgi?id=36843 Differential Revision: https://reviews.llvm.org/D45268 Reviewers: artem.tamazov, arsenm, timcorringham llvm-svn: 329440
*	[AMDGPU][MC][GFX9] Added instruction s_endpgm_ordered_ps_done	Dmitry Preobrazhensky	2018-04-06	1	-0/+7
\| \| \| \| \| \| \| \| \|	See bug 36844: https://bugs.llvm.org/show_bug.cgi?id=36844 Differential Revision: https://reviews.llvm.org/D45313 Reviewers: artem.tamazov, arsenm, timcorringham llvm-svn: 329430
*	[AMDGPU][MC][GFX9] Added instructions saveexec, wrexec and bitreplicate	Dmitry Preobrazhensky	2018-04-06	1	-0/+21
\| \| \| \| \| \| \| \| \|	See bug 36840: https://bugs.llvm.org/show_bug.cgi?id=36840 Differential Revision: https://reviews.llvm.org/D45250 Reviewers: artem.tamazov, arsenm, timcorringham llvm-svn: 329419
*	AMDGPU: Introduce common SOP_Pseudo and VOP_Pseudo TableGen base classes	Nicolai Haehnle	2018-03-26	1	-18/+20
\| \| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D44820 Change-Id: I732979e2964006aa15d78a333d8886e6855f319a llvm-svn: 328496
*	AMDGPU: Add llvm.amdgcn.wqm.vote intrinsic	Marek Olsak	2017-10-24	1	-1/+3
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D38543 llvm-svn: 316426
*	Implement custom lowering for ISD::CTTZ_ZERO_UNDEF and ISD::CTTZ.	Wei Ding	2017-10-12	1	-2/+3
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D37348 llvm-svn: 315610