bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[AMDGPU][NFC] Simplify test file for amdgcn intrinsics	Hideto Ueno	2019-07-23	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Remove unchecked attribute in the call site and use FileCheck String Substitution for `convergent` check. Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64901 llvm-svn: 366781
*	AMDGPU: Fold readlane/readfirstlane calls	Matt Arsenault	2019-06-17	1	-0/+125
\| \| \| \|	llvm-svn: 363587
*	AMDGPU: Mark exp/exp.compr as inaccessiblememonly	Matt Arsenault	2019-06-17	1	-17/+21
\| \| \| \| \| \| \| \| \| \|	Should also be marked writeonly, but I think that would require splitting the version with done set to a separate intrinsic Test change is only from renumbering the attribute group numbers, which for some reason the generated check lines consider. llvm-svn: 363560
*	AMDGPU: Fold readlane intrinsics of constants	Matt Arsenault	2019-06-14	1	-0/+56
\| \| \| \| \| \| \| \|	I'm not 100% sure about this, since I'm worried about IR transforms that might end up introducing divergence downstream once replaced with a constant, but I haven't come up with an example yet. llvm-svn: 363406
*	[AMDGPU] gfx1010 wave32 icmp/fcmp intrinsic changes for wave32	Stanislav Mekhanoshin	2019-06-13	1	-138/+136
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D63301 llvm-svn: 363339
*	[NFC][InstCombine] Add unary FNeg tests to AMDGPU/amdgcn-intrinsics.ll	Cameron McInally	2019-05-31	1	-0/+21
\| \| \| \|	llvm-svn: 362255
*	[InstCombine] auto-generate test checks; NFC	Sanjay Patel	2019-05-20	1	-550/+907
\| \| \| \|	llvm-svn: 361181
*	Revert "Temporarily Revert "Add basic loop fusion pass.""	Eric Christopher	2019-04-17	1	-0/+2098
\| \| \| \| \| \| \| \|	The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552
*	Temporarily Revert "Add basic loop fusion pass."	Eric Christopher	2019-04-17	1	-2098/+0
\| \| \| \| \| \| \| \|	As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546
*	IR: Add immarg attribute	Matt Arsenault	2019-03-12	1	-41/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This indicates an intrinsic parameter is required to be a constant, and should not be replaced with a non-constant value. Add the attribute to all AMDGPU and generic intrinsics that comments indicate it should apply to. I scanned other target intrinsics, but I don't see any obvious comments indicating which arguments are intended to be only immediates. This breaks one questionable testcase for the autoupgrade. I'm unclear on whether the autoupgrade is supposed to really handle declarations which were never valid. The verifier fails because the attributes now refer to a parameter past the end of the argument list. llvm-svn: 355981
*	AMDGPU: Add a fast path for icmp.i1(src, false, NE)	Marek Olsak	2019-01-15	1	-0/+193
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This allows moving the condition from the intrinsic to the standard ICmp opcode, so that LLVM can do simplifications on it. The icmp.i1 intrinsic is an identity for retrieving the SGPR mask. And we can also get the mask from and i1, or i1, xor i1. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D52060 llvm-svn: 351150
*	InstCombine: Avoid introducing poison values when lowering llvm.amdgcn.[us]bfe	Tom Stellard	2018-11-08	1	-23/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When the 3rd argument to these intrinsics is zero, lowering them to shift instructions produces poison values, since we end up with shift amounts equal to the number of bits in the shifted value. This means we can only lower these intrinsics if we can prove that the 3rd argument is not zero. Reviewers: arsenm Reviewed By: arsenm Subscribers: bnieuwenhuizen, jvesely, wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D53739 llvm-svn: 346422
*	AMDGPU: Remove nan tests in class if src is nnan	Matt Arsenault	2018-08-28	1	-0/+33
\| \| \| \|	llvm-svn: 340850
*	AMDGPU: Stop producing icmp/fcmp intrinsics with invalid types	Matt Arsenault	2018-08-15	1	-0/+149
\| \| \| \|	llvm-svn: 339815
*	AMDGPU: Turn class x, p_zero\|n_zero into fcmp oeq x, 0	Matt Arsenault	2018-08-10	1	-0/+7
\| \| \| \| \| \|	The library does use this for some reason. llvm-svn: 339461
*	Fix asserts in AMDGCN fmed3 folding by handling more cases of NaN	Matt Arsenault	2018-07-05	1	-3/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Better NaN handling for AMDGCN fmed3. All operands are checked for NaN now. The checks were moved before the canonicalization to provide a better mapping from fclamp. Changed the behaviour of fmed3(x,y,NaN) to return max(x,y) instead of min(x,y) in light of this. Updated tests as a result and added some new cases to cover the fix. Patch by Alan Baker llvm-svn: 336375
*	[InstCombine] Recommit: Fold (x << y) >> y -> x & (-1 >> y)	Roman Lebedev	2018-06-15	1	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We already do it for splat constants, but not just values. Also, undef cases are mostly non-functional. The original commit was reverted because it broke tests for amdgpu backend, which i didn't check. Now, the backed was updated to recognize these new patterns, so we are good. https://bugs.llvm.org/show_bug.cgi?id=37603 https://rise4fun.com/Alive/cplX Reviewers: spatel, craig.topper, mareko, bogner, rampitec, nhaehnle, arsenm Reviewed By: spatel, rampitec, nhaehnle Subscribers: wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D47980 llvm-svn: 334818
*	[AMDGPU] Optimze old value of v_mov_b32_dpp	Stanislav Mekhanoshin	2018-05-22	1	-0/+29
\| \| \| \| \| \| \| \| \| \|	We can eliminate old value if bound_ctrl = 1 and row_mask = bank_mask = 0xf. This is alternative implementation working with the intrinsic in InstCombine. Original review for past-ISel optimization: D46570. Differential Revision: https://reviews.llvm.org/D46596 llvm-svn: 332956
*	AMDGPU: Add intrinsics llvm.amdgcn.cvt.{pknorm.i16, pknorm.u16, pk.i16, pk.u16}	Marek Olsak	2018-01-31	1	-0/+108
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D41663 llvm-svn: 323908
*	AMDGPU: Add new intrinsic llvm.amdgcn.kill(i1)	Marek Olsak	2017-10-24	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Kill the thread if operand 0 == false. llvm.amdgcn.wqm.vote can be applied to the operand. Also allow kill in all shader stages. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D38544 llvm-svn: 316427
*	AMDGPU: Add llvm.amdgcn.wqm.vote intrinsic	Marek Olsak	2017-10-24	1	-0/+33
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D38543 llvm-svn: 316426
*	InstCombine: Move tests that use target intrinsics into subdirectories	Justin Bogner	2017-05-13	1	-0/+1540
	Tests with target intrinsics are inherently target specific, so it doesn't actually make sense to run them if we've excluded their target. llvm-svn: 302979