bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[AMDGPU] set read_only access qualifier for pointers	Stanislav Mekhanoshin	2017-04-14	1	-3/+8
\| \| \| \| \| \| \| \| \| \|	If a kernel's pointer argument is known to be readonly set access qualifier accordingly. This allows RT not to flush caches before dispatches. Differential Revision: https://reviews.llvm.org/D32091 llvm-svn: 300362
*	[AMDGPU][MC] Corrected ds_write_src2_* to require one offset instead of two.	Dmitry Preobrazhensky	2017-04-14	1	-14/+2
\| \| \| \| \| \| \| \| \| \|	Fixed bug 32551: https://bugs.llvm.org//show_bug.cgi?id=32551 Reviewers: vpykhtin Differential Revision: https://reviews.llvm.org/D31809 llvm-svn: 300319
*	[AMDGPU][MC] Enabled constants for src operands of s_cbranch_g_fork	Dmitry Preobrazhensky	2017-04-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Fixed bug 32619: https://bugs.llvm.org//show_bug.cgi?id=32619 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D31973 llvm-svn: 300318
*	[AMDGPU] added SIInstrInfo::getAddNoCarry() helper	Stanislav Mekhanoshin	2017-04-14	4	-23/+44
\| \| \| \| \| \| \| \|	Addressed rest of post submit comments from D31993. Differential Revision: https://reviews.llvm.org/D32057 llvm-svn: 300288
*	AMDGPU/GFX9: Do not use v_pack_b32_f16 when packing	Konstantin Zhuravlyov	2017-04-13	1	-29/+15
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31819 llvm-svn: 300275
*	[IR] Make getParamAttributes take argument numbers, not ArgNo+1	Reid Kleckner	2017-04-13	2	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Add hasParamAttribute() and use it instead of hasAttribute(ArgNo+1, Kind) everywhere. The fact that the AttributeList index for an argument is ArgNo+1 should be a hidden implementation detail. NFC llvm-svn: 300272
*	Fix -Wunused-value warning	Reid Kleckner	2017-04-13	1	-6/+6
\| \| \| \|	llvm-svn: 300254
*	[AMDGPU] Combine DS operations with offsets bigger than byte	Stanislav Mekhanoshin	2017-04-13	1	-150/+166
\| \| \| \| \| \| \| \| \|	In many cases ds operations can be combined even if offsets do not fit into 8 bit encoding. What it takes is to adjust base address. Differential Revision: https://reviews.llvm.org/D31993 llvm-svn: 300227
*	AMDGPU : Fix common dominator of two incoming blocks terminates with uniform ↵	Wei Ding	2017-04-12	1	-2/+24
\| \| \| \| \| \| \| \|	branch issue. Differential Revision: http://reviews.llvm.org/D31350 llvm-svn: 300142
*	AMDGPU: Fix invalid copies when copying i1 to phys reg	Matt Arsenault	2017-04-12	3	-4/+30
\| \| \| \| \| \| \|	Insert a VReg_1 virtual register so the i1 workaround pass can handle it. llvm-svn: 300113
*	[AMDGPU] Generate range metadata for workitem id	Stanislav Mekhanoshin	2017-04-12	6	-24/+118
\| \| \| \| \| \| \| \| \|	If workgroup size is known inform llvm about range returned by local id and local size queries. Differential Revision: https://reviews.llvm.org/D31804 llvm-svn: 300102
*	[AMDGPU][MC] Added support for several VI-specific opcodes (s_wakeup, etc)	Dmitry Preobrazhensky	2017-04-12	3	-1/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added support for VI: - s_endpgm_saved - s_wakeup - s_rfe_restore_b64 - v_perm_b32 Enabled for VI: - v_mov_fed_b32 - v_mov_fed_b32_e64 See bug 32593: https://bugs.llvm.org//show_bug.cgi?id=32593 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D31931 llvm-svn: 300076
*	[AMDGPU][MC] Corrected parsing of v_cmp_class* and v_cmpx_class*	Dmitry Preobrazhensky	2017-04-12	2	-2/+4
\| \| \| \| \| \| \| \| \| \|	Fixed bug 32565: https://bugs.llvm.org//show_bug.cgi?id=32565 Reviewers: vpykhtin Differential Revision: https://reviews.llvm.org/D31820 llvm-svn: 300073
*	[AMDGPU][MC] Corrected encoding of V_MQSAD_U32_U8 for CI	Dmitry Preobrazhensky	2017-04-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Corrected encoding of V_MQSAD_U32_U8 for CI See bug 32552: https://bugs.llvm.org//show_bug.cgi?id=32552 Reviewers: vpykhtin Differential Revision: https://reviews.llvm.org/D31810 llvm-svn: 300070
*	[AMDGPU][MC] Corrected ds_wrxchg2* to support two offsets	Dmitry Preobrazhensky	2017-04-12	1	-7/+21
\| \| \| \| \| \| \| \| \| \|	Fixed bug 28227: https://bugs.llvm.org//show_bug.cgi?id=28227 Reviewers: vpykhtin Differential Revision: https://reviews.llvm.org/D31808 llvm-svn: 300066
*	[AMDGPU][MC] Corrected src0 size for s_cbranch_join	Dmitry Preobrazhensky	2017-04-12	1	-1/+7
\| \| \| \| \| \| \| \| \| \|	Fix for bug 28159: https://bugs.llvm.org//show_bug.cgi?id=28159 Reviewers: vpykhtin, arsenm Differential Revision: https://reviews.llvm.org/D31595 llvm-svn: 300055
*	[AMDGPU] SDWA: make pass global	Sam Kolton	2017-04-12	1	-183/+175
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Remove checks for basic blocks. Reviewers: vpykhtin, rampitec, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31935 llvm-svn: 300040
*	[AMDGPU] Add a new pass to insert waitcnts. Leave under an option for testing.	Kannan Narayanan	2017-04-12	5	-1/+1881
\| \| \| \| \| \|	Based on comments in https://reviews.llvm.org/D31161. llvm-svn: 300023
*	AMDGPU: Insert wait at start of callee functions	Matt Arsenault	2017-04-11	1	-0/+14
\| \| \| \|	llvm-svn: 300000
*	AMDGPU: Refactor SIMachineFunctionInfo slightly	Matt Arsenault	2017-04-11	3	-16/+38
\| \| \| \| \| \|	Prepare for handling non-entry functions. llvm-svn: 299999
*	AMDGPU: Refactor argument lowering	Matt Arsenault	2017-04-11	10	-276/+375
\| \| \| \| \| \| \|	Split into smaller functions and prepare for handling non-entry functions. llvm-svn: 299998
*	AMDGPU: Fix folding reg_sequence into copy to phys reg	Matt Arsenault	2017-04-11	1	-0/+4
\| \| \| \| \| \| \|	This was producing an illegal reg_sequence defining a physical register with virtual register inputs. llvm-svn: 299997
*	AMDGPU: Prune unecessary include	Matt Arsenault	2017-04-11	1	-2/+0
\| \| \| \|	llvm-svn: 299996
*	[AMDGPU] Add A5 to data layout for amdgiz environment	Yaxun Liu	2017-04-11	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31589 llvm-svn: 299964
*	Remove unused functions. Remove static qualifier from functions in header ↵	Vassil Vassilev	2017-04-11	1	-10/+0
\| \| \| \| \| \|	files. NFC. llvm-svn: 299947
*	AMDGPU: Fix crash when disassembling VOP3 mac	Matt Arsenault	2017-04-10	10	-19/+23
\| \| \| \| \| \| \| \| \| \| \| \|	The unused dummy src2_modifiers is missing, so it crashes when trying to print it. I tried to fully remove src2_modifiers, but there are some irritations in the places where it is converted to mad since it starts to require modifying use lists while iterating over them. llvm-svn: 299861
*	AMDGPU: Actually write nops for writeNopData	Matt Arsenault	2017-04-08	1	-1/+14
\| \| \| \| \| \| \|	Before this was just writing 0s, which ends up looking like a v_cndmask_b32 v0, s0, v0, vcc. Write out an encoded s_nop instead. llvm-svn: 299816
*	[AMDGPU] Unroll more to eliminate phis and conditions	Stanislav Mekhanoshin	2017-04-07	1	-2/+52
\| \| \| \| \| \| \| \| \| \| \| \| \|	Increase threshold to unroll a loop which contains an "if" statement whose condition defined by a PHI belonging to the loop. This may help to eliminate if region and potentially even PHI itself, saving on both divergence and registers used for the PHI. Add a small bonus for each of such "if" statements. Differential Revision: https://reviews.llvm.org/D31693 llvm-svn: 299779
*	[AMDGPU][MC] Fix for Bug 28211 + LIT tests	Dmitry Preobrazhensky	2017-04-07	2	-36/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- corrected DS_GWS_* opcodes (see VI_Shader_Programming#16.pdf for detailed description) - address operand is not used - several opcodes have data operand - all opcodes have offset modifier - DS_AND_SRC2_B32: corrected typo in mnemo - DS_WRAP_RTN_F32 replaced with DS_WRAP_RTN_B32 - added CI/VI opcodes: - DS_CONDXCHG32_RTN_B64 - DS_GWS_SEMA_RELEASE_ALL - added VI opcodes: - DS_CONSUME - DS_APPEND - DS_ORDERED_COUNT Differential Revision: https://reviews.llvm.org/D31707 llvm-svn: 299767
*	[AMDGPU] Move SiShrinkInstruction and SDWAPeephole to SSAOptimization passes	Sam Kolton	2017-04-07	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Difference beetween PreRegAlloc() and MachineSSAOptimization() are that the former is run despite of -O0 optimization level. In my undestanding SiShrinkInstructions and SDWAPeephole shouldn't run when optimizations are disabled. With this change order of passes will not change. Reviewers: arsenm, vpykhtin, rampitec Subscribers: qcolombet, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31705 llvm-svn: 299757
*	AMDGPU/GFX9: Fix shared and private aperture queries	Konstantin Zhuravlyov	2017-04-06	3	-14/+35
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31786 llvm-svn: 299727
*	AMDGPU: Diagnose illegal SGPR to VGPR copies	Matt Arsenault	2017-04-06	2	-3/+40
\| \| \| \| \| \| \| \| \| \|	This is possible in ways that are not compiler bugs, so stop asserting on them. This emits an extra error when emitting objects when it can't encode the new pseudo, but I'm not sure that matters. llvm-svn: 299712
*	AMDGPU: Replace fp16SrcZerosHighBits with a whitelist	Matt Arsenault	2017-04-06	1	-4/+50
\| \| \| \| \| \| \|	FCOPYSIGN is lowered to bit operations which don't clear the high bits. llvm-svn: 299708
*	[AMDGPU] Temporarily change constant address space from 4 to 2	Yaxun Liu	2017-04-06	4	-10/+8
\| \| \| \| \| \| \| \| \| \|	Our final address space mapping is to let constant address space to be 4 to match nvptx. However for now we will make it 2 to avoid unnecessary work in FE/BE/devlib about intrinsics returning constant pointers. Differential Revision: https://reviews.llvm.org/D31770 llvm-svn: 299690
*	AMDGPU: Stop using CCAssignToRegWithShadow	Matt Arsenault	2017-04-06	3	-30/+36
\| \| \| \| \| \| \|	This does not do what it is attempting to use it for and requires working around in LowerFormalArguments. llvm-svn: 299667
*	[AMDGPU] Eliminate barrier if workgroup size is not greater than wavefront size	Stanislav Mekhanoshin	2017-04-06	1	-0/+11
\| \| \| \| \| \| \| \| \| \|	If a workgroup size is known to be not greater than wavefront size the s_barrier instruction is not needed since all threads are guarantied to come to the same point at the same time. Differential Revision: https://reviews.llvm.org/D31731 llvm-svn: 299659
*	[AMDGPU] Resubmit SDWA peephole: enable by default	Sam Kolton	2017-04-06	2	-6/+5
\| \| \| \| \| \| \| \| \| \|	Reviewers: vpykhtin, rampitec, arsenm Subscribers: qcolombet, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31671 llvm-svn: 299654
*	Revert r299536. [AMDGPU] SDWA peephole: enable by default.	Ivan Krasin	2017-04-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	Reason: breaks multiple bots: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/3988 http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/1173 Original Review URL: https://reviews.llvm.org/D31671 llvm-svn: 299583
*	[AMDGPU][MC] Fix for Bug 28158 + LIT tests	Dmitry Preobrazhensky	2017-04-05	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added support of the following instructions: - s_cbranch_cdbgsys - s_cbranch_cdbgsys_and_user - s_cbranch_cdbgsys_or_user - s_cbranch_cdbguser - s_setkill Reviewers: vpykhtin Differential Revision: https://reviews.llvm.org/D31469 llvm-svn: 299567
*	[AMDGPU][MC] Fix for Bug 28167 + LIT tests	Dmitry Preobrazhensky	2017-04-05	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Corrected src0 for v_writelane_b32: - Enabled inline constants and literals for SI/CI (VOP2) - Enabled inline constants for VI (VOP3) Reviewers: vpykhtin, arsenm https://reviews.llvm.org/D31463 llvm-svn: 299555
*	[AMDGPU] SDWA peephole: enable by default	Sam Kolton	2017-04-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Reviewers: vpykhtin, rampitec, arsenm Subscribers: qcolombet, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31671 llvm-svn: 299536
*	Add MCContext argument to MCAsmBackend::applyFixup for error reporting	Alex Bradbury	2017-04-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A number of backends (AArch64, MIPS, ARM) have been using MCContext::reportError to report issues such as out-of-range fixup values in their TgtAsmBackend. This is great, but because MCContext couldn't easily be threaded through to the adjustFixupValue helper function from its usual callsite (applyFixup), these backends ended up adding an MCContext* argument and adding another call to applyFixup to processFixupValue. Adding an MCContext parameter to applyFixup makes this unnecessary, and even better - applyFixup can take a reference to MCContext rather than a potentially null pointer. Differential Revision: https://reviews.llvm.org/D30264 llvm-svn: 299529
*	AMDGPU: Remove legacy export intrinsic	Matt Arsenault	2017-04-04	2	-36/+0
\| \| \| \|	llvm-svn: 299444
*	AMDGPU: Remove legacy image intrinsics	Matt Arsenault	2017-04-04	2	-217/+0
\| \| \| \|	llvm-svn: 299443
*	AMDGPU: Remove llvm.SI.vs.load.input	Matt Arsenault	2017-04-03	6	-19/+0
\| \| \| \|	llvm-svn: 299391
*	AMDGPU: Remove legacy bfe intrinsics	Matt Arsenault	2017-04-03	5	-37/+14
\| \| \| \|	llvm-svn: 299372
*	[AMDGPU] Garbage collect now unused dead code. NFCI.	Davide Italiano	2017-04-01	1	-10/+0
\| \| \| \|	llvm-svn: 299310
*	[AMDGPU] Remove assumption that vector and scalar types do not alias	Stanislav Mekhanoshin	2017-03-31	1	-8/+0
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31547 llvm-svn: 299250
*	AMDGPU: Remove unnecessary ands when f16 is legal	Matt Arsenault	2017-03-31	6	-2/+57
\| \| \| \| \| \| \| \| \| \|	Add a new node to act as a fancy bitcast from f16 operations to i32 that implicitly zero the high 16-bits of the result. Alternatively could try making v2f16 legal and canonicalizing on build_vectors. llvm-svn: 299246
*	AMDGPU/R600: Fix amdgpu alias analysis pass.	Jan Vesely	2017-03-31	2	-5/+11
\| \| \| \| \| \| \| \| \|	R600 uses higher AS number to access kernel parameters Fixes: r298846 Differential Revision: https://reviews.llvm.org/D31520 llvm-svn: 299245