bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Move Value adjustment to applyFixup. NFC.	Rafael Espindola	2017-06-23	1	-2/+1
\| \| \| \|	llvm-svn: 306178
*	ARM: move some logic from processFixupValue to applyFixup.	Rafael Espindola	2017-06-23	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \|	processFixupValue is called on every relaxation iteration. applyFixup is only called once at the very end. applyFixup is then the correct place to do last minute changes and value checks. While here, do proper range checks again for fixup_arm_thumb_bl. We used to do it, but dropped because of thumb2. We now do it again, but use the thumb2 range. llvm-svn: 306177
*	AMDGPU/GlobalISel: Mark 32-bit G_AND as legal	Tom Stellard	2017-06-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34349 llvm-svn: 306112
*	[AMDGPU] Add intrinsics for tbuffer load and store - build error fix	David Stuttard	2017-06-22	1	-2/+1
\| \| \| \| \| \| \|	Variable was unused in non-debug build (used in assert) causing compile time warning and eventual build failure llvm-svn: 306034
*	[AMDGPU] Add intrinsics for tbuffer load and store	David Stuttard	2017-06-22	8	-121/+535
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Intrinsic already existed for llvm.SI.tbuffer.store Needed tbuffer.load and also re-implementing the intrinsic as llvm.amdgcn.tbuffer.* Added CodeGen tests for the 2 new variants added. Left the original llvm.SI.tbuffer.store implementation to avoid issues with existing code Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, tpr Differential Revision: https://reviews.llvm.org/D30687 llvm-svn: 306031
*	[AMDGPU] SDWA: remove support for VOP2 instructions that have only 64-bit ↵	Sam Kolton	2017-06-22	1	-11/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	encoding Summary: Despite that this instructions are listed in VOP2, they are treated as VOP3 in specs. They should not support SDWA. There are no real instructions for them, but there are pseudo instructions. Reviewers: arsenm, vpykhtin, cfang Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34403 llvm-svn: 305999
*	[AMDGPU] SDWA: add support for GFX9 in peephole pass	Sam Kolton	2017-06-22	6	-39/+127
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Added support based on merged SDWA pseudo instructions. Now peephole allow one scalar operand, omod and clamp modifiers. Added several subtarget features for GFX9 SDWA. This diff also contains changes from D34026. Depends D34026 Reviewers: vpykhtin, rampitec, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34241 llvm-svn: 305986
*	[AMDGPU] Add FP_CLASS to the add/setcc combine	Stanislav Mekhanoshin	2017-06-21	1	-1/+3
\| \| \| \| \| \| \| \|	This is one of the nodes which also compile as v_cmp_*. Differential Revision: https://reviews.llvm.org/D34485 llvm-svn: 305970
*	Use a MutableArrayRef. NFC.	Rafael Espindola	2017-06-21	1	-4/+4
\| \| \| \|	llvm-svn: 305968
*	[AMDGPU] Combine add and adde, sub and sube	Stanislav Mekhanoshin	2017-06-21	2	-9/+81
\| \| \| \| \| \| \| \| \|	If one of the arguments of adde/sube is zero we can fold another add/sub into it. Differential Revision: https://reviews.llvm.org/D34374 llvm-svn: 305964
*	[AMDGPU] simplify add x, *ext (setcc) => addc\|subb x, 0, setcc	Stanislav Mekhanoshin	2017-06-21	4	-0/+59
\| \| \| \| \| \| \| \| \|	This simplification allows to avoid generating v_cndmask_b32 to serialize condition code between compare and use. Differential Revision: https://reviews.llvm.org/D34300 llvm-svn: 305962
*	[AMDGPU][MC][GFX9] Corrected VOP3P relevant code to fix disassembler failures	Dmitry Preobrazhensky	2017-06-21	4	-11/+6
\| \| \| \| \| \| \| \| \| \|	See Bug 33509: https://bugs.llvm.org//show_bug.cgi?id=33509 Reviewers: Sam Kolton, Artem Tamazov, Valery Pykhtin Differential Revision: https://reviews.llvm.org/D34360 llvm-svn: 305923
*	[AMDGPU][MC] Corrected V_QSAD instructions to check that dest register is ↵	Dmitry Preobrazhensky	2017-06-21	3	-5/+84
\| \| \| \| \| \| \| \| \| \| \| \|	different than any of the src See Bug 33279: https://bugs.llvm.org//show_bug.cgi?id=33279 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D34003 llvm-svn: 305915
*	[AMDGPU] SDWA: merge VI and GFX9 pseudo instructions	Sam Kolton	2017-06-21	15	-281/+323
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously there were two separate pseudo instruction for SDWA on VI and on GFX9. Created one pseudo instruction that is union of both of them. Added verifier to check that operands conform either VI or GFX9. Reviewers: dp, arsenm, vpykhtin Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, artem.tamazov Differential Revision: https://reviews.llvm.org/D34026 llvm-svn: 305886
*	AMDGPU: Allow vectorization of packed types	Matt Arsenault	2017-06-20	2	-8/+20
\| \| \| \|	llvm-svn: 305844
*	[AMDGPU] Fix illegal shrink of V_SUBB_U32 and V_ADDC_U32	Stanislav Mekhanoshin	2017-06-20	1	-0/+2
\| \| \| \| \| \| \| \| \|	If there is an immediate operand we shall not shrink V_SUBB_U32 and V_ADDC_U32, it does not fit e32 encoding. Differential Revison: https://reviews.llvm.org/D34291 llvm-svn: 305840
*	AMDGPU: Start adding global_* instructions	Matt Arsenault	2017-06-20	6	-6/+106
\| \| \| \|	llvm-svn: 305838
*	AMDGPU: Do operand folding in program order	Matt Arsenault	2017-06-20	1	-5/+3
\| \| \| \| \| \| \| \| \|	Before it was possible to partially fold use instructions before the defs. After the xor is folded into a copy, the same mov can end up in the fold list twice, so on the second attempt it will fail expecting to see a register to fold. llvm-svn: 305821
*	AMDGPU: Preserve undef when folding register operands	Matt Arsenault	2017-06-20	1	-0/+2
\| \| \| \| \| \| \| \|	If the source was a copy of an undef register, this would produce a read of an undefined register which is a verifier error. llvm-svn: 305816
*	[AMDGPU] Eliminate SGPR to VGPR copy when possible	Stanislav Mekhanoshin	2017-06-20	1	-0/+30
\| \| \| \| \| \| \| \|	SGPRs are generally cheaper, so try to use them over VGPRs. Differential Revision: https://reviews.llvm.org/D34130 llvm-svn: 305815
*	AMDGPU: Fix crash with undef vreg input operand	Matt Arsenault	2017-06-20	1	-1/+1
\| \| \| \|	llvm-svn: 305814
*	AMDGPU: Fix scratch wave offset relative FI expansion	Matt Arsenault	2017-06-19	1	-9/+20
\| \| \| \| \| \| \| \|	The offset may not be an inline immediate, so this needs to be materialized into a register. The post-RA run of SIShrinkInstructions is able to fold it later if it can. llvm-svn: 305761
*	[AMDGPU] Add infer address spaces pass before SROA	Stanislav Mekhanoshin	2017-06-19	1	-0/+8
\| \| \| \| \| \| \| \| \|	It adds it for the target after inlining but before SROA where we can get most out of it. Differential Revision: https://reviews.llvm.org/D34366 llvm-svn: 305759
*	AMDGPU: Cleanup CreateLiveInRegister	Matt Arsenault	2017-06-19	5	-34/+45
\| \| \| \|	llvm-svn: 305748
*	AMDGPU/GlobalISel: Mark G_BITCAST s32 <--> <2 x s16> legal	Tom Stellard	2017-06-19	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34129 llvm-svn: 305692
*	[AMDGPU] Testing commit access only, no real change	Alfred Huang	2017-06-15	1	-1/+1
\| \| \| \|	llvm-svn: 305523
*	DivergencyAnalysis patch for review	Alexander Timofeev	2017-06-15	3	-1/+15
\| \| \| \|	llvm-svn: 305494
*	[AMDGPU] Remove now dead defaultOffsetS13(). NFCI.	Davide Italiano	2017-06-13	1	-5/+0
\| \| \| \| \| \|	Fixes the GCC7 build with -Werror. llvm-svn: 305329
*	AMDGPU/GlobalISel: Mark 32-bit G_ADD as legal	Tom Stellard	2017-06-12	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D33992 llvm-svn: 305232
*	AMDGPU: Don't add same implicit use multiple times	Matt Arsenault	2017-06-12	1	-4/+2
\| \| \| \| \| \| \|	For the last component, the same register use was added as an implicit use and another implicit kill use. llvm-svn: 305205
*	AMDGPU: Teach isLegalAddressingMode about flat offsets	Matt Arsenault	2017-06-12	1	-3/+11
\| \| \| \| \| \| \|	Also fix reporting r+r as a valid addressing mode without offsets. llvm-svn: 305203
*	AMDGPU: Start selecting flat instruction offsets	Matt Arsenault	2017-06-12	2	-18/+42
\| \| \| \|	llvm-svn: 305201
*	AMDGPU: Verify that flat offsets aren't used pre-GFX9	Matt Arsenault	2017-06-12	1	-2/+11
\| \| \| \| \| \| \|	For convenience the operand is always present in the instruction, but it isn't valid to use except on GFX9. llvm-svn: 305200
*	AMDGPU: Start adding offset fields to flat instructions	Matt Arsenault	2017-06-12	5	-25/+94
\| \| \| \|	llvm-svn: 305194
*	Const correctness for TTI::getRegisterBitWidth	Daniel Neilson	2017-06-12	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The method TargetTransformInfo::getRegisterBitWidth() is declared const, but the type erasing implementation classes (TargetTransformInfo::Concept & TargetTransformInfo::Model) that were introduced by Chandler in https://reviews.llvm.org/D7293 do not have the method declared const. This is an NFC to tidy up the const consistency between TTI and its implementation. Reviewers: chandlerc, rnk, reames Reviewed By: reames Subscribers: reames, jfb, arsenm, dschuff, nemanjai, nhaehnle, javed.absar, sbc100, jgravelle-google, llvm-commits Differential Revision: https://reviews.llvm.org/D33903 llvm-svn: 305189
*	AMDGPU : Fix ISA Version Definitions.	Wei Ding	2017-06-10	4	-27/+99
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D28531 llvm-svn: 305137
*	[AMDGPU] Add intrinsics for alignbit and alignbyte instructions	Stanislav Mekhanoshin	2017-06-09	1	-2/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D34046 llvm-svn: 305098
*	[AMDGPU] Fix for issue in alloca to vector promotion pass	David Stuttard	2017-06-09	1	-6/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Alloca promotion pass not dealing with non-canonical input Added some additional checks so the pass simply backs-off forms it can't deal with (non-canonical) Also added some test cases in non-canonical form to check that it no longer crashes Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31710 llvm-svn: 305079
*	AMDGPU: Work around build special casing .inc files	Matt Arsenault	2017-06-08	3	-1/+7
\| \| \| \| \| \| \|	It complains because it assumes these were autogenerated files in the source directory. llvm-svn: 305005
*	AMDGPU: Use correct register names in inline assembly	Matt Arsenault	2017-06-08	3	-0/+410
\| \| \| \| \| \|	Fixes using physical registers in inline asm from clang. llvm-svn: 305004
*	[AMDGPU] Force qsads instrs to use different dest register than source registers	Mark Searles	2017-06-08	1	-0/+5
\| \| \| \| \| \| \| \|	The V_MQSAD_PK_U16_U8, V_QSAD_PK_U16_U8, and V_MQSAD_U32_U8 take more than 1 pass in hardware. For these three instructions, the destination registers must be different than all sources, so that the first pass does not overwrite sources for the following passes. Differential Revision: https://reviews.llvm.org/D33783 llvm-svn: 304998
*	[AMDGPU][MC] Corrected error message for s_waitcnt helpers	Dmitry Preobrazhensky	2017-06-07	1	-12/+16
\| \| \| \| \| \| \| \| \| \|	See Bug 32711: https://bugs.llvm.org//show_bug.cgi?id=32711 Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D33781 llvm-svn: 304922
*	AMDGPU/GlobalISel: Mark 32-bit G_SELECT as legal	Tom Stellard	2017-06-07	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D33949 llvm-svn: 304910
*	Move Object format code to lib/BinaryFormat.	Zachary Turner	2017-06-07	7	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \|	This creates a new library called BinaryFormat that has all of the headers from llvm/Support containing structure and layout definitions for various types of binary formats like dwarf, coff, elf, etc as well as the code for identifying a file from its magic. Differential Revision: https://reviews.llvm.org/D33843 llvm-svn: 304864
*	AMDGPU/NFC: Move amdgpu code object metadata to support	Konstantin Zhuravlyov	2017-06-06	3	-617/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31437 llvm-svn: 304812
*	[AMDGPU] Return correct value from SDWA pass	Stanislav Mekhanoshin	2017-06-06	1	-1/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D33927 llvm-svn: 304805
*	AMDGPU/GlobalISel: Mark 32-bit G_ICMP as legal	Tom Stellard	2017-06-06	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D33890 llvm-svn: 304797
*	Sort the remaining #include lines in include/... and lib/....	Chandler Carruth	2017-06-06	50	-81/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I did this a long time ago with a janky python script, but now clang-format has built-in support for this. I fed clang-format every line with a #include and let it re-sort things according to the precise LLVM rules for include ordering baked into clang-format these days. I've reverted a number of files where the results of sorting includes isn't healthy. Either places where we have legacy code relying on particular include ordering (where possible, I'll fix these separately) or where we have particular formatting around #include lines that I didn't want to disturb in this patch. This patch is entirely mechanical. If you get merge conflicts or anything, just ignore the changes in this patch and run clang-format over your #include lines in the files. Sorry for any noise here, but it is important to keep these things stable. I was seeing an increasing number of patches with irrelevant re-ordering of #include lines because clang-format was used. This patch at least isolates that churn, makes it easy to skip when resolving conflicts, and gets us to a clean baseline (again). llvm-svn: 304787
*	[llvm] Remove double semicolons	Mandeep Singh Grang	2017-06-06	3	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: craig.topper, arsenm, mehdi_amini Reviewed By: mehdi_amini Subscribers: mehdi_amini, wdng, nhaehnle, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33924 llvm-svn: 304767
*	AMDGPU: Remove deprecated and unused elf definitions	Konstantin Zhuravlyov	2017-06-05	5	-144/+0
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D33689 llvm-svn: 304737