bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[AMDGPU][MC] Fix for Bugs 28201, 28199, 28170 + LIT tests	Dmitry Preobrazhensky	2017-03-20	1	-8/+34
\| \| \| \| \| \| \| \| \| \|	This fix enables sp3 abs modifier with constants Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D30825 llvm-svn: 298265
*	[AMDGPU][MC] Fix for Bugs 28200, 28202 + LIT tests	Dmitry Preobrazhensky	2017-03-20	1	-23/+85
\| \| \| \| \| \| \| \| \| \|	Fixed several related issues with VOP3 fp modifiers. Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D30821 llvm-svn: 298255
*	[AMDGPU][MC] Fix for Bug 30829 + LIT tests	Dmitry Preobrazhensky	2017-03-03	1	-0/+132
\| \| \| \| \| \| \| \|	Added code to check constant bus restrictions for VOP formats (only one SGPR value or literal-constant may be used by the instruction). Note that the same checks are performed by SIInstrInfo::verifyInstruction (used by lowering code). Added LIT tests. llvm-svn: 296873
*	AMDGPU: Add VOP3P instruction format	Matt Arsenault	2017-02-27	1	-28/+284
\| \| \| \| \| \| \| \|	Add a few non-VOP3P but instructions related to packed. Includes hack with dummy operands for the benefit of the assembler llvm-svn: 296368
*	[AMDGPU] Runtime metadata fixes:	Konstantin Zhuravlyov	2017-02-27	1	-9/+14
\| \| \| \| \| \| \| \| \| \| \|	- Verify that runtime metadata is actually valid runtime metadata when assembling, otherwise we could accept the following when assembling, but ocl runtime will reject it: .amdgpu_runtime_metadata { amd.MDVersion: [ 2, 1 ], amd.RandomUnknownKey, amd.IsaInfo: ... - Make IsaInfo optional, and always emit it. Differential Revision: https://reviews.llvm.org/D30349 llvm-svn: 296324
*	[AMDGPU] Add target information that is required by tools to metadata	Konstantin Zhuravlyov	2017-02-08	1	-14/+19
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28760#fb670e28 llvm-svn: 294449
*	AMDGPU: Fix missing static	Matt Arsenault	2017-02-07	1	-3/+4
\| \| \| \|	llvm-svn: 294281
*	AMDGPU: AsmParser cleanups	Matt Arsenault	2017-02-03	1	-17/+24
\| \| \| \| \| \|	Use typedef, remove unnecessary enum, line wraps. llvm-svn: 294039
*	[AMDGPU][mc] Fix AddressSanitizer leftover issue in gfx7_asm_all test	Artem Tamazov	2017-02-03	1	-5/+6
\| \| \| \| \| \|	Issue occurs when assembling "ds_ordered_count v0, v0 gds". llvm-svn: 294004
*	[AMDGPU][mc] Fix memory corruption uncovered by AddressSanitizer during ↵	Artem Tamazov	2017-01-27	1	-2/+0
\| \| \| \| \| \| \| \| \| \|	coverage/smoke Gfx7/8 testing. Coverage/smoke Gfx7/8 tests were committed r292922 but then reverted by r292974 due to AddressSanitizer failure, which is fixed by this patch. Tests to be re-committed soon. llvm-svn: 293338
*	[AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-01-21	1	-44/+65
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 292688
*	[AMDGPU] Add subtarget features for SDWA/DPP	Sam Kolton	2017-01-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Reviewers: vpykhtin, artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28900 llvm-svn: 292596
*	[AMDGPU] Assembler: fix v_mac_f16 immediates	Sam Kolton	2017-01-17	1	-9/+16
\| \| \| \| \| \| \| \| \| \|	Reviewers: vpykhtin, artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28802 llvm-svn: 292224
*	[AMDGPU] Assembler: SDWA/DPP should not accept scalar registers and ↵	Sam Kolton	2017-01-11	1	-29/+87
\| \| \| \| \| \| \| \| \| \| \| \|	immediate operands Reviewers: artem.tamazov, nhaustov, vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28157 llvm-svn: 291668
*	Reapply r291025 ("AMDGPU: Remove unneccessary intermediate vector")	Matt Arsenault	2017-01-09	1	-19/+33
\| \| \| \|	llvm-svn: 291460
*	Revert "Reapply r291025 ("AMDGPU: Remove unneccessary intermediate vector")"	Evgeniy Stepanov	2017-01-05	1	-33/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This reverts commit r291144. It breaks build bots. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/3270, http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fuzzer/builds/2058 lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp:1638:12: error: could not convert ‘(const unsigned int)(& Variants)’ from ‘const unsigned int’ to ‘llvm::ArrayRef<unsigned int>’ return Variants; Reviewers: eugenis, tstellarAMD Patch by Alex Shlyapnikov. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D28372 llvm-svn: 291168
*	Reapply r291025 ("AMDGPU: Remove unneccessary intermediate vector")	Matt Arsenault	2017-01-05	1	-19/+33
\| \| \| \| \| \|	Arrays are supposed to be static const llvm-svn: 291144
*	Revert r291025 ("AMDGPU: Remove unneccessary intermediate vector")	Richard Smith	2017-01-05	1	-22/+18
\| \| \| \| \| \| \|	This caused buildbot failures due to returning ArrayRefs referencing local (temporary) objects. llvm-svn: 291067
*	AMDGPU: Remove unneccessary intermediate vector	Matt Arsenault	2017-01-04	1	-18/+22
\| \| \| \|	llvm-svn: 291025
*	[AMDGPU][mc] Enable absolute expressions in .hsa_code_object_isa directive	Artem Tamazov	2016-12-29	1	-12/+17
\| \| \| \| \| \| \| \| \| \| \|	Among other stuff, this allows to use predefined .option.machine_version_major /minor/stepping symbols in the directive. Relevant test expanded at once (also file renamed for clarity). Differential Revision: https://reviews.llvm.org/D28140 llvm-svn: 290710
*	[AMDGPU][llvm-mc] Predefined symbols to access register counts ↵	Artem Tamazov	2016-12-27	1	-7/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(.kernel.{v\|s}gpr_count) The feature allows for conditional assembly, filling the entries of .amd_kernel_code_t etc. Symbols are defined with value 0 at the beginning of each kernel scope. After each register usage, the respective symbol is set to: value = max( value, ( register index + 1 ) ) Thus, at the end of scope the value represents a count of used registers. Kernel scopes begin at .amdgpu_hsa_kernel directive, end at the next .amdgpu_hsa_kernel (or EOF, whichever comes first). There is also dummy scope that lies from the beginning of source file til the first .amdgpu_hsa_kernel. Test added. Differential Revision: https://reviews.llvm.org/D27859 llvm-svn: 290608
*	[AMDGPU] Assembler: support SDWA and DPP for VOP2b instructions	Sam Kolton	2016-12-27	1	-3/+9
\| \| \| \| \| \| \| \| \| \|	Reviewers: nhaustov, artem.tamazov, vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28051 llvm-svn: 290599
*	[AMDGPU] Add pseudo SDWA instructions	Sam Kolton	2016-12-22	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is needed for later SDWA support in CodeGen. Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27412 llvm-svn: 290338
*	[AMDGPU] Garbage collect dead code. NFCI.	Davide Italiano	2016-12-21	1	-15/+0
\| \| \| \|	llvm-svn: 290249
*	AMDGPU: [AMDGPU] Assembler: add .hsa_code_object_metadata directive for ↵	Sam Kolton	2016-12-19	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	functime metadata V2.0 Summary: Added pair of directives .hsa_code_object_metadata/.end_hsa_code_object_metadata. Between them user can put YAML string that would be directly put to the generated note. E.g.: ''' .hsa_code_object_metadata { amd.MDVersion: [ 2, 0 ] } .end_hsa_code_object_metadata ''' Based on D25046 Reviewers: vpykhtin, nhaustov, yaxunl, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, mgorny, tony-tye Differential Revision: https://reviews.llvm.org/D27619 llvm-svn: 290097
*	AMDGPU: Assembler support for vintrp instructions	Matt Arsenault	2016-12-15	1	-0/+78
\| \| \| \|	llvm-svn: 289866
*	Replace APFloatBase static fltSemantics data members with getter functions	Stephan Bergmann	2016-12-14	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	At least the plugin used by the LibreOffice build (<https://wiki.documentfoundation.org/Development/Clang_plugins>) indirectly uses those members (through inline functions in LLVM/Clang include files in turn using them), but they are not exported by utils/extract_symbols.py on Windows, and accessing data across DLL/EXE boundaries on Windows is generally problematic. Differential Revision: https://reviews.llvm.org/D26671 llvm-svn: 289647
*	AMDGPU: Fix handling of 16-bit immediates	Matt Arsenault	2016-12-10	1	-47/+158
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since 32-bit instructions with 32-bit input immediate behavior are used to materialize 16-bit constants in 32-bit registers for 16-bit instructions, determining the legality based on the size is incorrect. Change operands to have the size specified in the type. Also adds a workaround for a disassembler bug that produces an immediate MCOperand for an operand that is supposed to be OPERAND_REGISTER. The assembler appears to accept out of bounds immediates and truncates them, but this seems to be an issue for 32-bit already. llvm-svn: 289306
*	[AMDGPU, PowerPC, TableGen] Fix some Clang-tidy modernize and Include What ↵	Eugene Zelenko	2016-12-09	1	-30/+38
\| \| \| \| \| \|	You Use warnings; other minor fixes (NFC). llvm-svn: 289282
*	AMDGPU: Consolidate inline immediate predicate functions	Matt Arsenault	2016-12-05	1	-7/+16
\| \| \| \|	llvm-svn: 288718
*	AMDGPU: Minor assembler refactoring	Matt Arsenault	2016-12-05	1	-64/+71
\| \| \| \| \| \| \|	Fix return before else, check types for selecting fltSemantics, refactor immediate checks. llvm-svn: 288715
*	AMDGPU: Assembler support for exp	Matt Arsenault	2016-12-05	1	-2/+166
\| \| \| \| \| \| \| \|	compr is not currently parsed (or printed) correctly, but that should probably be fixed along with intrinsic changes. llvm-svn: 288698
*	AMDGPU: Change how exp is printed	Matt Arsenault	2016-12-05	1	-0/+16
\| \| \| \| \| \| \|	This is an improvement over a long list of unreadable numbers. A follow up patch will try to match how sc formats these. llvm-svn: 288697
*	AMDGPU: Clean up struct initializers	Matt Arsenault	2016-12-03	1	-8/+7
\| \| \| \|	llvm-svn: 288590
*	AMDGPU: Fix return after else	Matt Arsenault	2016-11-15	1	-8/+14
\| \| \| \|	llvm-svn: 287015
*	AMDGPU: Replace assert(false) with unreachable	Matt Arsenault	2016-11-15	1	-5/+5
\| \| \| \|	llvm-svn: 287013
*	[AMDGPU] Add f16 support (VI+)	Konstantin Zhuravlyov	2016-11-13	1	-12/+23
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D25975 llvm-svn: 286753
*	[TableGen] Move OperandMatchResultTy enum to MCTargetAsmParser.h	Alex Bradbury	2016-11-01	1	-22/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As it stands, the OperandMatchResultTy is only included in the generated header if there is custom operand parsing. However, almost all backends make use of MatchOperand_Success and friends from OperandMatchResultTy for e.g. parseRegister. This is a pain when starting an AsmParser for a new backend that doesn't yet have custom operand parsing. Move the enum to MCTargetAsmParser.h. This patch is a prerequisite for D23563 Differential Revision: https://reviews.llvm.org/D23496 llvm-svn: 285705
*	AMDGPU: Whitespace fixes	Matt Arsenault	2016-11-01	1	-8/+8
\| \| \| \|	llvm-svn: 285659
*	[AMDGPU][MC][gfx8] Support 20-bit immediate offset in SMEM instructions.	Artem Tamazov	2016-10-31	1	-7/+14
\| \| \| \| \| \| \| \| \| \| \|	Fixes Bug 30808. Note that passing subtarget information to predicates seems too complicated, so gfx8-specific def smrd_offset_20 introduced. Old gfx6/7-specific def renamed to smrd_offset_8 for clarity. Lit tests updated. Differential Revision: https://reviews.llvm.org/D26085 llvm-svn: 285590
*	AMDGPU: Add instruction definitions for VGPR indexing	Matt Arsenault	2016-10-12	1	-0/+5
\| \| \| \| \| \| \|	VI added a second method of indexing into VGPRs besides using v_movrel* llvm-svn: 284027
*	[AMDGPU] Refactor waitcnt encoding	Konstantin Zhuravlyov	2016-10-11	1	-24/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	- Refactor bit packing/unpacking - Calculate bit mask given bit shift and bit width - Introduce function for decoding bits of waitcnt - Introduce function for encoding bits of waitcnt - Introduce function for getting waitcnt mask (instead of using bare numbers) - Introduce function fot getting max waitcnt(s) (instead of using bare numbers) Differential Revision: https://reviews.llvm.org/D25298 llvm-svn: 283919
*	Revert r283690, "MC: Remove unused entities."	Peter Collingbourne	2016-10-10	1	-1/+1
\| \| \| \|	llvm-svn: 283814
*	Move the global variables representing each Target behind accessor function	Mehdi Amini	2016-10-09	1	-2/+2
\| \| \| \| \| \| \| \|	This avoids "static initialization order fiasco" Differential Revision: https://reviews.llvm.org/D25412 llvm-svn: 283702
*	MC: Remove unused entities.	Peter Collingbourne	2016-10-09	1	-1/+1
\| \| \| \|	llvm-svn: 283691
*	[AMDGPU] Assembler: support v_mac_f32 DPP and SDWA. Move getNamedOperandIdx ↵	Sam Kolton	2016-10-07	1	-36/+90
\| \| \| \| \| \| \| \| \| \| \| \|	to AMDGPUBaseInfo.h Reviewers: artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D25084 llvm-svn: 283560
*	[AMDGPU] Choose VMCNT, EXPCNT, LGKMCNT masks and shifts based on the isa version	Konstantin Zhuravlyov	2016-09-30	1	-6/+8
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D24973 llvm-svn: 282877
*	[AMDGPU][mc] Add support for absolute expressions in DPP modifiers.	Artem Tamazov	2016-09-22	1	-35/+22
\| \| \| \| \| \| \| \| \|	Also added range checking for DPP attributes. Assembler tests added as well. Differential Revision: https://reviews.llvm.org/D24755 llvm-svn: 282145
*	AMDGPU] Assembler: better support for immediate literals in assembler.	Sam Kolton	2016-09-09	1	-142/+365
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Prevously assembler parsed all literals as either 32-bit integers or 32-bit floating-point values. Because of this we couldn't support f64 literals. E.g. in instruction "v_fract_f64 v[0:1], 0.5", literal 0.5 was encoded as 32-bit literal 0x3f000000, which is incorrect and will be interpreted as 3.0517578125E-5 instead of 0.5. Correct encoding is inline constant 240 (optimal) or 32-bit literal 0x3FE00000 at least. With this change the way immediate literals are parsed is changed. All literals are always parsed as 64-bit values either integer or floating-point. Then we convert parsed literals to correct form based on information about type of operand parsed (was literal floating or binary) and type of expected instruction operands (is this f32/64 or b32/64 instruction). Here are rules how we convert literals: - We parsed fp literal: - Instruction expects 64-bit operand: - If parsed literal is inlinable (e.g. v_fract_f64_e32 v[0:1], 0.5) - then we do nothing this literal - Else if literal is not-inlinable but instruction requires to inline it (e.g. this is e64 encoding, v_fract_f64_e64 v[0:1], 1.5) - report error - Else literal is not-inlinable but we can encode it as additional 32-bit literal constant - If instruction expect fp operand type (f64) - Check if low 32 bits of literal are zeroes (e.g. v_fract_f64 v[0:1], 1.5) - If so then do nothing - Else (e.g. v_fract_f64 v[0:1], 3.1415) - report warning that low 32 bits will be set to zeroes and precision will be lost - set low 32 bits of literal to zeroes - Instruction expects integer operand type (e.g. s_mov_b64_e32 s[0:1], 1.5) - report error as it is unclear how to encode this literal - Instruction expects 32-bit operand: - Convert parsed 64 bit fp literal to 32 bit fp. Allow lose of precision but not overflow or underflow - Is this literal inlinable and are we required to inline literal (e.g. v_trunc_f32_e64 v0, 0.5) - do nothing - Else report error - Do nothing. We can encode any other 32-bit fp literal (e.g. v_trunc_f32 v0, 10000000.0) - Parsed binary literal: - Is this literal inlinable (e.g. v_trunc_f32_e32 v0, 35) - do nothing - Else, are we required to inline this literal (e.g. v_trunc_f32_e64 v0, 35) - report error - Else, literal is not-inlinable and we are not required to inline it - Are high 32 bit of literal zeroes or same as sign bit (32 bit) - do nothing (e.g. v_trunc_f32 v0, 0xdeadbeef) - Else - report error (e.g. v_trunc_f32 v0, 0x123456789abcdef0) For this change it is required that we know operand types of instruction (are they f32/64 or b32/64). I added several new register operands (they extend previous register operands) and set operand types to corresponding types: ''' enum OperandType { OPERAND_REG_IMM32_INT, OPERAND_REG_IMM32_FP, OPERAND_REG_INLINE_C_INT, OPERAND_REG_INLINE_C_FP, } ''' This is not working yet: - Several tests are failing - Problems with predicate methods for inline immediates - LLVM generated assembler parts try to select e64 encoding before e32. More changes are required for several AsmOperands. Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, artem.tamazov Differential Revision: https://reviews.llvm.org/D22922 llvm-svn: 281050
*	[AMDGPU] Assembler: match e32 VOP instructions before e64.	Sam Kolton	2016-09-09	1	-23/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Split assembler match table in 4 tables with assembler variants: Default - all instructions except VOP3, SDWA and DPP - VOP3 - SDWA - DPP First match Default table then VOP3, SDWA and DPP. Reviewers: tstellarAMD, artem.tamazov, vpykhtin Subscribers: arsenm, wdng, nhaehnle, AMDGPU Differential Revision: https://reviews.llvm.org/D24252 llvm-svn: 281023