bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AArch64: try to fix optimized build failure.	Tim Northover	2016-07-05	1	-1/+2
\| \| \| \| \| \| \| \| \|	I think the Ops filled out by Regex::match contain pointers into the temporary std::string returned by StringRef::upper. Its lifetime is extended by the call to match, but only until the end of that call (not to the uses of Ops later on). llvm-svn: 274586
*	[X86][AVX2] Simplified BROADCAST combining to avoid repeated matching attempts	Simon Pilgrim	2016-07-05	1	-12/+9
\| \| \| \|	llvm-svn: 274583
*	Fix an ordering problem in r274431	Manman Ren	2016-07-05	1	-1/+1
\| \| \| \|	llvm-svn: 274582
*	AMDGPU: Remove unnecessary string usage in AsmPrinter	Matt Arsenault	2016-07-05	2	-38/+49
\| \| \| \| \| \| \| \|	Registers are printed a lot, so don't create temporary std::strings. Using char instead of a string to an ostream saves a function call. llvm-svn: 274581
*	AArch64: TableGenerate system instruction operands.	Tim Northover	2016-07-05	10	-1959/+1255
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The way the named arguments for various system instructions are handled at the moment has a few problems: - Large-scale duplication between AArch64BaseInfo.h and AArch64BaseInfo.cpp - That weird Mapping class that I have no idea what I was on when I thought it was a good idea. - Searches are performed linearly through the entire list. - We print absolutely all registers in upper-case, even though some are canonically mixed case (SPSel for example). - The ARM ARM specifies sysregs in terms of 5 fields, but those are relegated to comments in our implementation, with a slightly opaque hex value indicating the canonical encoding LLVM will use. This adds a new TableGen backend to produce efficiently searchable tables, and switches AArch64 over to using that infrastructure. llvm-svn: 274576
*	Revert r259387: "AArch64: Implement missed conditional compare sequences."	Balaram Makam	2016-07-05	2	-47/+2
\| \| \| \| \| \| \|	This reverts commit r259387 because it inserts illegal code after legalization in some backends where i64 OR type is illegal for example. llvm-svn: 274573
*	[X86][AVX2] Add support for target shuffle combining to BROADCAST	Simon Pilgrim	2016-07-05	1	-6/+20
\| \| \| \| \| \|	Only support broadcast from vector register so far - memory folding support will have to wait. llvm-svn: 274572
*	[X86][AVX512] Fixed decoding of permd/permpd variable mask shuffles + ↵	Simon Pilgrim	2016-07-05	3	-7/+10
\| \| \| \| \| \| \| \|	enabled them for target shuffle combining Corrected element mask masking to extract the bottom index bits (now matches the perm2 implementation but for unary inputs). llvm-svn: 274571
*	ARM: fix `-mlong-calls` for WoA	Saleem Abdulrasool	2016-07-05	1	-1/+1
\| \| \| \| \| \| \| \| \|	Not all code-paths set the relocation model to static for Windows. This currently breaks on Windows ARM with `-mlong-calls` when built with clang. Loosen the assertion to what it was previously. We would ideally ensure that all the configuration sets Windows to static relocation model. llvm-svn: 274570
*	AArch64: use correct SDValue # when looking for bitfield placement.	Tim Northover	2016-07-05	1	-2/+3
\| \| \| \| \| \| \| \| \| \|	The other use really does only care about the SDNode (it checks the opcode against a whitelist), but bitFieldPlacement can be misled if the node produces multiple results. Patch by Ismail Badawi. llvm-svn: 274567
*	AMDGPU: Fix folding SGPRs into madak/madmk src0	Matt Arsenault	2016-07-05	4	-6/+26
\| \| \| \| \| \| \| \| \| \|	Because of the special immediate operand, the constant bus is already used so SGPRs are never useful. r263212 changed the name of the immediate operand, which broke the verifier check for the restriction. llvm-svn: 274564
*	AMDGPU/SI: Remove address space query functions from AMDGPUDAGToDAGISel	Tom Stellard	2016-07-05	3	-156/+78
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: These have been replaced with TableGen code (except for isConstantLoad, which is still used for R600). The queries were broken for cases where MemOperand was a PseudoSourceValue. Reviewers: arsenm Subscribers: arsenm, kzhuravl, llvm-commits Differential Revision: http://reviews.llvm.org/D21684 llvm-svn: 274561
*	[AMDGPU] rename DS_1A1D_Off8_NORET to DS_1A2D_Off8_NORET as ds_write2xx use ↵	Valery Pykhtin	2016-07-05	2	-5/+5
\| \| \| \| \| \|	2 source registers. NFC. llvm-svn: 274556
*	[X86][AVX512] Remove vector BROADCAST builtins.	Simon Pilgrim	2016-07-05	1	-34/+0
\| \| \| \|	llvm-svn: 274555
*	[LLVM][INTRINSICS] adding intrinsics of CLFLUSHOPT	Michael Zuckerman	2016-07-05	1	-1/+1
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D21789 llvm-svn: 274553
*	[AMDGPU] Assembler: Fix parsing error with floating-point literals passed to ↵	Sam Kolton	2016-07-05	1	-6/+1
\| \| \| \| \| \| \| \|	integer instructions Differential Revision: http://reviews.llvm.org/D21972 llvm-svn: 274551
*	[mips][ias] Remove k_PhysReg since it's not possible to create an operand of ↵	Daniel Sanders	2016-07-05	1	-20/+7
\| \| \| \| \| \| \| \| \| \| \| \|	this kind. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D21986 llvm-svn: 274547
*	[Thumb] Reapply r272251 with a fix for PR28348 (mk 2)	James Molloy	2016-07-05	1	-1/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The important thing I was missing was ensuring newly added constants were kept in topological order. Repositioning the node is correct if the constant is newly added (so it has no topological ordering) but wrong if it already existed - positioning it next in the worklist would break the topological ordering. Original commit message: [Thumb] Select a BIC instead of AND if the immediate can be encoded more optimally negated If an immediate is only used in an AND node, it is possible that the immediate can be more optimally materialized when negated. If this is the case, we can negate the immediate and use a BIC instead; int i(int a) { return a & 0xfffffeec; } Used to produce: ldr r1, [CONSTPOOL] ands r0, r1 CONSTPOOL: 0xfffffeec And now produces: movs r1, #255 adds r1, #20 ; Less costly immediate generation bics r0, r1 llvm-svn: 274543
*	Revert r274536: [mips][ias] Don't break apart and reconstruct StringRef's ↵	Daniel Sanders	2016-07-05	1	-4/+6
\| \| \| \| \| \| \| \|	for k_Token. NFC. It turns out that MSVC requires this. llvm-svn: 274538
*	[mips][ias] Don't break apart and reconstruct StringRef's for k_Token. NFC.	Daniel Sanders	2016-07-05	1	-6/+4
\| \| \| \|	llvm-svn: 274536
*	[PowerPC] - Legalize vector types by widening instead of integer promotion	Nemanja Ivanovic	2016-07-05	3	-1/+100
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: http://reviews.llvm.org/D20443 It changes the legalization strategy for illegal vector types from integer promotion to widening. This only applies for vectors with elements of width that is a multiple of a byte since we have hardware support for vectors with 1, 2, 3, 8 and 16 byte elements. Integer promotion for vectors is quite expensive on PPC due to the sequence of breaking apart the vector, extending the elements and reconstituting the vector. Two of these operations are expensive. This patch causes between minor and major improvements in performance on most benchmarks. There are very few benchmarks whose performance regresses. These regressions can be handled in a subsequent patch with a DAG combine (similar to how this patch handles int -> fp conversions of illegal vector types). llvm-svn: 274535
*	AMDGPU/R600: Add PatFrags for selecting the correct vtx id for loads	Tom Stellard	2016-07-05	4	-45/+65
\| \| \| \| \| \| \| \| \|	This moves of the r600 logic out of isGlobalLoad() and into the TableGen files. Differential Revision: http://reviews.llvm.org/D21710 llvm-svn: 274527
*	AMDGPU/SI: Remove hack for selecting < 32-bit loads to MUBUF instructions	Tom Stellard	2016-07-04	3	-15/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The isGlobalLoad() query was returning true for constant address space loads with memory types less than 32-bits, which is wrong. This logic has been replaced with PatFrag in the TableGen files, to provide the same functionality. Reviewers: arsenm Subscribers: arsenm, kzhuravl, llvm-commits Differential Revision: http://reviews.llvm.org/D21696 llvm-svn: 274521
*	[X86][AVX512] Add support for lowering shuffles to VSHUFPD	Simon Pilgrim	2016-07-04	1	-0/+5
\| \| \| \|	llvm-svn: 274520
*	[AVX512] Remove masked VPERMD/VPERMQ/VPERMILPS/VPERMILPD intrinsics. They ↵	Craig Topper	2016-07-04	1	-16/+0
\| \| \| \| \| \|	were autoupgraded to native IR in r274506 and r274506. llvm-svn: 274519
*	AMDGPU/R600: Add indentation to VTX and TEX fetch asm strings	Jan Vesely	2016-07-04	1	-2/+2
\| \| \| \| \| \| \| \|	These are printed as part of Fetch clauses. Differential Revision: http://reviews.llvm.org/D21730 llvm-svn: 274517
*	Revert "[Thumb] Reapply r272251 with a fix for PR28348"	James Molloy	2016-07-04	1	-40/+1
\| \| \| \| \| \|	This reverts commit r274510 - it made green dragon unhappy. llvm-svn: 274512
*	[Thumb] Reapply r272251 with a fix for PR28348	James Molloy	2016-07-04	1	-1/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We were using DAG->getConstant instead of DAG->getTargetConstant. This meant that we could inadvertently increase the use count of a constant if stars aligned, which it did in this testcase. Increasing the use count of the constant could cause ISel to fall over (because DAGToDAG lowering assumed the constant had only one use!) Original commit message: [Thumb] Select a BIC instead of AND if the immediate can be encoded more optimally negated If an immediate is only used in an AND node, it is possible that the immediate can be more optimally materialized when negated. If this is the case, we can negate the immediate and use a BIC instead; int i(int a) { return a & 0xfffffeec; } Used to produce: ldr r1, [CONSTPOOL] ands r0, r1 CONSTPOOL: 0xfffffeec And now produces: movs r1, #255 adds r1, #20 ; Less costly immediate generation bics r0, r1 llvm-svn: 274510
*	[X86] Add shuffle mask rescaling helper function. NFCI.	Simon Pilgrim	2016-07-03	1	-12/+26
\| \| \| \|	llvm-svn: 274476
*	[X86][AVX2] Merge unary permute matching behind the same V2.isUndef() ↵	Simon Pilgrim	2016-07-03	1	-9/+8
\| \| \| \| \| \|	condition. NFCI. llvm-svn: 274474
*	[X86][AVX512] Add support for 512-bit shuffle lowering to VPERMPD/VPERMQ	Simon Pilgrim	2016-07-03	1	-14/+39
\| \| \| \|	llvm-svn: 274473
*	[X86][AVX512] Add support for VPERMPD/VPERMQ masked shuffle comments	Simon Pilgrim	2016-07-03	1	-0/+16
\| \| \| \|	llvm-svn: 274469
*	[X86][AVX512] Add support for 512-bit shuffle decoding of VPERMPD/VPERMQ	Simon Pilgrim	2016-07-03	4	-26/+30
\| \| \| \|	llvm-svn: 274468
*	[X86][AVX] Renamed VPERMILPI shuffle comment macros to be more specific	Simon Pilgrim	2016-07-03	1	-27/+27
\| \| \| \|	llvm-svn: 274467
*	[X86][AVX512] Add support for VPALIGNR/PSHUFD/PSHUFHW/PSHUFLW masked shuffle ↵	Simon Pilgrim	2016-07-03	1	-0/+16
\| \| \| \| \| \|	comments llvm-svn: 274466
*	[X86][AVX512] Add support for UNPCK masked shuffle comments	Simon Pilgrim	2016-07-03	1	-1/+51
\| \| \| \|	llvm-svn: 274464
*	[X86][AVX512] Add support for VPERM/VSHUF masked shuffle comments	Simon Pilgrim	2016-07-03	1	-0/+56
\| \| \| \|	llvm-svn: 274462
*	[X86][AVX512] Add support for PMOVZX masked shuffle comments	Simon Pilgrim	2016-07-03	1	-0/+34
\| \| \| \|	llvm-svn: 274461
*	[X86][AVX512] Add support for masked shuffle comments	Simon Pilgrim	2016-07-03	1	-2/+53
\| \| \| \| \| \| \| \| \| \|	This patch adds support for including the avx512 mask register information in the mask/maskz versions of shuffle instruction comments. This initial version just adds support for MOVDDUP/MOVSHDUP/MOVSLDUP to reduce the mass of test regenerations, other shuffle instructions can be added in due course. Differential Revision: http://reviews.llvm.org/D21953 llvm-svn: 274459
*	[X86][AVX512] Add support for lowering shuffles to VPERMILPS	Simon Pilgrim	2016-07-03	1	-0/+4
\| \| \| \|	llvm-svn: 274458
*	Fix spelling.	Simon Pilgrim	2016-07-02	1	-2/+2
\| \| \| \|	llvm-svn: 274451
*	[X86][AVX512] Add support for lowering shuffles to VPERMILPD	Simon Pilgrim	2016-07-02	1	-0/+11
\| \| \| \|	llvm-svn: 274450
*	[X86][AVX512] Add support for 512-bit PSHUFB lowering	Simon Pilgrim	2016-07-02	1	-2/+7
\| \| \| \|	llvm-svn: 274444
*	[X86][AVX512] Converted the MOVDDUP/MOVSLDUP/MOVSHDUP masked intrinsics to ↵	Simon Pilgrim	2016-07-02	1	-18/+0
\| \| \| \| \| \|	generic IR llvm-svn: 274443
*	[Hexagon] Create global std::map lazily.	Benjamin Kramer	2016-07-02	1	-3/+3
\| \| \| \| \| \| \| \|	This could of course be a simple binary search with no global state involved at all if someone cares enough. Just don't make everyone linking the hexagon backend pay for it on process startup and shutdown. llvm-svn: 274437
*	[X86][AVX512] Add support for lowering shuffles to MOVDDUP/MOVSLDUP/MOVSHDUP	Simon Pilgrim	2016-07-02	1	-0/+19
\| \| \| \|	llvm-svn: 274436
*	Use arrays or initializer lists to feed ArrayRefs instead of SmallVector ↵	Benjamin Kramer	2016-07-02	7	-44/+24
\| \| \| \| \| \| \| \|	where possible. No functionality change intended. llvm-svn: 274431
*	[SystemZ] Move misplaced SystemZ::TDC to non-memory opcode range.	Marcin Koscielnicki	2016-07-02	2	-7/+7
\| \| \| \|	llvm-svn: 274417
*	AMDGPU: Add feature for unaligned access	Matt Arsenault	2016-07-01	5	-12/+32
\| \| \| \|	llvm-svn: 274398
*	AMDGPU: Expand unaligned accesses early	Matt Arsenault	2016-07-01	2	-21/+48
\| \| \| \| \| \| \| \|	Due to visit order problems, in the case of an unaligned copy the legalized DAG fails to eliminate extra instructions introduced by the expansion of both unaligned parts. llvm-svn: 274397