bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86][SSE] Update MOVNTDQA non-temporal loads to generic implementation (LLVM)	Simon Pilgrim	2017-04-14	2	-12/+6
\| \| \| \| \| \| \| \| \| \|	MOVNTDQA non-temporal aligned vector loads can be correctly represented using generic builtin loads, allowing us to remove the existing x86 intrinsics. Clang companion patch: D31766. Differential Revision: https://reviews.llvm.org/D31767 llvm-svn: 300325
*	This patch closes PR#32216: Better testing of schedule model instruction ↵	Andrew V. Tischenko	2017-04-14	3	-9/+14
\| \| \| \| \| \| \| \|	latencies/throughputs. The details are here: https://reviews.llvm.org/D30941 llvm-svn: 300311
*	Use methods to access data stored with frame instructions	Serge Pavlov	2017-04-13	5	-20/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	Instructions CALLSEQ_START..CALLSEQ_END and their target dependent counterparts keep data like frame size, stack adjustment etc. These data are accessed by getOperand using hard coded indices. It is error prone way. This change implements the access by special methods, which improve readability and allow changing data representation without massive changes of index values. Differential Revision: https://reviews.llvm.org/D31953 llvm-svn: 300196
*	[X86] Added missing mayLoad/mayStore attributes to some X86 instructions.	Ayman Musa	2017-04-13	7	-19/+55
\| \| \| \| \| \| \| \| \|	Throughout the effort of automatically generating the X86 memory folding tables these missing information were encountered. This is a preparation work for a future patch including the automation of these tables. Differential Revision: https://reviews.llvm.org/D31714 llvm-svn: 300190
*	[X86] Change instructions names to keep consistency with the naming ↵	Ayman Musa	2017-04-13	1	-2/+2
\| \| \| \| \| \| \| \|	convention. NFC Differential Revision: https://reviews.llvm.org/D31743 llvm-svn: 300184
*	Fix the bootstrap failure caused by r299986.	Easwaran Raman	2017-04-12	1	-0/+4
\| \| \| \|	llvm-svn: 300069
*	[GlobalIsel][X86] support G_CONSTANT selection.	Igor Breger	2017-04-12	1	-1/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: [GlobalISel][X86] support G_CONSTANT selection. Add regbank select tests. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: llvm-commits, dberris, rovka, kristof.beyls Differential Revision: https://reviews.llvm.org/D31974 llvm-svn: 300057
*	[SystemZ] TargetTransformInfo cost functions implemented.	Jonas Paulsson	2017-04-12	2	-7/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	getArithmeticInstrCost(), getShuffleCost(), getCastInstrCost(), getCmpSelInstrCost(), getVectorInstrCost(), getMemoryOpCost(), getInterleavedMemoryOpCost() implemented. Interleaved access vectorization enabled. BasicTTIImpl::getCastInstrCost() improved to check for legal extending loads, in which case the cost of the z/sext instruction becomes 0. Review: Ulrich Weigand, Renato Golin. https://reviews.llvm.org/D29631 llvm-svn: 300052
*	[x86] Relax the check in areLoadsFromSameBasePtr	Easwaran Raman	2017-04-11	1	-19/+16
\| \| \| \| \| \| \| \| \|	Check if the scale operand is identical (doesn't have to be 1) and do not check the chaain operand. Differential revision: https://reviews.llvm.org/D31833 llvm-svn: 299986
*	[X86] Create the correct ADC/SBB SDNode when lowering add.	Davide Italiano	2017-04-11	1	-2/+4
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31911 llvm-svn: 299973
*	Module::getOrInsertFunction is using C-style vararg instead of variadic ↵	Serge Guelton	2017-04-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	templates. From a user prospective, it forces the use of an annoying nullptr to mark the end of the vararg, and there's not type checking on the arguments. The variadic template is an obvious solution to both issues. Differential Revision: https://reviews.llvm.org/D31070 llvm-svn: 299949
*	Revert "Turn some C-style vararg into variadic templates"	Diana Picus	2017-04-11	1	-1/+1
\| \| \| \| \| \| \|	This reverts commit r299925 because it broke the buildbots. See e.g. http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15/builds/6008 llvm-svn: 299928
*	Turn some C-style vararg into variadic templates	Serge Guelton	2017-04-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Module::getOrInsertFunction is using C-style vararg instead of variadic templates. From a user prospective, it forces the use of an annoying nullptr to mark the end of the vararg, and there's not type checking on the arguments. The variadic template is an obvious solution to both issues. llvm-svn: 299925
*	[X86][MMX] Add fast-isel support for MMX non-temporal writes	Simon Pilgrim	2017-04-10	1	-0/+4
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31754 llvm-svn: 299852
*	Use PMADDWD to expand reduction in a loop	Dehao Chen	2017-04-07	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: PMADDWD can help improve 8/16 bit integer mutliply-add operation performance for cases like: for (int i = 0; i < count; i++) a += x[i] * y[i]; Reviewers: wmi, davidxl, hfinkel, RKSimon, zvi, mkuper Reviewed By: mkuper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31679 llvm-svn: 299776
*	[GlobalISel] implement narrowing for G_CONSTANT.	Igor Breger	2017-04-07	1	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: [GlobalISel] implement narrowing for G_CONSTANT. Reviewers: bogner, zvi, t.p.northover Reviewed By: t.p.northover Subscribers: llvm-commits, dberris, rovka, kristof.beyls Differential Revision: https://reviews.llvm.org/D31744 llvm-svn: 299772
*	[X86] Revert r299387 due to AVX legalization infinite loop.	Michael Kuperstein	2017-04-06	1	-55/+1
\| \| \| \|	llvm-svn: 299720
*	Revert "Turn some C-style vararg into variadic templates"	Mehdi Amini	2017-04-06	1	-3/+4
\| \| \| \| \| \|	This reverts commit r299699, the examples needs to be updated. llvm-svn: 299702
*	Turn some C-style vararg into variadic templates	Mehdi Amini	2017-04-06	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Module::getOrInsertFunction is using C-style vararg instead of variadic templates. From a user prospective, it forces the use of an annoying nullptr to mark the end of the vararg, and there's not type checking on the arguments. The variadic template is an obvious solution to both issues. Patch by: Serge Guelton <serge.guelton@telecom-bretagne.eu> Differential Revision: https://reviews.llvm.org/D31070 llvm-svn: 299699
*	[globalisel][tablegen] Move <Target>InstructionSelector declarations to ↵	Daniel Sanders	2017-04-06	5	-76/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	anonymous namespaces Summary: This resolves the issue of tablegen-erated includes in the headers for non-GlobalISel builds in a simpler way than before. Reviewers: qcolombet, ab Reviewed By: ab Subscribers: igorb, ab, mgorny, dberris, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30998 llvm-svn: 299637
*	[X86 TTI] Implement LSV hook	Keno Fischer	2017-04-05	2	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: LSV wants to know the maximum size that can be loaded to a vector register. On X86, this always matches the maximum register width. Implement this accordingly and add a test to make sure that LSV can vectorize up to the maximum permissible width on X86. Reviewers: delena, arsenm Reviewed By: arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D31504 llvm-svn: 299589
*	[DAGCombiner] add and use TLI hook to convert and-of-seteq / or-of-setne to ↵	Sanjay Patel	2017-04-05	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	bitwise logic+setcc (PR32401) This is a generic combine enabled via target hook to reduce icmp logic as discussed in: https://bugs.llvm.org/show_bug.cgi?id=32401 It's likely that other targets will want to enable this hook for scalar transforms, and there are probably other patterns that can use bitwise logic to reduce comparisons. Note that we are missing an IR canonicalization for these patterns, and we will probably prefer the pair-of-compares form in IR (shorter, more likely to fold). Differential Revision: https://reviews.llvm.org/D31483 llvm-svn: 299542
*	[X86][SSE] Renamed combine to make it clear that it only handles the vector ↵	Simon Pilgrim	2017-04-05	1	-4/+5
\| \| \| \| \| \|	shift by immediate opcodes. NFCI llvm-svn: 299532
*	Add MCContext argument to MCAsmBackend::applyFixup for error reporting	Alex Bradbury	2017-04-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A number of backends (AArch64, MIPS, ARM) have been using MCContext::reportError to report issues such as out-of-range fixup values in their TgtAsmBackend. This is great, but because MCContext couldn't easily be threaded through to the adjustFixupValue helper function from its usual callsite (applyFixup), these backends ended up adding an MCContext* argument and adding another call to applyFixup to processFixupValue. Adding an MCContext parameter to applyFixup makes this unnecessary, and even better - applyFixup can take a reference to MCContext rather than a potentially null pointer. Differential Revision: https://reviews.llvm.org/D30264 llvm-svn: 299529
*	[X86] Relax assert in broadcast-of-subvector lowering.	Ahmed Bougacha	2017-04-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before r294774, there was a problem when lowering broadcasts to use 128-bit subvectors. When we looked through a bitcast to find the broadcast input, we'd keep using the original type, so you'd end up with things like: (v8f32 (broadcast (v4f32 (extract_subvector (v8i32 V), ...)) )) r294774 fixed it to always emit subvectors with the scalar type of the original source. It also introduced some asserts, to check that we use scalars with the same size, and vectors with the same number of elements. The scalar size equality is checked earlier when looking through bitcasts, and is a useful assert. However, the number of elements don't have to be identical: we're always going to extract a 128-bit subvector, and we can have different size inputs if we looked through a concat_vector to find a 256-bit source. Relax the overzealous assert. Replace it with a check of the original source vector being 256 or 512 bits. If it's 128 bits, we can't extract_subvector from it. Fixes PR32371. llvm-svn: 299490
*	[x86] remove dead select-of-constants transform; NFCI	Sanjay Patel	2017-04-04	1	-12/+0
\| \| \| \| \| \| \| \|	https://reviews.llvm.org/D30537 / https://reviews.llvm.org/rL296977 added these transforms and other related transforms to the generic DAGCombiner (with a hook that x86 sets to true), so these patterns should not exist by the time we reach the target-specific combiner hook. llvm-svn: 299448
*	[X86][MS-compatability]Allow named synonymous for MS-assembly operators	Coby Tayree	2017-04-04	1	-0/+27
\| \| \| \| \| \| \| \| \| \|	This patch enhances X86AsmParser's immediate expression parsing abilities, to include a named synonymous for selected binary/unary bitwise operators: {and,shl,shr,or,xor,not}, ultimately achieving better MS-compatability MASM reference: https://msdn.microsoft.com/en-us/library/94b6khh4.aspx Differential Revision: D31277 llvm-svn: 299439
*	Strip trailing whitespace	Simon Pilgrim	2017-04-04	1	-4/+4
\| \| \| \|	llvm-svn: 299438
*	[X86][LLVM] Converting __mm{\|256\|512}_movm_epi{8\|16\|32\|64} LLVMIR call into ↵	Michael Zuckerman	2017-04-04	1	-12/+0
\| \| \| \| \| \| \| \| \| \| \|	generic intrinsics. This patch is a part one of two reviews, one for the clang and the other for LLVM. The patch deletes the back-end intrinsics and adds support for them in the auto upgrade. Differential Revision: https://reviews.llvm.org/D31393 llvm-svn: 299432
*	[X86] Add 64 bit pattern matching for PSADBW	Oren Ben Simhon	2017-04-04	1	-13/+41
\| \| \| \| \| \| \| \| \|	PSADBW pattern currently supports the 32 bit IR pattern and only GLT (greather than) comparison. The patch extends the pattern to catch also 64 bit IR pattern and includes all other comparison types (not only GLT). Differential Revision: https://reviews.llvm.org/D31577 llvm-svn: 299425
*	[X86][SSE]] Lower BUILD_VECTOR with repeated elts as BUILD_VECTOR + ↵	Simon Pilgrim	2017-04-03	1	-1/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	VECTOR_SHUFFLE It can be costly to transfer from the gprs to the xmm registers and can prevent loads merging. This patch splits vXi16/vXi32/vXi64 BUILD_VECTORS that use the same operand in multiple elements into a BUILD_VECTOR with only a single insertion of each of those elements and then performs an unary shuffle to duplicate the values. There are a couple of minor regressions this patch unearths due to some missing MOVDDUP/BROADCAST folds that I will address in a future patch. Note: Now that vector shuffle lowering and combining is pretty good we should be reusing that instead of duplicating so much in LowerBUILD_VECTOR - this is the first of several patches to address this. Differential Revision: https://reviews.llvm.org/D31373 llvm-svn: 299387
*	x86 interrupt calling convention: re-align stack pointer on 64-bit if an ↵	Amjad Aboud	2017-04-03	2	-2/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	error code was pushed The x86_64 ABI requires that the stack is 16 byte aligned on function calls. Thus, the 8-byte error code, which is pushed by the CPU for certain exceptions, leads to a misaligned stack. This results in bugs such as Bug 26413, where misaligned movaps instructions are generated. This commit fixes the misalignment by adjusting the stack pointer in these cases. The adjustment is done at the beginning of the prologue generation by subtracting another 8 bytes from the stack pointer. These additional bytes are popped again in the function epilogue. Fixes Bug 26413 Patch by Philipp Oppermann. Differential Revision: https://reviews.llvm.org/D30049 llvm-svn: 299383
*	[APInt] Move isMask and isShiftedMask out of APIntOps and into the APInt ↵	Craig Topper	2017-04-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	class. Implement them without memory allocation for multiword This moves the isMask and isShiftedMask functions to be class methods. They now use the MathExtras.h function for single word size and leading/trailing zeros/ones or countPopulation for the multiword size. The previous implementation made multiple temorary memory allocations to do the bitwise arithmetic operations to match the MathExtras.h implementation. Differential Revision: https://reviews.llvm.org/D31565 llvm-svn: 299362
*	[X86][MMX] Improve support for folding fptosi from XMM to MMX	Simon Pilgrim	2017-04-02	1	-0/+10
\| \| \| \|	llvm-svn: 299338
*	[X86][MMX] Simplify tablegen patterns by always combining MOVDQ2Q from v2i64	Simon Pilgrim	2017-04-02	1	-1/+2
\| \| \| \|	llvm-svn: 299336
*	[X86][MMX] Added support for subvector extraction to MMX register	Simon Pilgrim	2017-04-02	1	-2/+4
\| \| \| \|	llvm-svn: 299335
*	[AVX-512] Update lowering for gather/scatter prefetch intrinsics to match ↵	Craig Topper	2017-03-31	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	the immediate encodings the frontend uses based on the _MM_HINT_T0/T1 constant values in clang's headers. Our _MM_HINT_T0/T1 constant values are 3/2 which matches gcc, but not icc or Intel documentation. Interestingly gcc had this same bug on their implementation of the gather/scatter builtins at one point too. Fixes PR32411. llvm-svn: 299234
*	[DAGCombiner] Add vector demanded elements support to ComputeNumSignBits	Simon Pilgrim	2017-03-31	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently ComputeNumSignBits returns the minimum number of sign bits for all elements of vector data, when we may only be interested in one/some of the elements. This patch adds a DemandedElts argument that allows us to specify the elements we actually care about. The original ComputeNumSignBits implementation calls with a DemandedElts demanding all elements to match current behaviour. Scalar types set this to 1. I've only added support for BUILD_VECTOR and EXTRACT_VECTOR_ELT so far, all others will default to demanding all elements but can be updated in due course. Followup to D25691. Differential Revision: https://reviews.llvm.org/D31311 llvm-svn: 299219
*	[DAGCombiner] Add vector demanded elements support to ↵	Simon Pilgrim	2017-03-31	2	-0/+2
\| \| \| \| \| \| \| \| \| \|	computeKnownBitsForTargetNode Follow up to D25691, this sets up the plumbing necessary to support vector demanded elements support in known bits calculations in target nodes. Differential Revision: https://reviews.llvm.org/D31249 llvm-svn: 299201
*	[AVX-512] Fix bad comment from r299112. NFC	Craig Topper	2017-03-30	1	-1/+2
\| \| \| \|	llvm-svn: 299114
*	[AVX-512] Fix another case where fastisel was generating a GR8 to VK1 copy. ↵	Craig Topper	2017-03-30	1	-2/+12
\| \| \| \| \| \| \| \|	This time after calls returning i1. Fixes PR32472. llvm-svn: 299112
*	Spelling mistakes in comments. NFCI.	Simon Pilgrim	2017-03-30	1	-2/+2
\| \| \| \| \| \|	Based on corrections mentioned in patch for clang for PR27635 llvm-svn: 299072
*	Spelling mistakes in comments. NFCI.	Simon Pilgrim	2017-03-30	1	-15/+15
\| \| \| \|	llvm-svn: 299069
*	[X86IselLowering] Remove extraneous semicolon. NFCI.	Davide Italiano	2017-03-29	1	-1/+1
\| \| \| \| \| \|	Unbreaks the build with GCC -Werror. llvm-svn: 299030
*	[X86] Tidied up comment - we don't custom lower add/sub i64 on i686 anymore. ↵	Simon Pilgrim	2017-03-29	1	-1/+2
\| \| \| \| \| \|	NFCI. llvm-svn: 299004
*	Spelling mistakes in comments. NFCI.	Simon Pilgrim	2017-03-29	1	-5/+5
\| \| \| \|	llvm-svn: 299000
*	[X86][AVX2] Prevent unary interleaving patterns from calling ↵	Simon Pilgrim	2017-03-29	1	-3/+4
\| \| \| \| \| \|	lowerVectorShuffleAsSplitOrBlend (PR32453) llvm-svn: 298993
*	[X86] Removed old comment. NFCI.	Simon Pilgrim	2017-03-29	1	-2/+1
\| \| \| \| \| \|	No longer makes sense as the previous opcode mnemonic it was referring to is long gone. llvm-svn: 298988
*	Move the x86 cpu feature rtm from Haswell to Skylake matching clang commit ↵	Eric Christopher	2017-03-29	1	-1/+1
\| \| \| \| \| \|	r298956. llvm-svn: 298986
*	[AVX-512] Remove explicit KMOVWrk from isel patterns. COPY_TO_REGCLASS to ↵	Craig Topper	2017-03-29	1	-8/+8
\| \| \| \| \| \|	GR32 is enough. llvm-svn: 298985