bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[mips] Support for +abs2008 attribute	Aleksandar Beserminji	2019-01-28	7	-5/+91
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instruction abs.[ds] is not generating correct result when working with NaNs for revisions prior mips32r6 and mips64r6. To generate a sequence which always produce a correct result, but also to allow user more control on how his code is compiled, attribute +abs2008 is added, so user can choose legacy or 2008. By default legacy mode is used on revisions prior R6. Mips32r6 and mips64r6 use abs2008 mode by default. Differential Revision: https://reviews.llvm.org/D35983 llvm-svn: 352370
*	[AMDGPU] Add intrinsics for 16 bit interpolation	Tim Corringham	2019-01-28	6	-3/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Added the intrinsics llvm.amdgcn.interp.p1.f16() and llvm.amdgcn.interp.p2.f16() and related LIT test. The p1 intrinsic generates code appropriate for both 16 and 32 bank LDS. Reviewers: #amdgpu, dstuttard, arsenm, tpr Reviewed By: #amdgpu, arsenm Subscribers: jvesely, mgorny, arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46754 llvm-svn: 352357
*	[MIPS GlobalISel] Select sub	Petar Avramovic	2019-01-28	3	-2/+70
\| \| \| \| \| \| \| \| \|	Lower G_USUBO and G_USUBE. Add narrowScalar for G_SUB. Legalize and select G_SUB for MIPS 32. Differential Revision: https://reviews.llvm.org/D53416 llvm-svn: 352351
*	[DebugInfo][DAG] Avoid re-ordering of DBG_VALUEs	Jeremy Morse	2019-01-28	1	-21/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch improves the placement of DBG_VALUEs when by SelectionDAG, which as documented in PR40427 can go very wrong. At the core of this is ProcessSourceNode, which assumes the last instruction in a BB is the start of the last processed IR instruction, which isn't always true. Instead, use a helper function to call InstrEmitter::EmitNode, that records before-and-after iterators and determines the first of any new instruction created during emission. This is passed to ProcessSourceNode, which can then make more elightened decisions about ordering for DBG_VALUE placement. Differential revision: https://reviews.llvm.org/D57163 llvm-svn: 352350
*	[ARM GlobalISel] Support integer division for Thumb2	Diana Picus	2019-01-28	1	-19/+21
\| \| \| \| \| \| \| \| \|	Support G_SDIV, G_UDIV, G_SREM and G_UREM. The only significant difference between arm and thumb mode is that we need to check a different subtarget feature. llvm-svn: 352346
*	[X86] Add new variadic avx512 compress/expand intrinsics that use vXi1 types ↵	Craig Topper	2019-01-28	3	-74/+26
\| \| \| \| \| \| \| \|	for the mask argument. Remove and autoupgrade the old intrinsics llvm-svn: 352343
*	[AArch64][GlobalISel] Teach RBS about G_FNEG default mapping.	Amara Emerson	2019-01-28	1	-0/+1
\| \| \| \|	llvm-svn: 352340
*	[AArch64][GlobalISel] Add some missing vector support for FP arithmetic ops.	Amara Emerson	2019-01-28	1	-2/+2
\| \| \| \| \| \| \|	Moved the fneg lowering legalization test from AArch64 to X86, as we want to specify that it's already legal. llvm-svn: 352338
*	[AArch64][GlobalISel] Add some vector support for fp <-> int conversions.	Amara Emerson	2019-01-28	2	-2/+6
\| \| \| \| \| \|	Some unrelated, but benign, test changes as well due to the test update script. llvm-svn: 352337
*	GlobalISel: Don't reduce elements for atomic load/store	Matt Arsenault	2019-01-27	1	-1/+9
\| \| \| \| \| \| \|	This is invalid for the same reason as in the narrowScalar handling for load. llvm-svn: 352334
*	[x86] add restriction for lowering to vpermps	Sanjay Patel	2019-01-27	1	-2/+19
\| \| \| \| \| \| \| \| \|	This transform was added with rL351346, and we had an escape for shufps, but we also want one for unpckps vs. vpermps because vpermps doesn't take an immediate shuffle index operand. llvm-svn: 352333
*	GlobalISel: Factor fewerElementVectors into separate functions	Matt Arsenault	2019-01-27	1	-156/+170
\| \| \| \|	llvm-svn: 352332
*	[X86][SSE] Add UNDEF handling to combineSelect ISD::USUBSAT matching (PR40083)	Simon Pilgrim	2019-01-27	1	-5/+7
\| \| \| \|	llvm-svn: 352330
*	[X86][SSE] Permit UNDEFs in combineAddToSUBUS matching (PR40083)	Simon Pilgrim	2019-01-27	1	-3/+4
\| \| \| \|	llvm-svn: 352328
*	[COFF] Add new relocation types.	Martin Storsjo	2019-01-27	2	-0/+6
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D57291 llvm-svn: 352324
*	[x86] refactor logic in lowerShuffleWithUndefHalf	Sanjay Patel	2019-01-27	1	-28/+49
\| \| \| \| \| \| \| \|	Although this is longer code, this is no-functional-change-intended. The goal is to untangle the conditions under which we bail out, so that's easier to adjust. llvm-svn: 352320
*	GlobalISel: Verify load/store has a pointer input	Matt Arsenault	2019-01-27	1	-1/+6
\| \| \| \| \| \| \|	I expected this to be automatically verified, but it seems nothing uses that the type index was declared as a "ptype" llvm-svn: 352319
*	Re-apply "r351584: "GlobalISel: Verify g_zextload and g_sextload""	Amara Emerson	2019-01-27	1	-1/+14
\| \| \| \| \| \| \|	I reverted it originally due to a bot failing. The underlying bug has been fixed as of r352311. llvm-svn: 352312
*	[AArch64][GlobalISel] Fix the G_EXTLOAD combiner creating non-extending ↵	Amara Emerson	2019-01-27	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \|	illegal instructions. This fixes loads like 's1 = load %p (load 1 from %p)' being combined with an extend into an illegal 's8 = g_extload %p (load 1 from %p)' which doesn't do any extension, by avoiding touching those < s8 size loads. This bug was uncovered by a verifier update r351584, which I reverted it to keep the bots green. llvm-svn: 352311
*	Revert "Add support for prefix-only CLI options"	Thomas Preud'homme	2019-01-27	1	-14/+5
\| \| \| \| \| \|	This reverts commit r351038. llvm-svn: 352310
*	[X86] Add some missing blsr patterns	Gabor Buella	2019-01-27	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The add+and sequence followed by a branch can happen e.g. when looping over the set bits of an integer: ``` while (x != 0) { func(x & ~x); x &= x - 1; } ``` Reviewed By: ctopper Differential Revision: https://reviews.llvm.org/D57296 llvm-svn: 352306
*	[X86] Add a pattern for (i64 (and (anyext def32:), 0x00000000FFFFFFFF)) to ↵	Craig Topper	2019-01-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	produce SUBREG_TO_REG def32 here means the producing instruction zeroed bits 63:32. We already do this for zext, but it looks like we can get an and+anyext sometimes. Spotted in the diffs from D33587. llvm-svn: 352303
*	GlobalISel: Fix typo in assert messages	Matt Arsenault	2019-01-27	1	-2/+2
\| \| \| \|	llvm-svn: 352301
*	GlobalISel: Implement narrowScalar for mul	Matt Arsenault	2019-01-27	2	-0/+48
\| \| \| \|	llvm-svn: 352300
*	GlobalISel: fewerElementsVector for intrinsic_trunc/intrinsic_round	Matt Arsenault	2019-01-27	2	-2/+5
\| \| \| \|	llvm-svn: 352298
*	AMDGPU/GlobalISel: Use scalarize instead of clampMaxNumElements	Matt Arsenault	2019-01-26	1	-2/+1
\| \| \| \|	llvm-svn: 352297
*	[GlobalISel][IRTranslator] Fix crash on translation of fneg.	Amara Emerson	2019-01-26	1	-1/+1
\| \| \| \| \| \| \|	When the fneg IR instruction was added the code to do translation wasn't tested, and tried to get an invalid operand. llvm-svn: 352296
*	AMDGPU/GlobalISel: Legalize more bit ops	Matt Arsenault	2019-01-26	2	-4/+10
\| \| \| \|	llvm-svn: 352295
*	AMDGPU/GlobalISel: Widen small uaddo/usubo	Matt Arsenault	2019-01-26	1	-1/+2
\| \| \| \|	llvm-svn: 352294
*	[ValueTracking] Look through casts when determining non-nullness	Johannes Doerfert	2019-01-26	1	-0/+22
\| \| \| \| \| \| \| \| \| \|	Bitcast and certain Ptr2Int/Int2Ptr instructions will not alter the value of their operand and can therefore be looked through when we determine non-nullness. Differential Revision: https://reviews.llvm.org/D54956 llvm-svn: 352293
*	[X86] combineAddOrSubToADCOrSBB/combineCarryThroughADD - use oneuse for ↵	Simon Pilgrim	2019-01-26	1	-2/+3
\| \| \| \| \| \| \| \| \| \|	entire SDNode Fix issue noted in D57281 that only tested the one use for the SDValue (the result flag), not the entire SUB. I've added the getNode() to make it clearer what is intended than just the -> redirection. llvm-svn: 352291
*	[X86] combineCarryThroughADD - add support for X86::COND_A commutations ↵	Simon Pilgrim	2019-01-26	1	-6/+25
\| \| \| \| \| \| \| \| \| \|	(PR24545) As discussed on PR24545, we should try to commute X86::COND_A 'icmp ugt' cases to X86::COND_B 'icmp ult' to more optimally bind the carry flag output to a SBB instruction. Differential Revision: https://reviews.llvm.org/D57281 llvm-svn: 352289
*	[X86] Fold X86ISD::SBB(ISD::SUB(X,Y),0) -> X86ISD::SBB(X,Y) (PR25858)	Simon Pilgrim	2019-01-26	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	We often generate X86ISD::SBB(X, 0) for carry flag arithmetic. I had tried to create test cases for the ADC equivalent (which often uses the same pattern) but haven't managed to find anything yet. Differential Revision: https://reviews.llvm.org/D57169 llvm-svn: 352288
*	[X86][SSE] Generalized unsigned compares to support nonsplat constant ↵	Simon Pilgrim	2019-01-26	1	-7/+10
\| \| \| \| \| \|	vectors (PR39859) llvm-svn: 352283
*	[x86] add helper for creating a half-width shuffle; NFC	Sanjay Patel	2019-01-26	1	-28/+39
\| \| \| \| \| \| \| \| \| \|	This reduces a bit of duplication between the combining and lowering places that use it, but the primary motivation is to make it easier to rearrange the lowering logic and solve PR40434: https://bugs.llvm.org/show_bug.cgi?id=40434 llvm-svn: 352280
*	[X86] Remove and autoupgrade vpconflict intrinsics that take a mask and ↵	Craig Topper	2019-01-26	2	-12/+16
\| \| \| \| \| \| \| \|	passthru argument. We have unmasked versions as of r352172 llvm-svn: 352270
*	Revert r352255 "[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked ↵	Craig Topper	2019-01-26	2	-16/+4
\| \| \| \| \| \| \| \|	loads in the type legalizer" This might be breaking an lldb windows buildbot. llvm-svn: 352268
*	[X86] Remove GCCBuiltins from 512-bit cvt(u)qqtops, cvt(u)qqtopd, and ↵	Craig Topper	2019-01-26	2	-39/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	cvt(u)dqtops intrinsics. Add new variadic uitofp/sitofp with rounding mode intrinsics. Summary: See clang patch D56998 for a full description. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56999 llvm-svn: 352266
*	[WebAssembly][NFC] Group SIMD-related ISel configuration	Thomas Lively	2019-01-26	1	-59/+45
\| \| \| \| \| \| \| \| \| \|	Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57263 llvm-svn: 352262
*	[PowerPC] Update Vector Costs for P9	Nemanja Ivanovic	2019-01-26	5	-12/+59
\| \| \| \| \| \| \| \| \| \| \| \| \|	For the power9 CPU, vector operations consume a pair of execution units rather than one execution unit like a scalar operation. Update the target transform cost functions to reflect the higher cost of vector operations when targeting Power9. Patch by RolandF. Differential revision: https://reviews.llvm.org/D55461 llvm-svn: 352261
*	[X86] Add DAG combine to merge vzext_movl with the various fp<->int ↵	Craig Topper	2019-01-26	3	-84/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	conversion operations that only write the lower 64-bits of an xmm register and zero the rest. Summary: We have isel patterns for this, but we're missing some load patterns and all broadcast patterns. A DAG combine seems like a better fit for this. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56971 llvm-svn: 352260
*	[SelectionDAG][X86] Don't use SEXTLOAD for promoting masked loads in the ↵	Craig Topper	2019-01-26	2	-4/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	type legalizer Summary: I'm not sure why we were using SEXTLOAD. EXTLOAD seems more appropriate since we don't care about the upper bits. This patch changes this and then modifies the X86 post legalization combine to emit a extending shuffle instead of a sign_extend_vector_inreg. Could maybe use an any_extend_vector_inreg, but I just did what we already do in LowerLoad. I think we can actually get rid of this code entirely if we switch to -x86-experimental-vector-widening-legalization. On AVX512 targets I think we might be able to use a masked vpmovzx and not have to expand this at all. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D57186 llvm-svn: 352255
*	[NFC] Test commit : fix typo.	Alexey Lapshin	2019-01-25	1	-1/+1
\| \| \| \|	llvm-svn: 352248
*	[RISCV] Add target DAG combine for bitcast fabs/fneg on RV32FD	Alex Bradbury	2019-01-25	1	-3/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	DAGCombiner::visitBITCAST will perform: fold (bitconvert (fneg x)) -> (xor (bitconvert x), signbit) fold (bitconvert (fabs x)) -> (and (bitconvert x), (not signbit)) As shown in double-bitmanip-dagcombines.ll, this can be advantageous. But RV32FD doesn't use bitcast directly (as i64 isn't a legal type), and instead uses RISCVISD::SplitF64. This patch adds an equivalent DAG combine for SplitF64. llvm-svn: 352247
*	[llvm] Opt-in flag for X86DiscriminateMemOps	Mircea Trofin	2019-01-25	2	-1/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently, if an instruction with a memory operand has no debug information, X86DiscriminateMemOps will generate one based on the first line of the enclosing function, or the last seen debug info. This may cause confusion in certain debugging scenarios. The long term approach would be to use the line number '0' in such cases, however, that brings in challenges: the base discriminator value range is limited (4096 values). For the short term, adding an opt-in flag for this feature. See bug 40319 (https://bugs.llvm.org/show_bug.cgi?id=40319) Reviewers: dblaikie, jmorse, gbedwell Reviewed By: dblaikie Subscribers: aprantl, eraman, hiraditya Differential Revision: https://reviews.llvm.org/D57257 llvm-svn: 352246
*	[GlobalISel][AArch64][NFC] Fix incorrect comment in selectUnmergeValues	Jessica Paquette	2019-01-25	1	-1/+1
\| \| \| \| \| \|	s/scalar/vector/ llvm-svn: 352243
*	Revert rL352238.	Alina Sbirlea	2019-01-25	1	-2/+2
\| \| \| \|	llvm-svn: 352241
*	[WarnMissedTransforms] Set default to 1.	Alina Sbirlea	2019-01-25	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Set default value for retrieved attributes to 1, since the check is against 1. Eliminates the warning noise generated when the attributes are not present. Reviewers: sanjoy Subscribers: jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D57253 llvm-svn: 352238
*	Reapply: [RISCV] Set isAsCheapAsAMove for ADDI, ORI, XORI, LUI	Ana Pazos	2019-01-25	3	-3/+18
\| \| \| \| \| \|	This reapplies commit r352010 with RISC-V test fixes. llvm-svn: 352237
*	[MBP] Don't move bottom block before header if it can't reduce taken branches	Guozhi Wei	2019-01-25	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If bottom of block BB has only one successor OldTop, in most cases it is profitable to move it before OldTop, except the following case: -->OldTop<- \| . \| \| . \| \| . \| ---Pred \| \| \| BB----- Move BB before OldTop can't reduce the number of taken branches, this patch detects this case and prevent the moving. Differential Revision: https://reviews.llvm.org/D57067 llvm-svn: 352236