bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[DAGCombiner] don't try to extract a fraction of a vector binop and crash ↵	Sanjay Patel	2018-12-05	1	-10/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	(PR39893) Because we're potentially peeking through a bitcast in this transform, we need to use overall bitwidths rather than number of elements to determine when it's safe to proceed. Should fix: https://bugs.llvm.org/show_bug.cgi?id=39893 llvm-svn: 348383
*	[TargetLowering] Remove ISD::ANY_EXTEND/ANY_EXTEND_VECTOR_INREG opcodes from ↵	Simon Pilgrim	2018-12-05	1	-2/+0
\| \| \| \| \| \| \| \|	SimplifyDemandedVectorElts These have no test coverage and the KnownZero flags can't be guaranteed unlike SIGN/ZERO_EXTEND cases. llvm-svn: 348361
*	[SelectionDAG] Initial support for FSHL/FSHR funnel shift opcodes (PR39467)	Simon Pilgrim	2018-12-05	6	-38/+149
\| \| \| \| \| \| \| \| \| \|	This is an initial patch to add a minimum level of support for funnel shifts to the SelectionDAG and to begin wiring it up to the X86 SHLD/SHRD instructions. Some partial legalization code has been added to handle the case for 'SlowSHLD' where we want to expand instead and I've added a few DAG combines so we don't get regressions from the existing DAG builder expansion code. Differential Revision: https://reviews.llvm.org/D54698 llvm-svn: 348353
*	Remove superfluous comments. NFCI.	Simon Pilgrim	2018-12-05	1	-44/+38
\| \| \| \| \| \|	As requested in D54698. llvm-svn: 348350
*	[TargetLowering] SimplifyDemandedVectorElts - don't alter DemandedElts mask	Simon Pilgrim	2018-12-05	1	-2/+3
\| \| \| \| \| \| \| \|	Fix potential issue with the ISD::INSERT_VECTOR_ELT case tweaking the DemandedElts mask instead of using a local copy - so later uses of the mask use the tweaked version..... Noticed while investigating adding zero/undef folding to SimplifyDemandedVectorElts and the altered DemandedElts mask was causing mismatches. llvm-svn: 348348
*	[SelectionDAG] Split very large token factors for loads into 64k chunks.	Amara Emerson	2018-12-05	2	-3/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There's a 64k limit on the number of SDNode operands, and some very large functions with 64k or more loads can cause crashes due to this limit being hit when a TokenFactor with this many operands is created. To fix this, create sub-tokenfactors if we've exceeded the limit. No test case as it requires a very large function. rdar://45196621 Differential Revision: https://reviews.llvm.org/D55073 llvm-svn: 348324
*	[SelectionDAG] Redefine isGAPlusOffset in terms of unwrapAddress. NFCI.	Nirav Dave	2018-12-04	1	-1/+4
\| \| \| \|	llvm-svn: 348288
*	[TargetLowering] expandFP_TO_UINT - avoid FPE due to out of range conversion ↵	Simon Pilgrim	2018-12-04	1	-11/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR17686) PR17686 demonstrates that for some targets FP exceptions can fire in cases where the FP_TO_UINT is expanded using a FP_TO_SINT instruction. The existing code converts both the inrange and outofrange cases using FP_TO_SINT and then selects the result, this patch changes this for 'strict' cases to pre-select the FP_TO_SINT input and the offset adjustment. The X87 cases don't need the strict flag but generates much nicer code with it.... Differential Revision: https://reviews.llvm.org/D53794 llvm-svn: 348251
*	[TargetLowering] Add SimplifyDemandedVectorElts support to EXTEND opcodes	Simon Pilgrim	2018-12-04	2	-0/+23
\| \| \| \| \| \| \| \|	Add support for ISD::_EXTEND and ISD::_EXTEND_VECTOR_INREG opcodes. The extra broadcast in trunc-subvector.ll will be fixed in an upcoming patch. llvm-svn: 348246
*	[DAGCombiner] narrow truncated vector binops when legal	Sanjay Patel	2018-12-03	1	-7/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the smallest vector enhancement I could find to D54640. Here, we're allowing narrowing to only legal vector ops because we'll see regressions without that. All of the test diffs are wins from what I can tell. With AVX/AVX512, we can shrink ymm/zmm ops to xmm. x86 vector multiplies are the problem case that we're avoiding due to the patchwork ISA, and it's not clear to me if we can dance around those regressions using TLI hooks or if we need preliminary patches to plug those holes. Differential Revision: https://reviews.llvm.org/D55126 llvm-svn: 348195
*	[X86] Add DAG combine to combine a v8i32->v8i16 truncate with a packuswb ↵	Craig Topper	2018-12-03	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	that truncates v8i16->v8i8. Summary: Under -x86-experimental-vector-widening-legalization, fp_to_uint/fp_to_sint with a smaller than 128 bit vector type results are custom type legalized by promoting the result to a 128 bit vector by promoting the elements, inserting an assertzext/assertsext, then truncating back to original type. The truncate will be further legalizdd to a pack shuffle. In the case of a v8i8 result type, we'll end up with a v8i16 fp_to_sint. This will need to be further legalized during vector op legalization by promoting to v8i32 and then truncating again. Under avx2 this produces good code with two pack instructions, but Under avx512 this will result in a truncate instruction and a packuswb instruction. But we should be able to get away with a single truncate instruction. The other option is to promote all the way to vXi32 result type during the first type legalization. But in some experimentation that seemed to require more work to produce good code for other configurations. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D54836 llvm-svn: 348158
*	[SelectionDAG] fold constant with undef vector per element	Sanjay Patel	2018-12-02	1	-10/+16
\| \| \| \| \| \| \| \| \| \| \| \|	This makes the SDAG behavior consistent with the way we do this in IR. It's possible that we were getting the wrong answer before. For example, 'xor undef, undef --> 0' but 'xor undef, C' --> undef. But the most practical improvement is likely as shown in the tests here - for FP, we were overconstraining undef lanes to NaN, and that can prevent vector simplifications/narrowing (see D51553). llvm-svn: 348090
*	[DAGCombiner] guard against an oversized shift crash	Sanjay Patel	2018-12-02	1	-9/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change prevents the crash noted in the post-commit comments for rL347478 : http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181119/605166.html We can't guarantee that an oversized shift amount is folded away, so we have to check for it. Note that I committed an incomplete fix for that crash with: rL347502 But as discussed here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181126/605679.html ...we have to try harder. So I'm not sure how to expose the bug now (and apparently no fuzzers have found a way yet either). On the plus side, we have discovered that we're missing real optimizations by not simplifying nodes sooner, so the earlier fix still has value, and there's likely more value in extending that so we can simplify more opcodes and simplify when doing RAUW and/or putting nodes on the combiner worklist. Differential Revision: https://reviews.llvm.org/D54954 llvm-svn: 348089
*	[SelectionDAG] Improve SimplifyDemandedBits to SimplifyDemandedVectorElts ↵	Simon Pilgrim	2018-12-01	1	-5/+3
\| \| \| \| \| \| \| \| \| \| \| \|	simplification D52935 introduced the ability for SimplifyDemandedBits to call SimplifyDemandedVectorElts through BITCASTs if the demanded bit mask entirely covered the sub element. This patch relaxes this to demanding an element if we need any bit from it. Differential Revision: https://reviews.llvm.org/D54761 llvm-svn: 348073
*	AMDGPU: Fix various issues around the VirtReg2Value mapping	Nicolai Haehnle	2018-11-30	1	-2/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The VirtReg2Value mapping is crucial for getting consistently reliable divergence information into the SelectionDAG. This patch fixes a bunch of issues that lead to incorrect divergence info and introduces tight assertions to ensure we don't regress: 1. VirtReg2Value is generated lazily; there were some cases where a lookup was performed before all relevant virtual registers were created, leading to an out-of-sync mapping. Those cases were: - Complex code to lower formal arguments that generated CopyFromReg nodes from live-in registers (fixed by never querying the mapping for live-in registers). - Code that generates CopyToReg for formal arguments that are used outside the entry basic block (fixed by never querying the mapping for Register nodes, which don't need the divergence info anyway). 2. For complex values that are lowered to a sequence of registers, all registers must be reflected in the VirtReg2Value mapping. I am not adding any new tests, since I'm not actually aware of any bugs that these problems are causing with trunk as-is. However, I recently added a test case (in r346423) which fails when D53283 is applied without this change. Also, the new assertions should provide most of the effective test coverage. There is one test change in sdwa-peephole.ll. The underlying issue is that since the divergence info is now correct, the DAGISel will select V_OR_B32 directly instead of S_OR_B32. This leads to an extra COPY which affects the behavior of MachineLICM in a way that ends up with the S_MOV_B32 with the constant in a different basic block than the V_OR_B32, which is presumably what defeats the peephole. Reviewers: alex-t, arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D54340 llvm-svn: 348049
*	[SelectionDAG] fold FP binops with 2 undef operands to undef	Sanjay Patel	2018-11-30	1	-2/+4
\| \| \| \|	llvm-svn: 348016
*	[CodeGen] Prefer static frame index for STATEPOINT liveness args	Than McIntosh	2018-11-30	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If a given liveness arg of STATEPOINT is at a fixed frame index (e.g. a function argument passed on stack), prefer to use this fixed location even the address is also in a register. If we use the register it will generate a spill, which is not necessary since the fixed frame index can be directly recorded in the stack map. Patch by Cherry Zhang <cherryyz@google.com>. Reviewers: thanm, niravd, reames Reviewed By: reames Subscribers: cherryyz, reames, anna, arphaman, llvm-commits Differential Revision: https://reviews.llvm.org/D53889 llvm-svn: 347998
*	TableGen/ISel: Allow PatFrag predicate code to access captured operands	Nicolai Haehnle	2018-11-30	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This simplifies writing predicates for pattern fragments that are automatically re-associated or commuted. For example, a followup patch adds patterns for fragments of the form (add (shl $x, $y), $z) to the AMDGPU backend. Such patterns are automatically commuted to (add $z, (shl $x, $y)), which makes it basically impossible to refer to $x, $y, and $z generically in the PredicateCode. With this change, the PredicateCode can refer to $x, $y, and $z simply as `Operands[i]`. Test confirmed that there are no changes to any of the generated files when building all (non-experimental) targets. Change-Id: I61c00ace7eed42c1d4edc4c5351174b56b77a79c Reviewers: arsenm, rampitec, RKSimon, craig.topper, hfinkel, uweigand Subscribers: wdng, tpr, llvm-commits Differential Revision: https://reviews.llvm.org/D51994 llvm-svn: 347992
*	[SelectionDAG] Support result type promotion for FLT_ROUNDS_	Alex Bradbury	2018-11-30	2	-0/+10
\| \| \| \| \| \| \| \| \|	For targets where i32 is not a legal type (e.g. 64-bit RISC-V), LegalizeIntegerTypes must promote the result of ISD::FLT_ROUNDS_. Differential Revision: https://reviews.llvm.org/D53820 llvm-svn: 347986
*	[SelectionDAG] Support promotion of PREFETCH operands	Alex Bradbury	2018-11-30	2	-0/+15
\| \| \| \| \| \| \| \| \|	For targets where i32 is not a legal type (e.g. 64-bit RISC-V), LegalizeIntegerTypes must promote the operands of ISD::PREFETCH. Differential Revision: https://reviews.llvm.org/D53281 llvm-svn: 347980
*	[SelectionDAG] Support promotion of FRAMEADDR/RETURNADDR operands	Alex Bradbury	2018-11-30	2	-0/+10
\| \| \| \| \| \| \| \| \|	For targets where i32 is not a legal type (e.g. 64-bit RISC-V), LegalizeIntegerTypes must promote the operand. Differential Revision: https://reviews.llvm.org/D53279 llvm-svn: 347978
*	[TargetLowering][RISCV] Introduce isSExtCheaperThanZExt hook and implement ↵	Alex Bradbury	2018-11-30	2	-10/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	for RISC-V DAGTypeLegalizer::PromoteSetCCOperands currently prefers to zero-extend operands when it is able to do so. For some targets this is more expensive than a sign-extension, which is also a valid choice. Introduce the isSExtCheaperThanZExt hook and use it in the new SExtOrZExtPromotedInteger helper. On RISC-V, we prefer sign-extension for FromTy == MVT::i32 and ToTy == MVT::i64, as it can be performed using a single instruction. Differential Revision: https://reviews.llvm.org/D52978 llvm-svn: 347977
*	[DAGCombiner] narrow truncated binops	Sanjay Patel	2018-11-29	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The motivating case for this is shown in: https://bugs.llvm.org/show_bug.cgi?id=32023 and the corresponding rot16.ll regression tests. Because x86 scalar shift amounts are i8 values, we can end up with trunc-binop-trunc sequences that don't get folded in IR. As the TODO comments suggest, there will be regressions if we extend this (for x86, we mostly seem to be missing LEA opportunities, but there are likely vector folds missing too). I think those should be considered existing bugs because this is the same transform that we do as an IR canonicalization in instcombine. We just need more tests to make those visible independent of this patch. Differential Revision: https://reviews.llvm.org/D54640 llvm-svn: 347917
*	[SelectionDAG][AArch64][X86] Move legalization of vector MULHS/MULHU from ↵	Craig Topper	2018-11-29	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	LegalizeDAG to LegalizeVectorOps I believe we should be legalizing these with the rest of vector binary operations. If any custom lowering is required for these nodes, this will give the DAG combine between LegalizeVectorOps and LegalizeDAG to run on the custom code before constant build_vectors are lowered in LegalizeDAG. I've moved MULHU/MULHS handling in AArch64 from Lowering to isel. Moving the lowering earlier caused build_vector+extract_subvector simplifications to kick in which made the generated code worse. Differential Revision: https://reviews.llvm.org/D54276 llvm-svn: 347902
*	[LegalizeVectorTypes][X86][ARM][AArch64][PowerPC] Don't use ↵	Craig Topper	2018-11-26	1	-7/+2
\| \| \| \| \| \| \| \| \| \| \| \|	SplitVecOp_TruncateHelper for FP_TO_SINT/UINT. SplitVecOp_TruncateHelper tries to promote the result type while splitting FP_TO_SINT/UINT. It then concatenates the result and introduces a truncate to the original result type. But it does this without inserting the AssertZExt/AssertSExt that the regular result type promotion would insert. Nor does it turn FP_TO_UINT into FP_TO_SINT the way normal result type promotion for these operations does. This is bad on X86 which doesn't support FP_TO_SINT until AVX512. This patch disables the use of SplitVecOp_TruncateHelper for these operations and just lets normal promotion handle it. I've tweaked a couple things in X86ISelLowering to avoid a few obvious regressions there. I believe all the changes on X86 are improvements. The other targets look neutral. Differential Revision: https://reviews.llvm.org/D54906 llvm-svn: 347593
*	[SelectionDAG] Teach BaseIndexOffset::match to unwrap the base after looking ↵	Craig Topper	2018-11-26	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	through an add/or We might find a target specific node that needs to be unwrapped after we look through an add/or. Otherwise we get inconsistent results if one pointer is just X86WrapperRIP and the other is (add X86WrapperRIP, C) Differential Revision: https://reviews.llvm.org/D54818 llvm-svn: 347591
*	[x86] limit transform for select-of-fp-constants	Sanjay Patel	2018-11-25	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	This should likely be adjusted to limit this transform further, but these diffs should be clear wins. If we have blendv/conditional move, then we should assume those are cheap ops. The loads become independent of the compare, so those can be speculated before we need to use the values in the blend/mov. llvm-svn: 347526
*	[SelectionDAG] move constant or splat functions to common location	Sanjay Patel	2018-11-25	2	-39/+28
\| \| \| \| \| \| \| \|	rL347502 moved the null sibling, so we should group all of these together. I'm not sure why these aren't methods of the SDValue class itself, but that's another patch if that's possible. llvm-svn: 347523
*	[DAG] consolidate shift simplifications	Sanjay Patel	2018-11-23	2	-74/+58
\| \| \| \| \| \| \| \| \| \|	...and use them to avoid creating obviously undef values as discussed in the post-commit thread for r347478. The diffs in vector div/rem show that we were missing real optimizations by creating bogus shift nodes. llvm-svn: 347502
*	[LegalizeVectorTypes] Don't use SplitVecOp_TruncateHelper if we're heading ↵	Craig Topper	2018-11-23	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \|	towards scalarizing the type. This code takes a truncate, fp_to_int, or int_to_fp with a legal result type and an input type that needs to be split and enlarges the elements in the result type before doing the split. Then inserts a follow up truncate or fp_round after concatenating the two halves back together. But if the input type of the original op is being split on its way to ultimately being scalarized we're just going to end up building a vector from scalars and then truncating or rounding it in the vector register. Seems kind of silly to enlarge the result element type of the operation only to end up with scalar code and then building a vector with large elements only to make the elements smaller again in the vector register. Seems better to just try to get away producing smaller result types in the scalarized code. The X86 test case that changes is a pretty contrived test case that exists because of a bug we used to have in our AVG matching code. I think the code is better now, but its not realistic anyway. llvm-svn: 347482
*	[LegalizeVectorTypes] Have SplitVecOp_TruncateHelper fall back to ↵	Craig Topper	2018-11-22	1	-1/+7
\| \| \| \| \| \| \| \| \| \|	SplitVecOp_UnaryOp if splitting the output type would be a legal type. SplitVecOp_TruncateHelper tries to introduce a multilevel truncate to avoid scalarization. But if splitting the result type would still be a legal type we don't need to do that. The comment block at the top of the function implied that this was already implemented. I looked back through the history and it doesn't look to have ever been checked. llvm-svn: 347479
*	[DAGCombiner] form 'not' ops ahead of shifts (PR39657)	Sanjay Patel	2018-11-22	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We fail to canonicalize IR this way (prefer 'not' ops to arbitrary 'xor'), but that would not matter without this patch because DAGCombiner was reversing that transform. I think we need this transform in the backend regardless of what happens in IR to catch cases where the shift-xor is formed late from GEP or other ops. https://rise4fun.com/Alive/NC1 Name: shl Pre: (-1 << C2) == C1 %shl = shl i8 %x, C2 %r = xor i8 %shl, C1 => %not = xor i8 %x, -1 %r = shl i8 %not, C2 Name: shr Pre: (-1 u>> C2) == C1 %sh = lshr i8 %x, C2 %r = xor i8 %sh, C1 => %not = xor i8 %x, -1 %r = lshr i8 %not, C2 https://bugs.llvm.org/show_bug.cgi?id=39657 llvm-svn: 347478
*	[DAGCombiner] refactor select-of-FP-constants transform	Sanjay Patel	2018-11-21	1	-53/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This transform needs to be limited. We are converting to a constant pool load very early, and we are turning loads that are independent of the select condition (and therefore speculatable) into a dependent non-speculatable load. We may also be transferring a condition code from an FP register to integer to create that dependent load. llvm-svn: 347424
*	[DAGCombiner] reduce code duplication; NFC	Sanjay Patel	2018-11-21	1	-33/+30
\| \| \| \|	llvm-svn: 347410
*	[TargetLowering] SimplifyDemandedBits - only reduce known bits for integer ↵	Simon Pilgrim	2018-11-21	1	-1/+3
\| \| \| \| \| \| \| \|	constants Avoids fuzzing crash found by Mikael Holmén. llvm-svn: 347393
*	Test commit: Delete trailing space in comment	Nikita Popov	2018-11-21	1	-1/+1
\| \| \| \|	llvm-svn: 347385
*	[DAGCombiner] look through bitcasts when trying to narrow vector binops	Sanjay Patel	2018-11-20	1	-13/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is another step in vector narrowing - a follow-up to D53784 (and hoping to eventually squash potential regressions seen in D51553). The x86 test diffs are wins, but the AArch64 diff is probably not. That problem already exists independent of this patch (see PR39722), but it went unnoticed in the previous patch because there were no regression tests that showed the possibility. The x86 diff in i64-mem-copy.ll is close. Given the frequency throttling concerns with using wider vector ops, an extra extract to reduce vector width is the right trade-off at this level of codegen. Differential Revision: https://reviews.llvm.org/D54392 llvm-svn: 347356
*	[DAGCombine] Add calls to SimplifyDemandedVectorElts from ↵	Simon Pilgrim	2018-11-20	2	-1/+5
\| \| \| \| \| \| \| \|	visitINSERT_SUBVECTOR (PR37989) This uncovered an off-by-one typo in SimplifyDemandedVectorElts's INSERT_SUBVECTOR handling as its bounds check was bailing on safe indices. llvm-svn: 347313
*	[TargetLowering] Improve SimplifyDemandedVectorElts/SimplifyDemandedBits support	Simon Pilgrim	2018-11-20	1	-0/+17
\| \| \| \| \| \| \| \| \| \|	For bitcast nodes from larger element types, add the ability for SimplifyDemandedVectorElts to call SimplifyDemandedBits by merging the elts mask to a bits mask. I've raised https://bugs.llvm.org/show_bug.cgi?id=39689 to deal with the few places where SimplifyDemandedBits's lack of vector handling is a problem. Differential Revision: https://reviews.llvm.org/D54679 llvm-svn: 347301
*	[SelectionDAG] Compute known bits and num sign bits for live out vector ↵	Craig Topper	2018-11-20	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	registers. Use it to add AssertZExt/AssertSExt in the live in basic blocks Summary: We already support this for scalars, but it was explicitly disabled for vectors. In the updated test cases this allows us to see the upper bits are zero to use less multiply instructions to emulate a 64 bit multiply. This should help with this ispc issue that a coworker pointed me to https://github.com/ispc/ispc/issues/1362 Reviewers: spatel, efriedma, RKSimon, arsenm Reviewed By: spatel Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D54725 llvm-svn: 347287
*	[DAGCombiner] reduce code duplication in visitXOR; NFC	Sanjay Patel	2018-11-20	1	-32/+29
\| \| \| \|	llvm-svn: 347278
*	Implement computeKnownBits for scalar_to_vector	Stanislav Mekhanoshin	2018-11-19	1	-0/+13
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D54728 llvm-svn: 347274
*	[DAGCombine] SimplifyNodeWithTwoResults - ensure same legalization for LO/HI ↵	Simon Pilgrim	2018-11-19	1	-8/+6
\| \| \| \| \| \| \| \| \| \|	operands (PR21207) Consistently use (!LegalOperations \|\| isOperationLegalOrCustom) for all node pairs. Differential Revision: https://reviews.llvm.org/D53478 llvm-svn: 347255
*	[TargetLowering] expandFP_TO_UINT - improve fp16 support	Simon Pilgrim	2018-11-19	1	-10/+18
\| \| \| \| \| \| \| \| \| \|	As discussed on D53794, for float types with ranges smaller than the destination integer type, then we should be able to just use a regular FP_TO_SINT opcode. I thought we'd need to provide MSA test cases for very small integer types as well (fp16 -> i8 etc.), but it turns out that promotion will kick in so they're unnecessary. Differential Revision: https://reviews.llvm.org/D54703 llvm-svn: 347251
*	[SelectionDAG] simplify vector select with undef operand(s)	Sanjay Patel	2018-11-19	2	-5/+12
\| \| \| \|	llvm-svn: 347227
*	[SelectionDAG] simplify select FP with undef condition	Sanjay Patel	2018-11-19	1	-1/+1
\| \| \| \|	llvm-svn: 347212
*	[SelectionDAG] add simplifySelect() to reduce code duplication; NFC	Sanjay Patel	2018-11-19	2	-36/+27
\| \| \| \| \| \|	This should be extended to handle FP and vectors in follow-up patches. llvm-svn: 347210
*	[DAG] add undef simplifications for select nodes	Sanjay Patel	2018-11-18	2	-14/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Sadly, this duplicates (twice) the logic from InstSimplify. There might be some way to at least share the DAG versions of the code, but copying the folds seems to be the standard method to ensure that we don't miss these folds. Unlike in IR, we don't run DAGCombiner to fixpoint, so there's no way to ensure that we do these kinds of simplifications unless the code is repeated at node creation time and during combines. There were other tests that would become worthless with this improvement that I changed as pre-commits: rL347161 rL347164 rL347165 rL347166 rL347167 I'm not sure how to salvage the remaining tests (diffs in this patch). So the x86 tests verify that the new code is working as intended. The AMDGPU test is actually similar to my motivating case: we have some undef value that has survived to machine IR in an x86 test, and then it gets folded in some weird way, or we crash if we don't transfer the undef flag. But we would have been better off never getting to that point by doing these simplifications. This will lead back to PR32023 someday... https://bugs.llvm.org/show_bug.cgi?id=32023 llvm-svn: 347170
*	[SelectionDAG] simplify code; NFC	Sanjay Patel	2018-11-18	1	-6/+5
\| \| \| \|	llvm-svn: 347160
*	Use llvm::copy. NFC	Fangrui Song	2018-11-17	1	-3/+3
\| \| \| \|	llvm-svn: 347126