bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[ConstantRange] Rename isWrappedSet() to isUpperWrapped()	Nikita Popov	2019-03-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Split out from D59749. The current implementation of isWrappedSet() doesn't do what it says on the tin, and treats ranges like [X, Max] as wrapping, because they are represented as [X, 0) when using half-inclusive ranges. This also makes it inconsistent with the semantics of isSignWrappedSet(). This patch renames isWrappedSet() to isUpperWrapped(), in preparation for the introduction of a new isWrappedSet() method with corrected behavior. llvm-svn: 357107
*	[DAGCombiner] Unify Lifetime and memory Op aliasing.	Nirav Dave	2019-03-27	2	-79/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rework BaseIndexOffset and isAlias to fully work with lifetime nodes and fold in lifetime alias analysis. This is mostly NFC. Reviewers: courbet Reviewed By: courbet Subscribers: hiraditya, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59794 llvm-svn: 357070
*	[DAGCombine] Refactor GatherAllAliases. NFCI.	Nirav Dave	2019-03-27	1	-65/+66
\| \| \| \|	llvm-svn: 357069
*	Re-commit r355490 "[CodeGen] Omit range checks from jump tables when ↵	Hans Wennborg	2019-03-27	2	-55/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	lowering switches with unreachable default" Original commit by Ayonam Ray. This commit adds a regression test for the issue discovered in the previous commit: that the range check for the jump table can only be omitted if the fall-through destination of the jump table is unreachable, which isn't necessarily true just because the default of the switch is unreachable. This addresses the missing optimization in PR41242. > During the lowering of a switch that would result in the generation of a > jump table, a range check is performed before indexing into the jump > table, for the switch value being outside the jump table range and a > conditional branch is inserted to jump to the default block. In case the > default block is unreachable, this conditional jump can be omitted. This > patch implements omitting this conditional branch for unreachable > defaults. > > Differential Revision: https://reviews.llvm.org/D52002 > Reviewers: Hans Wennborg, Eli Freidman, Roman Lebedev llvm-svn: 357067
*	[DAGCombiner] Don't allow addcarry if the carry producer is illegal.	Jonas Paulsson	2019-03-27	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	getAsCarry() checks that the input argument is a carry-producing node before allowing a transformation to addcarry. This patch adds a check to make sure that the carry-producing node is legal. If it is not, it may not remain in a form that is manageable by the target backend. The test case caused a compilation failure during instruction selection for this reason on SystemZ. Patch by Ulrich Weigand. Review: Sanjay Patel https://reviews.llvm.org/D59822 llvm-svn: 357052
*	[SDAG] add simplifications for FP at node creation time	Sanjay Patel	2019-03-26	1	-0/+27
\| \| \| \| \| \| \| \|	We have the folds for fadd/fsub/fmul already in DAGCombiner, so it may be possible to remove that code if we can guarantee that these ops are zapped before they can exist. llvm-svn: 357029
*	[DAG] Avoid smart constructor-based dangling nodes.	Nirav Dave	2019-03-26	2	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Various SelectionDAG non-combine operations (e.g. the getNode smart constructor and legalization) may leave dangling nodes by applying optimizations or not fully pruning unused result values. This can result in nodes that are never added to the worklist and therefore can not be pruned. Add a node inserter as the current node deleter to make sure such nodes have the chance of being pruned. Many minor changes, mostly positive. llvm-svn: 356996
*	[TargetLowering] Add SimplifyDemandedBits support for ISD::INSERT_VECTOR_ELT	Simon Pilgrim	2019-03-26	2	-3/+45
\| \| \| \| \| \| \| \| \| \| \| \|	This helps us relax the extension of a lot of scalar elements before they are inserted into a vector. Its exposes an issue in DAGCombiner::convertBuildVecZextToZext as some/all the zero-extensions may be relaxed to ANY_EXTEND, so we need to handle that case to avoid a couple of AVX2 VPMOVZX test regressions. Once this is in it should be easier to fix a number of remaining failures to fold loads into VBROADCAST nodes. Differential Revision: https://reviews.llvm.org/D59484 llvm-svn: 356989
*	Fix nondeterminism introduced in r353954	Yi Kong	2019-03-26	2	-2/+3
\| \| \| \| \| \| \| \| \| \|	DenseMap iteration order is not guaranteed, use MapVector instead. Fix provided by srhines. Differential Revision: https://reviews.llvm.org/D59807 llvm-svn: 356988
*	[SelectionDAG] Add icmp UNDEF handling to SelectionDAG::FoldSetCC	Simon Pilgrim	2019-03-25	1	-3/+19
\| \| \| \| \| \| \| \| \| \|	First half of PR40800, this patch adds DAG undef handling to icmp instructions to match the behaviour in llvm::ConstantFoldCompareInstruction and SimplifyICmpInst, this permits constant folding of vector comparisons where some elements had been reduced to UNDEF (by SimplifyDemandedVectorElts etc.). This involved a lot of tweaking to reduced tests as bugpoint loves to reduce icmp arguments to undef........ Differential Revision: https://reviews.llvm.org/D59363 llvm-svn: 356938
*	[LegalizeDAG] Expand i16 bswap directly to a rotate by 8 instead of relying ↵	Craig Topper	2019-03-24	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	on DAG combine. An i16 bswap can be implemented with an i16 rotate by 8. We previously emitted a shift and OR sequence that DAG combine should be able to turn back into rotate. But we might as well go there directly. If rotate isn't legal, LegalizeDAG should further legalize it to either the opposite rotate, or the shift and OR pattern. I don't know of any way to get the existing DAG combine reliance to fail. So I don't know any way to add new tests for this that wouldn't have worked previously. llvm-svn: 356860
*	[TargetLowering] SimplifyDemandedBits trunc(srl(x, C1)) - early out for out ↵	Simon Pilgrim	2019-03-22	1	-19/+19
\| \| \| \| \| \|	of range C1. NFCI. llvm-svn: 356810
*	[DAGCombiner] Use getTokenFactor in a few more cases.	Florian Hahn	2019-03-21	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SDNodes can only have 64k operands and for some inputs (e.g. large number of stores), we can reach this limit when creating TokenFactor nodes. This patch is a follow up to D56740 and updates a few more places that potentially can create TokenFactors with too many operands. Reviewers: efriedma, craig.topper, aemerson, RKSimon Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D59156 llvm-svn: 356668
*	[DAGCombine] SimplifySelectCC - call FoldSetCC with the setcc result type	Simon Pilgrim	2019-03-21	1	-2/+3
\| \| \| \| \| \| \| \|	We were calling FoldSetCC with the compare operand type instead of the result type. Found by OSS-Fuzz #13838 (https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=13838) llvm-svn: 356667
*	[SelectionDAG] Add scalarization of ABS node (PR41149)	Simon Pilgrim	2019-03-21	1	-0/+1
\| \| \| \| \| \| \| \|	Patch by: @ikulagin (Ivan Kulagin) Differential Revision: https://reviews.llvm.org/D59577 llvm-svn: 356656
*	Remove out of date comment. NFCI.	Simon Pilgrim	2019-03-20	1	-1/+0
\| \| \| \| \| \|	DAGCombiner::convertBuildVecZextToZext just requires the extractions to be sequential, they don't have to start from 0'th index. llvm-svn: 356552
*	Fix for ABS legalization on PPC buildbot.	Simon Pilgrim	2019-03-19	1	-0/+1
\| \| \| \|	llvm-svn: 356498
*	Allow unordered loads to be considered invariant in CodeGen	Philip Reames	2019-03-19	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	The actual code change is fairly straight forward, but exercising it isn't. First, it turned out we weren't adding the appropriate flags in SelectionDAG. Second, it turned out that we've got some optimization gaps, so obvious test cases don't work. My first attempt (in atomic-unordered.ll) points out a deficiency in our peephole-opt folding logic which I plan to fix separately. Instead, I'm exercising this through MachineLICM. Differential Revision: https://reviews.llvm.org/D59375 llvm-svn: 356494
*	[DAGCombine] Fix a miscompile when reducing BUILD_VECTORs to a shuffle	Justin Bogner	2019-03-19	1	-11/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In r311255 we added a case where we split vectors whose elements are all derived from the same input vector so that we could shuffle it more efficiently. In doing so, createBuildVecShuffle was taught to adjust for the fact that all indices would be based off of the first vector when this happens, but it's possible for the code that checked that to fire incorrectly if we happen to have a BUILD_VECTOR of extracts from subvectors and don't hit this new optimization. Instead of trying to detect if we've split the vector by checking if we have extracts from the same base vector, we can just pass that information into createBuildVecShuffle, avoiding the miscompile. Differential Revision: https://reviews.llvm.org/D59507 llvm-svn: 356476
*	[SelectionDAG] Handle unary SelectPatternFlavor for ABS case in ↵	Simon Pilgrim	2019-03-19	5	-38/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SelectionDAGBuilder::visitSelect These changes are related to PR37743 and include: SelectionDAGBuilder::visitSelect handles the unary SelectPatternFlavor::SPF_ABS case to build ABS node. Delete the redundant recognizer of the integer ABS pattern from the DAGCombiner. Add promoting the integer ABS node in the LegalizeIntegerType. Expand-based legalization of integer result for the ABS nodes. Expand-based legalization of ABS vector operations. Add some integer abs testcases for different typesizes for Thumb arch Add the custom ABS expanding and change the SAD pattern recognizer for X86 arch: The i64 result of the ABS is expanded to: tmp = (SRA, Hi, 31) Lo = (UADDO tmp, Lo) Hi = (XOR tmp, (ADDCARRY tmp, hi, Lo:1)) Lo = (XOR tmp, Lo) The "detectZextAbsDiff" function is changed for the recognition of pattern with the ABS node. Given a ABS node, detect the following pattern: (ABS (SUB (ZERO_EXTEND a), (ZERO_EXTEND b))). Change integer abs testcases for codegen with the ABS node support for AArch64. Indicate that the ABS is legal for the i64 type when the NEON is supported. Change the integer abs testcases to show changing of codegen. Add combine and legalization of ABS nodes for Thumb arch. Extend 'matchSelectPattern' to recognize the ABS patterns with ICMP_SGE condition. For discussion, see https://bugs.llvm.org/show_bug.cgi?id=37743 Patch by: @ikulagin (Ivan Kulagin) Differential Revision: https://reviews.llvm.org/D49837 llvm-svn: 356468
*	[TargetLowering] Add code size information on isFPImmLegal. NFC	Adhemerval Zanella	2019-03-18	2	-39/+63
\| \| \| \| \| \| \| \| \| \| \|	This allows better code size for aarch64 floating point materialization in a future patch. Reviewers: evandro Differential Revision: https://reviews.llvm.org/D58690 llvm-svn: 356389
*	[DAG] Cleanup unused node in SimplifySelectCC.	Nirav Dave	2019-03-18	1	-8/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Delete temporarily constructed node uses for analysis after it's use, holding onto original input nodes. Ideally this would be rewritten without making nodes, but this appears relatively complex. Reviewers: spatel, RKSimon, craig.topper Subscribers: jdoerfert, hiraditya, deadalnix, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57921 llvm-svn: 356382
*	[DebugInfo] Ignore bitcasts when lowering stack arg dbg.values	David Stenberg	2019-03-18	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Look past bitcasts when looking for parameter debug values that are described by frame-index loads in `EmitFuncArgumentDbgValue()`. In the attached test case we would be left with an undef `DBG_VALUE` for the parameter without this patch. A similar fix was done for parameters passed in registers in D13005. This fixes PR40777. Reviewers: aprantl, vsk, jmorse Reviewed By: aprantl Subscribers: bjope, javed.absar, jdoerfert, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D58831 llvm-svn: 356363
*	[CodeGen] Prepare for introduction of v3 and v5 MVTs	Tim Renouf	2019-03-17	3	-2/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AMDGPU would like to have MVTs for v3i32, v3f32, v5i32, v5f32. This commit does not add them, but makes preparatory changes: * Exclude non-legal non-power-of-2 vector types from ComputeRegisterProp mechanism in TargetLoweringBase::getTypeConversion. * Cope with SETCC and VSELECT for odd-width i1 vector when the other vectors are legal type. Some of this patch is from Matt Arsenault, also of AMD. Differential Revision: https://reviews.llvm.org/D58899 Change-Id: Ib5f23377dbef511be3a936211a0b9f94e46331f8 llvm-svn: 356350
*	[DAGCombine] Fold (x & ~y) \| y patterns	Nikita Popov	2019-03-17	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fold (x & ~y) \| y and it's four commuted variants to x \| y. This pattern can in particular appear when a vselect c, x, -1 is expanded to (x & ~c) \| (-1 & c) and combined to (x & ~c) \| c. This change has some overlap with D59066, which avoids creating a vselect of this form in the first place during uaddsat expansion. Differential Revision: https://reviews.llvm.org/D59174 llvm-svn: 356333
*	[TargetLowering] improve the default expansion of uaddsat/usubsat	Sanjay Patel	2019-03-17	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a subset of what was proposed in: D59006 ...and may overlap with test changes from: D59174 ...but it seems like a good general optimization to turn selects into bitwise-logic when possible because we never know exactly what can happen at this stage of DAG combining depending on how the target has defined things. Differential Revision: https://reviews.llvm.org/D59066 llvm-svn: 356332
*	[DAGCombine] combineShuffleOfScalars - handle non-zero SCALAR_TO_VECTOR ↵	Simon Pilgrim	2019-03-16	1	-2/+2
\| \| \| \| \| \| \| \|	indices (PR41097) rL356292 reduces the size of scalar_to_vector if we know the upper bits are undef - which means that shuffles may find they are suddenly referencing scalar_to_vector elements other than zero - so make sure we handle this as undef. llvm-svn: 356327
*	[WebAssembly] Make rethrow take an except_ref type argument	Heejin Ahn	2019-03-16	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In the new wasm EH proposal, `rethrow` takes an `except_ref` argument. This change was missing in r352598. This patch adds `llvm.wasm.rethrow.in.catch` intrinsic. This is an intrinsic that's gonna eventually be lowered to wasm `rethrow` instruction, but this intrinsic can appear only within a catchpad or a cleanuppad scope. Also this intrinsic needs to be invokable - otherwise EH pad successor for it will not be correctly generated in clang. This also adds lowering logic for this intrinsic in `SelectionDAGBuilder::visitInvoke`. This routine is basically a specialized and simplified version of `SelectionDAGBuilder::visitTargetIntrinsic`, but we can't use it because if is only for `CallInst`s. This deletes the previous `llvm.wasm.rethrow` intrinsic and related tests, which was meant to be used within a `__cxa_rethrow` library function. Turned out this needs some more logic, so the intrinsic for this purpose will be added later. LateEHPrepare takes a result value of `catch` and inserts it into matching `rethrow` as an argument. `RETHROW_IN_CATCH` is a pseudo instruction that serves as a link between `llvm.wasm.rethrow.in.catch` and the real wasm `rethrow` instruction. To generate a `rethrow` instruction, we need an `except_ref` argument, which is generated from `catch` instruction. But `catch` instrutions are added in LateEHPrepare pass, so we use `RETHROW_IN_CATCH`, which takes no argument, until we are able to correctly lower it to `rethrow` in LateEHPrepare. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59352 llvm-svn: 356316
*	[SelectionDAG] Add SimplifyDemandedBits handling for ISD::SCALAR_TO_VECTOR	Simon Pilgrim	2019-03-15	1	-0/+13
\| \| \| \| \| \|	Fixes a lot of constant folding mismatches between i686 and x86_64 llvm-svn: 356273
*	[DAGCombiner] Fix Comment. NFC.	Nirav Dave	2019-03-13	1	-1/+1
\| \| \| \|	llvm-svn: 356069
*	[DAGCombiner] If a TokenFactor would be merged into its user, consider the ↵	Nirav Dave	2019-03-13	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	user later. Summary: A number of optimizations are inhibited by single-use TokenFactors not being merged into the TokenFactor using it. This makes we consider if we can do the merge immediately. Most tests changes here are due to the change in visitation causing minor reorderings and associated reassociation of paired memory operations. CodeGen tests with non-reordering changes: X86/aligned-variadic.ll -- memory-based add folded into stored leaq value. X86/constant-combiners.ll -- Optimizes out overlap between stores. X86/pr40631_deadstore_elision -- folds constant byte store into preceding quad word constant store. Reviewers: RKSimon, craig.topper, spatel, efriedma, courbet Reviewed By: courbet Subscribers: dylanmckay, sdardis, nemanjai, jvesely, nhaehnle, javed.absar, eraman, hiraditya, kbarton, jrtc27, atanasyan, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59260 llvm-svn: 356068
*	Re-land r354244 "[DAGCombiner] Eliminate dead stores to stack."	Clement Courbet	2019-03-13	1	-0/+89
\| \| \| \| \| \|	Always check candidates for hasOtherUses(), not only stores. llvm-svn: 356050
*	[DAG] Move integer setcc %x, %x folding into FoldSetCC	Simon Pilgrim	2019-03-13	2	-5/+6
\| \| \| \| \| \| \| \| \| \|	First step towards PR40800 - I intend to move the float case in a separate future patch. I had to tweak the (overly reduced) thumb2 test and the x86 widening test change is annoying (no longer rematerializable) but we should address this separately. Differential Revision: https://reviews.llvm.org/D59244 llvm-svn: 356040
*	[CodeGen] Add MMOs to statepoint nodes during SelectionDAG	Philip Reames	2019-03-12	1	-13/+42
\| \| \| \| \| \| \| \| \| \| \| \|	The existing statepoint lowering code does something odd; it adds machine memory operands post instruction selection. This was copied from the stackmap/patchpoint implementation, but appears to be non-idiomatic. This change is largely NFC. It moves the MMO creation logic into SelectionDAG building. It ends up not quite being NFC because the size of the stack slot is reflected in the MMO. The old code blindly used pointer size for the MMO size, which appears to have always been incorrect for larger values. It just happened nothing actually relied on the MMOs, so it worked out okay. For context, I'm planning on removing the MOVolatile flag from these in a future commit, and then removing the MOStore flag from deopt spill slots in a separate one. Doing so is motivated by a small test case where we should be able to better schedule spill slots, but don't do so due to a memory use/def implied by the statepoint. Differential Revision: https://reviews.llvm.org/D59106 llvm-svn: 355953
*	[SDAG] Expand pow2 mulo using shifts	Nikita Popov	2019-03-12	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \|	Expand MULO with constant power of two operand into a shift. The overflow is checked with (x << shift) >> shift == x, where the right shift will be logical for umulo and arithmetic for smulo (with exception for multiplications by signed_min). Differential Revision: https://reviews.llvm.org/D59041 llvm-svn: 355937
*	[DAGCombine] Pull out repeated demanded bitmask generation. NFCI.	Simon Pilgrim	2019-03-12	1	-10/+9
\| \| \| \|	llvm-svn: 355932
*	[SDAG][AArch64] Legalize VECREDUCE	Nikita Popov	2019-03-11	7	-6/+336
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes https://bugs.llvm.org/show_bug.cgi?id=36796. Implement basic legalizations (PromoteIntRes, PromoteIntOp, ExpandIntRes, ScalarizeVecOp, WidenVecOp) for VECREDUCE opcodes. There are more legalizations missing (esp float legalizations), but there's no way to test them right now, so I'm not adding them. This also includes a few more changes to make this work somewhat reasonably: * Add support for expanding VECREDUCE in SDAG. Usually experimental.vector.reduce is expanded prior to codegen, but if the target does have native vector reduce, it may of course still be necessary to expand due to legalization issues. This uses a shuffle reduction if possible, followed by a naive scalar reduction. * Allow the result type of integer VECREDUCE to be larger than the vector element type. For example we need to be able to reduce a v8i8 into an (nominally) i32 result type on AArch64. * Use the vector operand type rather than the scalar result type to determine the action, so we can control exactly which vector types are supported. Also change the legalize vector op code to handle operations that only have vector operands, but no vector results, as is the case for VECREDUCE. * Default VECREDUCE to Expand. On AArch64 (only target using VECREDUCE), explicitly specify for which vector types the reductions are supported. This does not handle anything related to VECREDUCE_STRICT_*. Differential Revision: https://reviews.llvm.org/D58015 llvm-svn: 355860
*	[DAG] FoldSetCC - reuse valuetype + ensure its simple.	Simon Pilgrim	2019-03-11	1	-4/+3
\| \| \| \|	llvm-svn: 355847
*	[DAG] Move SetCC NaN handling into FoldSetCC	Simon Pilgrim	2019-03-11	2	-79/+78
\| \| \| \|	llvm-svn: 355845
*	[DAG] TargetLowering::SimplifySetCC - call FoldSetCC early to handle ↵	Simon Pilgrim	2019-03-11	1	-13/+6
\| \| \| \| \| \| \| \|	constant/commute folds. Noticed while looking at PR40800 (and also D57921) llvm-svn: 355828
*	Remove redundant extractBooleanFlip argument. NFC	Amaury Sechet	2019-03-11	1	-3/+5
\| \| \| \|	llvm-svn: 355794
*	Recommit r355224 "[TableGen][SelectionDAG][X86] Add specific isel matchers ↵	Craig Topper	2019-03-10	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	for immAllZerosV/immAllOnesV. Remove bitcasts from X86 patterns that are no longer necessary." Includes a fix to emit a CheckOpcode for build_vector when immAllZerosV/immAllOnesV is used as a pattern root. This means it can't be used to look through bitcasts when used as a root, but that's probably ok. This extra CheckOpcode will ensure that the first match in the isel table will be a SwitchOpcode which is needed by the caching optimization in the ISel Matcher. Original commit message: Previously we had build_vector PatFrags that called ISD::isBuildVectorAllZeros/Ones. Internally the ISD::isBuildVectorAllZeros/Ones look through bitcasts, but we aren't able to take advantage of that in isel. Instead of we have to canonicalize the types of the all zeros/ones build_vectors and insert bitcasts. Then we have to pattern match those exact bitcasts. By emitting specific matchers for these 2 nodes, we can make isel look through any bitcasts without needing to explicitly match them. We should also be able to remove the canonicalization to vXi32 from lowering, but I've left that for a follow up. This removes something like 40,000 bytes from the X86 isel table. Differential Revision: https://reviews.llvm.org/D58595 llvm-svn: 355784
*	Refactor isBooleanFlip into extractBooleanFlip so that users do not depend ↵	Amaury Sechet	2019-03-09	1	-19/+28
\| \| \| \| \| \|	on the patern matched. NFC llvm-svn: 355769
*	DAG: Don't try to cluster loads with tied inputs	Matt Arsenault	2019-03-08	1	-1/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This avoids breaking possible value dependencies when sorting loads by offset. AMDGPU has some load instructions that write into the high or low bits of the destination register, and have a tied input for the other input bits. These can easily have the same base pointer, but be a swizzle so the high address load needs to come first. This was inserting glue forcing the opposite ordering, producing a cycle the InstrEmitter would assert on. It may be potentially expensive to look for the dependency between the other loads, so just skip any where this could happen. Fixes bug 40936 by reverting r351379, which added a hacky attempt to fix this by adding chains in this case, which I think was just working around broken glue before the InstrEmitter. The core of the patch is re-implementing the fix for that problem. llvm-svn: 355728
*	[DAGCombiner] fold (add (add (xor a, -1), b), 1) -> (sub b, a)	Amaury Sechet	2019-03-08	1	-4/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This pattern is sometime created after legalization. Reviewers: efriedma, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58874 llvm-svn: 355716
*	[DAGCombine] Merge visitSMULO+visitUMULO into visitMULO. NFCI.	Simon Pilgrim	2019-03-08	1	-17/+8
\| \| \| \|	llvm-svn: 355690
*	[DAGCombine] Merge visitSADDO+visitUADDO into visitADDO. NFCI.	Simon Pilgrim	2019-03-08	1	-48/+24
\| \| \| \|	llvm-svn: 355689
*	[DAGCombine] Merge visitSSUBO+visitUSUBO into visitSUBO. NFCI.	Simon Pilgrim	2019-03-08	1	-33/+8
\| \| \| \|	llvm-svn: 355688
*	[PS4] Emit a trap after a stack-protector fail call.	Paul Robinson	2019-03-06	1	-0/+6
\| \| \| \|	llvm-svn: 355542
*	[DAGCombine] Improve select (not Cond), N1, N2 -> select Cond, N2, N1 fold	Simon Pilgrim	2019-03-06	1	-9/+15
\| \| \| \| \| \| \| \|	Move the x86 combine from D58974 into the DAGCombine VSELECT code and update the SELECT version to use the isBooleanFlip helper as well. Requested by @spatel on D59006 llvm-svn: 355533