bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[SelectionDAG] Add asserts to verify the vectorness of input and output ↵	Craig Topper	2019-05-02	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	types of TRUNCATE/ZERO_EXTEND/ANY_EXTEND/SIGN_EXTEND agree As a result of the underlying cause of PR41678 we created an ANY_EXTEND node with a scalar result type and v1i1 input type. Ideally we would have asserted for this instead of letting it go through to instruction selection and generate bad machine IR Differential Revision: https://reviews.llvm.org/D61463 llvm-svn: 359836
*	[DAGCombiner] try repeated fdiv divisor transform before building estimate ↵	Sanjay Patel	2019-05-02	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(2nd try) The original patch was committed at rL359398 and reverted at rL359695 because of infinite looping. This includes a fix to check for a vector splat of "1.0" to avoid the infinite loop. Original commit message: This was originally part of D61028, but it's an independent diff. If we try the repeated divisor reciprocal transform before producing an estimate sequence, then we have an opportunity to use scalar fdiv. On x86, the trade-off is 1 divss vs. 5 vector FP ops in the default estimate sequence. On recent chips (Skylake, Ryzen), the full-precision division is only 3 cycle throughput, so that's probably the better perf default option and avoids problems from x86's inaccurate estimates. The last 2 tests show that users still have the option to override the defaults by using the function attributes for reciprocal estimates, but those patterns are potentially made faster by converting the vector ops (including ymm ops) to scalar math. Differential Revision: https://reviews.llvm.org/D61149 llvm-svn: 359793
*	[SelectionDAG] remove constant folding limitations based on FP exceptions	Sanjay Patel	2019-05-02	1	-26/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We don't have FP exception limits in the IR constant folder for the binops (apart from strict ops), so it does not make sense to have them here in the DAG either. Nothing else in the backend tries to preserve exceptions (again outside of strict ops), so I don't see how this could have ever worked for real code that cares about FP exceptions. There are still cases (examples: unary opcodes in SDAG, FMA in IR) where we are trying (at least partially) to preserve exceptions without even asking if the target supports FP exceptions. Those should be corrected in subsequent patches. Real support for FP exceptions requires several changes to handle the constrained/strict FP ops. Differential Revision: https://reviews.llvm.org/D61331 llvm-svn: 359791
*	Revert "[DAGCombiner] try repeated fdiv divisor transform before building ↵	Sanjay Patel	2019-05-01	1	-3/+3
\| \| \| \| \| \| \| \| \|	estimate" This reverts commit fb9a5307a94e6f1f850e4d89f79103b123f16279 (rL359398) because it can cause an infinite loop due to opposing combines. llvm-svn: 359695
*	DAG: allow DAG pointer size different from memory representation.	Tim Northover	2019-05-01	2	-44/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In preparation for supporting ILP32 on AArch64, this modifies the SelectionDAG builder code so that pointers are allowed to have a larger type when "live" in the DAG compared to memory. Pointers get zero-extended whenever they are loaded, and truncated prior to stores. In addition, a few not quite so obvious locations need updating: * A GEP that has not been marked inbounds needs to enforce the IR-documented 2s-complement wrapping at the memory pointer size. Inbounds GEPs are undefined if they overflow the address space, so no additional operations are needed. * Signed comparisons would give incorrect results if performed on the zero-extended values. This shouldn't affect CodeGen for now, but will become active when the AArch64 ILP32 support is committed. llvm-svn: 359676
*	[SelectionDAG] remove div-by-zero constant folding restriction	Sanjay Patel	2019-04-30	1	-7/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We don't have this restriction in IR, so it should not be here either simply out of consistency. Code that wants to handle FP exceptions is expected to use the 'strict' variants of these nodes. We don't get the frem case because frem by 0.0 produces NaN (invalid), and that's the remaining check here (so the removed check for frem was dead code AFAIK). This is the only place in SDAG that uses "HasFPExceptions", so I think we should remove that entirely as a follow-up patch. llvm-svn: 359566
*	[TargetLowering] findOptimalMemOpLowering. NFCI.	Sjoerd Meijer	2019-04-30	2	-123/+119
\| \| \| \| \| \| \| \| \| \|	This was a local static funtion in SelectionDAG, which I've promoted to TargetLowering so that I can reuse it to estimate the cost of a memory operation in D59787. Differential Revision: https://reviews.llvm.org/D59766 llvm-svn: 359543
*	[TargetLowering] Change getOptimalMemOpType to take a function attribute list	Sjoerd Meijer	2019-04-30	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \|	The MachineFunction wasn't used in getOptimalMemOpType, but more importantly, this allows reuse of findOptimalMemOpLowering that is calling getOptimalMemOpType. This is the groundwork for the changes in D59766 and D59787, that allows implementation of TTI::getMemcpyCost. Differential Revision: https://reviews.llvm.org/D59785 llvm-svn: 359537
*	[DAGCombiner] Do not generate ISD::ADDE node if adde is not legal for the ↵	Zi Xuan Wu	2019-04-30	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	target when combine ISD::TRUNC node Do not combine (trunc adde(X, Y, Carry)) into (adde trunc(X), trunc(Y), Carry), if adde is not legal for the target. Even it's at type-legalize phase. Because adde is special and will not be legalized at operation-legalize phase later. This fixes: PR40922 https://bugs.llvm.org/show_bug.cgi?id=40922 Differential Revision: https://reviews.llvm.org//D60854 llvm-svn: 359532
*	[DAG] Refactor DAGCombiner::ReassociateOps	Bjorn Pettersson	2019-04-29	1	-45/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extract the logic for doing reassociations from DAGCombiner::reassociateOps into a helper function DAGCombiner::reassociateOpsCommutative, and use that helper to trigger reassociation on the original operand order, or the commuted operand order. Codegen is not identical since the operand order will be different when doing the reassociations for the commuted case. That causes some unfortunate churn in some test cases. Apart from that this should be NFC. Reviewers: spatel, craig.topper, tstellar Reviewed By: spatel Subscribers: dmgreen, dschuff, jvesely, nhaehnle, javed.absar, sbc100, jgravelle-google, hiraditya, aheejin, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61199 llvm-svn: 359476
*	[DAGCombiner] try repeated fdiv divisor transform before building estimate	Sanjay Patel	2019-04-28	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was originally part of D61028, but it's an independent diff. If we try the repeated divisor reciprocal transform before producing an estimate sequence, then we have an opportunity to use scalar fdiv. On x86, the trade-off is 1 divss vs. 5 vector FP ops in the default estimate sequence. On recent chips (Skylake, Ryzen), the full-precision division is only 3 cycle throughput, so that's probably the better perf default option and avoids problems from x86's inaccurate estimates. The last 2 tests show that users still have the option to override the defaults by using the function attributes for reciprocal estimates, but those patterns are potentially made faster by converting the vector ops (including ymm ops) to scalar math. Differential Revision: https://reviews.llvm.org/D61149 llvm-svn: 359398
*	[DAGCombine] Cleanup visitEXTRACT_SUBVECTOR. NFCI.	Simon Pilgrim	2019-04-26	1	-10/+11
\| \| \| \| \| \|	Use ArrayRef::slice, reduce some rather awkward long lines for legibility and run clang-format. llvm-svn: 359326
*	[X86][SSE] Disable shouldFoldConstantShiftPairToMask for btver1/btver2 ↵	Simon Pilgrim	2019-04-26	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	targets (PR40758) As detailed on PR40758, Bobcat/Jaguar can perform vector immediate shifts on the same pipes as vector ANDs with the same latency - so it doesn't make sense to replace a shl+lshr with a shift+and pair as it requires an additional mask (with the extra constant pool, loading and register pressure costs). Differential Revision: https://reviews.llvm.org/D61068 llvm-svn: 359293
*	[SelectionDAG][X86] Use stack load/store in PromoteIntRes_BITCAST when the ↵	Craig Topper	2019-04-25	1	-15/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	input needs to be be split and the output type is a vector. We had special case handling here, but it uses a scalar any_extend for the promotion then bitcasts to the final type. This won't split up the input data into multiple promoted elements like we need. This patch falls back to doing the conversion through memory. Fixes PR41594 which I believe was reflected in the bitcast-vector-bool.ll changes. The changes to vector-half-conversions.ll are fixing a previously unknown miscompile from this issue. Differential Revision: https://reviews.llvm.org/D61114 llvm-svn: 359219
*	Recommitting r358783 and r358786 "[MS] Emit S_HEAPALLOCSITE debug info" with ↵	Amy Huang	2019-04-24	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes for buildbot error (undefined assembler label). Summary: This emits labels around heapallocsite calls and S_HEAPALLOCSITE debug info in codeview. Currently only changes FastISel, so emitting labels still needs to be implemented in SelectionDAG. Reviewers: rnk Subscribers: aprantl, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D61083 llvm-svn: 359149
*	[DAGCombiner] scale repeated FP divisor by splat factor	Sanjay Patel	2019-04-24	1	-3/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we have a vector FP division with a splatted divisor, use the existing transform that converts 'x/y' into 'x * (1.0/y)' to allow more conversions. This can then potentially be converted into a scalar FP division by existing combines (rL358984) as seen in the tests here. That can be a potentially big perf difference if scalar fdiv has better timing (including avoiding possible frequency throttling for vector ops). Differential Revision: https://reviews.llvm.org/D61028 llvm-svn: 359147
*	Add "const" in GetUnderlyingObjects. NFC	Bjorn Pettersson	2019-04-24	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Both the input Value pointer and the returned Value pointers in GetUnderlyingObjects are now declared as const. It turned out that all current (in-tree) uses of GetUnderlyingObjects were trivial to update, being satisfied with have those Value pointers declared as const. Actually, in the past several of the users had to use const_cast, just because of ValueTracking not providing a version of GetUnderlyingObjects with "const" Value pointers. With this patch we get rid of those const casts. Reviewers: hfinkel, materi, jkorous Reviewed By: jkorous Subscribers: dexonsmith, jkorous, jholewinski, sdardis, eraman, hiraditya, jrtc27, atanasyan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61038 llvm-svn: 359072
*	Revert "[MS] Emit S_HEAPALLOCSITE debug info" because of ToTWin64(db)	Amy Huang	2019-04-23	1	-6/+0
\| \| \| \| \| \| \| \| \|	buildbot failure. This reverts commit d07d6d617713bececf57f3547434dd52f0f13f9e and c774f687b6880484a126ed3e3d737e74c926f0ae. llvm-svn: 359034
*	Use llvm::stable_sort	Fangrui Song	2019-04-23	1	-1/+1
\| \| \| \| \| \|	While touching the code, simplify if feasible. llvm-svn: 358996
*	[DAGCombiner] generalize binop-of-splats scalarization	Sanjay Patel	2019-04-23	1	-46/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we only match build vectors, we can miss some patterns that use shuffles as seen in the affected tests. Note that the underlying calls within getSplatSourceVector() have the potential for compile-time explosion because of exponential recursion looking through binop opcodes, but currently the list of supported opcodes is very limited. Both of those problems should be addressed in follow-up patches. llvm-svn: 358984
*	[DAGCombiner] Combine OR as ADD when no common bits are set	Bjorn Pettersson	2019-04-23	1	-16/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The DAGCombiner is rewriting (canonicalizing) an ISD::ADD with no common bits set in the operands as an ISD::OR node. This could sometimes result in "missing out" on some combines that normally are performed for ADD. To be more specific this could happen if we already have rewritten an ADD into OR, and later (after legalizations or combines) we expose patterns that could have been optimized if we had seen the OR as an ADD (e.g. reassociations based on ADD). To make the DAG combiner less sensitive to if ADD or OR is used for these "no common bits set" ADD/OR operations we now apply most of the ADD combines also to an OR operation, when value tracking indicates that the operands have no common bits set. Reviewers: spatel, RKSimon, craig.topper, kparzysz Reviewed By: spatel Subscribers: arsenm, rampitec, lebedev.ri, jvesely, nhaehnle, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59758 llvm-svn: 358965
*	[SelectionDAG] move splat util functions up from x86 lowering	Sanjay Patel	2019-04-22	1	-0/+52
\| \| \| \| \| \| \| \| \| \|	This was supposed to be NFC, but the change in SDLoc definitions causes instruction scheduling changes. There's nothing x86-specific in this code, and it can likely be used from DAGCombiner's simplifyVBinOp(). llvm-svn: 358930
*	[TargetLowering][AMDGPU][X86] Improve SimplifyDemandedBits bitcast handling	Simon Pilgrim	2019-04-22	1	-1/+25
\| \| \| \| \| \| \| \| \| \| \| \|	This patch adds support for BigBitWidth -> SmallBitWidth bitcasts, splitting the DemandedBits/Elts accordingly. The AMDGPU backend needed an extra (srl (and x, c1 << c2), c2) -> (and (srl(x, c2), c1) combine to encourage BFE creation, I investigated putting this in DAGCombine but it caused a lot of noise on other targets - some improvements, some regressions. The X86 changes are all definite wins. Differential Revision: https://reviews.llvm.org/D60462 llvm-svn: 358887
*	[DAGCombiner] make variable name less ambiguous; NFC	Sanjay Patel	2019-04-22	1	-4/+4
\| \| \| \|	llvm-svn: 358886
*	[DAGCombiner] prepare shuffle-of-splat to handle more patterns; NFC	Sanjay Patel	2019-04-22	1	-11/+16
\| \| \| \|	llvm-svn: 358884
*	[MS] Emit S_HEAPALLOCSITE debug info	Amy Huang	2019-04-19	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This emits labels around heapallocsite calls and S_HEAPALLOCSITE debug info in codeview. Currently only changes FastISel, so emitting labels still needs to be implemented in SelectionDAG. Reviewers: hans, rnk Subscribers: aprantl, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D60800 llvm-svn: 358783
*	[SelectionDAG] soften splat mask assert/unreachable (PR41535)	Sanjay Patel	2019-04-19	1	-1/+4
\| \| \| \| \| \| \| \|	These are general queries, so they should not die when given a degenerate input like an all undef mask. Callers should be able to deal with an op that will eventually be simplified away. llvm-svn: 358761
*	[DAGCombine] Add SimplifyDemandedBits helper that handles demanded elts mask ↵	Simon Pilgrim	2019-04-17	1	-4/+13
\| \| \| \| \| \| \| \|	as well The other SimplifyDemandedBits helpers become wrappers to this new demanded elts variant. llvm-svn: 358585
*	[ScheduleDAGRRList] Recompute topological ordering on demand.	Florian Hahn	2019-04-17	1	-24/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently there is a single point in ScheduleDAGRRList, where we actually query the topological order (besides init code). Currently we are recomputing the order after adding a node (which does not have predecessors) and then we add predecessors edge-by-edge. We can avoid adding edges one-by-one after we added a new node. In that case, we can just rebuild the order from scratch after adding the edges to the DAG and avoid all the updates to the ordering. Also, we can delay updating the DAG until we query the DAG, if we keep a list of added edges. Depending on the number of updates, we can either apply them when needed or recompute the order from scratch. This brings down the geomean compile time for of CTMark with -O1 down 0.3% on X86, with no regressions. Reviewers: MatzeB, atrick, efriedma, niravd, paquette Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D60125 llvm-svn: 358583
*	[TargetLowering] Rename preferShiftsToClearExtremeBits and ↵	Simon Pilgrim	2019-04-16	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	shouldFoldShiftPairToMask (PR41359) As discussed on PR41359, this patch renames the pair of shift-mask target feature functions to make their purposes more obvious. shouldFoldShiftPairToMask -> shouldFoldConstantShiftPairToMask preferShiftsToClearExtremeBits -> shouldFoldMaskToVariableShiftPair llvm-svn: 358526
*	[DAGCombiner] Add missing flag to addressing mode check	Luis Marques	2019-04-16	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	The checks in `canFoldInAddressingMode` tested for addressing modes that have a base register but didn't set the `HasBaseReg` flag to true (it's false by default). This patch fixes that. Although the omission of the flag was technically incorrect it had no known observable impact, so no tests were changed by this patch. Differential Revision: https://reviews.llvm.org/D60314 llvm-svn: 358502
*	DAG: propagate ConsecutiveRegs flags to returns too.	Tim Northover	2019-04-15	1	-0/+18
\| \| \| \| \| \| \| \| \| \|	Arguments already have a flag to inform backends when they have been split up. The AArch64 arm64_32 ABI makes use of these on return types too, so that code emitted for armv7k can be ABI-compliant. There should be no CodeGen changes yet, just making more information available. llvm-svn: 358399
*	DAG: propagate whether an arg is a pointer for CallingConv decisions.	Tim Northover	2019-04-15	2	-5/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The arm64_32 ABI specifies that pointers (despite being 32-bits) should be zero-extended to 64-bits when passed in registers for efficiency reasons. This means that the SelectionDAG needs to be able to tell the backend that an argument was originally a pointer, which is implmented here. Additionally, some memory intrinsics need to be declared as taking an i8* instead of an iPTR. There should be no CodeGen change yet, but it will be triggered when AArch64 backend support for ILP32 is added. llvm-svn: 358398
*	[SelectionDAG] Use KnownBits::computeForAddSub/computeForAddCarry	Bjorn Pettersson	2019-04-15	1	-58/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Use KnownBits::computeForAddSub/computeForAddCarry in SelectionDAG::computeKnownBits when doing value tracking for addition/subtraction. This should improve the precision of the known bits, as we only used to make a simple estimate of known zeroes. The KnownBits support functions are also able to deduce bits that are known to be one in the result. Reviewers: spatel, RKSimon, nikic, lebedev.ri Reviewed By: nikic Subscribers: nikic, javed.absar, lebedev.ri, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60460 llvm-svn: 358372
*	[DAGCombiner] narrow shuffle of concatenated vectors	Sanjay Patel	2019-04-12	1	-0/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	// shuffle (concat X, undef), (concat Y, undef), Mask --> // concat (shuffle X, Y, Mask0), (shuffle X, Y, Mask1) The ARM changes with 'vtrn' and narrowed 'vuzp' are improvements. The x86 changes look neutral or better. There's one test with an extra instruction, but that could be reversed for a subtarget with the right attributes. But by default, we want to avoid the 256-bit op when possible (in my motivating benchmark, a handful of ymm ops sprinkled into a sequence of xmm ops are triggering frequency throttling on Haswell resulting in significantly worse perf). Differential Revision: https://reviews.llvm.org/D60545 llvm-svn: 358291
*	[TargetLowering][X86] Teach SimplifyDemandedBits to use ShrinkDemandedOp on ↵	Craig Topper	2019-04-12	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	ISD::SHL nodes. If the upper bits of the SHL result aren't used, we might be able to use a narrower shift. For example, on X86 this can turn a 64-bit into 32-bit enabling a smaller encoding. Differential Revision: https://reviews.llvm.org/D60358 llvm-svn: 358257
*	[DAGCombiner] refactor narrowing of extracted vector binop; NFC	Sanjay Patel	2019-04-11	1	-20/+19
\| \| \| \| \| \| \|	There's a TODO comment about handling patterns with insert_subvector, and we do want to match that. llvm-svn: 358187
*	[DAGCombiner][x86] scalarize inserted vector FP ops	Sanjay Patel	2019-04-11	1	-0/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	// bo (build_vec ...undef, x, undef...), (build_vec ...undef, y, undef...) --> // build_vec ...undef, (bo x, y), undef... The lifetime of the nodes in these examples is different for variables versus constants, but they are all build vectors briefly, so I'm proposing to catch them in this form to handle all of the leading examples in the motivating test file. Before we have build vectors, we might have insert_vector_element. After that, we might have scalar_to_vector and constant pool loads. It's going to take more work to ensure that FP vector operands are getting simplified with undef elements, so this transform can apply more widely. In a non-loose FP environment, we are likely simplifying FP elements to NaN values rather than undefs. We also need to allow more opcodes down this path. Eg, we don't handle FP min/max flavors yet. Differential Revision: https://reviews.llvm.org/D60514 llvm-svn: 358172
*	Revert rL357745: [SelectionDAG] Compute known bits of CopyFromReg	David Green	2019-04-10	1	-20/+0
\| \| \| \| \| \| \| \| \| \|	Certain optimisations from ConstantHoisting and CGP rely on Selection DAG not seeing through to the constant in other blocks. Revert this patch while we come up with a better way to handle that. I will try to follow this up with some better tests. llvm-svn: 358113
*	[DAGCombiner][X86][SystemZ] Canonicalize SSUBO with immediate RHS to SADDO ↵	Craig Topper	2019-04-09	1	-0/+8
\| \| \| \| \| \| \| \| \| \|	by negating the immediate. This lines up with what we do for regular subtract and it matches up better with X86 assumptions in isel patterns that add with immediate is more canonical than sub with immediate. Differential Revision: https://reviews.llvm.org/D60020 llvm-svn: 358027
*	[TargetLowering] SimplifyDemandedBits - add ISD::INSERT_SUBVECTOR support	Simon Pilgrim	2019-04-09	1	-0/+39
\| \| \| \|	llvm-svn: 358019
*	[TargetLowering] SimplifyDemandedBits - Remove GetDemandedSrcMask lambda. NFCI.	Simon Pilgrim	2019-04-09	1	-28/+21
\| \| \| \| \| \|	An older version of this could return false but now that this always succeeds we can just inline and simplify it. llvm-svn: 357999
*	[TargetLowering] SimplifyDemandedBits - call SimplifyDemandedBits in bitcast ↵	Simon Pilgrim	2019-04-09	1	-6/+16
\| \| \| \| \| \| \| \|	handling When bitcasting from a source op to a larger bitwidth op, split the demanded bits and OR them on top of one another and demand those merged bits in the SimplifyDemandedBits call on the source op. llvm-svn: 357992
*	[TargetLowering] SimplifyDemandedBits - use DemandedElts in bitcast handling	Simon Pilgrim	2019-04-08	1	-12/+13
\| \| \| \| \| \|	Be more selective in the SimplifyDemandedBits -> SimplifyDemandedVectorElts bitcast call based on the demanded elts. llvm-svn: 357942
*	[DAG] Pull out ComputeNumSignBits call to make debugging easier. NFCI.	Simon Pilgrim	2019-04-07	1	-2/+2
\| \| \| \|	llvm-svn: 357861
*	[SelectionDAG] Add fcmp UNDEF handling to SelectionDAG::FoldSetCC	Simon Pilgrim	2019-04-05	1	-3/+8
\| \| \| \| \| \| \| \| \| \|	Second half of PR40800, this patch adds DAG undef handling to fcmp instructions to match the behavior in llvm::ConstantFoldCompareInstruction, this permits constant folding of vector comparisons where some elements had been reduced to UNDEF (by SimplifyDemandedVectorElts etc.). This involves a lot of tweaking to reduced tests as bugpoint loves to reduce fcmp arguments to undef........ Differential Revision: https://reviews.llvm.org/D60006 llvm-svn: 357765
*	[DAGCombiner][x86] scalarize splatted vector FP ops	Sanjay Patel	2019-04-05	1	-2/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are a variety of vector patterns that may be profitably reduced to a scalar op when scalar ops are performed using a subset (typically, the first lane) of the vector register file. For x86, this is true for float/double ops and element 0 because insert/extract is just a sub-register rename. Other targets should likely enable the hook in a similar way. Differential Revision: https://reviews.llvm.org/D60150 llvm-svn: 357760
*	[SelectionDAG] Compute known bits of CopyFromReg	Piotr Sobczak	2019-04-05	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach SelectionDAG how to compute known bits of ISD::CopyFromReg if the virtual reg used has one def only. This can be particularly useful when calling isBaseWithConstantOffset() with the ISD::CopyFromReg argument, as more optimizations may get enabled in the result. Also add a missing truncation on X86, found by testing of this patch. Change-Id: Id1c9fceec862d118c54a5b53adf72ada5d6daefa Reviewers: bogner, craig.topper, RKSimon Reviewed By: RKSimon Subscribers: lebedev.ri, nemanjai, jvesely, nhaehnle, javed.absar, jsji, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59535 llvm-svn: 357745
*	[FastISel] Fix crash for gc.relocate lowring	Serguei Katkov	2019-04-05	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Lowering safepoint checks that all gc.relocaes observed in safepoint must be lowered. However Fast-Isel is able to skip dead gc.relocate. To resolve this issue we just ignore dead gc.relocate in the check. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D60184 llvm-svn: 357742
*	[IR] Refactor attribute methods in Function class (NFC)	Evandro Menezes	2019-04-04	4	-11/+11
\| \| \| \| \| \| \| \|	Rename the functions that query the optimization kind attributes. Differential revision: https://reviews.llvm.org/D60287 llvm-svn: 357731