bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SelectionDAG] computeKnownBits - use ashrInPlace on known bits of ISD::SRA ↵	Simon Pilgrim	2017-11-01	1	-11/+3
\| \| \| \| \| \|	input. NFCI. llvm-svn: 317087
*	[DAGCombiner] Fix typos in comments. NFC	Craig Topper	2017-11-01	1	-2/+2
\| \| \| \|	llvm-svn: 317072
*	Fix unused variable warnings. NFCI.	Simon Pilgrim	2017-10-30	1	-3/+0
\| \| \| \|	llvm-svn: 316964
*	[SelectionDAG] Tidyup computeKnownBits extension/truncation cases. NFCI.	Simon Pilgrim	2017-10-30	1	-17/+4
\| \| \| \| \| \|	We don't need to extend/truncate the Known structure before calling computeKnownBits - it will reset at the start of the function. llvm-svn: 316962
*	Create instruction classes for identifying any atomicity of memory ↵	Daniel Neilson	2017-10-30	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	intrinsic. (NFC) Summary: For reference, see: http://lists.llvm.org/pipermail/llvm-dev/2017-August/116589.html This patch fleshes out the instruction class hierarchy with respect to atomic and non-atomic memory intrinsics. With this change, the relevant part of the class hierarchy becomes: IntrinsicInst -> MemIntrinsicBase (methods-only class) -> MemIntrinsic (non-atomic intrinsics) -> MemSetInst -> MemTransferInst -> MemCpyInst -> MemMoveInst -> AtomicMemIntrinsic (atomic intrinsics) -> AtomicMemSetInst -> AtomicMemTransferInst -> AtomicMemCpyInst -> AtomicMemMoveInst -> AnyMemIntrinsic (both atomicities) -> AnyMemSetInst -> AnyMemTransferInst -> AnyMemCpyInst -> AnyMemMoveInst This involves some class renaming: ElementUnorderedAtomicMemCpyInst -> AtomicMemCpyInst ElementUnorderedAtomicMemMoveInst -> AtomicMemMoveInst ElementUnorderedAtomicMemSetInst -> AtomicMemSetInst A script for doing this renaming in downstream trees is included below. An example of where the Any* classes should be used in LLVM is when reasoning about the effects of an instruction (ex: aliasing). --- Script for renaming AtomicMem* classes: PREFIXES="[<,([:space:]]" CLASSES="MemIntrinsic\|MemTransferInst\|MemSetInst\|MemMoveInst\|MemCpyInst" SUFFIXES="[;)>,[:space:]]" REGEX="(${PREFIXES})ElementUnorderedAtomic(${CLASSES})(${SUFFIXES})" REGEX2="visitElementUnorderedAtomic(${CLASSES})" FILES=$( grep -E "(${REGEX}\|${REGEX2})" -r . \| tr ':' ' ' \| awk '{print $1}' \| sort \| uniq ) SED_SCRIPT="s~${REGEX}~\1Atomic\2\3~g" SED_SCRIPT2="s~${REGEX2}~visitAtomic\1~g" for f in $FILES; do echo "Processing: $f" sed -i ".bak" -E "${SED_SCRIPT};${SED_SCRIPT2};${EA_SED_SCRIPT};${EA_SED_SCRIPT2}" $f done Reviewers: sanjoy, deadalnix, apilipenko, anna, skatkov, mkazantsev Reviewed By: sanjoy Subscribers: hfinkel, jholewinski, arsenm, sdardis, nhaehnle, JDevlieghere, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D38419 llvm-svn: 316950
*	[SelectionDAG] Add VSELECT demanded elts support to computeKnownBits	Simon Pilgrim	2017-10-30	1	-4/+4
\| \| \| \|	llvm-svn: 316947
*	[SelectionDAG] Add VSELECT support to computeKnownBits	Simon Pilgrim	2017-10-30	1	-0/+1
\| \| \| \|	llvm-svn: 316944
*	[SelectionDAG] Add SELECT demanded elts support to ComputeNumSignBits	Simon Pilgrim	2017-10-30	1	-4/+5
\| \| \| \|	llvm-svn: 316933
*	[SelectionDAG] Add SEXT/AND/XOR/Or demanded elts support to ComputeNumSignBits	Simon Pilgrim	2017-10-29	1	-7/+11
\| \| \| \|	llvm-svn: 316875
*	[SelectionDAG] Add SRA/SHL demanded elts support to ComputeNumSignBits	Simon Pilgrim	2017-10-29	1	-3/+29
\| \| \| \| \| \|	Introduce a isConstOrDemandedConstSplat helper function that can recognise a constant splat build vector for at least the demanded elts we care about. llvm-svn: 316866
*	[SelectionDAG] Add support for INSERT_SUBVECTOR to computeKnownBits	Simon Pilgrim	2017-10-28	1	-0/+34
\| \| \| \|	llvm-svn: 316847
*	[SelectionDAG] Support 'bit preserving' floating points bitcasts on ↵	Simon Pilgrim	2017-10-28	1	-7/+15
\| \| \| \| \| \| \| \| \| \| \| \|	computeKnownBits/ComputeNumSignBits For cases where we know the floating point representations match the bitcasted integer equivalent, allow bitcasting to these types. This is especially useful for the X86 floating point compare results which return all/zero bits but as a floating point type. Differential Revision: https://reviews.llvm.org/D39289 llvm-svn: 316831
*	[DAGCombine] Don't combine sext with extload if sextload is not supported ↵	Guozhi Wei	2017-10-27	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and extload has multi users In function DAGCombiner::visitSIGN_EXTEND_INREG, sext can be combined with extload even if sextload is not supported by target, then if sext is the only user of extload, there is no big difference, no harm no benefit. if extload has more than one user, the combined sextload may block extload from combining with other zext, causes extra zext instructions generated. As demonstrated by the attached test case. This patch add the constraint that when sextload is not supported by target, sext can only be combined with extload if it is the only user of extload. Differential Revision: https://reviews.llvm.org/D39108 llvm-svn: 316802
*	DAG: Fold fma (fneg x), K, y -> fma x, -K, y	Matt Arsenault	2017-10-27	1	-0/+8
\| \| \| \|	llvm-svn: 316753
*	Add subclass data to the FoldingSetNode for MemIntrinsicSDNodes.	Sean Fertile	2017-10-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	Not having the subclass data on an MemIntrinsicSDNodes means it was possible to try to fold 2 nodes with the same operands but differing MMO flags. This would trip an assertion when trying to refine the alignment between the 2 MachineMemOperands. Differential Revision: https://reviews.llvm.org/D38898 llvm-svn: 316737
*	DAG: Fix creating select with wrong condition type	Matt Arsenault	2017-10-25	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This code added in r297930 assumed that it could create a select with a condition type that is just an integer bitcast of the selected type. For AMDGPU any vselect is going to be scalarized (although the vector types are legal), and all select conditions must be i1 (the same as getSetCCResultType). This logic doesn't really make sense to me, but there's never really been a consistent policy in what the select condition mask type is supposed to be. Try to extend the logic for skipping the transform for condition types that aren't setccs. It doesn't seem quite right to me though, but checking conditions that seem more sensible (like whether the vselect is going to be expanded) doesn't work since this seems to depend on that also. llvm-svn: 316554
*	Implement salavageDebugInfo functionality for SelectionDAG.	Adrian Prantl	2017-10-24	2	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Similar to how llvm::salvagDebugInfo hooks into InstCombine, this adds a hook that can be invoked before an SDNode that is associated with an SDDbgValue is erased to capture the effect of the deleted node in a DIExpression. The motivating example is an SDDebugValue attached to an ADD operation that gets folded into a LOAD+OFFSET operation. rdar://problem/32121503 llvm-svn: 316525
*	Use range-based for loop. NFC	Adrian Prantl	2017-10-24	1	-5/+2
\| \| \| \|	llvm-svn: 316496
*	Use range-based-for. NFC	Adrian Prantl	2017-10-24	1	-6/+5
\| \| \| \|	llvm-svn: 316485
*	Doxygenify comments.	Adrian Prantl	2017-10-24	1	-26/+25
\| \| \| \|	llvm-svn: 316466
*	[SelectionDAG] Add VSELECT support to ComputeNumSignBits	Simon Pilgrim	2017-10-24	1	-0/+1
\| \| \| \|	llvm-svn: 316457
*	Fix buildbot breakage	George Burgess IV	2017-10-23	1	-0/+1
\| \| \| \| \| \|	SP is only used in an assert. Caused by r316374. llvm-svn: 316377
*	Don't crash when we see unallocatable registers in clobbers	George Burgess IV	2017-10-23	3	-15/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a bug where we'd crash given code like the test-case from https://bugs.llvm.org/show_bug.cgi?id=30792 . Instead, we let the offending clobber silently slide through. This doesn't fully fix said bug, since the assembler will still complain the moment it sees a crypto/fp/vector op, and we still don't diagnose calls that require vector regs. Differential Revision: https://reviews.llvm.org/D39030 llvm-svn: 316374
*	[DAGCombine] Permit combining of shuffles of equivalent splat BUILD_VECTORs	Simon Pilgrim	2017-10-23	1	-5/+15
\| \| \| \| \| \| \| \| \| \|	combineShuffleOfScalars is very conservative about shuffled BUILD_VECTORs that can be combined together. This patch adds one additional case - if both BUILD_VECTORs represent splats of the same scalar value but with different UNDEF elements, then we should create a single splat BUILD_VECTOR, sharing only the UNDEF elements defined by the shuffle mask. Differential Revision: https://reviews.llvm.org/D38696 llvm-svn: 316331
*	[SelectionDAG] Use dyn_cast without cast.	Florian Hahn	2017-10-21	1	-2/+2
\| \| \| \|	llvm-svn: 316258
*	[SelectionDAG] Use isa to silence unused variable warning (NFC).	Florian Hahn	2017-10-21	1	-1/+1
\| \| \| \|	llvm-svn: 316257
*	[SelectionDAG] Don't subject ConstantSDNodes to the depth limit in ↵	Craig Topper	2017-10-21	1	-10/+13
\| \| \| \| \| \| \| \|	computeKnownBits and ComputeNumSignBits. We don't need to do any additional recursion, we just need to analyze the APInt stored in the node. This matches what the ValueTracking versions do for IR. llvm-svn: 316256
*	[SelectionDAG] Don't subject ISD:Constant to the depth limit in ↵	Craig Topper	2017-10-21	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	TargetLowering::SimplifyDemandedBits. Summary: We shouldn't recurse any further but it doesn't mean we shouldn't be able to give the known bits for a constant. The caller would probably like that we always return the right answer for a constant RHS. This matches what InstCombine does in this case. I don't have a test case because this showed up while trying to revive D31724. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: arsenm, llvm-commits Differential Revision: https://reviews.llvm.org/D38967 llvm-svn: 316255
*	[SelectionDAG] Add a check to getVectorShuffle to ensure that the only ↵	Craig Topper	2017-10-19	1	-1/+2
\| \| \| \| \| \|	negative index we allow is -1. llvm-svn: 316183
*	Untabify.	NAKAMURA Takumi	2017-10-18	1	-1/+1
\| \| \| \|	llvm-svn: 316079
*	[DAGCombine] Add SCALAR_TO_VECTOR undef handling to simplifyShuffleMask.	Simon Pilgrim	2017-10-17	1	-2/+6
\| \| \| \| \| \| \| \|	This allows us to simplify later visitVECTOR_SHUFFLE optimizations such as combineShuffleOfScalars. Noticed whilst working on D38696 llvm-svn: 316017
*	Use the return value of UpdateNodeOperands(); in some cases, ↵	Mark Searles	2017-10-16	1	-1/+1
\| \| \| \| \| \| \| \|	UpdateNodeOperands() modifies the node in-place and using the return value isn’t strictly necessary. However, it does not necessarily modify the node, but may return a resultant node if it already exists in the DAG. See comments in UpdateNodeOperands(). In that case, the return value must be used to avoid such scenarios as an infinite loop (node is assumed to have been updated, so added back to the worklist, and re-processed; however, node hasn’t changed so it is once again passed to UpdateNodeOperands(), assumed modified, added back to worklist; cycle infinitely repeats). Differential Revision: https://reviews.llvm.org/D38466 llvm-svn: 315957
*	Add iterator range MachineRegisterInfo::liveins(), adopt users, NFC	Krzysztof Parzyszek	2017-10-16	1	-4/+3
\| \| \| \|	llvm-svn: 315927
*	ISel type legalizer: debug messages. NFC.	Sjoerd Meijer	2017-10-16	2	-4/+17
\| \| \| \| \| \| \| \| \| \| \|	Minor addition and follow up of r314773 and r311533: this adds more debug messages to the type legalizer. For each node, it dumps legalization info for results and operands nodes, rather than just the final legalized node. Differential Revision: https://reviews.llvm.org/D38726 llvm-svn: 315904
*	Reverting r315590; it did not include changes for llvm-tblgen, which is ↵	Aaron Ballman	2017-10-15	3	-8/+8
\| \| \| \| \| \| \| \|	causing link errors for several people. Error LNK2019 unresolved external symbol "public: void __cdecl `anonymous namespace'::MatchableInfo::dump(void)const " (?dump@MatchableInfo@?A0xf4f1c304@@QEBAXXZ) referenced in function "public: void __cdecl `anonymous namespace'::AsmMatcherEmitter::run(class llvm::raw_ostream &)" (?run@AsmMatcherEmitter@?A0xf4f1c304@@QEAAXAEAVraw_ostream@llvm@@@Z) llvm-tblgen D:\llvm\2017\utils\TableGen\AsmMatcherEmitter.obj 1 llvm-svn: 315854
*	DAG: Add opcode and source type to isFPExtFree	Matt Arsenault	2017-10-13	1	-235/+253
\| \| \| \| \| \| \| \|	This is only currently used for mad/fma transforms. This is the only case where it should be used for AMDGPU, so add an opcode to be sure. llvm-svn: 315740
*	DAG: Add flags to dumps	Matt Arsenault	2017-10-13	1	-0/+30
\| \| \| \|	llvm-svn: 315690
*	[SelectionDAG] Cleanup the SIGN_EXTEND_INREG handling in computeKnownBits. NFCI	Craig Topper	2017-10-13	1	-26/+14
\| \| \| \| \| \|	Use less temporary APInts. Use bit counting more. Don't call getScalarSizeInBits so many places, just capture it once. llvm-svn: 315671
*	[SelectionDAG] Fix typo in comment. NFC	Craig Topper	2017-10-13	1	-1/+1
\| \| \| \|	llvm-svn: 315670
*	[SelectionDAG] Correct the early out in SelectionDAG::getZeroExtendInReg to ↵	Craig Topper	2017-10-13	1	-1/+1
\| \| \| \| \| \| \| \|	work properly for vector types. I don't know if we ever hit this case or not. Turning it into an assert only fired on expanding some atomic operation in a SystemZ lit test. llvm-svn: 315648
*	[SelectionDAG] Const-correct the DemandedMask argument to one of the ↵	Craig Topper	2017-10-12	1	-1/+1
\| \| \| \| \| \|	overloads of SimplifyDemandedBits. NFC llvm-svn: 315641
*	[SelectionDAG] Simplify the ISD::SIGN_EXTEND/ZERO_EXTEND handling to use ↵	Craig Topper	2017-10-12	1	-25/+11
\| \| \| \| \| \|	less temporary APInts by counting bits instead. NFCI llvm-svn: 315628
*	Implement custom lowering for ISD::CTTZ_ZERO_UNDEF and ISD::CTTZ.	Wei Ding	2017-10-12	1	-4/+15
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D37348 llvm-svn: 315610
*	[dump] Remove NDEBUG from test to enable dump methods [NFC]	Don Hinton	2017-10-12	3	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add LLVM_FORCE_ENABLE_DUMP cmake option, and use it along with LLVM_ENABLE_ASSERTIONS to set LLVM_ENABLE_DUMP. Remove NDEBUG and only use LLVM_ENABLE_DUMP to enable dump methods. Move definition of LLVM_ENABLE_DUMP from config.h to llvm-config.h so it'll be picked up by public headers. Differential Revision: https://reviews.llvm.org/D38406 llvm-svn: 315590
*	Revert r307036 because of PR34919.	Wei Mi	2017-10-12	1	-92/+0
\| \| \| \|	llvm-svn: 315540
*	[DAGCombiner] convert insertelement of bitcasted vector into shuffle	Sanjay Patel	2017-10-11	1	-3/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Eg: insert v4i32 V, (v2i16 X), 2 --> shuffle v8i16 V', X', {0,1,2,3,8,9,6,7} This is a generalization of the IR fold in D38316 to handle insertion into a non-undef vector. We may want to abandon that one if we can't find value in squashing the more specific pattern sooner. We're using the existing legal shuffle target hook to avoid AVX512 horror with vXi1 shuffles. There may be room for improvement in the shuffle lowering here, but that would be follow-up work. Differential Revision: https://reviews.llvm.org/D38388 llvm-svn: 315460
*	[TargetLowering] Correctly track NumFixedArgs field of CallLoweringInfo	Alex Bradbury	2017-10-11	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The NumFixedArgs field of CallLoweringInfo is used by TargetLowering::LowerCallTo to determine whether a given argument is passed using the vararg calling convention or not (specifically, to set IsFixed for each ISD::OutputArg). Firstly, CallLoweringInfo::setLibCallee and CallLoweringInfo::setCallee both incorrectly set NumFixedArgs based on the _previous_ args list. Secondly, TargetLowering::LowerCallTo failed to increment NumFixedArgs when modifying the argument list so a pointer is passed for the return value. If your backend uses the IsFixed property or directly accesses NumFixedArgs, it is _possible_ this change could result in codegen changes (although the previous behaviour would have been incorrect). No such cases have been identified during code review for any in-tree architecture. Differential Revision: https://reviews.llvm.org/D37898 llvm-svn: 315457
*	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-10-10	2	-65/+109
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 315380
*	[DAGCombine] Fix for shuffle to vector extend for non power 2 vectors	David Stuttard	2017-10-10	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: See https://llvm.org/PR33743 for more details It seems that for non-power of 2 vector sizes, the algorithm can produce non-matching sizes for input and result causing an assert. This usually isn't a problem as the isAnyExtend check will weed these out, but in some cases (most often with lots of undefined values for the mask indices) it can pass this check for non power of 2 vectors. Adding in an extra check that ensures that bit size will match for the result and input (as required) Subscribers: nhaehnle Differential Revision: https://reviews.llvm.org/D35241 llvm-svn: 315307
*	Rename OptimizationDiagnosticInfo.* to OptimizationRemarkEmitter.*	Adam Nemet	2017-10-09	1	-1/+1
\| \| \| \| \| \| \|	Sync it up with the name of the class actually defined here. This has been bothering me for a while... llvm-svn: 315249