bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SelectionDAG] ComputeNumSignBits - cleanup ROTL/ROTR wrapping to match ↵	Simon Pilgrim	2017-09-14	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	DAGCombine etc. Use RotAmt.urem(VTBits) instead of AND(RotAmt, VTBits - 1) TBH I don't expect non-power-of-2 types to be created, but it makes the logic clearer and matches what we do in other rotation combines. llvm-svn: 313245
*	Add llvm.codeview.annotation to implement MSVC __annotation	Reid Kleckner	2017-09-05	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This intrinsic represents a label with a list of associated metadata strings. It is modelled as reading and writing inaccessible memory so that it won't be removed as dead code. I think the intention is that the annotation strings should appear at most once in the debug info, so I marked it noduplicate. We are allowed to inline code with annotations as long as we strip the annotation, but that can be done later. Reviewers: majnemer Subscribers: eraman, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D36904 llvm-svn: 312569
*	Add ‘llvm.experimental.constrained.fma‘ Intrinsic.	Wei Ding	2017-08-24	1	-0/+6
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D36335 llvm-svn: 311629
*	[SelectionDAG] Make ISD::isConstantSplatVector always return an element ↵	Craig Topper	2017-08-22	1	-5/+3
\| \| \| \| \| \| \| \| \| \| \| \|	sized APInt. This partially reverts r311429 in favor of making ISD::isConstantSplatVector do something not confusing. Turns out the only other user of it was also having to deal with the weird property of it returning a smaller size. So rather than continue to deal with this quirk everywhere, just make the interface do something sane. Differential Revision: https://reviews.llvm.org/D37039 llvm-svn: 311510
*	[SelectionDAG] Add getNode debug messages	Sjoerd Meijer	2017-08-22	1	-8/+37
\| \| \| \| \| \| \| \| \| \|	This adds debug messages to various functions that create new SDValue nodes. This is e.g. useful to have during legalization, as otherwise it can prints legalization info of nodes that did not appear in the dumps before. Differential Revision: https://reviews.llvm.org/D36984 llvm-svn: 311444
*	[X86] Prevent several calls to ISD::isConstantSplatVector from returning a ↵	Craig Topper	2017-08-22	1	-4/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	narrower APInt than the original scalar type ISD::isConstantSplatVector can shrink to the smallest splat width. But we don't check the size of the resulting APInt at all. This can cause us to misinterpret the results. This patch just adds a flag to prevent the APInt from changing width. Fixes PR34271. Differential Revision: https://reviews.llvm.org/D36996 llvm-svn: 311429
*	[Debug info] Transfer DI to fragment expressions for split integer values.	Jonas Devlieghere	2017-08-18	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \|	This patch teaches the SDag type legalizer how to split up debug info for integer values that are split into a hi and lo part. (re-commit) Differential Revision: https://reviews.llvm.org/D36805 llvm-svn: 311181
*	Revert "[Debug info] Transfer DI to fragment expressions for split integer ↵	Jonas Devlieghere	2017-08-17	1	-10/+8
\| \| \| \| \| \| \| \|	values." This reverts commit r311102. llvm-svn: 311111
*	[Debug info] Transfer DI to fragment expressions for split integer values.	Jonas Devlieghere	2017-08-17	1	-8/+10
\| \| \| \| \| \| \| \| \|	This patch teaches the SDag type legalizer how to split up debug info for integer values that are split into a hi and lo part. Differential Revision: https://reviews.llvm.org/D36805 llvm-svn: 311102
*	[SelectionDAG] combine vextract (v1iX extract_subvector(vNiX, Idx))	Elad Cohen	2017-08-14	1	-0/+9
\| \| \| \| \| \| \| \| \|	into vextract(vNiX,Idx) when creating vextract with getNode(). This case appeared in AVX512 after fixing pr33349 in r310552. Differential revision: https://reviews.llvm.org/D36571 llvm-svn: 310828
*	[X86] Keep dependencies when constructing loads in combineStore	Nirav Dave	2017-08-10	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Preserve chain dependecies between old and new loads constructed to prevent loads from reordering below later stores. Fixes PR34088. Reviewers: craig.topper, spatel, RKSimon, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36528 llvm-svn: 310604
*	[SelectionDAG] Allow constant folding for implicitly truncating BUILD_VECTOR ↵	Guy Blank	2017-08-10	1	-2/+16
\| \| \| \| \| \| \| \| \| \| \| \| \|	nodes. In FoldConstantArithmetic, handle BUILD_VECTOR nodes that do implicit truncation on the elements. This is similar to what is done in FoldConstantVectorArithmetic. Differential Revision: https://reviews.llvm.org/D36506 llvm-svn: 310593
*	DAG: Provide access to Pass instance from SelectionDAG	Matt Arsenault	2017-08-03	1	-1/+3
\| \| \| \| \| \|	This allows accessing an analysis pass during lowering. llvm-svn: 309991
*	[SelectionDAG][X86] CombineBT - more aggressively determine demanded bits	Simon Pilgrim	2017-07-29	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \|	This patch is in 2 parts: 1 - replace combineBT's use of SimplifyDemandedBits (hasOneUse only) with SelectionDAG::GetDemandedBits to more aggressively determine the lower bits used by BT. 2 - update SelectionDAG::GetDemandedBits to support ANY_EXTEND - if the demanded bits are only in the non-extended portion, then peek through and demand from the source value and then ANY_EXTEND that if we found a match. Differential Revision: https://reviews.llvm.org/D35896 llvm-svn: 309486
*	Remove the unused dbg.value offset from SelectionDAG (NFC)	Adrian Prantl	2017-07-28	1	-11/+9
\| \| \| \| \| \| \|	Followup to r309426. rdar://problem/33580047 llvm-svn: 309436
*	[DAG] Move DAGCombiner::GetDemandedBits to SelectionDAG	Simon Pilgrim	2017-07-25	1	-3/+54
\| \| \| \| \| \| \| \|	This patch moves the DAGCombiner::GetDemandedBits function to SelectionDAG::GetDemandedBits as a first step towards making it easier for targets to get to the source of any demanded bits without the limitations of SimplifyDemandedBits. Differential Revision: https://reviews.llvm.org/D35841 llvm-svn: 308983
*	Enhance synchscope representation	Konstantin Zhuravlyov	2017-07-11	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	OpenCL 2.0 introduces the notion of memory scopes in atomic operations to global and local memory. These scopes restrict how synchronization is achieved, which can result in improved performance. This change extends existing notion of synchronization scopes in LLVM to support arbitrary scopes expressed as target-specific strings, in addition to the already defined scopes (single thread, system). The LLVM IR and MIR syntax for expressing synchronization scopes has changed to use syncscope("<scope>"), where <scope> can be "singlethread" (this replaces singlethread keyword), or a target-specific name. As before, if the scope is not specified, it defaults to CrossThread/System scope. Implementation details: - Mapping from synchronization scope name/string to synchronization scope id is stored in LLVM context; - CrossThread/System and SingleThread scopes are pre-defined to efficiently check for known scopes without comparing strings; - Synchronization scope names are stored in SYNC_SCOPE_NAMES_BLOCK in the bitcode. Differential Revision: https://reviews.llvm.org/D21723 llvm-svn: 307722
*	Rewrite areNonVolatileConsecutiveLoads to use BaseIndexOffset	Nirav Dave	2017-07-05	1	-39/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Relanding after rewriting undef.ll test to avoid host-dependant endianness. As discussed in D34087, rewrite areNonVolatileConsecutiveLoads using generic checks. Also, propagate missing local handling from there to BaseIndexOffset checks. Tests of note: * test/CodeGen/X86/build-vector* - Improved. * test/CodeGen/BPF/undef.ll - Improved store alignment allows an additional store merge * test/CodeGen/X86/clear_upper_vector_element_bits.ll - This is a case we already do not handle well. Here, the DAG is improved, but scheduling causes a code size degradation. Reviewers: RKSimon, craig.topper, spatel, andreadb, filcab Subscribers: nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D34472 llvm-svn: 307114
*	Revert "[DAG] Rewrite areNonVolatileConsecutiveLoads to use BaseIndexOffset"	Nirav Dave	2017-06-30	1	-8/+39
\| \| \| \| \| \| \|	This reverts commit r306819 which appears be exposing underlying issues in a stage1 ppc64be build llvm-svn: 306820
*	[DAG] Rewrite areNonVolatileConsecutiveLoads to use BaseIndexOffset	Nirav Dave	2017-06-30	1	-39/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As discussed in D34087, rewrite areNonVolatileConsecutiveLoads using generic checks. Also, propagate missing local handling from there to BaseIndexOffset checks. Tests of note: * test/CodeGen/X86/build-vector* - Improved. * test/CodeGen/BPF/undef.ll - Improved store alignment allows an additional store merge * test/CodeGen/X86/clear_upper_vector_element_bits.ll - This is a case we already do not handle well. Here, the DAG is improved, but scheduling causes a code size degradation. Reviewers: RKSimon, craig.topper, spatel, andreadb, filcab Subscribers: nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D34472 llvm-svn: 306819
*	[SelectionDAG] set dereferenceable flag when expanding memcpy/memmove	Hiroshi Inoue	2017-06-24	1	-8/+25
\| \| \| \| \| \| \| \| \| \|	When SelectionDAG expands memcpy (or memmove) call into a sequence of load and store instructions, it disregards dereferenceable flag even the source pointer is known to be dereferenceable. This results in an assertion failure if SelectionDAG commonizes a load instruction generated for memcpy with another load instruction for the source pointer. This patch makes SelectionDAG to set the dereferenceable flag for the load instructions properly to avoid the assertion failure. Differential Revision: https://reviews.llvm.org/D34467 llvm-svn: 306209
*	[DAG] add helper to bind memop chains; NFCI	Sanjay Patel	2017-06-12	1	-0/+18
\| \| \| \| \| \| \| \| \| \|	This step is just intended to reduce code duplication rather than change any functionality. A follow-up would be to replace PPCTargetLowering::spliceIntoChain() usage with this new helper. Differential Revision: https://reviews.llvm.org/D33649 llvm-svn: 305192
*	Prevent RemoveDeadNodes from deleted already deleted node.	Nirav Dave	2017-06-09	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This prevents against assertion errors like PR32659 which occur from a replacement deleting a node after it's been added to the list argument of RemoveDeadNodes. The specific failure from PR32659 does not currently happen, but it is still potentially possible. The underlying cause is that the callers of the change dfunction builds up a list of nodes to delete after having moved their uses and it possible that a move of a later node will cause a previously deleted nodes to be deleted. Reviewers: bkramer, spatel, davide Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33731 llvm-svn: 305070
*	[DAG] Move SelectionDAG::isCommutativeBinOp to TargetLowering.	Simon Pilgrim	2017-06-07	1	-3/+3
\| \| \| \| \| \| \| \|	This will allow commutation of target-specific DAG nodes in future patches Differential Revision: https://reviews.llvm.org/D33882 llvm-svn: 304911
*	Sort the remaining #include lines in include/... and lib/....	Chandler Carruth	2017-06-06	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I did this a long time ago with a janky python script, but now clang-format has built-in support for this. I fed clang-format every line with a #include and let it re-sort things according to the precise LLVM rules for include ordering baked into clang-format these days. I've reverted a number of files where the results of sorting includes isn't healthy. Either places where we have legacy code relying on particular include ordering (where possible, I'll fix these separately) or where we have particular formatting around #include lines that I didn't want to disturb in this patch. This patch is entirely mechanical. If you get merge conflicts or anything, just ignore the changes in this patch and run clang-format over your #include lines in the files. Sorry for any noise here, but it is important to keep these things stable. I was seeing an increasing number of patches with irrelevant re-ordering of #include lines because clang-format was used. This patch at least isolates that churn, makes it easy to skip when resolving conflicts, and gets us to a clean baseline (again). llvm-svn: 304787
*	[llvm] Remove double semicolons	Mandeep Singh Grang	2017-06-06	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: craig.topper, arsenm, mehdi_amini Reviewed By: mehdi_amini Subscribers: mehdi_amini, wdng, nhaehnle, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33924 llvm-svn: 304767
*	[CodeGen] Fix Windows builds which treat warnings as errors, broken in r304621.	Eugene Zelenko	2017-06-03	1	-1/+1
\| \| \| \|	llvm-svn: 304627
*	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-06-03	1	-59/+62
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 304621
*	[ARM] Fix lowering of misaligned memcpy/memset	John Brawn	2017-05-26	1	-12/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently getOptimalMemOpType returns i32 for large enough sizes without checking for alignment, leading to poor code generation when misaligned accesses aren't permitted as we generate a word store then later split it up into byte stores. This means we inadvertantly go over the MaxStoresPerMemcpy limit and for memset we splat the memset value into a word then immediately split it up again. Fix this by leaving it up to FindOptimalMemOpLowering to figure out which type to use, but also fix a bug there where it wasn't correctly checking if misaligned memory accesses are allowed. Differential Revision: https://reviews.llvm.org/D33442 llvm-svn: 303990
*	Add constrained intrinsics for some libm-equivalent operations	Andrew Kaylor	2017-05-25	1	-0/+57
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D32319 llvm-svn: 303922
*	SimplifyLibCalls: Optimize wcslen	Matthias Braun	2017-05-19	1	-19/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Refactor the strlen optimization code to work for both strlen and wcslen. This especially helps with programs in the wild where people pass L"string"s to const std::wstring& function parameters and the wstring constructor gets inlined. This also fixes a lingerind API problem/bug in getConstantStringInfo() where zeroinitializers would always give you an empty string (without a length) back regardless of the actual length of the initializer which did not work well in the TrimAtNul==false causing the PR mentioned below. Note that the fixed getConstantStringInfo() needed fixes to SelectionDAG memcpy lowering and may lead to some cases for out-of-bounds zeroinitializer accesses not getting optimized anymore. So some code with UB may produce out of bound memory reads now instead of just producing zeros. The refactoring "accidentally" fixes http://llvm.org/PR32124 Differential Revision: https://reviews.llvm.org/D32839 llvm-svn: 303461
*	[SelectionDAG] Added support for EXTRACT_SUBVECTOR/CONCAT_VECTORS ↵	Simon Pilgrim	2017-05-13	1	-7/+29
\| \| \| \| \| \|	demandedelts in ComputeNumSignBits llvm-svn: 302997
*	[SelectionDAG] Add VECTOR_SHUFFLE support to ComputeNumSignBits	Simon Pilgrim	2017-05-13	1	-0/+34
\| \| \| \|	llvm-svn: 302993
*	[ValueTracking] Remove const_casts on several calls to computeKnownBits and ↵	Craig Topper	2017-05-13	1	-2/+1
\| \| \| \| \| \|	ComputeSignBit. NFC llvm-svn: 302991
*	[KnownBits] Add bit counting methods to KnownBits struct and use them where ↵	Craig Topper	2017-05-12	1	-30/+25
\| \| \| \| \| \| \| \| \| \| \| \|	possible This patch adds min/max population count, leading/trailing zero/one bit counting methods. The min methods return answers based on bits that are known without considering unknown bits. The max methods give answers taking into account the largest count that unknown bits could give. Differential Revision: https://reviews.llvm.org/D32931 llvm-svn: 302925
*	Use SDValue::getOperand() helper. NFCI.	Simon Pilgrim	2017-05-12	1	-16/+14
\| \| \| \|	llvm-svn: 302896
*	Strip trailing whitespace. NFCI.	Simon Pilgrim	2017-05-11	1	-1/+1
\| \| \| \|	llvm-svn: 302784
*	Introduce experimental generic intrinsics for horizontal vector reductions.	Amara Emerson	2017-05-09	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	- This change allows targets to opt-in to using them instead of the log2 shufflevector algorithm. - The SLP and Loop vectorizers have the common code to do shuffle reductions factored out into LoopUtils, and now have a unified interface for generating reductions regardless of the preference of the target. LoopUtils now uses TTI to determine what kind of reductions the target wants to handle. - For CodeGen, basic legalization support is added. Differential Revision: https://reviews.llvm.org/D30086 llvm-svn: 302514
*	[KnownBits] Add wrapper methods for setting and clear all bits in the ↵	Craig Topper	2017-05-05	1	-6/+3
\| \| \| \| \| \| \| \| \| \|	underlying APInts in KnownBits. This adds routines for reseting KnownBits to unknown, making the value all zeros or all ones. It also adds methods for querying if the value is zero, all ones or unknown. Differential Revision: https://reviews.llvm.org/D32637 llvm-svn: 302262
*	[SelectionDAG] Improve known bits support for CTPOP.	Craig Topper	2017-05-04	1	-1/+4
\| \| \| \| \| \|	This is based on the same concept from ValueTracking's version of computeKnownBits. llvm-svn: 302110
*	[KnownBits] Add zext, sext, and trunc methods to KnownBits	Craig Topper	2017-05-03	1	-32/+16
\| \| \| \| \| \| \| \|	This patch adds zext, sext, and trunc methods to KnownBits and uses them where possible. Differential Revision: https://reviews.llvm.org/D32784 llvm-svn: 302088
*	[SelectionDAG] Improve support for promotion of <1 x fX> floating point ↵	Simon Pilgrim	2017-05-02	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \|	argument types (PR31088) PR31088 demonstrated that we were assuming that only integers require promotion from <1 x iX> types, when in fact float types may require it as well - in this case half floats. This patch adds support for extension/truncation for both integer and float types. Differential Revision: https://reviews.llvm.org/D32391 llvm-svn: 301910
*	[SelectionDAG] Use known ones to provide a better bound for the known zeros ↵	Craig Topper	2017-05-01	1	-2/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	for CTTZ/CTLZ operations. This is the SelectionDAG version of D32521. If know where at least one 1 is located in the input to these intrinsics we can place an upper bound on the number of bits needed to represent the count and thus increase the number of known zeros in the output. I think we can also refine this further for CTTZ_UNDEF/CTLZ_UNDEF by assuming that the answer will never be BitWidth. I've left this out for now because it caused other test failures across multiple targets. Usually because of turning ADD into OR based on this new information. I'll fix CTPOP in a future patch. Differential Revision: https://reviews.llvm.org/D32692 llvm-svn: 301806
*	Generalize the specialized flag-carrying SDNodes by moving flags into SDNode.	Amara Emerson	2017-05-01	1	-56/+26
\| \| \| \| \| \| \| \|	This removes BinaryWithFlagsSDNode, and flags are now all passed by value. Differential Revision: https://reviews.llvm.org/D32527 llvm-svn: 301803
*	Do not legalize large add with addc/adde, introduce addcarry and do it with ↵	Amaury Sechet	2017-04-30	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	uaddo/addcarry Summary: As per discution on how to get better codegen an large int legalization, it became clear that using a glue for the carry was preventing several desirable optimizations. Passing the carry down as a value allow for more flexibility. Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D29872 llvm-svn: 301775
*	[APInt] Replace calls to setBits with more specific calls to setBitsFrom and ↵	Craig Topper	2017-04-30	1	-1/+1
\| \| \| \| \| \|	setLowBits where possible. llvm-svn: 301768
*	[KnownBits] Add methods for determining if the known bits represent a ↵	Craig Topper	2017-04-29	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	negative/nonnegative number and add methods for changing the negative/nonnegative state Summary: This patch adds isNegative, isNonNegative for querying whether the sign bit is known. It also adds makeNegative and makeNonNegative for controlling the sign bit. Reviewers: RKSimon, spatel, davide Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32651 llvm-svn: 301747
*	[APInt] Add clearSignBit method. Use it and setSignBit in a few places. NFCI	Craig Topper	2017-04-28	1	-1/+1
\| \| \| \|	llvm-svn: 301656
*	[DAGCombiner] Add ComputeNumSignBits vector demanded elements support to ↵	Simon Pilgrim	2017-04-28	1	-1/+39
\| \| \| \| \| \| \| \|	ASHR and INSERT_VECTOR_ELT (reapplied) Reapplied r299221 after fix for nondeterminism in ThinLTO builder (rL301599), with extra check for implicit truncation of inserted element. llvm-svn: 301644
*	[ValueTracking] Convert computeKnownBitsFromRangeMetadata to use KnownBits ↵	Craig Topper	2017-04-28	1	-1/+1
\| \| \| \| \| \|	struct. llvm-svn: 301626