bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[DAG] LegalizeVectorTypes - reduce scope of local variables. NFCI.	Simon Pilgrim	2019-05-28	1	-4/+2
\| \| \| \| \| \|	Move the element index/count variables into the block where they are actually used - appeases cppcheck and helps avoid shadow variable warnings. llvm-svn: 361821
*	Stop undef fragments from closing non-overlapping fragments	David Stenberg	2019-05-28	2	-14/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When DwarfDebug::buildLocationList() encountered an undef debug value, it would truncate all open values, regardless if they were overlapping or not. This patch fixes so that it only does that for overlapping fragments. This change unearthed a bug that I had introduced in D57511, which I have fixed in this patch. The code in DebugHandlerBase that changes labels for parameter debug values could break DwarfDebug's assumption that the labels for the entries in the debug value history are monotonically increasing. Before this patch, that bug could result in location list entries whose ending address was lower than the beginning address, and with the changes for undef debug values that this patch introduces it could trigger an assertion, due to attempting to emit location list entries with empty ranges. A reproducer for the bug is added in param-reg-const-mix.mir. Reviewers: aprantl, jmorse, probinson Reviewed By: aprantl Subscribers: javed.absar, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D62379 llvm-svn: 361820
*	MIR: Fix printer crashing on dead CSR frame indexes	Matt Arsenault	2019-05-28	1	-0/+3
\| \| \| \|	llvm-svn: 361819
*	[x86] fix 256-bit vector store splitting to honor 'volatile'	Sanjay Patel	2019-05-28	1	-14/+30
\| \| \| \| \| \| \| \| \| \| \|	Forking this out of the discussion in D62498 (and assuming that will be committed later, so adding the helper function here). The LangRef says: "the backend should never split or merge target-legal volatile load/store instructions." Differential Revision: https://reviews.llvm.org/D62506 llvm-svn: 361815
*	[X86] Custom lower CONCAT_VECTORS of v2i1	Benjamin Kramer	2019-05-28	2	-7/+3
\| \| \| \| \| \| \|	The generic legalizer cannot handle this. Add an assert instead of silently miscompiling vectors with elements smaller than 8 bits. llvm-svn: 361814
*	[NFC] Test commit, delete trailing whitespace	Graham Hunter	2019-05-28	1	-1/+1
\| \| \| \|	llvm-svn: 361813
*	Re-commit r357452 (take 2): "SimplifyCFG SinkCommonCodeFromPredecessors: ↵	Hans Wennborg	2019-05-28	1	-14/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Also sink function calls without used results (PR41259)" This was reverted in r360086 as it was supected of causing mysterious test failures internally. However, it was never concluded that this patch was the root cause. > The code was previously checking that candidates for sinking had exactly > one use or were a store instruction (which can't have uses). This meant > we could sink call instructions only if they had a use. > > That limitation seemed a bit arbitrary, so this patch changes it to > "instruction has zero or one use" which seems more natural and removes > the need to special-case stores. > > Differential revision: https://reviews.llvm.org/D59936 llvm-svn: 361811
*	[CorrelatedValuePropagation] Fix prof branch_weights metadata handling for ↵	Yevgeny Rouban	2019-05-28	1	-56/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	SwitchInst This patch fixes the CorrelatedValuePropagation pass to keep prof branch_weights metadata of SwitchInst consistent. It makes use of SwitchInstProfUpdateWrapper. New tests are added. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D62126 llvm-svn: 361808
*	[X86] X86CmovConverterPass::collectCmovCandidates - fix uninitialized ↵	Simon Pilgrim	2019-05-28	1	-1/+2
\| \| \| \| \| \|	variable warnings. NFCI. llvm-svn: 361804
*	[AArch64][SVE2] Asm: support SVE2 Floating Point Convert Group	Cullen Rhodes	2019-05-28	2	-0/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds support for the following intructions: SVE2 floating-point convert precision: * FCVTXNT, FCVTNT, FCVTLT The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62382 llvm-svn: 361801
*	[AArch64][SVE2] Asm: support SVE2 Crypto Extensions Group	Cullen Rhodes	2019-05-28	2	-0/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds support for the following instructions: SVE2 crypto constructive binary operations: * SM4EKEY, RAX1 SVE2 crypto destructive binary operations: * AESE, AESD, SM4E SVE2 crypto unary operations: * AESMC, AESIMC AESE, AESD, AESMC and AESIMC are enabled with +sve2-aes. SM4E and SM4EKEY are enabled with +sve2-sm4. RAX1 is enabled with +sve2-sha3. The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D62307 llvm-svn: 361797
*	[AArch64][SVE2] Asm: support SVE2 Histogram Computation Groups	Cullen Rhodes	2019-05-28	2	-0/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds support for the following instructions: SVE2 histogram generation (segment): * HISTSEG SVE2 histogram generation (vector): * HISTCNT The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62306 llvm-svn: 361796
*	[AArch64][SVE2] Asm: support SVE2 Misc Group	Cullen Rhodes	2019-05-28	2	-0/+98
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds support for the following instructions: SVE2 bitwise exclusive-or interleaved: * EORBT, EORTB SVE2 bitwise permute: * BEXT, BDEP, BGRP SVE2 bitwise shift left long: * SSHLLB, SSHLLT, USHLLB, USHLLT SVE2 integer add/subtract interleaved long: * SADDLBT, SSUBLBT, SSUBLTB BDEP, BEXT and BGRP are enabled with SVE2 feature +bitperm, all other instructions in this group are enabled with +sve2. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62304 llvm-svn: 361795
*	[InlineCost] Fix a couple comments. NFC	Craig Topper	2019-05-28	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	Replace "unary operator" with "unary instruction" in visitUnaryInstruction since we now have a UnaryOperator class which might needs its own visit function. Fix a copy/paste in visitCastInst that appears to have been copied from visitPtrToInt. llvm-svn: 361794
*	[CostModel] Add really basic support for being able to query the cost of the ↵	Craig Topper	2019-05-28	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	FNeg instruction. Summary: This reuses the getArithmeticInstrCost, but passes dummy values of the second operand flags. The X86 costs are wrong and can be improved in a follow up. I just wanted to stop it from reporting an unknown cost first. Reviewers: RKSimon, spatel, andrew.w.kaylor, cameron.mcinally Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62444 llvm-svn: 361788
*	llvm-undname: Remove unreachable statement	Nico Weber	2019-05-28	1	-1/+0
\| \| \| \|	llvm-svn: 361786
*	llvm-undname: Extract demangleMD5Name() method; no behavior change	Nico Weber	2019-05-27	1	-30/+34
\| \| \| \|	llvm-svn: 361783
*	[RuntimeDyld][ARM] Fix an incorrect assertion condition.	Lang Hames	2019-05-27	1	-1/+1
\| \| \| \| \| \|	Fixes https://llvm.org/PR42036 llvm-svn: 361782
*	RegAllocFast: Set MayLiveAcrossBlocks when allocating uses	Matt Arsenault	2019-05-27	1	-1/+27
\| \| \| \| \| \| \| \| \| \|	Setting mayLiveOut based only on use instructions after allocating the def block did not work if the use block was allocated before the def block, since the virtual register uses were already removed. Fixes bug 41973. llvm-svn: 361781
*	[SelectionDAG] fold concat of extract subvectors	Sanjay Patel	2019-05-27	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is derived from the related fold for build vectors. We also have a version of this in DAGCombiner. The benefit of having this fold at node creation time is (1) efficiency and (2) preventing infinite looping from creating patterns that should not exist in the first place. Currently, the inf-loop could happen with MergeConsecutiveStores() because it naively creates concat of extracts when forming a wider vector store. That could fight with target-specific store narrowing. llvm-svn: 361780
*	[SelectionDAG] fix formatting and redundant comments; NFC	Sanjay Patel	2019-05-27	1	-7/+6
\| \| \| \| \| \| \| \| \|	There's a possible missing fold here for extracting from the same source vector. It's similar to a check that we use to squash a build vector with all extracted elements from the same source vector. llvm-svn: 361778
*	[SelectionDAG] Enhance the simplification of `copyto` from `implicit-def`.	Michael Liao	2019-05-27	2	-31/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: - The current implementation simplifies the case where the source of `copyto` is `implicit-def`ed. However, it only works when that `implicit-def` is single-used since it detects that from `implicit-def` and cannot determine which destination vreg should be used if there are multiple uses. - This patch changes that detection when `copyto` is being emitted. If that `copyto`'s source is defined from `implicit-def`, it simplifies it. Hence, it works even that `implicit-def` is multi-used. - Except it simplifies the internal IR, it won't improve the quality of code generation. However, it helps to detect 'implicit-def` in a straight-forward manner in some passes, such as `si-i1-copies`. A test case is added. Reviewers: sunfish, nhaehnle Subscribers: jvesely, hiraditya, asbirlea, llvm-commits, yaxunl Tags: #llvm Differential Revision: https://reviews.llvm.org/D62342 llvm-svn: 361777
*	[AMDGPU] Fix for the address sanitizer failure. Fixing typo	Alexander Timofeev	2019-05-27	1	-1/+1
\| \| \| \|	llvm-svn: 361776
*	Include what you use in AArch64AsmBackend.cpp	Dmitri Gribenko	2019-05-27	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	AArch64AsmBackend.cpp was not using any APIs from AArch64.h, and was only including it for transitive dependencies. Doing so is problematic from include-what-you-use perspective, but it is also a layering issue (it creates a dependency cycle between the primary AArch64 target library and the MCTargetDesc library). llvm-svn: 361774
*	[SelectionDAG] GetDemandedBits - add demanded elements wrapper implementation	Simon Pilgrim	2019-05-27	1	-1/+15
\| \| \| \| \| \|	The DemandedElts variable is pretty much inert at the moment - the original GetDemandedBits implementation calls it with an 'all ones' DemandedElts value so the function is active and behaves exactly as it used to. llvm-svn: 361773
*	[LLParser] Fix uninitialized flag variable warnings. NFCI.	Simon Pilgrim	2019-05-27	1	-2/+2
\| \| \| \| \| \|	Fixes a large number of warnings in the scan-build report on llvm builds. llvm-svn: 361772
*	[AMDGPU] Fix for the address sanitizer failure caused by the ifollowing ↵	Alexander Timofeev	2019-05-27	1	-1/+3
\| \| \| \| \| \| \| \|	commit: 1a8b2ea611cf4ca7cb09562e0238cfefa27c05b5 Divergence driven ISel. Assign register class for cross block values according to the divergence. llvm-svn: 361770
*	[AMDGPU][MC] Enabled constant expressions as operands of s_waitcnt	Dmitry Preobrazhensky	2019-05-27	1	-36/+28
\| \| \| \| \| \| \| \| \| \|	See bug 40820: https://bugs.llvm.org/show_bug.cgi?id=40820 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D61017 llvm-svn: 361763
*	[MustExecute] Improve MustExecute to correctly handle loop nest	Xing Xue	2019-05-27	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: for.outer: br for.inner for.inner: LI <loop invariant load instruction> for.inner.latch: br for.inner, for.outer.latch for.outer.latch: br for.outer, for.outer.exit LI is a loop invariant load instruction that post dominate for.outer, so LI should be able to move out of the loop nest. However, there is a bug in allLoopPathsLeadToBlock(). Current algorithm of allLoopPathsLeadToBlock() 1. get all the transitive predecessors of the basic block LI belongs to (for.inner) ==> for.outer, for.inner.latch 2. if any successors of any of the predecessors are not for.inner or for.inner's predecessors, then return false 3. return true Although for.inner.latch is for.inner's predecessor, but for.inner dominates for.inner.latch, which means if for.inner.latch is ever executed, for.inner should be as well. It should not return false for cases like this. Author: Whitney (committed by xingxue) Reviewers: kbarton, jdoerfert, Meinersbur, hfinkel, fhahn Reviewed By: jdoerfert Subscribers: hiraditya, jsji, llvm-commits, etiotto, bmahjour Tags: #LLVM Differential Revision: https://reviews.llvm.org/D62418 llvm-svn: 361762
*	Test commit (NFC)	Nikola Prica	2019-05-27	1	-0/+1
\| \| \| \| \| \|	Add blank line. llvm-svn: 361761
*	[ARM GlobalISel] Cleanup CallLowering a bit	Diana Picus	2019-05-27	2	-22/+13
\| \| \| \| \| \| \|	We never actually use the Offsets produced by ComputeValueVTs, so remove them until we need them. llvm-svn: 361755
*	Revert r361356: "[MIR] Add simple PRE pass to MachineCSE"	David L. Jones	2019-05-27	1	-113/+9
\| \| \| \| \| \| \| \|	This is problematic on buildbots, as discussed here: https://reviews.llvm.org/rL361356 It seems like the plan already was to revert, but that hasn't happened yet. llvm-svn: 361746
*	llvm-undname: Make demangling of MD5 names more robust	Nico Weber	2019-05-27	1	-3/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Demangler::parse() for MD5 names would: 1. Put all remaining text into the MD5 name sight unseen 2. Not modify MangledName This meant that if the demangler recursively called parse() (e.g. in demangleLocallyScopedNamePiece()), every recursive call that started on an MD5 name would add all remaining bytes to the output buffer but only advance the input by a byte. For valid inputs, MD5 types are never (well, see comments for 2 exceptions) nested, but for invalid input this could cause memory use quadratic in the input size. llvm-svn: 361744
*	[LoopInterchange] Fix handling of LCSSA nodes defined in headers and latches.	Florian Hahn	2019-05-26	1	-22/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code to preserve LCSSA PHIs currently only properly supports reduction PHIs and PHIs for values defined outside the latches. This patch improves the LCSSA PHI handling to cover PHIs for values defined in the latches. Fixes PR41725. Reviewers: efriedma, mcrosier, davide, jdoerfert Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D61576 llvm-svn: 361743
*	[BPF] generate R_BPF_NONE relocation for BTF DataSec variables	Yonghong Song	2019-05-26	1	-10/+22
\| \| \| \| \| \| \| \| \| \| \|	The variables in BTF DataSec type encode in-section offset. R_BPF_NONE should be generated instead of R_BPF_64_32. Signed-off-by: Yonghong Song <yhs@fb.com> Differential Revision: https://reviews.llvm.org/D62460 llvm-svn: 361742
*	[AMDGPU] Divergence driven ISel. Assign register class for cross block ↵	Alexander Timofeev	2019-05-26	13	-135/+213
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	values according to the divergence. Details: To make instruction selection really divergence driven it is necessary to assign the correct register classes to the cross block values beforehand. For the divergent targets same value type requires different register classes dependent on the value divergence. Reviewers: rampitec, nhaehnle Differential Revision: https://reviews.llvm.org/D59990 This commit was reverted because of the build failure. The reason was mlformed patch. Build failure fixed. llvm-svn: 361741
*	[MCA][Scheduler] Improved critical memory dependency computation.	Andrea Di Biagio	2019-05-26	1	-6/+16
\| \| \| \| \| \| \| \|	This fixes a problem where back-pressure increases caused by register dependencies were not correctly notified if execution was also delayed by memory dependencies. llvm-svn: 361740
*	[SelectionDAG] GetDemandedBits - cleanup to more closely match ↵	Simon Pilgrim	2019-05-26	1	-16/+21
\| \| \| \| \| \| \| \|	SimplifyDemandedBits. NFCI. Prep work before adding demanded elts support. llvm-svn: 361739
*	[SelectionDAG] MaskedValueIsZero - add demanded elements implementation	Simon Pilgrim	2019-05-26	1	-2/+15
\| \| \| \| \| \|	Will be used in an upcoming patch but I've updated the original implementation to call this to ensure test coverage. llvm-svn: 361738
*	[MCA] Refactor the logic that computes the critical memory dependency info. NFCI	Andrea Di Biagio	2019-05-26	3	-25/+74
\| \| \| \| \| \| \| \|	CriticalRegDep has been renamed CriticalDependency, and it is now used by class Instruction to store information about the critical register dependency and the critical memory dependency. No functional change intendend. llvm-svn: 361737
*	[SimplifyCFG] back out all SwitchInst commits	Shawn Landden	2019-05-26	2	-91/+72
\| \| \| \| \| \| \| \|	They caused the sanitizer builds to fail. My suspicion is the change the countLeadingZeros(). llvm-svn: 361736
*	[X86][SSE] Add shuffle combining support for ISD::ANY_EXTEND_VECTOR_INREG	Simon Pilgrim	2019-05-26	4	-13/+23
\| \| \| \| \| \|	Reuses what we already have in place for ISD::ZERO_EXTEND_VECTOR_INREG just with a different sentinel llvm-svn: 361734
*	Revert rL361731 : [LLParser] Fix uninitialized variable warnings. NFCI.	Simon Pilgrim	2019-05-26	1	-3/+3
\| \| \| \| \| \| \| \|	These 3 variables cause quite a few warnings in the scan-build report on llvm. ........ Revert accidental commit. llvm-svn: 361732
*	[LLParser] Fix uninitialized variable warnings. NFCI.	Simon Pilgrim	2019-05-26	1	-3/+3
\| \| \| \| \| \|	These 3 variables cause quite a few warnings in the scan-build report on llvm. llvm-svn: 361731
*	[InstCombine] prevent crashing with invalid extractelement index	Sanjay Patel	2019-05-26	1	-2/+3
\| \| \| \| \| \| \|	This was found/reduced from a fuzzer report: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=14956 llvm-svn: 361729
*	[SimplifyCFG] ReduceSwitchRange: Improve on the case where the SubThreshold ↵	Shawn Landden	2019-05-26	1	-14/+24
\| \| \| \| \| \|	doesn't trigger llvm-svn: 361728
*	[SimplifyCFG] Run ReduceSwitchRange unconditionally, generalize	Shawn Landden	2019-05-26	1	-56/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than gating on "isSwitchDense" (resulting in necessesarily sparse lookup tables even when they were generated), always run this quite cheap transform. This transform is useful not just for generating tables. LowerSwitch also wants this: read LowerSwitch.cpp:257. Be careful to not generate worse code, by introducing a SubThreshold heuristic. Instead of just sorting by signed, generalize the finding of the best base. And now that it is run unconditionally, do not replicate its functionality in SwitchToLookupTable (which could use a Sub when having a hole is smaller, hence the SubThreshold heuristic located in a single place). This simplifies SwitchToLookupTable, and fixes some ugly corner cases due to the use of signed numbers, such as a table containing i16 32768 and 32769, of which 32769 would be interpreted as -32768, and now the code thinks the table is size 65536. (We still use unconditional subtraction when building a single-register mask, but I think this whole block should go when the more general sparse map is added, which doesn't leave empty holes in the table.) And the reason test4 and test5 did not trigger was documented wrong: it was because they were not considered sufficiently "dense". Also, fix generation of invalid LLVM-IR: shl by bit-width. llvm-svn: 361727
*	[SimpligyCFG] NFC, remove GCD that was only used for powers of two	Shawn Landden	2019-05-26	1	-12/+10
\| \| \| \| \| \| \| \| \| \|	and replace with an equilivent countTrailingZeros. GCD is much more expensive than this, with repeated division. This depends on D60823 llvm-svn: 361726
*	[Support] make countLeadingZeros() and countTrailingZeros() return unsigned	Shawn Landden	2019-05-26	2	-12/+13
\| \| \| \| \| \| \| \| \|	This matches countLeadingOnes() and countTrailingOnes(), and APInt's countLeadingZeros() and countTrailingZeros(). (as well as __builtin_clzll()) llvm-svn: 361724
*	[ValueTracking] Base computeOverflowForUnsignedMul() on ConstantRange code; NFCI	Nikita Popov	2019-05-26	1	-68/+34
\| \| \| \| \| \| \| \|	The implementation in ValueTracking and ConstantRange are equally powerful, reuse the one in ConstantRange, which will make this easier to extend. llvm-svn: 361723