bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Fix MachineInstr::findRegisterUseOperandIdx subreg checks	Stanislav Mekhanoshin	2018-11-12	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The function only checks that instruction reads a super-register containing requested physical register. In case if a sub-register if being read that is also a use of a super-reg, so added the check. In particular MI->readsRegister() is broken because of the missing check. The resulting check is essentially regsOverlap(). Differential Revision: https://reviews.llvm.org/D54128 llvm-svn: 346686
*	[CostModel][X86] Add SHLD/SHRD scalar funnel shift costs	Simon Pilgrim	2018-11-12	1	-2/+11
\| \| \| \| \| \|	The costs match the typical reg-reg cases - the RMW case can be a lot slower but we don't model that at this level llvm-svn: 346683
*	[MachineOutliner][NFC] Early exit pruning when candidates don't share an MBB	Jessica Paquette	2018-11-12	1	-0/+8
\| \| \| \| \| \| \| \| \| \|	There's no way they can overlap in this case. This can save a few iterations when the candidate is close to the beginning of a MachineBasicBlock. It's particularly useful when the average length of a MachineBasicBlock in the program is small. llvm-svn: 346682
*	[MachineOutliner][NFC] Put suffix tree in buildCandidateList	Jessica Paquette	2018-11-12	1	-6/+5
\| \| \| \| \| \|	It's only used there, so it doesn't make much sense to have it in runOnModule. llvm-svn: 346681
*	[DWARFv5] Emit split type units in .debug_info.dwo.	Paul Robinson	2018-11-12	1	-4/+7
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D54350 llvm-svn: 346674
*	[CostModel][X86] SK_ExtractSubvector is cheap if the (legal) subvector is ↵	Simon Pilgrim	2018-11-12	1	-5/+13
\| \| \| \| \| \|	aligned within the source vector llvm-svn: 346664
*	[SystemZ::TTI] Improve accuracy of costs for vector fp <-> int conversions	Jonas Paulsson	2018-11-12	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Improve getCastInstrCost() by respecting the different types of Src and Dst for vector integer <-> fp conversions. This means that extracting from integer becomes more expensive (by the extraction penalty), and the extraction from fp becomes cheaper (no longer has a false extraction penalty). Review: Ulrich Weigand https://reviews.llvm.org/D54423 llvm-svn: 346663
*	[VectorUtils] add funnel-shifts to the list of vectorizable intrinsics	Sanjay Patel	2018-11-12	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This just identifies the intrinsics as candidates for vectorization. It does not mean we will attempt to vectorize under normal conditions (the test file is forcing vectorization). The cost model must be fixed to show that the transform is profitable in general. Allowing vectorization with these intrinsics is required to avoid potential regressions from canonicalizing to the intrinsics from generic IR: https://bugs.llvm.org/show_bug.cgi?id=37417 llvm-svn: 346661
*	[VectorUtils] reorder list of vectorizable intrinsics; NFC	Sanjay Patel	2018-11-12	1	-10/+9
\| \| \| \| \| \| \|	We need to add funnel-shifts to this list, so clean up the random order before it gets worse. llvm-svn: 346660
*	[CostModel] Add more realistic SK_ExtractSubvector generic costs.	Simon Pilgrim	2018-11-12	1	-1/+2
\| \| \| \| \| \| \| \|	Instead of defaulting to a cost = 1, expand to element extract/insert like we do for other shuffles. This exposes an issue in LoopVectorize which could call SK_ExtractSubvector with a scalar subvector type. llvm-svn: 346656
*	[RISCV] Support .option relax and .option norelax	Alex Bradbury	2018-11-12	7	-99/+177
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This extends the .option support from D45864 to enable/disable the relax feature flag from D44886 During parsing of the relax/norelax directives, the RISCV::FeatureRelax feature bits of the SubtargetInfo stored in the AsmParser are updated appropriately to reflect whether relaxation is currently enabled in the parser. When an instruction is parsed, the parser checks if relaxation is currently enabled and if so, gets a handle to the AsmBackend and sets the ForceRelocs flag. The AsmBackend uses a combination of the original RISCV::FeatureRelax feature bits set by e.g -mattr=+/-relax and the ForceRelocs flag to determine whether to emit relocations for symbol and branch diffs. Diff relocations should therefore only not be emitted if the relax flag was not set on the command line and no instruction was ever parsed in a section with relaxation enabled to ensure correct diffs are emitted. Differential Revision: https://reviews.llvm.org/D46423 Patch by Lewis Revill. llvm-svn: 346655
*	[DAGCombiner] Fix load-store forwarding of indexed loads.	Nirav Dave	2018-11-12	1	-3/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Handle extra output from index loads in cases where we wish to forward a load value directly from a preceeding store. Fixes PR39571. Reviewers: peter.smith, rengolin Subscribers: javed.absar, hiraditya, arphaman, llvm-commits Differential Revision: https://reviews.llvm.org/D54265 llvm-svn: 346654
*	Add an OptimizerLast EP	Philip Pfaffe	2018-11-12	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: It turns out that we need an OptimizerLast PassBuilder extension point after all. I missed the relevance of this EP the first time. By legacy PM magic, function passes added at this EP get added to the last _Function_ PM, which is a feature we lost when dropping this EP for the new PM. A key difference between this and the legacy PassManager's OptimizerLast callback is that this extension point is not triggered at O0. Extensions to the O0 pipeline should append their passes to the end of the overall pipeline. Differential Revision: https://reviews.llvm.org/D54374 llvm-svn: 346645
*	[LICM] Hoist guards from non-header blocks	Max Kazantsev	2018-11-12	3	-11/+39
\| \| \| \| \| \| \| \| \| \| \|	This patch relaxes overconservative checks on whether or not we could write memory before we execute an instruction. This allows us to hoist guards out of loops even if they are not in the header block. Differential Revision: https://reviews.llvm.org/D50891 Reviewed By: fedor.sergeev llvm-svn: 346643
*	[GCOV] Add options to filter files which must be instrumented.	Calixte Denizet	2018-11-12	1	-2/+82
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When making code coverage, a lot of files (like the ones coming from /usr/include) are removed when post-processing gcno/gcda so finally they doen't need to be instrumented nor to appear in gcno/gcda. The goal of the patch is to be able to filter the files we want to instrument, there are several advantages to do that: - improve speed (no overhead due to instrumentation on files we don't care) - reduce gcno/gcda size - it gives the possibility to easily instrument only few files (e.g. ones modified in a patch) without changing the build system - need to accept this patch to be enabled in clang: https://reviews.llvm.org/D52034 Reviewers: marco-c, vsk Reviewed By: marco-c Subscribers: llvm-commits, sylvestre.ledru Differential Revision: https://reviews.llvm.org/D52033 llvm-svn: 346641
*	[SystemZ] Replicate the load with most uses in buildVector()	Jonas Paulsson	2018-11-12	1	-8/+11
\| \| \| \| \| \| \| \| \| \| \|	Iterate over all elements and count the number of uses among them for each used load. Then make sure to REPLICATE the load which has the most uses in order to minimize the number of needed element insertions. Review: Ulrich Weigand https://reviews.llvm.org/D54322 llvm-svn: 346637
*	[GC] Remove unused configuration variable	Philip Reames	2018-11-12	1	-6/+1
\| \| \| \| \| \|	The custom root mechanism didn't actually do anything. ShadowStackGC, the only one which used it, just removed the gcroots before they reached the normal lowering in SelectionDAG. As a result, the state flag had no value. llvm-svn: 346632
*	[GC] Minor style modernization	Philip Reames	2018-11-12	1	-44/+43
\| \| \| \|	llvm-svn: 346631
*	[GCRoot] Remove some unneccessary complexity	Philip Reames	2018-11-11	2	-50/+33
\| \| \| \| \| \| \| \| \|	The GCStrategy provides three configuration options were are largely redundant. 1) Support for conditionally lowering gcread and gcwrite to loads and stores. This is redundant since any GC which wished to use these abstractions would lower them out of existance before the built in lowering anyways. As such, there's no need to have the lowering being conditional. 2) Conditional initialization for allocas marked via gcroot. Semantically, roots have to be initialized before first potential use. Arguably, the frontend really should have responsibility for that, but the old API allowed the frontend to ignore this detail. Only one builtin GC used the non-initializing mode. Since no one to my knowledge actually uses the ErlangGC strategy, I decide the slight pessimization was worth the simplicity. If that turns out to be problematic, we can always improve the insertion algorithm to detect more existing initializing stores. llvm-svn: 346621
*	[IPSCCP,PM] Preserve PDT in the new pass manager.	Florian Hahn	2018-11-11	2	-7/+8
\| \| \| \| \| \| \| \| \| \|	Reviewers: kuhar, chandlerc, NutshellySima, brzycki Reviewed By: NutshellySima, brzycki Differential Revision: https://reviews.llvm.org/D54317 llvm-svn: 346618
*	[DWARF] Change pubnames to use DWARFSection instead of StringRef	Fangrui Song	2018-11-11	2	-25/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The debug_info_offset values in .debug_{,gnu_}pub{name,types} may be relocated. Change it to DWARFSection so that we can get relocated values. Reviewers: ruiu, dblaikie, grimar, JDevlieghere Reviewed By: JDevlieghere Subscribers: aprantl, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D54375 llvm-svn: 346615
*	Make initializeOutputStream() return false on error and true on success.	Nico Weber	2018-11-11	2	-10/+10
\| \| \| \| \| \| \| \|	As discussed in https://reviews.llvm.org/D52104 Differential Revision: https://reviews.llvm.org/D52143 llvm-svn: 346606
*	[X86] Use DAG.getConstant instead of getZeroVector.	Craig Topper	2018-11-11	1	-1/+1
\| \| \| \|	llvm-svn: 346605
*	[Support] Make error banner optional in logAllUnhandledErrors	Jonas Devlieghere	2018-11-11	8	-12/+12
\| \| \| \| \| \| \| \|	In a lot of places an empty string was passed as the ErrorBanner to logAllUnhandledErrors. This patch makes that argument optional to simplify the call sites. llvm-svn: 346604
*	[X86] Replace calls to getOnesVector/getZeroVector with getConstant.	Craig Topper	2018-11-11	1	-2/+2
\| \| \| \| \| \|	getConstant will create a BUILD_VECTOR for us and use a legal type if necessary. So just create the simple node and let BUILD_VECTOR legalization do the canonicalization. llvm-svn: 346603
*	[DAGCombiner] Make tryToFoldExtendOfConstant return an SDValue instead of an ↵	Craig Topper	2018-11-10	1	-14/+14
\| \| \| \| \| \| \| \|	SDNode*. NFC Removes the need to call getNode internally and to recreate an SDValue after the call. llvm-svn: 346600
*	[InstCombine] simplify code for merging stores; NFCI	Sanjay Patel	2018-11-10	2	-51/+28
\| \| \| \|	llvm-svn: 346596
*	[x86] allow vector load narrowing with multi-use values	Sanjay Patel	2018-11-10	4	-6/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a long-awaited follow-up suggested in D33578. Since then, we've picked up even more opportunities for vector narrowing from changes like D53784, so there are a lot of test diffs. Apart from 2-3 strange cases, these are all wins. I've structured this to be no-functional-change-intended for any target except for x86 because I couldn't tell if AArch64, ARM, and AMDGPU would improve or not. All of those targets have existing regression tests (4, 4, 10 files respectively) that would be affected. Also, Hexagon overrides the shouldReduceLoadWidth() hook, but doesn't show any regression test diffs. The trade-off is deciding if an extra vector load is better than a single wide load + extract_subvector. For x86, this is almost always better (on paper at least) because we often can fold loads into subsequent ops and not increase the official instruction count. There's also some unknown -- but potentially large -- benefit from using narrower vector ops if wide ops are implemented with multiple uops and/or frequency throttling is avoided. Differential Revision: https://reviews.llvm.org/D54073 llvm-svn: 346595
*	[X86] Remove unused variable	Benjamin Kramer	2018-11-10	1	-1/+0
\| \| \| \|	llvm-svn: 346592
*	[X86] Remove apparently unneeded code from combineVSZext.	Craig Topper	2018-11-10	1	-50/+0
\| \| \| \| \| \| \| \|	No lit tests fail with this code removed. This is a pre-commit for D54346. llvm-svn: 346590
*	[CostModel][X86] SK_ExtractSubvector costs must only be tested for vector ↵	Simon Pilgrim	2018-11-10	1	-1/+1
\| \| \| \| \| \|	types (PR39615) llvm-svn: 346589
*	[GC] Rename a header for consistency	Philip Reames	2018-11-10	3	-3/+3
\| \| \| \|	llvm-svn: 346588
*	[X86][BdVer2] Fix loads/stores throughput for Piledriver (PR39465)	Roman Lebedev	2018-11-10	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	There are two AGU units, and per 1cy, there can be either two loads, or a load and a store; but not two stores, or two loads and a store. Additionally, loads shouldn't affect the store scheduler and vice versa. (but should affect the PdEX scheduler.) Required rL346545. Fixes https://bugs.llvm.org/show_bug.cgi?id=39465 llvm-svn: 346587
*	[ThinLTO] Internalize readonly globals	Eugene Leviant	2018-11-10	10	-48/+289
\| \| \| \| \| \| \| \| \|	This patch allows internalising globals if all accesses to them (from live functions) are from non-volatile load instructions Differential revision: https://reviews.llvm.org/D49362 llvm-svn: 346584
*	[X86] Use a MOVSX instruction instead of a MOVZX instruction in isel for an ↵	Craig Topper	2018-11-10	1	-0/+9
\| \| \| \| \| \| \| \|	any_extend of the remainder from an 8-bit sdivrem. The sdivrem will emit its own MOVSX to move %ah to the low byte of a register. By using a MOVSX for an any_extend this allows a post-isel peephole to merge them. llvm-svn: 346581
*	Fix DragonFlyBSD build	David Carlier	2018-11-10	1	-1/+3
\| \| \| \| \| \| \| \| \| \|	Reviewers: rnk, thakis Reviewed By: krytarowski Differential Revision: https://reviews.llvm.org/D54363 llvm-svn: 346577
*	RegAllocFast: Further cleanups; NFC	Matthias Braun	2018-11-10	1	-210/+217
\| \| \| \|	llvm-svn: 346576
*	[X86] In LowerHorizontalByteSum, emit vector_shuffle nodes instead of ↵	Craig Topper	2018-11-10	1	-5/+5
\| \| \| \| \| \| \| \| \| \|	directly using X86ISD::UNPCKL/X86ISD::UNPCKH. This gives shuffle lowering the freedom to use zero_extend_vector_inreg for the unpckl shuffle. Shuffle combining usually makes this swap later, but not when AVX512 is enabled it seems. While there also use DAG.getConstant to create a 0 vector instead of using the helper the forces a specific BUILD_VECTOR. I don't think that helper is usually needed. We're basically free to create a constant build_vector anytime and it will be legalized on its own. llvm-svn: 346574
*	[WebAssembly] Update bleeding-edge cpu features	Thomas Lively	2018-11-10	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D54362 llvm-svn: 346570
*	[GC] Simplify linking of GC builtin GC strategies	Philip Reames	2018-11-09	1	-6/+2
\| \| \| \|	llvm-svn: 346569
*	[ARM64] [Windows] Handle funclets	Eli Friedman	2018-11-09	9	-20/+243
\| \| \| \| \| \| \| \| \| \| \| \|	This patch adds support for funclets in frame lowering and ISel lowering. Together with D50288 and D50166, it enables C++ exception handling. Patch by Sanjin Sijaric, with some fixes by me. Differential Revision: https://reviews.llvm.org/D51524 llvm-svn: 346568
*	[SelectionDAG] Fix a -Wparentheses warning from gcc in an assert. NFC	Craig Topper	2018-11-09	1	-2/+2
\| \| \| \| \| \|	gcc wants parentheses around the logical OR since there is a logical AND for the string. llvm-svn: 346564
*	[ARM] Add MemOperand to LDRcp to enable DCE.	Eli Friedman	2018-11-09	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \|	LDRcp should be deleted when the dest register is dead in register coalescing. Without MemOp, dead LDRcp will cause dead constant pool value which references to non-existing label. Patch by Yin Ma. Differential Revision: https://reviews.llvm.org/D54173 llvm-svn: 346563
*	[JumpThreading] Fix exponential time algorithm computing known values.	Eli Friedman	2018-11-09	1	-19/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ComputeValueKnownInPredecessors has a "visited" set to prevent infinite loops, since a value can be visited more than once. However, the implementation didn't prevent the algorithm from taking exponential time. Instead of removing elements from the RecursionSet one at a time, we should keep around the whole set until ComputeValueKnownInPredecessors finishes, then discard it. The testcase is synthetic because I was having trouble effectively reducing the original. But it's basically the same idea. Instead of failing, we could theoretically cache the result instead. But I don't think it would help substantially in practice. Differential Revision: https://reviews.llvm.org/D54239 llvm-svn: 346562
*	[X86] Move the promotion of v16i16->v16i8 for avx512f but not avx512bw from ↵	Craig Topper	2018-11-09	2	-8/+18
\| \| \| \| \| \| \| \| \| \|	lowering to isel. Change to use vpmovzx instead of vpmovsx. With avx512f but not avx512bw we need to extend to v16i32 then truncate that to to v16i8. Previously we emitted both nodes during lowering, but I'm trying to switch to using target independent nodes and with that switched the extend+truncate wou This patch changes the implementation to what will be necessary with that patch which helps minimize test diffs. llvm-svn: 346552
*	[AArch64] Support HiSilicon's TSV110 processor	Bryan Chan	2018-11-09	4	-1/+35
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: t.p.northover, SjoerdMeijer, kristof.beyls Reviewed By: kristof.beyls Subscribers: olista01, javed.absar, kristof.beyls, kristina, llvm-commits Differential Revision: https://reviews.llvm.org/D53908 llvm-svn: 346546
*	[MS demangler] Use a slightly shorter unmangling for mangled strings.	Nico Weber	2018-11-09	1	-5/+4
\| \| \| \| \| \| \| \| \| \|	Before: const wchar_t * {L"%"} Now: L"%" See also PR39593. Differential Revision: https://reviews.llvm.org/D54294 llvm-svn: 346544
*	[Hexagon] Fix some -Wunused-function with LLVM_DUMP_METHOD and -Wunused-variable	Fangrui Song	2018-11-09	2	-4/+9
\| \| \| \|	llvm-svn: 346543
*	[DWARFv5] Emit normal type units in .debug_info comdats.	Paul Robinson	2018-11-09	2	-5/+11
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D54282 llvm-svn: 346540
*	[X86] Turn X86ISD::VSEXT into X86ISD::VZEXT if the upper bits aren't demanded.	Craig Topper	2018-11-09	1	-0/+12
\| \| \| \| \| \| \| \|	This makes X86ISD::VSEXT more similar to ISD::SIGN_EXTEND and ISD::ZERO_EXTEND. I'm hoping to replace X86ISD::VSEXT/VZEXT with target independent nodes. Making the target specific nodes similar to the target independent nodes helps minimize test diffs in that patch. llvm-svn: 346539