bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[DAGCombine] Move load checks on store of loads into candidate	Nirav Dave	2018-05-15	1	-15/+13
\| \| \| \| \| \| \| \| \| \|	search. NFCI. Migrate single-use and non-volatility, non-indexed requirements on stores of immediate store values to candidate collection pass from later stage. llvm-svn: 332392
*	Nios2: Unbreak build.	Peter Collingbourne	2018-05-15	2	-5/+6
\| \| \| \|	llvm-svn: 332391
*	[x86][eflags] Fix PR37431 by teaching the EFLAGS copy lowering to	Chandler Carruth	2018-05-15	1	-3/+142
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	specially handle SETB_C* pseudo instructions. Summary: While the logic here is somewhat similar to the arithmetic lowering, it is different enough that it made sense to have its own function. I actually tried a bunch of different optimizations here and none worked well so I gave up and just always do the arithmetic based lowering. Looking at code from the PR test case, we actually pessimize a bunch of code when generating these. Because SETB_C* pseudo instructions clobber EFLAGS, we end up creating a bunch of copies of EFLAGS to feed multiple SETB_C* pseudos from a single set of EFLAGS. This in turn causes the lowering code to ruin all the clever code generation that SETB_C* was hoping to achieve. None of this is needed. Whenever we're generating multiple SETB_C* instructions from a single set of EFLAGS we should instead generate a single maximally wide one and extract subregs for all the different desired widths. That would result in substantially better code generation. But this patch doesn't attempt to address that. The test case from the PR is included as well as more directed testing of the specific lowering pattern used for these pseudos. Reviewers: craig.topper Subscribers: sanjoy, mcrosier, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D46799 llvm-svn: 332389
*	AMDGPU: Fix v_dot{4, 8}* instruction encoding	Konstantin Zhuravlyov	2018-05-15	2	-8/+13
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D46848 llvm-svn: 332387
*	[MemorySSA] Don't sort IDF blocks.	Michael Zolotukhin	2018-05-15	1	-11/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: After r332167 we started to sort the IDF blocks inside IDF calculation, so there is no need to re-sort them on the user site. The test changes are due to a slightly different order we're using now (originally we used DFSInNumber and now the blocks are sorted by a pair (LevelFromRoot, DFSInNumber)). Reviewers: dberlin, mgrang Subscribers: Prazek, hiraditya, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D46899 llvm-svn: 332385
*	AMDGPU/GlobalISel: Implement select() for G_FCONSTANT	Tom Stellard	2018-05-15	1	-15/+47
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Also clean up G_CONSTANT selection. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D46170 llvm-svn: 332379
*	AMDGPU: Add disasm tests for deep learning instructions + fix v_fmac_f32 disasm	Konstantin Zhuravlyov	2018-05-15	1	-1/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D46853 llvm-svn: 332377
*	[X86] Split WriteCvtF2F into F32->F64 and F64->F32 scheduler classes	Simon Pilgrim	2018-05-15	12	-136/+181
\| \| \| \| \| \| \| \|	BtVer2 - Fixes schedules for (V)CVTPS2PD instructions A lot of the Intel models still have too many InstRW overrides for these new classes - this needs cleaning up but I wanted to get the classes in first llvm-svn: 332376
*	[InstCombine] fix binop-of-shuffles to check uses	Sanjay Patel	2018-05-15	1	-12/+10
\| \| \| \|	llvm-svn: 332375
*	[Hexagon] Remove unused function from subtarget	Krzysztof Parzyszek	2018-05-15	1	-8/+0
\| \| \| \|	llvm-svn: 332369
*	[Hexagon] Remove unused flag from subtarget and (non)corresponding test	Krzysztof Parzyszek	2018-05-15	3	-8/+0
\| \| \| \|	llvm-svn: 332365
*	[mips] Mark select instructions correctly	Simon Dardis	2018-05-15	3	-151/+192
\| \| \| \| \| \| \| \|	Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46702 llvm-svn: 332364
*	[DAG] propagate FMF for all FPMathOperators	Sanjay Patel	2018-05-15	2	-12/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a simple hack based on what's proposed in D37686, but we can extend it if needed in follow-ups. It gets us most of the FMF functionality that we want without adding any state bits to the flags. It also intentionally leaves out non-FMF flags (nsw, etc) to minimize the patch. It should provide a superset of the functionality from D46563 - the extra tests show propagation and codegen diffs for fcmp, vecreduce, and FP libcalls. The PPC log2() test shows the limits of this most basic approach - we only applied 'afn' to the last node created for the call. AFAIK, there aren't any libcall optimizations based on the flags currently, so that shouldn't make any difference. Differential Revision: https://reviews.llvm.org/D46854 llvm-svn: 332358
*	[X86] Split off F16C WriteCvtPH2PS/WriteCvtPS2PH scheduler classes	Simon Pilgrim	2018-05-15	12	-139/+121
\| \| \| \| \| \| \| \| \|	Btver2 - VCVTPH2PSYrm needs to double pump the AGU Broadwell - missing VCVTPS2PH*mr stores extra latency Allows us to remove the WriteCvtF2FSt conversion store class llvm-svn: 332357
*	Reapply "DWARFVerifier: Check "completeness" of .debug_names section"	Pavel Labath	2018-05-15	2	-2/+176
\| \| \| \| \| \| \| \| \|	This is a resubmit of r331868 (D46583), which was reverted due to failures on the PS4 bot. These have been resolved with r332246/D46748. llvm-svn: 332349
*	[MergeFunctions] Fix merging of small weak functions	whitequark	2018-05-15	1	-35/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When two interposable functions are merged, we cannot replace uses and have to emit calls to a common internal function. However, writeThunk() will not actually emit a thunk if the function is too small. This leaves us in a broken state where mergeTwoFunctions already rewired the functions, but writeThunk doesn't do anything. This patch changes the implementation so that: * writeThunk() does just that. * The direct replacement of calls is moved into mergeTwoFunctions() into the non-interposable case only. * isThunkProfitable() is extracted and will be called for the non-iterposable case always, and in the interposable case only if uses are still left after replacement. This issue has been introduced in https://reviews.llvm.org/D34806, where the code for checking thunk profitability has been moved. Differential Revision: https://reviews.llvm.org/D46804 Reviewed By: whitequark llvm-svn: 332342
*	[mips] Fix formatting of floating point conversion patterns	Simon Dardis	2018-05-15	1	-8/+8
\| \| \| \|	llvm-svn: 332341
*	[mips] Add disassembly support for comparison instructions	Simon Dardis	2018-05-15	1	-4/+6
\| \| \| \|	llvm-svn: 332340
*	[mips] Fix predicates of mfc1, mtc1 instructions	Simon Dardis	2018-05-15	2	-28/+22
\| \| \| \| \| \| \| \|	Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46692 llvm-svn: 332339
*	[X86] Improve unsigned saturation downconvert detection.	Artur Gainullin	2018-05-15	1	-19/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: New unsigned saturation downconvert patterns detection was implemented in X86 Codegen: (truncate (smin (smax (x, C1), C2)) to dest_type), where C1 >= 0 and C2 is unsigned max of destination type. (truncate (smax (smin (x, C2), C1)) to dest_type) where C1 >= 0, C2 is unsigned max of destination type and C1 <= C2. These two patterns are equivalent to: (truncate (umin (smax(x, C1), unsigned_max_of_dest_type)) to dest_type) Reviewers: RKSimon Subscribers: llvm-commits, a.elovikov Differential Revision: https://reviews.llvm.org/D45315 llvm-svn: 332336
*	[NFC] pull a function into its own lambda	JF Bastien	2018-05-15	1	-19/+21
\| \| \| \| \| \| \| \| \|	As requested in D46858, pulling this function into its own lambda makes it easier to read that part of the code and reason as to what's going on because the scope it can be called from is extremely limited. We want to keep it as a function because it's called from the two subsequent lines. llvm-svn: 332325
*	[NFC] Update comments	JF Bastien	2018-05-15	2	-31/+30
\| \| \| \| \| \|	Don't prepend function or data name before each comment. Split into its own NFC patch as requested in D46858. llvm-svn: 332323
*	[RISCV] Define FeatureRelax and shouldForceRelocation for RISCV linker ↵	Shiva Chen	2018-05-15	3	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	relaxation 1. Deine FeatureRelax to enable/disable linker relaxation. 2. Define shouldForceRelocation to preserve relocation types even if the fixup can be resolved when linker relaxation enabled. This is necessary for correctness as offsets may change during relaxation. Differential Revision: https://reviews.llvm.org/D46674 llvm-svn: 332318
*	[NFC] Add const to method signature	Max Kazantsev	2018-05-15	1	-1/+1
\| \| \| \|	llvm-svn: 332317
*	[Debugify] Add -debugify-each for testing each pass in a pipeline	Vedant Kumar	2018-05-15	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a -debugify-each mode to opt which, when enabled, wraps each {Module,Function}Pass in a pipeline with logic to add, check, and strip synthetic debug info for testing purposes. This mode can be used to test complex pipelines for debug info bugs, or to collect statistics about the number of debug values & locations lost throughout various stages of a pipeline. Patch by Son Tuan Vu! Differential Revision: https://reviews.llvm.org/D46525 llvm-svn: 332312
*	[CommandLine] Error message for incorrect PositionalEatArgs usage	Keno Fischer	2018-05-14	1	-9/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: bugpoint has several options specified as `PositionalEatArgs` to pass options through to the underlying tool, e.g. `-tool-args`. The `-help` message suggests the usage is: `-tool-args=<string>`. However, this is misleading, because that's not how these arguments work. Rather than taking a value, the option consumes all positional arguments until the next recognized option (or all arguments if `--` is specified at some point). To make this slightly clearer, instead print the help as: ``` -tool-args <string>... - <tool arguments>... ``` Additionally, add an error if the user attempts to use a `PositionalEatArgs` argument with a value, instead of silently ignoring it. Example: ``` ./bin/bugpoint -tool-args=-mpcu=skylake-avx512 bugpoint: for the -tool-args option: This argument does not take a value. Instead, it consumes any positional arguments until the next recognized option. ``` Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D46787 llvm-svn: 332311
*	[WebAssembly] Move toString helpers to BinaryFormat	Sam Clegg	2018-05-14	4	-29/+38
\| \| \| \| \| \| \| \|	Subscribers: dschuff, mgorny, jgravelle-google, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D46847 llvm-svn: 332305
*	Revert "[Option] Fix PR37006 prefix choice in findNearest"	Brian Gesiak	2018-05-14	1	-24/+24
\| \| \| \| \| \| \| \| \| \| \|	Summary: This revision causes build failures in PS4 and ppc64le buildbots (for example, http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/29988). I'll revert for now and try to diagnose the issue. Test Plan: check-llvm check-clang llvm-svn: 332304
*	[InstCombine] fix crash due to ignored addrspacecast	Keno Fischer	2018-05-14	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Part of the InstCombine code for simplifying GEPs looks through addrspacecasts. However, this was done by updating a variable also used by the next transformation, for marking GEPs as inbounds. This led to replacing a GEP with a similar instruction in a different addrspace, which caused an assertion failure in RAUW. This caused julia issue https://github.com/JuliaLang/julia/issues/27055 Patch by Jeff Bezanson <jeff@juliacomputing.com> Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D46722 llvm-svn: 332302
*	[Option] Fix PR37006 prefix choice in findNearest	Brian Gesiak	2018-05-14	1	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In https://bugs.llvm.org/show_bug.cgi?id=37006 Nico Weber points out a flaw in `OptTable::findNearest`: if an option "foo"'s prefixes are "--" and "-", then the nearest option for "--fob" will be "-foo". This is incorrect, however, since the function is expected to return "--foo". The bug is due to a naive loop that attempts to predetermines which prefix is best. Instead, compute the edit distance for each prefix/name pair. Test Plan: `check-llvm` Reviewers: thakis Reviewed By: thakis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46776 llvm-svn: 332299
*	[ARM] Back up R4 and LR if calling the stack probe function	Martin Storsjo	2018-05-14	1	-0/+11
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D46777 llvm-svn: 332298
*	[CodeView] Improve debugging of virtual base class member variables	Brock Wyma	2018-05-14	2	-3/+6
\| \| \| \| \| \| \| \|	Initial support for passing the virtual base pointer offset to CodeViewDebug. https://reviews.llvm.org/D46271 llvm-svn: 332296
*	[Hexagon] Add a target feature to control using small data section	Krzysztof Parzyszek	2018-05-14	4	-16/+20
\| \| \| \|	llvm-svn: 332292
*	[Hexagon] Add a target feature for generating new-value stores	Krzysztof Parzyszek	2018-05-14	3	-6/+18
\| \| \| \|	llvm-svn: 332290
*	[DWARF] Factor out a DWARFUnitHeader class. NFC	Paul Robinson	2018-05-14	3	-56/+44
\| \| \| \| \| \| \| \| \| \| \|	Extract information related to a "unit header" from DWARFUnit into a new DWARFUnitHeader class, and add a DWARFUnit member for the header. This is one step in the direction of allowing type units in the .debug_info section for DWARF v5. Differential Revision: https://reviews.llvm.org/D46707 llvm-svn: 332289
*	[Hexagon] Add a target feature for memop generation	Krzysztof Parzyszek	2018-05-14	4	-31/+32
\| \| \| \|	llvm-svn: 332285
*	[X86] Add NT load/store scheduler classes	Simon Pilgrim	2018-05-14	13	-84/+148
\| \| \| \|	llvm-svn: 332274
*	[X86] Remove and autoupgrade avx512.vbroadcast.ss/avx512.vbroadcast.sd ↵	Craig Topper	2018-05-14	2	-6/+3
\| \| \| \| \| \|	intrinsics. llvm-svn: 332271
*	[X86][BtVer2] Fix MMX/YMM integer vector nt store schedules	Simon Pilgrim	2018-05-14	1	-2/+8
\| \| \| \| \| \|	MMX was missing and YMM was tagged as a fp nt store llvm-svn: 332269
*	[BranchFolding] Allow hoisting to block with a single conditional branch.	Geoff Berry	2018-05-14	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The BranchFolding pass is currently missing opportunities to hoist common code if the hoisted-to block contains a single conditional branch that has register uses. This occurs somewhat frequently on AArch64 with CBZ/TBZ opcodes. This change also eliminates some code differences when debug info is present since the presence of e.g. DBG_VALUE instructions in the hoisted-to block can enable hoisting that wouldn't have occurred without them. Reviewers: MatzeB, rnk, kparzysz, twoh, aprantl, javed.absar Subscribers: kristof.beyls, JDevlieghere, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D46324 llvm-svn: 332265
*	[Hexagon] Avoid predicate copies to integer registers from store-locked	Krzysztof Parzyszek	2018-05-14	1	-0/+15
\| \| \| \|	llvm-svn: 332260
*	[mips] Fix the predicates of round, ceiling, floor and trunc.	Simon Dardis	2018-05-14	2	-38/+36
\| \| \| \| \| \| \| \|	Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46691 llvm-svn: 332258
*	[NFC] [Power] Fix instruction format for xsrqpi	Zaara Syeda	2018-05-14	2	-1/+22
\| \| \| \| \| \| \| \| \| \| \| \|	xsrqpi is currently using Z23Form_1. The instruction format is xsrqpi R,VRT,VRB,RMC. Rathar than bits 11-15 being used for FRA, it should have bits 11-14 reserved and bit 15 for R. This patch adds a new class Z23Form_4 to fix the instruction format. Differential Revision: https://reviews.llvm.org/D46761 llvm-svn: 332253
*	[AArch64] Improve single vector lane stores	Evandro Menezes	2018-05-14	1	-20/+57
\| \| \| \| \| \| \| \|	When storing the 0th lane of a vector, use a simpler and usually more efficient scalar store instead. Differential revision: https://reviews.llvm.org/D46655 llvm-svn: 332251
*	[CodeGen/AccelTable]: Handle -dwarf-linkage-names=Abstract correctly	Pavel Labath	2018-05-14	3	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we are not emitting a linkage name in the .debug_info sections, we should not add it into the index either. This makes sure our index is consistent with the actual debug info. I am also explicitly setting the --dwarf-linkage-names=All in the name-collsions test as that one would now fail on targets where this defaults to "Abstract" (in fact, it would have failed already if there wasn't a bug in the DWARF verifier, which I fix as well). Reviewers: probinson, aprantl, JDevlieghere Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46748 llvm-svn: 332246
*	[AggressiveInstCombine] avoid crashing on unsimplified code (PR37446)	Sanjay Patel	2018-05-14	1	-0/+4
\| \| \| \| \| \| \| \| \|	This bug: https://bugs.llvm.org/show_bug.cgi?id=37446 ...raises another question: why do we run aggressive-instcombine before regular instcombine? llvm-svn: 332243
*	Rename DEBUG macro to LLVM_DEBUG.	Nicola Zaghen	2018-05-14	459	-8104/+8548
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The DEBUG() macro is very generic so it might clash with other projects. The renaming was done as follows: - git grep -l 'DEBUG' \| xargs sed -i 's/\bDEBUG\s\?(/LLVM_DEBUG(/g' - git diff -U0 master \| ../clang/tools/clang-format/clang-format-diff.py -i -p1 -style LLVM - Manual change to APInt - Manually chage DOCS as regex doesn't match it. In the transition period the DEBUG() macro is still present and aliased to the LLVM_DEBUG() one. Differential Revision: https://reviews.llvm.org/D43624 llvm-svn: 332240
*	Fix "not all control paths return a value" MSVC warning. NFCI.	Simon Pilgrim	2018-05-14	1	-0/+1
\| \| \| \|	llvm-svn: 332238
*	[AArch64][SVE] Extend parsing of Prefetch operation for SVE.	Sander de Smalen	2018-05-14	7	-10/+99
\| \| \| \| \| \| \| \| \| \|	Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D46681 llvm-svn: 332234
*	[mips] Correct the predicates of indexed floating point stores and loads.	Simon Dardis	2018-05-14	2	-14/+14
\| \| \| \| \| \| \| \| \| \|	Also, fix the register class for microMIPS. Reviewers: atanasyan, abeserminji, smaksimovic Differential Revision: https://reviews.llvm.org/D46689 llvm-svn: 332227