bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[AArch64] Fix some Include What You Use warnings; other minor fixes (NFC).	Eugene Zelenko	2017-02-03	4	-25/+46
\| \| \| \| \| \|	This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294053
*	[ARM] Fix some Include What You Use warnings; other minor fixes (NFC).	Eugene Zelenko	2017-02-03	2	-2/+9
\| \| \| \| \| \|	This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294052
*	[XCore] Fix some Include What You Use warnings; other minor fixes (NFC).	Eugene Zelenko	2017-02-03	2	-3/+9
\| \| \| \| \| \|	This is preparation to reduce MCExpr.h dependencies. llvm-svn: 294051
*	[InstCombine] fix operand-complexity-based canonicalization (PR28296)	Sanjay Patel	2017-02-03	1	-7/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code comments didn't match the code logic, and we didn't actually distinguish the fake unary (not/neg/fneg) operators from arguments. Adding another level to the weighting scheme provides more structure and can help simplify the pattern matching in InstCombine and other places. I fixed regressions that would have shown up from this change in: rL290067 rL290127 But that doesn't mean there are no pattern-matching logic holes left; some combines may just be missing regression tests. Should fix: https://llvm.org/bugs/show_bug.cgi?id=28296 Differential Revision: https://reviews.llvm.org/D27933 llvm-svn: 294049
*	Properly parse the TypeServer2 record.	Zachary Turner	2017-02-03	6	-25/+57
\| \| \| \|	llvm-svn: 294046
*	AMDGPU: AsmParser cleanups	Matt Arsenault	2017-02-03	1	-17/+24
\| \| \| \| \| \|	Use typedef, remove unnecessary enum, line wraps. llvm-svn: 294039
*	[libfuzzer] chromium-related compilation fixes	Mike Aizatsky	2017-02-03	3	-10/+13
\| \| \| \| \| \| \| \|	Reviewers: kcc Differential Revision: https://reviews.llvm.org/D29502 llvm-svn: 294035
*	[AMDGPU] Bump -amdgpu-unroll-threshold-private to 2000	Stanislav Mekhanoshin	2017-02-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This has quite positive performance impact according to measurements. Before previous fixes to limit the optimization that was too high and blowed compile time and scratch usage, but now this is gone and we can bump the threshold. Differential Revision: https://reviews.llvm.org/D29505 llvm-svn: 294032
*	AMDGPU: Set MCAsmInfo::PointerSize	Matt Arsenault	2017-02-03	1	-0/+1
\| \| \| \|	llvm-svn: 294031
*	AMDGPU: Don't unroll for private with dynamic allocas	Matt Arsenault	2017-02-03	1	-1/+1
\| \| \| \| \| \| \|	This won't be elimnated, so this will just bloat code if/when these are ever used/supported. llvm-svn: 294030
*	[SLP] Make sortMemAccesses explicitly return an error. NFC.	Michael Kuperstein	2017-02-03	2	-24/+25
\| \| \| \|	llvm-svn: 294029
*	[TLI] Robustize SDAG LibFunc proto checking by merging it into TLI.	Ahmed Bougacha	2017-02-03	1	-97/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This re-applies commit r292189, reverted in r292191. SelectionDAGBuilder recognizes libfuncs using some homegrown parameter type-checking. Use TLI instead, removing another heap of redundant code. This isn't strictly NFC, as the SDAG code was too lax. Concretely, this means changes are required to a few tests: - calling a non-variadic function via a variadic prototype isn't OK; it just happens to work on x86_64 (but not on, e.g., aarch64). - mempcpy has a size_t parameter; the SDAG code accepts any integer type, which meant using i32 on x86_64 worked. - a handful of SystemZ tests check the SDAG support for lax prototype checking: Ulrich agrees on removing them. I don't think it's worth supporting any of these (IMO) invalid testcases. Instead, fix them to be more meaningful. llvm-svn: 294028
*	[SLP] Use SCEV to sort memory accesses.	Michael Kuperstein	2017-02-03	1	-17/+34
\| \| \| \| \| \| \| \| \| \|	This generalizes memory access sorting to use differences between SCEVs, instead of relying on constant offsets. That allows us to properly do SLP vectorization of non-sequentially ordered loads within loops bodies. Differential Revision: https://reviews.llvm.org/D29425 llvm-svn: 294027
*	GlobalISel: translate dynamic alloca instructions.	Tim Northover	2017-02-03	2	-8/+99
\| \| \| \|	llvm-svn: 294022
*	[X86][SSE] Add support for combining scalar_to_vector(extract_vector_elt) ↵	Simon Pilgrim	2017-02-03	1	-0/+14
\| \| \| \| \| \| \| \|	into a target shuffle. Correctly flagging upper elements as undef. llvm-svn: 294020
*	NFC: [LoopUnroll] More meaningful message in tracing	Anna Thomas	2017-02-03	1	-1/+1
\| \| \| \|	llvm-svn: 294017
*	IRMover: Merge flags LinkModuleInlineAsm and IsPerformingImport.	Peter Collingbourne	2017-02-03	4	-13/+12
\| \| \| \| \| \| \| \| \|	Currently these flags are always the inverse of each other, so there is no need to keep them separate. Differential Revision: https://reviews.llvm.org/D29471 llvm-svn: 294016
*	ModuleLinker: Remove importing support. NFCI.	Peter Collingbourne	2017-02-03	1	-58/+12
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D29470 llvm-svn: 294015
*	FunctionImport: Use IRMover directly.	Peter Collingbourne	2017-02-03	3	-16/+20
\| \| \| \| \| \| \| \| \| \| \| \|	The importer was previously using ModuleLinker in a sort of "IRMover mode". Use IRMover directly instead in order to remove a level of indirection. I will remove all importing support from ModuleLinker in a separate change. Differential Revision: https://reviews.llvm.org/D29468 llvm-svn: 294014
*	[mips] Remove absolute size assertion for end directive	Simon Dardis	2017-02-03	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The .end <symbol> directive for MIPS marks the end of a symbol and sets the symbol's size. Previously, the corresponding emitDirective handler asserted that a function's size could be evaluated to an absolute value at that point in time. This cannot be done with when directives like .align have been encountered, instead set the function's size to the corresponding symbolic expression and let ELFObjectWriter resolve the expression to an absolute value. This avoids a redundant call to evaluateAsAbsolute. llvm-svn: 294012
*	[NVPTX] Enable combineRepeatedFPDivisors for NVPTX.	Justin Lebar	2017-02-03	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Reviewers: tra Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D29477 llvm-svn: 294011
*	[AMDGPU][mc] Fix AddressSanitizer leftover issue in gfx7_asm_all test	Artem Tamazov	2017-02-03	3	-9/+11
\| \| \| \| \| \|	Issue occurs when assembling "ds_ordered_count v0, v0 gds". llvm-svn: 294004
*	[SelectionDAG] Fix for PR30775: Assertion `NodeToMatch->getOpcode() !=	Alexey Bataev	2017-02-03	1	-8/+12
\| \| \| \| \| \| \| \| \| \| \| \|	ISD::DELETED_NODE && "NodeToMatch was removed partway through selection"' failed. NodeToMatch can be modified during matching, but code does not handle this situation. Differential Revision: https://reviews.llvm.org/D29292 llvm-svn: 294003
*	[ARM] Change TCReturn to tBL if tailcall optimization fails.	Sanne Wouda	2017-02-03	2	-6/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The tail call optimisation is performed before register allocation, so at that point we don't know if LR is being spilt or not. If LR was spilt to the stack, then we cannot do a tail call optimisation. That would involve popping back into LR which is not possible in Thumb1 code. Reviewers: rengolin, jmolloy, rovka, olista01 Reviewed By: olista01 Subscribers: llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D29020 llvm-svn: 294000
*	[SLP] Fix for PR31690: Allow using of extra values in horizontal reductions.	Alexey Bataev	2017-02-03	1	-12/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently LLVM supports vectorization of horizontal reduction instructions with initial value set to 0. Patch supports vectorization of reduction with non-zero initial values. Also it supports a vectorization of instructions with some extra arguments, like: float f(float x[], int a, int b) { float p = a % b; p += x[0] + 3; for (int i = 1; i < 32; i++) p += x[i]; return p; } Patch allows vectorization of this kind of horizontal reductions. Differential Revision: https://reviews.llvm.org/D28961 llvm-svn: 293994
*	Revert "[ThinLTO] Add an auto-hide feature"	Mehdi Amini	2017-02-03	6	-77/+45
\| \| \| \| \| \| \| \| \|	This reverts commit r293970. After more discussion, this belongs to the linker side and there is no added value to do it at this level. llvm-svn: 293993
*	[AMDGPU] Unroll preferences improvements	Stanislav Mekhanoshin	2017-02-03	1	-1/+28
\| \| \| \| \| \| \| \| \| \| \|	Exit loop analysis early if suitable private access found. Do not account for GEPs which are invariant to loop induction variable. Do not account for Allocas which are too big to fit into register file anyway. Add option for tuning: -amdgpu-unroll-threshold-private. Differential Revision: https://reviews.llvm.org/D29473 llvm-svn: 293991
*	[sanitizer coverage] Fix Instrumentation to work on Windows.	Marcos Pividori	2017-02-03	1	-21/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On Windows, the symbols "___stop___sancov_guards" and "___start___sancov_guards" are not defined automatically. So, we need to take a different approach. We define 3 sections: Section ".SCOV$A" will only hold a variable ___start___sancov_guard. Section ".SCOV$M" will hold the main data. Section ".SCOV$Z" will only hold a variable ___stop___sancov_guards. When linking, they will be merged sorted by the characters after the $, so we can use the pointers of the variables ___[start\|stop]___sancov_guard to know the actual range of addresses of that section. In this diff, I updated instrumentation to include all the guard arrays in section ".SCOV$M". Differential Revision: https://reviews.llvm.org/D28434 llvm-svn: 293987
*	AMDGPU: Fold fneg into fmin/fmax_legacy	Matt Arsenault	2017-02-03	1	-2/+24
\| \| \| \|	llvm-svn: 293972
*	DebugInfo: ensure type and namespace names are included in pubnames/pubtypes ↵	David Blaikie	2017-02-03	5	-14/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	even when they are only present in type units While looking to add support for placing singular types (types that will only be emitted in one place (such as attached to a strong vtable or explicit template instantiation definition)) not in type units (since type units have overhead) I stumbled across that change causing an increase in pubtypes. Turns out we were missing some types from type units if they were only referenced from other type units and not from the debug_info section. This fixes that, following GCC's line of describing the offset of such entities as the CU die (since there's no compile unit-relative offset that would describe such an entity - they aren't in the CU). Also like GCC, this change prefers to describe the type stub within the CU rather than the "just use the CU offset" fallback where possible. This may give the DWARF consumer some opportunity to find the extra info in the type stub - though I'm not sure GDB does anything with this currently. The size of the pubnames/pubtypes sections now match exactly with or without type units enabled. This nearly triples (+189%) the pubtypes section for a clang self-host and grows pubnames by 0.07% (without compression). For a total of 8% increase in debug info sections of the objects of a Split DWARF build when using type units. llvm-svn: 293971
*	[ThinLTO] Add an auto-hide feature	Mehdi Amini	2017-02-03	6	-45/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a symbol is not exported outside of the DSO, it is can be hidden. Usually we try to internalize as much as possible, but it is not always possible, for instance a symbol can be referenced outside of the LTO unit, or there can be cross-module reference in ThinLTO. This is a recommit of r293912 after fixing build failures, and a recommit of r293918 after fixing LLD tests. Differential Revision: https://reviews.llvm.org/D28978 llvm-svn: 293970
*	[X86] Mark 256-bit and 512-bit INSERT_SUBVECTOR operations as legal and ↵	Craig Topper	2017-02-03	1	-27/+6
\| \| \| \| \| \|	remove the custom lowering. llvm-svn: 293969
*	AMDGPU: Fold fneg into fminnum/fmaxnum	Matt Arsenault	2017-02-03	1	-0/+30
\| \| \| \|	llvm-svn: 293968
*	AMDGPU: Check if users of fneg can fold mods	Matt Arsenault	2017-02-02	1	-4/+64
\| \| \| \| \| \|	In multi-use cases this can save a few instructions. llvm-svn: 293962
*	Revert "[ThinLTO] Add an auto-hide feature"	Mehdi Amini	2017-02-02	6	-77/+45
\| \| \| \| \| \|	This reverts commit r293918, one lld test does not pass. llvm-svn: 293961
*	[lto] add getLinkerOpts()	Bob Haarman	2017-02-02	2	-29/+62
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Some compilers, including MSVC and Clang, allow linker options to be specified in source files. In the legacy LTO API, there is a getLinkerOpts() method that returns linker options for the bitcode module being processed. This change adds that method to the new API, so that the COFF linker can get the right linker options when using the new LTO API. Reviewers: pcc, ruiu, mehdi_amini, tejohnson Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D29207 llvm-svn: 293950
*	[X86] Fix some Clang-tidy modernize and Include What You Use warnings; other ↵	Eugene Zelenko	2017-02-02	13	-151/+240
\| \| \| \| \| \|	minor fixes (NFC). llvm-svn: 293949
*	[X86] Avoid sorted order check in release builds	Reid Kleckner	2017-02-02	1	-4/+6
\| \| \| \| \| \| \|	Effectively reverts r290248 and fixes the unused function warning with ifndef NDEBUG. llvm-svn: 293945
*	[X86] Move turning 256-bit INSERT_SUBVECTORS into BLENDI from legalize to ↵	Craig Topper	2017-02-02	1	-44/+39
\| \| \| \| \| \| \| \|	DAG combine. On one test this seems to have given more chance for DAG combine to do other INSERT_SUBVECTOR/EXTRACT_SUBVECTOR combines before the BLENDI was created. Looks like we can still improve more by teaching DAG combine to optimize INSERT_SUBVECTOR/EXTRACT_SUBVECTOR with BLENDI. llvm-svn: 293944
*	[CodeGen] Remove dead call-or-prologue enum from CCState	Reid Kleckner	2017-02-02	2	-32/+10
\| \| \| \| \| \| \|	This enum has been dead since Olivier Stannard re-implemented ARM byval handling in r202985 (2014). llvm-svn: 293943
*	[PGO] internal option cleanups	Xinliang David Li	2017-02-02	4	-24/+59
\| \| \| \| \| \| \| \| \| \|	1. Added comments for options 2. Added missing option cl::desc field 3. Uniified function filter option for graph viewing. Now PGO count/raw-counts share the same filter option: -view-bfi-func-name=. llvm-svn: 293938
*	Change how we handle section symbols on ELF.	Rafael Espindola	2017-02-02	7	-75/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On ELF every section can have a corresponding section symbol. When in an assembly file we have .quad .text the '.text' refers to that symbol. The way we used to handle them is to leave .text an undefined symbol until the very end when the object writer would map them to the actual section symbol. The problem with that is that anything before the end would see an undefined symbol. This could result in bad diagnostics (test/MC/AArch64/label-arithmetic-diags-elf.s), or incorrect results when using the asm streamer (est/MC/Mips/expansion-jal-sym-pic.s). Fixing this will also allow using the section symbol earlier for setting sh_link of SHF_METADATA sections. This patch includes a few hacks to avoid changing our behaviour when handling conflicts between section symbols and other symbols. I reported pr31850 to track that. llvm-svn: 293936
*	[ARM] Classification Improvements to ARM Sched-Model. NFCI.	Javed Absar	2017-02-02	5	-58/+160
\| \| \| \| \| \| \| \| \| \| \| \|	This is the second in the series of patches to enable adding of machine sched-models for ARM processors easier and compact. This patch focuses on integer instructions and adds missing sched definitions. Reviewers: rovka, rengolin Differential Revision: https://reviews.llvm.org/D29127 llvm-svn: 293935
*	[LiveRangeEdit] Don't mess up with LiveInterval when a new vreg is created.	Quentin Colombet	2017-02-02	1	-3/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In r283838, we added the capability of splitting unspillable register. When doing so we had to make sure the split live-ranges were also unspillable and we did that by marking the related live-ranges in the delegate method that is called when a new vreg is created. However, by accessing the live-range there, we also triggered their lazy computation (LiveIntervalAnalysis::getInterval) which is not what we want in general. Indeed, later code in LiveRangeEdit is going to build the live-ranges this lazy computation may mess up that computation resulting in assertion failures. Namely, the createEmptyIntervalFrom method expect that the live-range is going to be empty, not computed. Thanks to Mikael Holmén <mikael.holmen@ericsson.com> for noticing and reporting the problem. llvm-svn: 293934
*	[Hexagon] Adding opExtentBits and opExtentAlign to GPrel instructions	Krzysztof Parzyszek	2017-02-02	4	-12/+62
\| \| \| \| \| \|	Patch by Colin LeMahieu. llvm-svn: 293933
*	[X86] Add costs for non-AVX512 single-source permutation integer shuffles	Michael Kuperstein	2017-02-02	1	-3/+16
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D29416 llvm-svn: 293932
*	[Hexagon] Fix relocation kind for extended predicated calls	Krzysztof Parzyszek	2017-02-02	1	-5/+7
\| \| \| \| \| \|	Patch by Sid Manning. llvm-svn: 293931
*	[Hexagon] Remove A4_ext_* pseudo instructions	Krzysztof Parzyszek	2017-02-02	6	-38/+35
\| \| \| \| \| \|	Patch by Colin LeMahieu. llvm-svn: 293929
*	[libFuzzer] reorganize the tracing code to make it easier to experiment with ↵	Kostya Serebryany	2017-02-02	2	-19/+36
\| \| \| \| \| \|	inlined coverage instrumentation. NFC llvm-svn: 293928
*	[Hexagon] Fix insertBranch for loops with multiple ENDLOOP instructions	Krzysztof Parzyszek	2017-02-02	1	-18/+24
\| \| \| \|	llvm-svn: 293925