bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[LV] Apply sink-after & interleave-groups as VPlan transformations (NFC)	Gil Rapaport	2019-11-05	7	-131/+174
\| \| \| \| \| \| \|	This recommits 2be17087f8c38934b7fc9208ae6cf4e9b4d44f4b (reverted in d3ec06d219788801380af1948c7f7ef9d3c6100b for heap-use-after-free) with a fix in IAI's reset() which was not clearing the set of interleave groups after deleting them.
*	Fix uninitialized variable warning. NFCI.	Simon Pilgrim	2019-11-05	1	-1/+1
\|
*	[MCObjectFileInfo] Fix uninitialized variable warnings. NFCI.	Simon Pilgrim	2019-11-05	1	-88/+89
\|
*	[MachineOutliner] Fix uninitialized variable warnings. NFCI.	Simon Pilgrim	2019-11-05	2	-7/+7
\|
*	[OPENMP][DOCS]Fix coloring of the implemented features status, NFC.	Alexey Bataev	2019-11-05	1	-5/+5
\|
*	[ObjC][ARC] Ignore lifetime markers between *ReturnValue calls	Francis Visoiu Mistrih	2019-11-05	2	-5/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When eliminating a pair of `llvm.objc.autoreleaseReturnValue` followed by `llvm.objc.retainAutoreleasedReturnValue` we need to make sure that the instructions in between are safe to ignore. Other than bitcasts and useless GEPs, it's also safe to ignore lifetime markers for both static allocas (lifetime.start/lifetime.end) and dynamic allocas (stacksave/stackrestore). These get added by the inliner as part of the return sequence and can prevent the transformation from happening in practice. Differential Revision: https://reviews.llvm.org/D69833
*	[NFC][ObjC][ARC] Add tests for OptimizeRetainRVCall	Francis Visoiu Mistrih	2019-11-05	1	-0/+68
\| \| \| \| \|	Add tests for bitcasts + zero GEPs, and pre-commit tests for lifetime markers.
*	[JumpThreading] Factor out common code to update the SSA form (NFC)	Kazu Hirata	2019-11-05	2	-75/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch factors out common code to update the SSA form in JumpThreading.cpp -- partly for readability and partly to facilitate an coming patch of my own. Reviewers: wmi Subscribers: hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69811
*	[GVN] Fix uninitialized variable warnings. NFCI.	Simon Pilgrim	2019-11-05	3	-23/+23
\|
*	Add missing GVN =operator. NFCI.	Simon Pilgrim	2019-11-05	2	-0/+2
\| \| \| \|	Fixes PVS Studio warning that the 'ValueTable' class implements a copy constructor, but lacks the '=' operator.
*	[InstCombine] add tests for shift-logic-shift; NFC	Sanjay Patel	2019-11-05	1	-0/+171
\| \| \| \| \| \|	This is based on existing CodeGen test files for x86 and AArch64. The corresponding potential transform is shown in: rL370617
*	[lldb] Fix readline/libedit compat patch for py2	serge-sans-paille	2019-11-05	1	-1/+9
\| \| \| \|	This is a follow-up to https://reviews.llvm.org/D69793
*	[AtomicExpandPass] Silence static analyzer warnings about operator priority. ↵	Dávid Bolvanský	2019-11-05	1	-1/+1
\| \| \| \|	NFCI.
*	[MachineScheduler] Enable AA in PostRA Machine scheduler	David Green	2019-11-05	7	-49/+51
\| \| \| \| \| \| \| \| \| \| \| \|	This adds AA to Post-RA Machine Scheduling, allowing the pass more freedom when handling memory operations. My understanding is that this was just never done, not that it is inherently incorrect to do so. The older PostRA List scheduler already makes use of AA, it's just that the MI PostRA Scheduler was never taught to use it. Differential Revision: https://reviews.llvm.org/D69814
*	[Docs] Add LangRef documentation for freeze instruction	Nuno Lopes	2019-11-05	1	-33/+81
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: - Describe the new freeze instruction - Make it explicit that branch on undef/poison is UB Reviewers: chandlerc, majnemer, efriedma, nikic, reames, jdoerfert, lebedev.ri, regehr Subscribers: fhahn, bollu, lebedev.ri, delcypher, spatel, filcab, llvm-commits, aqjune Differential Revision: https://reviews.llvm.org/D29121
*	[Clang FE] Recognize -mnop-mcount CL option (SystemZ only).	Jonas Paulsson	2019-11-05	7	-0/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Recognize -mnop-mcount from the command line and add a function attribute "mnop-mcount"="true" when passed. When this option is used, a nop is added instead of a call to fentry. This is used when building the Linux Kernel. If this option is passed for any other target than SystemZ, an error is generated. Review: Ulrich Weigand https://reviews.llvm.org/D67763
*	Fix PR40644: miscompile indexed FP constant store	Thomas Preud'homme	2019-11-05	2	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Functions replaceStoreOfFPConstant() and OptimizeFloatStore() both replace store of float by a store of an integer unconditionally. However this generates wrong code when the store that is replaced is an indexed or truncating store. This commit solves this issue by adding an early return in these functions when the store being considered is not a normal store. Bug was only observed on out of tree targets, hence the lack of testcase in this commit. Reviewers: efriedma Subscribers: hiraditya, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68420
*	[ARM] Always enable UseAA in the arm backend	David Green	2019-11-05	6	-36/+47
\| \| \| \| \| \| \| \| \| \|	This feature controls whether AA is used into the backend, and was previously turned on for certain subtargets to help create less constrained scheduling graphs. This patch turns it on for all subtargets, so that they can all make use of the extra information to produce better code. Differential Revision: https://reviews.llvm.org/D69796
*	[Scheduling][ARM] Consistently enable PostRA Machine scheduling	David Green	2019-11-05	17	-18/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the ARM backend, for historical reasons we have only some targets using Machine Scheduling. The rest use the old list scheduler as they are using itinaries and the list scheduler seems to produce better code (and not crash running out of register on v6m codes). So whether to use the MIScheduler or not is checked at runtime from the subtarget features. This is fine, except for post-ra scheduling. Whether to use the old post-ra list scheduler or the post-ra machine schedule is decided as the pass manager is set up, in arms case from a newly constructed subtarget. Under some situations, like LTO, this won't include the correct cpu so can pick the wrong option. This can have a surprising effect on performance. To fix that, this patch overrides targetSchedulesPostRAScheduling and addPreSched2 in the ARM backend, adding _both_ post-ra schedulers and picking at runtime which to execute. To pick between the two I've had to add a enablePostRAMachineScheduler() method that normally returns enableMachineScheduler() && enablePostRAScheduler(), which can be overridden to enable just one of PostRAMachineScheduler vs PostRAScheduler. Thanks to David Penry for the identifying this problem. Differential Revision: https://reviews.llvm.org/D69775
*	lldb/breakpad: add suppport for the "x86_64h" architecture	Pavel Labath	2019-11-05	2	-2/+2
\|
*	Revert and patch "[Python] Remove readline module"	serge-sans-paille	2019-11-05	4	-0/+124
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix https://bugs.llvm.org/show_bug.cgi?id=43830 while avoiding polluting the global Python namespace. This both reverts r357277 to rebundle a version of Python's readline module based on libedit. However, this patch also provides two improvements over the previous implementation: 1. use PyMem_RawMalloc instead of PyMem_Malloc, as expected by PyOS_Readline (prevents to segfault upon exit of interactive session) 2. patch the readline module upon embedded interpreter loading, instead of patching it globally, which should prevent any side effect on other modules/packages 3. only activate the patched module if libedit is actually linked in lldb Differential Revision: https://reviews.llvm.org/D69793
*	[OpenCL] Group builtin functions by prototype	Sven van Haastregt	2019-11-05	1	-13/+136
\| \| \| \| \| \| \| \| \| \|	The TableGen-generated file containing the function definitions can be reorganized to save some memory in the Clang binary. Functions having the same prototype(s) will point to a shared list of prototype(s). Patch by Pierre Gondois and Sven van Haastregt. Differential Revision: https://reviews.llvm.org/D63557
*	[OpenCL] Add builtin function attribute handling	Sven van Haastregt	2019-11-05	4	-43/+97
\| \| \| \| \| \| \| \| \|	Add handling for the "pure", "const" and "convergent" function attributes for OpenCL builtin functions. Patch by Pierre Gondois and Sven van Haastregt. Differential Revision: https://reviews.llvm.org/D64319
*	lldb/minidump: Add support for the alternate ARM64 constant	Pavel Labath	2019-11-05	2	-1/+2
\|
*	MemoryRegion: Print "don't know" permission values as such	Pavel Labath	2019-11-05	4	-25/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The permissions in a memory region have ternary states (yes, no, don't know), but the memory region command only prints in binary, treating "don't know" as "yes", which is particularly confusing as for instance the unwinder will treat an unknown value as "no". This patch makes is so that we distinguish all three states when printing the values, using "?" to indicate the lack of information. It is implemented via a special argument to the format provider for the OptionalBool enumeration. Reviewers: clayborg, jingham Subscribers: lldb-commits Differential Revision: https://reviews.llvm.org/D69106
*	[LoopUnroll] peel-loop-conditions.ll: add some 'is even/odd' peeling tests	Roman Lebedev	2019-11-05	1	-0/+98
\|
*	[InstCombine] dropRedundantMaskingOfLeftShiftInput(): truncation (PR42563)	Roman Lebedev	2019-11-05	12	-110/+141
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: That fold keeps growing and growing :( I think this may be one of the last pieces for it. Since D67677/D67725, the fold knowns the general form of the pattern - where some masking is needed: https://rise4fun.com/Alive/F5R https://rise4fun.com/Alive/gslRa But there is one more huge piece missing - if you are extracting some bits, it is not impossible that the origin is wider than the extraction, i.e. there may be a truncation. And we don't deal with that yet. But we can, and the generalization remains fully identical: https://rise4fun.com/Alive/Uar https://rise4fun.com/Alive/5SW After a preparatory cleanup i think the diff looks rather clean. One missing piece is that in some patterns (especially pat. b), `-1` only needs to be `-1` in final type, but that is for later.. https://bugs.llvm.org/show_bug.cgi?id=42563 Reviewers: spatel, nikic Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69125
*	[RISCV] Add InstrInfo areMemAccessesTriviallyDisjoint hook	Luís Marques	2019-11-05	3	-0/+89
\| \| \| \| \| \| \| \| \| \| \|	Summary: Introduces the `InstrInfo::areMemAccessesTriviallyDisjoint` hook. The test could check for instruction reorderings, but to avoid being brittle it just checks instruction dependencies. Reviewers: asb, lenary Reviewed By: lenary Tags: #llvm Differential Revision: https://reviews.llvm.org/D67046
*	DWARFDebugLoclists: Make it possible to read relocated addresses	Pavel Labath	2019-11-05	5	-19/+142
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Handling relocations was not needed when the loclists section was a DWO-only thing. But since DWARF5, it is possible to use it in regular objects too, and the standard permits embedding addresses into the section directly. These addresses need to be relocated in unlinked files. Reviewers: JDevlieghere, dblaikie, probinson Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68271
*	[mips] Set __OCTEON__ macros	Simon Atanasyan	2019-11-05	2	-0/+4
\|
*	[mips] Fix `__mips_isa_rev` macros value for Octeon CPU	Simon Atanasyan	2019-11-05	2	-1/+10
\|
*	Recommit "[HardwareLoops] Optimisation remarks"	Sjoerd Meijer	2019-11-05	3	-26/+107
\| \| \| \| \| \| \| \| \|	With a few things fixed: - initialisaiton of the optimisation remark pass (this was causing the buildbot failures on PPC), - a test case. Differential Revision: https://reviews.llvm.org/D69660
*	[AArch64] Update test checks on merge-store-dependency.ll. NFC	David Green	2019-11-05	1	-4/+42
\|
*	[lldb][NFC] Give some parameters in CommandInterpreter more descriptive names	Raphael Isemann	2019-11-05	2	-9/+9
\|
*	[IR] Remove switch's default block that causes clang 8 raise error	aqjune	2019-11-05	1	-2/+0
\|
*	[X86] Lower the cost of avx512 horizontal bool and/or reductions to ↵	Craig Topper	2019-11-04	3	-42/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	2*log2(bitwidth)+1 for legal types. This better represents the kshift+binop we'd get for each stage before the final extract. Its likely we'll do even better by doing a kmov and a cmp with a GPR, but this is a good start. The default handling was costing a worst case single source permute shuffle of the vector before the binop. This worst case assumes the shuffle might have to be emulated with extracts and inserts. But since we know we're doing a reduction we can assume we'll get kshift lowering. There's still some room for improvement here, but this is much better than it was.
*	[IR] Add Freeze instruction	aqjune	2019-11-05	25	-135/+242
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: - Define Instruction::Freeze, let it be UnaryOperator - Add support for freeze to LLLexer/LLParser/BitcodeReader/BitcodeWriter The format is `%x = freeze <ty> %v` - Add support for freeze instruction to llvm-c interface. - Add m_Freeze in PatternMatch. - Erase freeze when lowering IR to SelDag. Reviewers: deadalnix, hfinkel, efriedma, lebedev.ri, nlopes, jdoerfert, regehr, filcab, delcypher, whitequark Reviewed By: lebedev.ri, jdoerfert Subscribers: jfb, kristof.beyls, hiraditya, lebedev.ri, steven_wu, dexonsmith, xbolva00, delcypher, spatel, regehr, trentxintong, vsk, filcab, nlopes, mehdi_amini, deadalnix, llvm-commits Differential Revision: https://reviews.llvm.org/D29011
*	[BPF] fix a use after free bug	Yonghong Song	2019-11-04	1	-2/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commit fff2721286e1 ("[BPF] Fix CO-RE bugs with bitfields") fixed CO-RE handling bitfield issues. But the implementation introduced a use after free bug. The "Base" of the intrinsic might be freed so later on accessing the Type of "Base" might access the freed memory. The failed test case, CodeGen/BPF/CORE/offset-reloc-middle-chain.ll is exactly used to test such a case. Similarly to previous attempt to remember Metadata etc, remember "Base" pointee Alignment in advance to avoid such use after free bug.
*	[X86] Teach X86MCInstLower to swap operands of commutable instructions to ↵	Craig Topper	2019-11-04	30	-233/+279
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	enable 2-byte VEX encoding. Summary: The 2 source operands commutable instructions are encoded in the VEX.VVVV field and the r/m field of the MODRM byte plus the VEX.B field. The VEX.B field is missing from the 2-byte VEX encoding. If the VEX.VVVV source is 0-7 and the other register is 8-15 we can swap them to avoid needing the VEX.B field. This works as long as the VEX.W, VEX.mmmmm, and VEX.X fields are also not needed. Fixes PR36706. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68550
*	[analyzer] Require darwin for scan-build tests	Devin Coughlin	2019-11-04	5	-4/+6
\| \| \| \| \|	Let's at least get some coverage from these tests. We can generalize to other platforms later.
*	[analyzer] Fixup scan-build tests for non-Darwin platforms.	Devin Coughlin	2019-11-04	5	-1/+8
\| \| \| \| \|	This is a fix to 0aba69eb1a01c44185009f50cc633e3c648e9950 to address failing bots.
*	Fix clone_constant_impl to correctly deal with null pointers	aqjune	2019-11-05	2	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch resolves llvm-c-test's following error ``` LLVM ERROR: LLVMGetValueKind returned incorrect type ``` which arises when the input bitcode contains a null pointer. Reviewers: jdoerfert, CodaFi, deadalnix Reviewed By: jdoerfert Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68928
*	[analyzer] Add test directory for scan-build.	Devin Coughlin	2019-11-04	10	-1/+143
\| \| \| \| \| \| \| \| \| \|	The static analyzer's scan-build script is critical infrastructure but is not well tested. To start to address this, add a new test directory under tests/Analysis for scan-build lit tests and seed it with several tests. The goal is that future scan-build changes will be accompanied by corresponding tests. Differential Revision: https://reviews.llvm.org/D69781
*	[CUDA][HIP] Disable emitting llvm.linker.options in device compilation	Yaxun (Sam) Liu	2019-11-04	3	-4/+42
\| \| \| \| \| \| \|	The linker options (e.g. pragma detect_mismatch) are intended for host compilation only, therefore disable it for device compilation. Differential Revision: https://reviews.llvm.org/D57829
*	[BPF] Fix CO-RE bugs with bitfields	Yonghong Song	2019-11-04	3	-38/+284
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	bitfield handling is not robust with current implementation. I have seen two issues as described below. Issue 1: struct s { long long f1; char f2; char b1:1; } *p; The current approach will generate an access bit size 56 (from b1 to the end of structure) which will be rejected as it is not power of 2. Issue 2: struct s { char f1; char b1:3; char b2:5; char b3:6: char b4:2; char f2; }; The LLVM will group 4 bitfields together with 2 bytes. But loading 2 bytes is not correct as it violates alignment requirement. Note that sometimes, LLVM breaks a large bitfield groups into multiple groups, but not in this case. To resolve the above two issues, this patch takes a different approach. The alignment for the structure is used to construct the offset of the bitfield access. The bitfield incurred memory access is an aligned memory access with alignment/size equal to the alignment of the structure. This also simplified the code. This may not be the optimal memory access in terms of memory access width. But this should be okay since extracting the bitfield value will have the same amount of work regardless of what kind of memory access width. Differential Revision: https://reviews.llvm.org/D69837
*	Optimize std::midpoint for integers	Jorg Brown	2019-11-04	1	-10/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Same idea as the current algorithm, that is, add (half of the difference between a and b) to a. But we use a different technique for computing the difference: we compute b - a into a pair of integers that are named "sign_bit" and "diff". We have to use a pair because subtracting two 32-bit integers produces a 33-bit result. Computing half of that is a simple matter of shifting diff right by 1, and adding sign_bit shifted left by 31. llvm knows how to do that with one instruction: shld. The only tricky part is that if the difference is odd and negative, then shifting it by one isn't the same as dividing it by two - shifting a negative one produces a negative one, for example. So there's one more adjustment: if the sign bit and the low bit of diff are one, we add one. For a demonstration of the codegen difference, see https://godbolt.org/z/7ar3K9 , which also has a built-in test. Differential Revision: https://reviews.llvm.org/D69459
*	[cmake] Add an option to skip stripping before install	Vedant Kumar	2019-11-04	2	-7/+10
\| \| \| \| \| \|	The swift build system has support for cross-compiling, installing, and generating symbols for lldb. As the swift symbol-generation step occurs after installation, we need to disable stripping during the install.
*	build: explicitly set the linker language for unwind	Saleem Abdulrasool	2019-11-04	1	-0/+2
\| \| \| \| \| \|	The unwinder should not depend on libc++. In fact, we do not end up with a link against libc++ as we do not have a dependency on libc++ at runtime. This ensures that we link with `clang` rather than `clang++`.
*	[CGDebugInfo] Emit subprograms for decls when AT_tail_call is understood	Vedant Kumar	2019-11-04	4	-10/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, clang emits subprograms for declared functions when the target debugger or DWARF standard is known to support entry values (DW_OP_entry_value & the GNU equivalent). Treat DW_AT_tail_call the same way to allow debuggers to follow cross-TU tail calls. Pre-patch debug session with a cross-TU tail call: ``` * frame #0: 0x0000000100000fa4 main`target at b.c:4:3 [opt] frame #1: 0x0000000100000f99 main`main at a.c:8:10 [opt] ``` Post-patch (note that the tail-calling frame, "helper", is visible): ``` * frame #0: 0x0000000100000fa4 main`target at b.c:4:3 [opt] frame #1: 0x0000000100000f80 main`helper [opt] [artificial] frame #2: 0x0000000100000f99 main`main at a.c:8:10 [opt] ``` rdar://46577651 Differential Revision: https://reviews.llvm.org/D69743
*	Test commit: adds a . to comment. NFC	Ron Lieberman	2019-11-04	1	-1/+1
\|