bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[NVPTX] Select atomic loads and stores	Jonas Hahnfeld	2018-08-09	1	-0/+88
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	According to PTX ISA .volatile has the same memory synchronization semantics as .relaxed.sys, so it can be used to implement monotonic atomic loads and stores. This is important for OpenMP's atomic construct where - 'read's and 'write's are lowered to atomic loads and stores, and - an update of float or double types are lowered into a cmpxchg loop. (Note that PTX could do better because it has atom.add.f{32,64} but LLVM's atomicrmw instruction only allows integer types.) Higher levels of atomicity (like acquire and release) need additional synchronization properties which were added with PTX ISA 6.0 / sm_70. So using these instructions still results in an error. Differential Revision: https://reviews.llvm.org/D50391 llvm-svn: 339316
*	[RISCV] Add "lla" pseudo-instruction to assembler	Roger Ferrer Ibanez	2018-08-09	2	-0/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This pseudo-instruction is similar to la but uses PC-relative addressing unconditionally. This is, la is only different to lla when using -fPIC. This pseudo-instruction seems often forgotten in several specs but it is definitely mentioned in binutils opcodes/riscv-opc.c. The semantics are defined both in page 37 of the "RISC-V Reader" book but also in function macro found in gas/config/tc-riscv.c. This is a very first step towards adding PIC support for Linux in the RISC-V backend. The lla pseudo-instruction expands to a sequence of auipc + addi with a couple of pc-rel relocations where the second points to the first one. This is described in https://github.com/riscv/riscv-elf-psabi-doc/blob/master/riscv-elf.md#pc-relative-symbol-addresses For now, this patch only introduces support of that pseudo instruction at the assembler parser. Differential Revision: https://reviews.llvm.org/D49661 llvm-svn: 339314
*	[LICM] Add tests for future hoisting of fence instructions [NFC]	Philip Reames	2018-08-09	1	-0/+118
\| \| \| \| \| \|	The main interesting case is a fence in an otherwise dead loop or one containing only arithmetic. This can happen as a result of DSE or other transforms from seemingly reasonable initial IR. llvm-svn: 339310
*	[CMake] Use normalized Windows target triples	Petr Hosek	2018-08-09	33	-37/+37
\| \| \| \| \| \| \| \| \| \| \|	Changes the default Windows target triple returned by GetHostTriple.cmake from the old environment names (which we wanted to move away from) to newer, normalized ones. This also requires updating all tests to use the new systems names in constraints. Differential Revision: https://reviews.llvm.org/D47381 llvm-svn: 339307
*	[DWARF] Verifier now handles .debug_types sections.	Paul Robinson	2018-08-08	1	-2/+6
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D50466 llvm-svn: 339302
*	[x86] add test for commuted variant for fsub fold; NFC	Sanjay Patel	2018-08-08	1	-2/+21
\| \| \| \|	llvm-svn: 339300
*	[DAGCombiner] loosen constraints for fsub+fadd fold	Sanjay Patel	2018-08-08	1	-24/+44
\| \| \| \| \| \| \| \| \|	isNegatibleForFree() should not matter here (as the test diffs show) because it's always a win to replace an fsub+fadd with fneg. The problem in D50195 persists because either (1) we are doing these folds in the wrong order or (2) we're missing another fold for fadd. llvm-svn: 339299
*	[ADT] Normalize empty triple components	Petr Hosek	2018-08-08	1	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	LLVM triple normalization is handling "unknown" and empty components differently; for example given "x86_64-unknown-linux-gnu" and "x86_64-linux-gnu" which should be equivalent, triple normalization returns "x86_64-unknown-linux-gnu" and "x86_64--linux-gnu". autoconf's config.sub returns "x86_64-unknown-linux-gnu" for both "x86_64-linux-gnu" and "x86_64-unknown-linux-gnu". This changes the triple normalization to behave the same way, replacing empty triple components with "unknown". This addresses PR37129. Differential Revision: https://reviews.llvm.org/D50219 llvm-svn: 339294
*	[x86] add tests for fsub+fadd with FMF; NFC	Sanjay Patel	2018-08-08	1	-3/+69
\| \| \| \| \| \|	These are related to the block of code under review in D50195. llvm-svn: 339293
*	[DWARF] Unclamp line table version on Darwin for v5 and later.	Jonas Devlieghere	2018-08-08	4	-10/+6
\| \| \| \| \| \| \| \| \|	On Darwin we pin the DWARF line tables to version 2. Stop doing so for DWARF v5 and later. Differential revision: https://reviews.llvm.org/D49381 llvm-svn: 339288
*	[ARM] Avoid spilling lr with Thumb1 tail calls.	Eli Friedman	2018-08-08	1	-30/+137
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Normally, if any registers are spilled, we prefer to spill lr on Thumb1 so we can fold the "bx lr" into the "pop". However, if there are tail calls involved, restoring lr is expensive, so skip the optimization in that case. The spill of r7 in the new test also isn't necessary, but that's mostly orthogonal to this patch. (It's the same code in ARMFrameLowering, but it's not related to tail calls.) Differential Revision: https://reviews.llvm.org/D49459 llvm-svn: 339283
*	revert tests of '[CodeGen] emit inline asm clobber list warnings for reserved'	Ties Stuij	2018-08-08	3	-43/+0
\| \| \| \|	llvm-svn: 339276
*	[MS Demangler] Create a new backref context for template instantiations.	Zachary Turner	2018-08-08	1	-4/+2
\| \| \| \| \| \| \| \|	Template manglings use a fresh back-referencing context, so we need to do the same. This fixes several existing tests which are marked as FIXME, so those are now actually run. llvm-svn: 339275
*	[Hexagon] Diagnose misaligned absolute loads and stores	Krzysztof Parzyszek	2018-08-08	2	-0/+76
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D50405 llvm-svn: 339272
*	AMDGPU: Error more gracefully on libcalls	Matt Arsenault	2018-08-08	1	-0/+7
\| \| \| \| \| \| \|	I think this is the only situation where the callsite will have a null instruction. llvm-svn: 339271
*	AMDGPU: Fix shifts for i128	Matt Arsenault	2018-08-08	1	-0/+1047
\| \| \| \|	llvm-svn: 339270
*	[WASM] Fix overflow when reading custom section	Jonas Devlieghere	2018-08-08	2	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	When reading a custom WASM section, it was possible that its name extended beyond the size of the section. This resulted in a bogus value for the section size due to the size overflowing. Fixes heap buffer overflow detected by OSS-fuzz: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=8190 Differential revision: https://reviews.llvm.org/D50387 llvm-svn: 339269
*	[DebugInfo] Fine tune emitting flags as part of the producer	Jonas Devlieghere	2018-08-08	1	-3/+7
\| \| \| \| \| \| \| \| \|	When using APPLE extensions, don't duplicate the compiler invocation's flags both in AT_producer and AT_APPLE_flags. Differential revision: https://reviews.llvm.org/D50453 llvm-svn: 339268
*	[InstCombine] fold fadd+fsub with common operand	Sanjay Patel	2018-08-08	1	-10/+8
\| \| \| \| \| \| \|	This is a sibling to the simplify from: https://reviews.llvm.org/rL339174 llvm-svn: 339267
*	[InstCombine] fold fsub+fsub with common operand	Sanjay Patel	2018-08-08	1	-4/+3
\| \| \| \| \| \| \|	This is a sibling to the simplify from: rL339171 llvm-svn: 339266
*	[InstCombine] add tests for fsub folds; NFC	Sanjay Patel	2018-08-08	1	-68/+137
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The scalar cases are handled in instcombine's internal reassociation pass for FP ops, but it misses the vector types. These patterns are similar to what was handled in InstSimplify in: https://reviews.llvm.org/rL339171 https://reviews.llvm.org/rL339174 https://reviews.llvm.org/rL339176 ...but we can't use instsimplify on these because we require negation of the original operand. llvm-svn: 339263
*	[PowerPC] Improve codegen for vector loads using scalar_to_vector	Zaara Syeda	2018-08-08	11	-219/+1444
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch aims to improve the codegen for vector loads involving the scalar_to_vector (load X) sequence. Initially, ld->mv instructions were used for scalar_to_vector (load X), so this patch allows scalar_to_vector (load X) to utilize: LXSD and LXSDX for i64 and f64 LXSIWAX for i32 (sign extension to i64) LXSIWZX for i32 and f64 Committing on behalf of Amy Kwan. Differential Revision: https://reviews.llvm.org/D48950 llvm-svn: 339260
*	[CodeGen] emit inline asm clobber list warnings for reserved	Ties Stuij	2018-08-08	3	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently, in line with GCC, when specifying reserved registers like sp or pc on an inline asm() clobber list, we don't always preserve the original value across the statement. And in general, overwriting reserved registers can have surprising results. For example: ``` extern int bar(int[]); int foo(int i) { int a[i]; // VLA asm volatile( "mov r7, #1" : : : "r7" ); return 1 + bar(a); } ``` Compiled for thumb, this gives: ``` $ clang --target=arm-arm-none-eabi -march=armv7a -c test.c -o - -S -O1 -mthumb ... foo: .fnstart @ %bb.0: @ %entry .save {r4, r5, r6, r7, lr} push {r4, r5, r6, r7, lr} .setfp r7, sp, #12 add r7, sp, #12 .pad #4 sub sp, #4 movs r1, #7 add.w r0, r1, r0, lsl #2 bic r0, r0, #7 sub.w r0, sp, r0 mov sp, r0 @APP mov.w r7, #1 @NO_APP bl bar adds r0, #1 sub.w r4, r7, #12 mov sp, r4 pop {r4, r5, r6, r7, pc} ... ``` r7 is used as the frame pointer for thumb targets, and this function needs to restore the SP from the FP because of the variable-length stack allocation a. r7 is clobbered by the inline assembly (and r7 is included in the clobber list), but LLVM does not preserve the value of the frame pointer across the assembly block. This type of behavior is similar to GCC's and has been discussed on the bugtracker: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=11807 . No consensus seemed to have been reached on the way forward. Clang behavior has briefly been discussed on the CFE mailing (starting here: http://lists.llvm.org/pipermail/cfe-dev/2018-July/058392.html). I've opted for following Eli Friedman's advice to print warnings when there are reserved registers on the clobber list so as not to diverge from GCC behavior for now. The patch uses MachineRegisterInfo's target-specific knowledge of reserved registers, just before we convert the inline asm string in the AsmPrinter. If we find a reserved register, we print a warning: ``` repro.c:6:7: warning: inline asm clobber list contains reserved registers: R7 [-Winline-asm] "mov r7, #1" ^ ``` Reviewers: eli.friedman, olista01, javed.absar, efriedma Reviewed By: efriedma Subscribers: efriedma, eraman, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D49727 llvm-svn: 339257
*	[RISCV] Add mnemonic alias: move, sbreak and scall.	Alex Bradbury	2018-08-08	1	-0/+11
\| \| \| \| \| \| \| \| \|	Further improve compatibility with the GNU assembler. Differential Revision: https://reviews.llvm.org/D50217 Patch by Kito Cheng. llvm-svn: 339255
*	[TargetLowering] BuildUDIV - Add support for divide by one (PR38477)	Simon Pilgrim	2018-08-08	1	-85/+19
\| \| \| \| \| \| \| \|	Provide a pass-through of the numerator for divide by one cases - this is the same approach we take in DAGCombiner::visitSDIVLike. I investigated whether we could achieve this by magic MULHU/SRL values but nothing appeared to work as we don't have a way for MULHU(x,c) -> x llvm-svn: 339254
*	[RISCV] Add InstAlias definitions for add[w], and, xor, or, sll[w], srl[w], ↵	Alex Bradbury	2018-08-08	5	-1/+81
\| \| \| \| \| \| \| \| \| \| \| \|	sra[w], slt and sltu with immediate Match the GNU assembler in supporting immediate operands for these instructions even when the reg-reg mnemonic is used. Differential Revision: https://reviews.llvm.org/D50046 Patch by Kito Cheng. llvm-svn: 339252
*	[ARM][NFC] Replaced tab-characters in test file vtrn.ll	Sjoerd Meijer	2018-08-08	1	-100/+100
\| \| \| \|	llvm-svn: 339251
*	[InstCombine] fold fneg into constant operand of fmul/fdiv	Sanjay Patel	2018-08-08	2	-23/+14
\| \| \| \| \| \| \| \| \| \| \| \|	This accounts for the missing IR fold noted in D50195. We don't need any fast-math to enable the negation transform. FP negation can always be folded into an fmul/fdiv constant to eliminate the fneg. I've limited this to one-use to ensure that we are eliminating an instruction rather than replacing fneg by a potentially expensive fdiv or fmul. Differential Revision: https://reviews.llvm.org/D50417 llvm-svn: 339248
*	[X86][SSE] PR38477 test is more cleanly tested with udiv instead of urem	Simon Pilgrim	2018-08-08	1	-110/+78
\| \| \| \| \| \|	Making the test use urem relies on it calling udiv-like combines, but the real issue is with the udiv so we're better off using that directly. llvm-svn: 339247
*	[InstCombine] De Morgan: sink 'not' into 'xor' (PR38446)	Roman Lebedev	2018-08-08	1	-9/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: https://rise4fun.com/Alive/IT3 Comes up in the [most ugliest] `signed int` -> `signed char` case of `-fsanitize=implicit-conversion` (https://reviews.llvm.org/D50250) Previously, we were stuck with `not`: {F6867736} But now we are able to completely get rid of it: {F6867737} (FIXME: why are we loosing the metadata? that seems wrong/strange.) Here, we only want to do that it we will be able to completely get rid of that 'not'. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: vsk, erichkeane, llvm-commits Differential Revision: https://reviews.llvm.org/D50301 llvm-svn: 339243
*	[ARM] FP16: codegen support for VEXT	Sjoerd Meijer	2018-08-08	1	-12/+18
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D50427 llvm-svn: 339241
*	[ARM] FP16: vector vmov and vdup support	Sjoerd Meijer	2018-08-08	1	-52/+72
\| \| \| \| \| \| \| \|	This adds codegen support for the vmov_n_f16 and vdup_n_f16 variants. Differential Revision: https://reviews.llvm.org/D50329 llvm-svn: 339238
*	[ARM] FP16: vector VMUL variants	Sjoerd Meijer	2018-08-08	1	-34/+44
\| \| \| \| \| \| \| \|	This adds codegen support for the vmul_lane_f16 and vmul_n_f16 variants. Differential Revision: https://reviews.llvm.org/D50326 llvm-svn: 339232
*	[X86][SSE] Add divide-by-one exact sdiv vector test	Simon Pilgrim	2018-08-08	1	-0/+34
\| \| \| \| \| \|	Based on PR38477, we need to ensure we're testing for divide-by-one in non-uniform vectors llvm-svn: 339231
*	[TargetLowering] BuildUDIV - Early out for divide by one (PR38477)	Simon Pilgrim	2018-08-08	1	-0/+128
\| \| \| \| \| \|	We're not handling the UDIV by one special case properly - for now just early out. llvm-svn: 339229
*	[ARM] FP16: support vector INT_TO_FP and FP_TO_INT	Sjoerd Meijer	2018-08-08	1	-43/+65
\| \| \| \| \| \| \| \|	This adds codegen support for the different vcvt_f16 variants. Differential Revision: https://reviews.llvm.org/D50393 llvm-svn: 339227
*	Support inline asm with multiple 64bit output in 32bit GPR	Thomas Preud'homme	2018-08-08	2	-122/+307
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extend fix for PR34170 to support inline assembly with multiple output operands that do not naturally go in the register class it is constrained to (eg. double in a 32-bit GPR as in the PR). Reviewers: bogner, t.p.northover, lattner, javed.absar, efriedma Reviewed By: efriedma Subscribers: efriedma, tra, eraman, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D45437 llvm-svn: 339225
*	[NFC][InstCombine] Cleanup demorgan-sink-not-into-xor.ll test	Roman Lebedev	2018-08-08	1	-62/+33
\| \| \| \| \| \|	We are only going to do it if it is free to do. llvm-svn: 339223
*	[ARM] FP16: support the vector vmin and vmax variants	Sjoerd Meijer	2018-08-08	2	-32/+350
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D50238 llvm-svn: 339221
*	[NFC] Add some tests on mustexec	Max Kazantsev	2018-08-08	1	-0/+149
\| \| \| \|	llvm-svn: 339219
*	[MS Demangler] Properly handle backreferencing of special names.	Zachary Turner	2018-08-08	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Function template names are not stored in the backref table, but non-template function names are. The general pattern seems to be that when you are demangling a symbol name, if the name starts with '?' it does not go into the backreference table, otherwise it does. Note that this even handles the general case of operator names (template or otherwise) not going into the back-reference table, anonymous namespaces not going into the backreference table, etc. It's important that we apply this check only for the unqualified portion of a name, and only for symbol names. For example, this does not apply to type names (such as class templates) and we need to make sure that these still do go into the backref table. Differential Revision: https://reviews.llvm.org/D50394 llvm-svn: 339211
*	[InstCombine] add tests for fneg fold including FMF; NFC	Sanjay Patel	2018-08-07	1	-3/+42
\| \| \| \|	llvm-svn: 339203
*	[InstCombine] fix FP constant in test; NFC	Sanjay Patel	2018-08-07	1	-2/+2
\| \| \| \| \| \|	Too many digits... llvm-svn: 339200
*	[NFC] adding tests for Y - (X + Y) --> -X	Michael Berg	2018-08-07	2	-0/+26
\| \| \| \|	llvm-svn: 339197
*	[InstCombine] add tests for fneg of fmul/fdiv with constant; NFC	Sanjay Patel	2018-08-07	1	-0/+128
\| \| \| \|	llvm-svn: 339195
*	AMDGPU: Remove broken i16 ternary patterns	Jan Vesely	2018-08-07	1	-3/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixup test to check for GCN prefix These patterns always zero extend the result even though it might need sign extension. This has been broken since the addition of i16 support. It has popped up in mad_sat(char) test since min(max()) combination is turned into v_med3, resulting in the following (incorrect) sequence: v_mad_i16 v2, v10, v9, v11 v_med3_i32 v2, v2, v8, v7 Fixes mad_sat(char) piglit on VI. Differential Revision: https://reviews.llvm.org/D49836 llvm-svn: 339190
*	[WebAssembly] Update SIMD binary arithmetic	Derek Schuff	2018-08-07	1	-4/+93
\| \| \| \| \| \| \| \| \| \| \| \|	Add missing SIMD types (v2f64) and binary ops. Also adds tablegen support for automatically prepending prefix byte to SIMD opcodes. Differential Revision: https://reviews.llvm.org/D50292 Patch by Thomas Lively llvm-svn: 339186
*	[Hexagon] Allow use of gather intrinsics even with no-packets	Krzysztof Parzyszek	2018-08-07	1	-5/+26
\| \| \| \| \| \| \| \| \|	Vgather requires must be in a packet with a store, which contradicts the no-packets feature. As a consequence, gather/scatter could not be used with no-packets. Relax this, and allow gather packets as exceptions to the no-packets requirements. llvm-svn: 339177
*	[InstSimplify] fold fsub+fadd with common operand	Sanjay Patel	2018-08-07	1	-6/+2
\| \| \| \|	llvm-svn: 339176
*	[InstSimplify] fold fadd+fsub with common operand	Sanjay Patel	2018-08-07	1	-6/+2
\| \| \| \|	llvm-svn: 339174