bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[AMDGPU][MC] Corrected V_QSAD instructions to check that dest register is ↵	Dmitry Preobrazhensky	2017-06-21	3	-5/+84
\| \| \| \| \| \| \| \| \| \| \| \|	different than any of the src See Bug 33279: https://bugs.llvm.org//show_bug.cgi?id=33279 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D34003 llvm-svn: 305915
*	[x86] fix formatting; NFC	Sanjay Patel	2017-06-21	1	-15/+13
\| \| \| \|	llvm-svn: 305914
*	[AARCH64][LSE] Preliminary support for ARMv8.1 LSE Atomics.	Christof Douma	2017-06-21	4	-5/+114
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implemented support to AArch64 codegen for ARMv8.1 Large System Extensions atomic instructions. Where supported, these instructions can provide atomic operations with higher performance. Currently supported operations include: fetch_add, fetch_or, fetch_xor, fetch_smin, fetch_min/max (signed and unsigned), swap, and compare_exchange. This implementation implies sequential-consistency ordering, more relaxed ordering is under development. Subtarget->hasLSE is currently supported for Cavium ThunderX2T99. Patch by Ananth Jasty. Differential Revision: https://reviews.llvm.org/D33586 Change-Id: I82f6d3d64255622791ceb0715b7ab9f4dc4d4b2c llvm-svn: 305893
*	[AArch64] Add early exit to promoteLoadFromStore.	Florian Hahn	2017-06-21	1	-1/+4
\| \| \| \| \| \| \| \|	There should be at most a single kill flag for the promoted operand between the store/load pair. Discussed in https://reviews.llvm.org/D34402. llvm-svn: 305889
*	[MIPS] Fix for selecting of DINS/INS instruction	Strahinja Petrovic	2017-06-21	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	This patch adds one more condition in selection DINS/INS instruction, which fixes MultiSource/Applications/JM/ldecod/ for mips32r2 (and mips64r2 n32 abi). Differential Revision: https://reviews.llvm.org/D33725 llvm-svn: 305888
*	[AMDGPU] SDWA: merge VI and GFX9 pseudo instructions	Sam Kolton	2017-06-21	15	-281/+323
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously there were two separate pseudo instruction for SDWA on VI and on GFX9. Created one pseudo instruction that is union of both of them. Added verifier to check that operands conform either VI or GFX9. Reviewers: dp, arsenm, vpykhtin Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, artem.tamazov Differential Revision: https://reviews.llvm.org/D34026 llvm-svn: 305886
*	[AArch64] Preserve register flags when promoting a load from store.	Florian Hahn	2017-06-21	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch updates promoteLoadFromStore to use the store MachineOperand as the source operand of the of the new instruction instead of creating a new register MachineOperand. This way, the existing register flags are preserved. This fixes PR33468 (https://bugs.llvm.org/show_bug.cgi?id=33468). Reviewers: MatzeB, t.p.northover, junbuml Reviewed By: MatzeB Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34402 llvm-svn: 305885
*	clang-format a region.	Rafael Espindola	2017-06-20	1	-20/+19
\| \| \| \| \| \|	It will make a followup patch easier to read. llvm-svn: 305865
*	AMDGPU: Allow vectorization of packed types	Matt Arsenault	2017-06-20	2	-8/+20
\| \| \| \|	llvm-svn: 305844
*	[AMDGPU] Fix illegal shrink of V_SUBB_U32 and V_ADDC_U32	Stanislav Mekhanoshin	2017-06-20	1	-0/+2
\| \| \| \| \| \| \| \| \|	If there is an immediate operand we shall not shrink V_SUBB_U32 and V_ADDC_U32, it does not fit e32 encoding. Differential Revison: https://reviews.llvm.org/D34291 llvm-svn: 305840
*	AMDGPU: Start adding global_* instructions	Matt Arsenault	2017-06-20	6	-6/+106
\| \| \| \|	llvm-svn: 305838
*	AMDGPU: Do operand folding in program order	Matt Arsenault	2017-06-20	1	-5/+3
\| \| \| \| \| \| \| \| \|	Before it was possible to partially fold use instructions before the defs. After the xor is folded into a copy, the same mov can end up in the fold list twice, so on the second attempt it will fail expecting to see a register to fold. llvm-svn: 305821
*	AMDGPU: Preserve undef when folding register operands	Matt Arsenault	2017-06-20	1	-0/+2
\| \| \| \| \| \| \| \|	If the source was a copy of an undef register, this would produce a read of an undefined register which is a verifier error. llvm-svn: 305816
*	[AMDGPU] Eliminate SGPR to VGPR copy when possible	Stanislav Mekhanoshin	2017-06-20	1	-0/+30
\| \| \| \| \| \| \| \|	SGPRs are generally cheaper, so try to use them over VGPRs. Differential Revision: https://reviews.llvm.org/D34130 llvm-svn: 305815
*	AMDGPU: Fix crash with undef vreg input operand	Matt Arsenault	2017-06-20	1	-1/+1
\| \| \| \|	llvm-svn: 305814
*	[PowerPC] fix trivial typos in comment, NFC	Hiroshi Inoue	2017-06-20	1	-1/+1
\| \| \| \|	llvm-svn: 305813
*	[x86] enable CGP memcmp() expansion for 2/4/8 byte sizes	Sanjay Patel	2017-06-20	3	-1/+13
\| \| \| \| \| \| \| \| \|	There are a couple of potential improvements as seen in the IR and asm: 1. We're unnecessarily extending to a larger type to compare values. 2. The codegen for (select cond, 1, -1) could avoid a cmov. (or we could change the order of the compares, so we have a select with 0 operand) llvm-svn: 305802
*	[X86][SSE] Relax 0/-1 vector element insertion to work for any vector with ↵	Simon Pilgrim	2017-06-20	1	-1/+2
\| \| \| \| \| \| \| \|	>=16bit elements Shuffle lowering/combining now does a good job for 256/512-bit vectors - we don't need to prevent this llvm-svn: 305801
*	[X86][SSE] Dropped old INSERT_VECTOR_ELT lowering TODO	Simon Pilgrim	2017-06-20	1	-2/+0
\| \| \| \| \| \|	Target shuffle combining now supports the matching of INSERT_VECTOR_ELT/PINSRW/PINSRB for merging multiple insertions into shuffles/bitmasks. llvm-svn: 305788
*	[GlobalISel][X86] fix compilation error ( -Werror=unused-function )	Igor Breger	2017-06-20	1	-2/+2
\| \| \| \|	llvm-svn: 305786
*	[GlobalISel][X86] Get correct RegClass for given RegBank.	Igor Breger	2017-06-20	1	-17/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In some cases RegClass depends on target feature. Hight (16-31) vector registers exist only if AVX512f available. Split from https://reviews.llvm.org/D33665 Reviewers: qcolombet, t.p.northover, zvi, guyblank Reviewed By: t.p.northover, guyblank Subscribers: guyblank, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33952 Conflicts: test/CodeGen/X86/GlobalISel/select-memop-scalar.mir llvm-svn: 305784
*	[ARM] Support constant pools in data when generating execute-only code.	Alexandros Lamprineas	2017-06-20	3	-15/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Resubmission of r305387, which was reverted at r305390. The Address Sanitizer caught a stack-use-after-scope of a Twine variable. This is now fixed by passing the Twine directly as a function parameter. The ARM backend asserts against constant pool lowering when it generates execute-only code in order to prevent the generation of constant pools in the text section. It appears that target independent optimizations might generate DAG nodes that represent constant pools. By lowering such nodes as global addresses we don't violate the semantics of execute-only code and also it is guaranteed that execute-only behaves correct with the position-independent addressing modes that support execute-only code. Differential Revision: https://reviews.llvm.org/D33773 llvm-svn: 305776
*	AMDGPU: Fix scratch wave offset relative FI expansion	Matt Arsenault	2017-06-19	1	-9/+20
\| \| \| \| \| \| \| \|	The offset may not be an inline immediate, so this needs to be materialized into a register. The post-RA run of SIShrinkInstructions is able to fold it later if it can. llvm-svn: 305761
*	[AMDGPU] Add infer address spaces pass before SROA	Stanislav Mekhanoshin	2017-06-19	1	-0/+8
\| \| \| \| \| \| \| \| \|	It adds it for the target after inlining but before SROA where we can get most out of it. Differential Revision: https://reviews.llvm.org/D34366 llvm-svn: 305759
*	[Target] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-06-19	2	-17/+37
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 305757
*	[AArch64][Falkor] Fix MOVZ sched predicate to not assert on non-imm operands ↵	Geoff Berry	2017-06-19	1	-1/+2
\| \| \| \| \| \|	(e.g. blockaddress). llvm-svn: 305752
*	[AArch64][Kryo] Add missing write latency for LDAXP, LDXP second destination.	Geoff Berry	2017-06-19	1	-2/+4
\| \| \| \| \| \|	Fixes PR33491 and PR33512. llvm-svn: 305751
*	[AArch64][Falkor] Refine load/store increment latencies.	Geoff Berry	2017-06-19	1	-164/+242
\| \| \| \| \| \|	Also fix LDXP & LDAXP write latency to avoid similar assert as PR33491 and PR33512. llvm-svn: 305750
*	AMDGPU: Cleanup CreateLiveInRegister	Matt Arsenault	2017-06-19	5	-34/+45
\| \| \| \|	llvm-svn: 305748
*	Revert r305382, it caused PR33513.	Nico Weber	2017-06-19	1	-6/+6
\| \| \| \|	llvm-svn: 305735
*	Revert r304824 "Fix PR23384 (part 3 of 3)"	Hans Wennborg	2017-06-19	2	-13/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This seems to be interacting badly with ASan somehow, causing false reports of heap-buffer overflows: PR33514. > Summary: > The patch makes instruction count the highest priority for > LSR solution for X86 (previously registers had highest priority). > > Reviewers: qcolombet > > Differential Revision: http://reviews.llvm.org/D30562 > > From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 305720
*	[AArch64] Fix order of checks in shouldScheduleAdjacent.	Florian Hahn	2017-06-19	1	-2/+2
\| \| \| \| \| \| \|	We need to check the opcode of FirstMI before accessing the operands. This caused a buildbot failure during bootstrapping on AArch64. llvm-svn: 305694
*	AMDGPU/GlobalISel: Mark G_BITCAST s32 <--> <2 x s16> legal	Tom Stellard	2017-06-19	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34129 llvm-svn: 305692
*	[GlobalISel][X86] Fold FI/G_GEP into LDR/STR instruction addressing mode.	Igor Breger	2017-06-19	1	-4/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Implement some of the simplest addressing modes.It should help to test ABI. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33888 llvm-svn: 305691
*	Recommit rL305677: [CodeGen] Add generic MacroFusion pass	Florian Hahn	2017-06-19	4	-253/+53
\| \| \| \| \| \| \| \| \| \| \| \| \|	Use llvm::make_unique to avoid ambiguity with MSVC. This patch adds a generic MacroFusion pass, that is used on X86 and AArch64, which both define target-specific shouldScheduleAdjacent functions. This generic pass should make it easier for other targets to implement macro fusion and I intend to add macro fusion for ARM shortly. Differential Revision: https://reviews.llvm.org/D34144 llvm-svn: 305690
*	[ARM] GlobalISel: Support G_ICMP for s8 and s16	Diana Picus	2017-06-19	1	-0/+2
\| \| \| \| \| \|	Widen to s32 (like all other binary ops). llvm-svn: 305683
*	Revert r305677 [CodeGen] Add generic MacroFusion pass.	Florian Hahn	2017-06-19	4	-53/+253
\| \| \| \| \| \|	This causes Windows buildbot failures do an ambiguous call. llvm-svn: 305681
*	[CodeGen] Add generic MacroFusion pass.	Florian Hahn	2017-06-19	4	-253/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds a generic MacroFusion pass, that is used on X86 and AArch64, which both define target-specific shouldScheduleAdjacent functions. This generic pass should make it easier for other targets to implement macro fusion and I intend to add macro fusion for ARM shortly. Reviewers: craig.topper, evandro, t.p.northover, atrick, MatzeB Reviewed By: MatzeB Subscribers: atrick, aemerson, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34144 llvm-svn: 305677
*	[ARM] GlobalISel: Support G_ICMP for i32 and pointers	Diana Picus	2017-06-19	3	-0/+119
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support throughout the pipeline: - mark as legal for s32 and pointers - map to GPRs - lower to a sequence of instructions, which moves 0 or 1 into the result register based on the flags set by a CMPrr We have copied from FastISel a helper function which maps CmpInst predicates into ARMCC codes. Ideally, we should be able to move it somewhere that both FastISel and GlobalISel can use. llvm-svn: 305672
*	Rework logic and comment out the default relocation models for PPC.	Eric Christopher	2017-06-17	1	-10/+13
\| \| \| \|	llvm-svn: 305630
*	Turn a large if block into a smaller early return for clarity.	Eric Christopher	2017-06-17	1	-11/+10
\| \| \| \|	llvm-svn: 305629
*	Remove the old and unused PPC32 and PPC64TargetMachine classes.	Eric Christopher	2017-06-17	2	-47/+4
\| \| \| \|	llvm-svn: 305628
*	Remove unused forward declaration.	Eric Christopher	2017-06-17	1	-1/+0
\| \| \| \|	llvm-svn: 305627
*	Tidy up some calls to getRegister for readability.	Eric Christopher	2017-06-17	1	-5/+6
\| \| \| \|	llvm-svn: 305626
*	[PPC] Remove isBarrier from CFENCE8's definition.	Tim Shen	2017-06-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is my misunderstanding on isBarrier. It's not for memory barriers, but for other control flow purposes. lwsync doesn't have it either. This fixes a simple crash with -verify-machineinstrs like below: define void @Foo() { entry: %tmp = load atomic i64, i64* undef acquire, align 8 unreachable } I deliberately don't want to check in the test, since there is little chance to regress on such a mistake. Such a test adds noise to the code base. I plan to check in first, since it fixes a crash, and the fix is obvious. Reviewers: kbarton, echristo Subscribers: sanjoy, nemanjai, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D34314 llvm-svn: 305624
*	[WebAssembly] Use __stack_pointer global when writing wasm binary	Sam Clegg	2017-06-16	7	-65/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This ensures that symbolic relocations are generated for stack pointer manipulations. These relocations are of type R_WEBASSEMBLY_GLOBAL_INDEX_LEB. This change also adds support for reading relocations of this type in WasmObjectFile.cpp. Since its a globally imported symbol this does mean that the get_global/set_global instruction won't be valid until the objects are linked that global used in no longer an imported global. Differential Revision: https://reviews.llvm.org/D34172 llvm-svn: 305616
*	bpf: fix a strict-aliasing issue	Yonghong Song	2017-06-16	1	-11/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Davide Italiano reported the following issue if llvm is compiled with gcc -Wstrict-aliasing -Werror: ..... lib/Target/BPF/CMakeFiles/LLVMBPFCodeGen.dir/BPFISelDAGToDAG.cpp.o ../lib/Target/BPF/BPFISelDAGToDAG.cpp: In member function ‘virtual void {anonymous}::BPFDAGToDAGISel::PreprocessISelDAG()’: ../lib/Target/BPF/BPFISelDAGToDAG.cpp:264:26: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing] val = (uint16_t )new_val; ..... The error is caused by my previous commit (revision 305560). This patch fixed the issue by introducing an union to avoid type casting. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 305608
*	bpf: avoid load from read-only sections	Yonghong Song	2017-06-16	1	-7/+233
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If users tried to have a structure decl/init code like below struct test_t t = { .memeber1 = 45 }; It is very likely that compiler will generate a readonly section to hold up the init values for variable t. Later load of t members, e.g., t.member1 will result in a read from readonly section. BPF program cannot handle relocation. This will force users to write: struct test_t t = {}; t.member1 = 45; This is just inconvenient and unintuitive. This patch addresses this issue by implementing BPF PreprocessISelDAG. For any load from a global constant structure or an global array of constant struct, it attempts to translate it into a constant directly. The traversal of the constant struct and other constant data structures are similar to where the assembler emits read-only sections. Four different unit test cases are also added to cover different scenarios. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 305560
*	bpf: set missing types in insn tablegen file	Yonghong Song	2017-06-16	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \|	o This is discovered during my study of 32-bit subregister support. o This is no impact on current functionality since we only support 64-bit registers. o Searching the web, looks like the issue has been discovered before, so fix it now. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 305559
*	Revert "[mips][microMIPS] Extending size reduction pass with ADDIUSP and ↵	Simon Dardis	2017-06-16	1	-97/+12
\| \| \| \| \| \| \| \| \|	ADDIUR1SP" This reverts commit r305455. This commit was reported as breaking one of the sanitizer buildbots. Reverting until lab.llvm.org comes back online. llvm-svn: 305557