bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[AMDGPU] Combine add and adde, sub and sube	Stanislav Mekhanoshin	2017-06-21	2	-9/+81
\| \| \| \| \| \| \| \| \|	If one of the arguments of adde/sube is zero we can fold another add/sub into it. Differential Revision: https://reviews.llvm.org/D34374 llvm-svn: 305964
*	Mark dump() methods as const. NFC	Sam Clegg	2017-06-21	1	-1/+1
\| \| \| \| \| \| \| \| \|	Add const qualifier to any dump() method where adding one was trivial. Differential Revision: https://reviews.llvm.org/D34481 llvm-svn: 305963
*	[AMDGPU] simplify add x, *ext (setcc) => addc\|subb x, 0, setcc	Stanislav Mekhanoshin	2017-06-21	4	-0/+59
\| \| \| \| \| \| \| \| \|	This simplification allows to avoid generating v_cndmask_b32 to serialize condition code between compare and use. Differential Revision: https://reviews.llvm.org/D34300 llvm-svn: 305962
*	[Hexagon] Use MachineInstrBuilder instead of changing instruction in place	Krzysztof Parzyszek	2017-06-21	1	-45/+9
\| \| \| \|	llvm-svn: 305953
*	[Target] Implement the ".rdata" MIPS assembly directive.	Davide Italiano	2017-06-21	1	-0/+22
\| \| \| \| \| \| \| \|	Patch by John Baldwin < jhb at freebsd dot org >! Differential Revision: https://reviews.llvm.org/D34452 llvm-svn: 305949
*	[Solaris] emit .init_array instead of .ctors on Solaris (Sparc/x86)	Davide Italiano	2017-06-21	5	-0/+21
\| \| \| \| \| \| \| \|	Patch by Fedor Sergeev. Differential Revision: https://reviews.llvm.org/D33868 llvm-svn: 305948
*	[Hexagon] Handle more types of immediate operands in expand-condsets	Krzysztof Parzyszek	2017-06-21	1	-2/+13
\| \| \| \|	llvm-svn: 305943
*	[PowerPC] define target hook isReallyTriviallyReMaterializable()	Lei Huang	2017-06-21	3	-2/+29
\| \| \| \| \| \| \| \| \| \| \|	Define target hook isReallyTriviallyReMaterializable() to explicitly specify PowerPC instructions that are trivially rematerializable. This will allow the MachineLICM pass to accurately identify PPC instructions that should always be hoisted. Differential Revision: https://reviews.llvm.org/D34255 llvm-svn: 305932
*	[AMDGPU][MC][GFX9] Corrected VOP3P relevant code to fix disassembler failures	Dmitry Preobrazhensky	2017-06-21	4	-11/+6
\| \| \| \| \| \| \| \| \| \|	See Bug 33509: https://bugs.llvm.org//show_bug.cgi?id=33509 Reviewers: Sam Kolton, Artem Tamazov, Valery Pykhtin Differential Revision: https://reviews.llvm.org/D34360 llvm-svn: 305923
*	[AMDGPU][MC] Corrected V_QSAD instructions to check that dest register is ↵	Dmitry Preobrazhensky	2017-06-21	3	-5/+84
\| \| \| \| \| \| \| \| \| \| \| \|	different than any of the src See Bug 33279: https://bugs.llvm.org//show_bug.cgi?id=33279 Reviewers: artem.tamazov, vpykhtin Differential Revision: https://reviews.llvm.org/D34003 llvm-svn: 305915
*	[x86] fix formatting; NFC	Sanjay Patel	2017-06-21	1	-15/+13
\| \| \| \|	llvm-svn: 305914
*	[AARCH64][LSE] Preliminary support for ARMv8.1 LSE Atomics.	Christof Douma	2017-06-21	4	-5/+114
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implemented support to AArch64 codegen for ARMv8.1 Large System Extensions atomic instructions. Where supported, these instructions can provide atomic operations with higher performance. Currently supported operations include: fetch_add, fetch_or, fetch_xor, fetch_smin, fetch_min/max (signed and unsigned), swap, and compare_exchange. This implementation implies sequential-consistency ordering, more relaxed ordering is under development. Subtarget->hasLSE is currently supported for Cavium ThunderX2T99. Patch by Ananth Jasty. Differential Revision: https://reviews.llvm.org/D33586 Change-Id: I82f6d3d64255622791ceb0715b7ab9f4dc4d4b2c llvm-svn: 305893
*	[AArch64] Add early exit to promoteLoadFromStore.	Florian Hahn	2017-06-21	1	-1/+4
\| \| \| \| \| \| \| \|	There should be at most a single kill flag for the promoted operand between the store/load pair. Discussed in https://reviews.llvm.org/D34402. llvm-svn: 305889
*	[MIPS] Fix for selecting of DINS/INS instruction	Strahinja Petrovic	2017-06-21	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	This patch adds one more condition in selection DINS/INS instruction, which fixes MultiSource/Applications/JM/ldecod/ for mips32r2 (and mips64r2 n32 abi). Differential Revision: https://reviews.llvm.org/D33725 llvm-svn: 305888
*	[AMDGPU] SDWA: merge VI and GFX9 pseudo instructions	Sam Kolton	2017-06-21	15	-281/+323
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously there were two separate pseudo instruction for SDWA on VI and on GFX9. Created one pseudo instruction that is union of both of them. Added verifier to check that operands conform either VI or GFX9. Reviewers: dp, arsenm, vpykhtin Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, artem.tamazov Differential Revision: https://reviews.llvm.org/D34026 llvm-svn: 305886
*	[AArch64] Preserve register flags when promoting a load from store.	Florian Hahn	2017-06-21	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch updates promoteLoadFromStore to use the store MachineOperand as the source operand of the of the new instruction instead of creating a new register MachineOperand. This way, the existing register flags are preserved. This fixes PR33468 (https://bugs.llvm.org/show_bug.cgi?id=33468). Reviewers: MatzeB, t.p.northover, junbuml Reviewed By: MatzeB Subscribers: aemerson, rengolin, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34402 llvm-svn: 305885
*	clang-format a region.	Rafael Espindola	2017-06-20	1	-20/+19
\| \| \| \| \| \|	It will make a followup patch easier to read. llvm-svn: 305865
*	AMDGPU: Allow vectorization of packed types	Matt Arsenault	2017-06-20	2	-8/+20
\| \| \| \|	llvm-svn: 305844
*	[AMDGPU] Fix illegal shrink of V_SUBB_U32 and V_ADDC_U32	Stanislav Mekhanoshin	2017-06-20	1	-0/+2
\| \| \| \| \| \| \| \| \|	If there is an immediate operand we shall not shrink V_SUBB_U32 and V_ADDC_U32, it does not fit e32 encoding. Differential Revison: https://reviews.llvm.org/D34291 llvm-svn: 305840
*	AMDGPU: Start adding global_* instructions	Matt Arsenault	2017-06-20	6	-6/+106
\| \| \| \|	llvm-svn: 305838
*	AMDGPU: Do operand folding in program order	Matt Arsenault	2017-06-20	1	-5/+3
\| \| \| \| \| \| \| \| \|	Before it was possible to partially fold use instructions before the defs. After the xor is folded into a copy, the same mov can end up in the fold list twice, so on the second attempt it will fail expecting to see a register to fold. llvm-svn: 305821
*	AMDGPU: Preserve undef when folding register operands	Matt Arsenault	2017-06-20	1	-0/+2
\| \| \| \| \| \| \| \|	If the source was a copy of an undef register, this would produce a read of an undefined register which is a verifier error. llvm-svn: 305816
*	[AMDGPU] Eliminate SGPR to VGPR copy when possible	Stanislav Mekhanoshin	2017-06-20	1	-0/+30
\| \| \| \| \| \| \| \|	SGPRs are generally cheaper, so try to use them over VGPRs. Differential Revision: https://reviews.llvm.org/D34130 llvm-svn: 305815
*	AMDGPU: Fix crash with undef vreg input operand	Matt Arsenault	2017-06-20	1	-1/+1
\| \| \| \|	llvm-svn: 305814
*	[PowerPC] fix trivial typos in comment, NFC	Hiroshi Inoue	2017-06-20	1	-1/+1
\| \| \| \|	llvm-svn: 305813
*	[x86] enable CGP memcmp() expansion for 2/4/8 byte sizes	Sanjay Patel	2017-06-20	3	-1/+13
\| \| \| \| \| \| \| \| \|	There are a couple of potential improvements as seen in the IR and asm: 1. We're unnecessarily extending to a larger type to compare values. 2. The codegen for (select cond, 1, -1) could avoid a cmov. (or we could change the order of the compares, so we have a select with 0 operand) llvm-svn: 305802
*	[X86][SSE] Relax 0/-1 vector element insertion to work for any vector with ↵	Simon Pilgrim	2017-06-20	1	-1/+2
\| \| \| \| \| \| \| \|	>=16bit elements Shuffle lowering/combining now does a good job for 256/512-bit vectors - we don't need to prevent this llvm-svn: 305801
*	[X86][SSE] Dropped old INSERT_VECTOR_ELT lowering TODO	Simon Pilgrim	2017-06-20	1	-2/+0
\| \| \| \| \| \|	Target shuffle combining now supports the matching of INSERT_VECTOR_ELT/PINSRW/PINSRB for merging multiple insertions into shuffles/bitmasks. llvm-svn: 305788
*	[GlobalISel][X86] fix compilation error ( -Werror=unused-function )	Igor Breger	2017-06-20	1	-2/+2
\| \| \| \|	llvm-svn: 305786
*	[GlobalISel][X86] Get correct RegClass for given RegBank.	Igor Breger	2017-06-20	1	-17/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In some cases RegClass depends on target feature. Hight (16-31) vector registers exist only if AVX512f available. Split from https://reviews.llvm.org/D33665 Reviewers: qcolombet, t.p.northover, zvi, guyblank Reviewed By: t.p.northover, guyblank Subscribers: guyblank, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33952 Conflicts: test/CodeGen/X86/GlobalISel/select-memop-scalar.mir llvm-svn: 305784
*	[ARM] Support constant pools in data when generating execute-only code.	Alexandros Lamprineas	2017-06-20	3	-15/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Resubmission of r305387, which was reverted at r305390. The Address Sanitizer caught a stack-use-after-scope of a Twine variable. This is now fixed by passing the Twine directly as a function parameter. The ARM backend asserts against constant pool lowering when it generates execute-only code in order to prevent the generation of constant pools in the text section. It appears that target independent optimizations might generate DAG nodes that represent constant pools. By lowering such nodes as global addresses we don't violate the semantics of execute-only code and also it is guaranteed that execute-only behaves correct with the position-independent addressing modes that support execute-only code. Differential Revision: https://reviews.llvm.org/D33773 llvm-svn: 305776
*	AMDGPU: Fix scratch wave offset relative FI expansion	Matt Arsenault	2017-06-19	1	-9/+20
\| \| \| \| \| \| \| \|	The offset may not be an inline immediate, so this needs to be materialized into a register. The post-RA run of SIShrinkInstructions is able to fold it later if it can. llvm-svn: 305761
*	[AMDGPU] Add infer address spaces pass before SROA	Stanislav Mekhanoshin	2017-06-19	1	-0/+8
\| \| \| \| \| \| \| \| \|	It adds it for the target after inlining but before SROA where we can get most out of it. Differential Revision: https://reviews.llvm.org/D34366 llvm-svn: 305759
*	[Target] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-06-19	2	-17/+37
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 305757
*	[AArch64][Falkor] Fix MOVZ sched predicate to not assert on non-imm operands ↵	Geoff Berry	2017-06-19	1	-1/+2
\| \| \| \| \| \|	(e.g. blockaddress). llvm-svn: 305752
*	[AArch64][Kryo] Add missing write latency for LDAXP, LDXP second destination.	Geoff Berry	2017-06-19	1	-2/+4
\| \| \| \| \| \|	Fixes PR33491 and PR33512. llvm-svn: 305751
*	[AArch64][Falkor] Refine load/store increment latencies.	Geoff Berry	2017-06-19	1	-164/+242
\| \| \| \| \| \|	Also fix LDXP & LDAXP write latency to avoid similar assert as PR33491 and PR33512. llvm-svn: 305750
*	AMDGPU: Cleanup CreateLiveInRegister	Matt Arsenault	2017-06-19	5	-34/+45
\| \| \| \|	llvm-svn: 305748
*	Revert r305382, it caused PR33513.	Nico Weber	2017-06-19	1	-6/+6
\| \| \| \|	llvm-svn: 305735
*	Revert r304824 "Fix PR23384 (part 3 of 3)"	Hans Wennborg	2017-06-19	2	-13/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This seems to be interacting badly with ASan somehow, causing false reports of heap-buffer overflows: PR33514. > Summary: > The patch makes instruction count the highest priority for > LSR solution for X86 (previously registers had highest priority). > > Reviewers: qcolombet > > Differential Revision: http://reviews.llvm.org/D30562 > > From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 305720
*	[AArch64] Fix order of checks in shouldScheduleAdjacent.	Florian Hahn	2017-06-19	1	-2/+2
\| \| \| \| \| \| \|	We need to check the opcode of FirstMI before accessing the operands. This caused a buildbot failure during bootstrapping on AArch64. llvm-svn: 305694
*	AMDGPU/GlobalISel: Mark G_BITCAST s32 <--> <2 x s16> legal	Tom Stellard	2017-06-19	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, igorb, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D34129 llvm-svn: 305692
*	[GlobalISel][X86] Fold FI/G_GEP into LDR/STR instruction addressing mode.	Igor Breger	2017-06-19	1	-4/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Implement some of the simplest addressing modes.It should help to test ABI. Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33888 llvm-svn: 305691
*	Recommit rL305677: [CodeGen] Add generic MacroFusion pass	Florian Hahn	2017-06-19	4	-253/+53
\| \| \| \| \| \| \| \| \| \| \| \| \|	Use llvm::make_unique to avoid ambiguity with MSVC. This patch adds a generic MacroFusion pass, that is used on X86 and AArch64, which both define target-specific shouldScheduleAdjacent functions. This generic pass should make it easier for other targets to implement macro fusion and I intend to add macro fusion for ARM shortly. Differential Revision: https://reviews.llvm.org/D34144 llvm-svn: 305690
*	[ARM] GlobalISel: Support G_ICMP for s8 and s16	Diana Picus	2017-06-19	1	-0/+2
\| \| \| \| \| \|	Widen to s32 (like all other binary ops). llvm-svn: 305683
*	Revert r305677 [CodeGen] Add generic MacroFusion pass.	Florian Hahn	2017-06-19	4	-53/+253
\| \| \| \| \| \|	This causes Windows buildbot failures do an ambiguous call. llvm-svn: 305681
*	[CodeGen] Add generic MacroFusion pass.	Florian Hahn	2017-06-19	4	-253/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds a generic MacroFusion pass, that is used on X86 and AArch64, which both define target-specific shouldScheduleAdjacent functions. This generic pass should make it easier for other targets to implement macro fusion and I intend to add macro fusion for ARM shortly. Reviewers: craig.topper, evandro, t.p.northover, atrick, MatzeB Reviewed By: MatzeB Subscribers: atrick, aemerson, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34144 llvm-svn: 305677
*	[ARM] GlobalISel: Support G_ICMP for i32 and pointers	Diana Picus	2017-06-19	3	-0/+119
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support throughout the pipeline: - mark as legal for s32 and pointers - map to GPRs - lower to a sequence of instructions, which moves 0 or 1 into the result register based on the flags set by a CMPrr We have copied from FastISel a helper function which maps CmpInst predicates into ARMCC codes. Ideally, we should be able to move it somewhere that both FastISel and GlobalISel can use. llvm-svn: 305672
*	Rework logic and comment out the default relocation models for PPC.	Eric Christopher	2017-06-17	1	-10/+13
\| \| \| \|	llvm-svn: 305630
*	Turn a large if block into a smaller early return for clarity.	Eric Christopher	2017-06-17	1	-11/+10
\| \| \| \|	llvm-svn: 305629