bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Assume spilling will occur at -O0	Matt Arsenault	2016-10-13	1	-1/+5
\| \| \| \| \| \| \| \|	Because everything live is spilled at the end of a block by fast regalloc, assume this will happen and avoid the copies of the resource descriptor. llvm-svn: 284119
*	AMDGPU: Fix truncate to bool warnings	Matt Arsenault	2016-10-13	1	-5/+5
\| \| \| \|	llvm-svn: 284116
*	[mips] Add IAS support for dvp, evp	Simon Dardis	2016-10-13	4	-4/+44
\| \| \| \| \| \| \| \| \| \| \| \| \|	These instructions were only defined for microMIPSR6 previously. Add definitions for MIPSR6, correct definitions for microMIPSR6, flag these instructions as having unmodelled side effects (they disable/enable virtual processors) and add missing disassember tests for microMIPSR6. Reviewers: vkalintiris Differential Review: https://reviews.llvm.org/D24291 llvm-svn: 284115
*	[X86] Basic additions to support RegCall Calling Convention.	Oren Ben Simhon	2016-10-13	3	-0/+223
\| \| \| \| \| \| \| \| \| \|	The Register Calling Convention (RegCall) was introduced by Intel to optimize parameter transfer on function call. This calling convention ensures that as many values as possible are passed or returned in registers. This commit presents the basic additions to LLVM CodeGen in order to support RegCall in X86. Differential Revision: http://reviews.llvm.org/D25022 llvm-svn: 284108
*	Silence unused warning in non-assert builds.	Daniel Jasper	2016-10-13	1	-3/+3
\| \| \| \|	llvm-svn: 284107
*	[AVX-512] Teach shuffle lowering to recognize 512-bit zero extends.	Craig Topper	2016-10-13	1	-2/+27
\| \| \| \|	llvm-svn: 284105
*	[X86] Simplify the lowering code for extracting and inserting subvectors.	Craig Topper	2016-10-13	1	-24/+21
\| \| \| \| \| \| \|	We don't need to check if AVX is enabled. It's implied by the operation action being set to Custom. We don't need to check both the input and output type widths. We only need to check the type that's being inserted or extracted. The other type is known to be a legal type and we can assume its a different width. llvm-svn: 284102
*	[AArch64][RegisterBankInfo] Provide alternative mappings for 64-bit load	Quentin Colombet	2016-10-13	1	-1/+30
\| \| \| \| \| \| \| \|	This allows RegBankSelect in greedy mode to get rid some of the cross register bank copies when loads are involved in the chain of computation. llvm-svn: 284097
*	Correct PrivateLinkage for COFF	Reid Kleckner	2016-10-13	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	- Use storage class C_STAT for 'PrivateLinkage' The storage class for PrivateLinkage should equal to the Internal Linkage. - Set 'PrivateGlobalPrefix' from "L" to ".L" for MM_WinCOFF (includes x86_64) MM_WinCOFF has empty GlobalPrefix '\0' so PrivateGlobalPrefix "L" may conflict to the normal symbol name starting with 'L'. Based on a patch by Han Sangjin! Manually updated test cases. llvm-svn: 284096
*	[AArch64][RegisterBankInfo] Provide alternative mappings for G_BITCASTs.	Quentin Colombet	2016-10-13	1	-8/+45
\| \| \| \| \| \| \|	Thanks to this patch, RegBankSelect is able to get rid of some register bank copies as demonstrated in the test case. llvm-svn: 284094
*	[AArch64][RegisterBankInfo] Describe cross regbank copies statically.	Quentin Colombet	2016-10-13	2	-2/+68
\| \| \| \| \| \|	NFC. llvm-svn: 284091
*	[AArch64][RegisterBankInfo] Use static mapping for same bank G_BITCAST.	Quentin Colombet	2016-10-13	1	-0/+8
\| \| \| \| \| \|	NFC. llvm-svn: 284090
*	[AArch64][MachineLegalizer] Mark more G_BITCAST as legal.	Quentin Colombet	2016-10-13	1	-1/+8
\| \| \| \| \| \| \|	Basically any vector types that fits in a 32-bit register is also valid as far as copies are concerned. llvm-svn: 284089
*	[AArch64][RegisterBankInfo] Bump the cost of vector loads.	Quentin Colombet	2016-10-13	1	-0/+10
\| \| \| \| \| \| \|	This does not change anything yet, because we do not offer any alternative mapping. llvm-svn: 284088
*	[AArch64][RegisterBankInfo] Use a proper cost for cross regbank G_BITCASTs.	Quentin Colombet	2016-10-13	1	-2/+11
\| \| \| \| \| \| \|	This does not change anything yet, because we do not offer any alternative mapping. llvm-svn: 284087
*	[AArch64][RegisterBankInfo] Provide more realistic copy costs.	Quentin Colombet	2016-10-13	1	-1/+10
\| \| \| \|	llvm-svn: 284086
*	GlobalISel: support G_TRUNC selection on AArch64.	Tim Northover	2016-10-12	1	-0/+80
\| \| \| \| \| \|	Ahmed's patch again. llvm-svn: 284075
*	GlobalISel: support int <-> float conversions on AArch64.	Tim Northover	2016-10-12	1	-1/+95
\| \| \| \| \| \|	More of Ahmed's work. llvm-svn: 284074
*	GlobalISel: select G_FCMP instructions on AArch64.	Tim Northover	2016-10-12	1	-0/+116
\| \| \| \| \| \|	Another of Ahmed's patches. llvm-svn: 284073
*	GlobalISel: support selection of G_ICMP on AArch64.	Tim Northover	2016-10-12	1	-0/+71
\| \| \| \| \| \|	Patch from Ahmed Bougaca again. llvm-svn: 284072
*	GlobalISel: select G_BRCOND instructions on AArch64.	Tim Northover	2016-10-12	1	-0/+22
\| \| \| \|	llvm-svn: 284071
*	GlobalISel: mark G_BRCOND on s1 as legal.	Tim Northover	2016-10-12	1	-3/+2
\| \| \| \| \| \|	It's going to be a TBNZ (at -O0) anyway, so the high bits don't matter. llvm-svn: 284070
*	Create llvm.addressofreturnaddress intrinsic	Albert Gutowski	2016-10-12	2	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: We need a new LLVM intrinsic to implement MS _AddressOfReturnAddress builtin on 64-bit Windows. Reviewers: majnemer, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25293 llvm-svn: 284061
*	AMDGPU: Initial implementation of VGPR indexing mode	Matt Arsenault	2016-10-12	3	-43/+194
\| \| \| \| \| \| \| \| \| \| \|	This is the most basic handling of the indirect access pseudos using GPR indexing mode. This currently only enables the mode for a single v_mov_b32 and then disables it. This is much more complicated to use than the movrel instructions, so a new optimization pass is probably needed to fold the access into the uses and keep the mode enabled for them. llvm-svn: 284031
*	AMDGPU: Add instruction definitions for VGPR indexing	Matt Arsenault	2016-10-12	10	-8/+126
\| \| \| \| \| \| \|	VI added a second method of indexing into VGPRs besides using v_movrel* llvm-svn: 284027
*	AMDGPU/SI: Change mimg intrinsic signatures	Tom Stellard	2016-10-12	1	-18/+23
\| \| \| \| \| \| \| \|	This makes more fields overridable and removes redundant bits. Patch by: Changpeng Fang llvm-svn: 284024
*	NFC: The Cost Model specialization, by Andrey Tischenko	Alexey Bataev	2016-10-12	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current Cost Model implementation is very inaccurate and has to be updated, improved, re-implemented to be able to take into account the concrete CPU models and the concrete targets where this Cost Model is being used. For example, the Latency Cost Model should be differ from Code Size Cost Model, etc. This patch is the first step to launch the developing and implementation of a new Cost Model generation. Differential Revision: https://reviews.llvm.org/D25186 llvm-svn: 284012
*	[AArch64][InstrustionSelector] Teach the selector about G_BITCAST.	Quentin Colombet	2016-10-12	1	-59/+2
\| \| \| \|	llvm-svn: 283973
*	[AArch64][InstructionSelector] Refactor the handling of copies.	Quentin Colombet	2016-10-12	1	-26/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Although Copies are not specific to preISel, we still have to assign them a proper register class. However, given they are not constrained to anything we do not have to handle the source register at the copy. It will be properly mapped when reaching the related definition. In the process, the handlong of G_ANYEXT is slightly modified as those end up being selected as copy. The difference is that when register size do not match on both sides, we need to insert SUBREG_TO_REG operation, otherwise the post RA copy expansion will not be happy! llvm-svn: 283972
*	[AArch64][MachineLegalizer] Mark more bitcasts as legal.	Quentin Colombet	2016-10-12	1	-0/+3
\| \| \| \| \| \|	Those are copies, we do not have to do any legalization action for them. llvm-svn: 283970
*	[PPCMIPeephole] Fix splat elimination	Tim Shen	2016-10-12	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In PPCMIPeephole, when we see two splat instructions, we can't simply do the following transformation: B = Splat A C = Splat B => C = Splat A because B may still be used between these two instructions. Instead, we should make the second Splat a PPC::COPY and let later passes decide whether to remove it or not: B = Splat A C = Splat B => B = Splat A C = COPY B Fixes PR30663. Reviewers: echristo, iteratee, kbarton, nemanjai Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D25493 llvm-svn: 283961
*	GlobalISel: support same-size casts on AArch64.	Tim Northover	2016-10-11	2	-0/+75
\| \| \| \| \| \| \|	Mostly Ahmed's work again, I'm just sprucing things up slightly before committing. llvm-svn: 283952
*	Re-land "[Thumb] Save/restore high registers in Thumb1 pro/epilogues"	Reid Kleckner	2016-10-11	3	-24/+375
\| \| \| \| \| \| \| \| \|	Reverts r283938 to reinstate r283867 with a fix. The original change had an ArrayRef referring to a destroyed temporary initializer list. Use plain C arrays instead. llvm-svn: 283942
*	Revert "[Thumb] Save/restore high registers in Thumb1 pro/epilogues"	Reid Kleckner	2016-10-11	3	-369/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts r283867. This appears to be an infinite loop: while (HiRegToSave != AllHighRegs.end() && CopyReg != AllCopyRegs.end()) { if (HiRegsToSave.count(*HiRegToSave)) { ... CopyReg = findNextOrderedReg(++CopyReg, CopyRegs, AllCopyRegs.end()); HiRegToSave = findNextOrderedReg(++HiRegToSave, HiRegsToSave, AllHighRegs.end()); } } llvm-svn: 283938
*	GlobalISel: support selection of extend operations.	Tim Northover	2016-10-11	1	-0/+99
\| \| \| \| \| \|	Patch mostly by Ahmed Bougaca. llvm-svn: 283937
*	[AMDGPU] Refactor waitcnt encoding	Konstantin Zhuravlyov	2016-10-11	5	-66/+171
\| \| \| \| \| \| \| \| \| \| \| \| \|	- Refactor bit packing/unpacking - Calculate bit mask given bit shift and bit width - Introduce function for decoding bits of waitcnt - Introduce function for encoding bits of waitcnt - Introduce function for getting waitcnt mask (instead of using bare numbers) - Introduce function fot getting max waitcnt(s) (instead of using bare numbers) Differential Revision: https://reviews.llvm.org/D25298 llvm-svn: 283919
*	Fix "static initialization order fiasco" for the XCore Target.	Mehdi Amini	2016-10-11	6	-15/+19
\| \| \| \| \| \| \| \|	I fixed all the other Targets in r283702, and interestingly the sanitizers are only now "sometimes" catching this bug on the only one I missed. llvm-svn: 283914
*	ARMMachineFunctionInfo.cpp: Add an initializer of ↵	NAKAMURA Takumi	2016-10-11	1	-2/+2
\| \| \| \| \| \| \| \|	ARMFunctionInfo::ReturnRegsCount in the explicit ctor. It caused crash since r283867. llvm-svn: 283909
*	Reformat.	NAKAMURA Takumi	2016-10-11	1	-4/+4
\| \| \| \|	llvm-svn: 283908
*	Silence unused warning in non-assert builds.	Daniel Jasper	2016-10-11	1	-0/+1
\| \| \| \|	llvm-svn: 283899
*	AMDGPU/SI: Update ISA version numbers for Tonga and Polaris10/11.	Changpeng Fang	2016-10-11	4	-3/+8
\| \| \| \| \| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D25454 Reviewers: tstellarAMD llvm-svn: 283893
*	[Thumb] Save/restore high registers in Thumb1 pro/epilogues	Oliver Stannard	2016-10-11	3	-24/+368
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The high registers are not allocatable in Thumb1 functions, but they could still be used by inline assembly, so we need to save and restore the callee-saved high registers (r8-r11) in the prologue and epilogue. This is complicated by the fact that the Thumb1 push and pop instructions cannot access these registers. Therefore, we have to move them down into low registers before pushing, and move them back after popping into low registers. In most functions, we will have low registers that are also being pushed/popped, which we can use as the temporary registers for saving/restoring the high registers. However, this is not guaranteed, so we may need to push some extra low registers to ensure that the high registers can be saved/restored. For correctness, it would be sufficient to use just one low register, but if we have enough low registers available then we only need one push/pop instruction, rather than one per high register. We can also use the argument/return registers when they are not live, and the link register when saving (but not restoring), reducing the number of extra registers we need to push. There are still a few extreme edge cases where we need two push/pop instructions, because not enough low registers can be made live in the prologue or epilogue. In addition to the regression tests included here, I've also tested this using a script to generate functions which clobber different combinations of registers, have different numbers of argument and return registers (including variadic arguments), allocate different fixed sized objects on the stack, and do or don't use variable sized allocas and the __builtin_return_address intrinsic (all of which affect the available registers in the prologue and epilogue). I ran these functions in a test harness which verifies that all of the callee-saved registers are correctly preserved. Differential Revision: https://reviews.llvm.org/D24228 llvm-svn: 283867
*	[ARM] Fix registers clobbered by SjLj EH on soft-float targets	Oliver Stannard	2016-10-11	4	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, the Int_eh_sjlj_dispatchsetup intrinsic is marked as clobbering all registers, including floating-point registers that may not be present on the target. This is technically true, as we could get linked against code that does use the FP registers, but that will not actually work, as the soft-float code cannot save and restore the FP registers. SjLj exception handling can only work correctly if either all or none of the code is built for a target with FP registers. Therefore, we can assume that, when Int_eh_sjlj_dispatchsetup is compiled for a soft-float target, it is only going to be linked against other soft-float code, and so only clobbers the general-purpose registers. This allows us to check that no non-savable registers are clobbered when generating the prologue/epilogue. Differential Revision: https://reviews.llvm.org/D25180 llvm-svn: 283866
*	[AArch64] Allow label arithmetic with add/sub/cmp	Diana Picus	2016-10-11	3	-26/+44
\| \| \| \| \| \| \| \| \| \| \| \| \|	Allow instructions such as 'cmp w0, #(end - start)' by folding the expression into a constant. For ELF, we fold only if the symbols are in the same section. For MachO, we fold if the expression contains only symbols that are not linker visible. Fixes https://llvm.org/bugs/show_bug.cgi?id=18920 Differential Revision: https://reviews.llvm.org/D23834 llvm-svn: 283862
*	[AArch64][InstructionSelector] Teach how to select FP load/store.	Quentin Colombet	2016-10-11	1	-0/+7
\| \| \| \| \| \|	This patch allows to select 32 and 64-bit FP load and store. llvm-svn: 283832
*	[AArch64][InstructionSelector] Teach the selector how to handle vector OR.	Quentin Colombet	2016-10-11	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	This only adds the support for 64-bit vector OR. Adding more sizes is not difficult, but it requires a bigger refactoring because ORs work on any size, not necessarly the ones that match the width of the register width. Right now, this is not expressed in the legalization, so don't bother pushing the refactoring yet. llvm-svn: 283831
*	[AArch64][MachineLegalizer] Mark v2s32 G_LOAD as legal.	Quentin Colombet	2016-10-11	1	-1/+1
\| \| \| \| \| \| \|	Actually every 64-bit loads are legal, but right now the API does not offer a simple way to express that. llvm-svn: 283829
*	Revert r283690, "MC: Remove unused entities."	Peter Collingbourne	2016-10-10	12	-14/+22
\| \| \| \|	llvm-svn: 283814
*	GlobalISel: select G_GLOBAL_VALUE uses on AArch64.	Tim Northover	2016-10-10	3	-4/+30
\| \| \| \|	llvm-svn: 283809
*	GlobalISel: allow G_GLOBAL_VALUEs in AArch64 legalization.	Tim Northover	2016-10-10	1	-0/+1
\| \| \| \|	llvm-svn: 283808