bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[GISel]: Don't assert when constraining RegisterOperands which are uses.	Aditya Nandakumar	2018-02-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we assert that only non target specific opcodes can have missing RegisterClass constraints in the MCDesc. The backend can have instructions with register operands but don't have RegisterClass constraints (say using unknown_class) in which case the instruction defining the register will constrain it. Change the assert to only fire if a def has no regclass. https://reviews.llvm.org/D43409 llvm-svn: 326142
*	[MachineOperand][Target] MachineOperand::isRenamable semantics changes	Geoff Berry	2018-02-23	4	-9/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add a target option AllowRegisterRenaming that is used to opt in to post-register-allocation renaming of registers. This is set to 0 by default, which causes the hasExtraSrcRegAllocReq/hasExtraDstRegAllocReq fields of all opcodes to be set to 1, causing MachineOperand::isRenamable to always return false. Set the AllowRegisterRenaming flag to 1 for all in-tree targets that have lit tests that were effected by enabling COPY forwarding in MachineCopyPropagation (AArch64, AMDGPU, ARM, Hexagon, Mips, PowerPC, RISCV, Sparc, SystemZ and X86). Add some more comments describing the semantics of the MachineOperand::isRenamable function and how it is set and maintained. Change isRenamable to check the operand's opcode hasExtraSrcRegAllocReq/hasExtraDstRegAllocReq bit directly instead of relying on it being consistently reflected in the IsRenamable bit setting. Clear the IsRenamable bit when changing an operand's register value. Remove target code that was clearing the IsRenamable bit when changing registers/opcodes now that this is done conservatively by default. Change setting of hasExtraSrcRegAllocReq in AMDGPU target to be done in one place covering all opcodes that have constant pipe read limit restrictions. Reviewers: qcolombet, MatzeB Subscribers: aemerson, arsenm, jyknight, mcrosier, sdardis, nhaehnle, javed.absar, tpr, arichardson, kristof.beyls, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, jordy.potman.lists, apazos, sabuasal, niosHD, escha, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D43042 llvm-svn: 325931
*	Support for the mno-stack-arg-probe flag	Hans Wennborg	2018-02-23	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Adds support for this flag. There is also another piece for clang (separate review). More info: https://bugs.llvm.org/show_bug.cgi?id=36221 By Ruslan Nikolaev! Differential Revision: https://reviews.llvm.org/D43107 llvm-svn: 325900
*	Recommit: [ARM] f16 constant pool fix	Sjoerd Meijer	2018-02-22	1	-4/+2
\| \| \| \| \| \| \|	This recommits r325754; the modified and failing test case actually didn't need any modifications. llvm-svn: 325765
*	[ARM] Fix issue with large xor constants.	David Green	2018-02-22	1	-5/+2
\| \| \| \| \| \| \| \| \| \|	Fixup to rL325573 for large xor constants. Thanks to Eli Friedman for the catch. Differential revision: https://reviews.llvm.org/D43549 llvm-svn: 325761
*	Revert r325754 and r325755 (f16 literal pool) because buildbots were unhappy.	Sjoerd Meijer	2018-02-22	1	-2/+4
\| \| \| \|	llvm-svn: 325756
*	[ARM] f16 constant pool fix	Sjoerd Meijer	2018-02-22	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \|	This is a follow up of r325012, that allowed half types in constant pools. Proper alignment was enforced when a big basic block was split up, but not when a CPE was placed before/after a block; the successor block had the wrong alignment. Differential Revision: https://reviews.llvm.org/D43580 llvm-svn: 325754
*	[NFC] fix trivial typos in comments	Hiroshi Inoue	2018-02-22	1	-1/+1
\| \| \| \| \| \|	"a a" -> "a" llvm-svn: 325752
*	[ARM] Lower BR_CC for f16	Sjoerd Meijer	2018-02-20	1	-2/+1
\| \| \| \| \| \| \| \|	This case wasn't handled yet. Differential Revision: https://reviews.llvm.org/D43508 llvm-svn: 325616
*	[ARM] Mark -1 as cheap in xor's for thumb1	David Green	2018-02-20	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	We can always convert xor %a, -1 into MVN, even in thumb 1 where the -1 would not otherwise be considered a cheap constant. This prevents the -1's from being pulled out into constants and potentially hoisted. Differential Revision: https://reviews.llvm.org/D43451 llvm-svn: 325573
*	[ARM] Return true in enableMultipleCopyHints().	Jonas Paulsson	2018-02-16	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Enable multiple COPY hints to eliminate more COPYs during register allocation. Note that this is something all targets should do, see https://reviews.llvm.org/D38128. Review: Eli Friedman llvm-svn: 325327
*	[ARM] Materialise some boolean values to avoid a branch	Roger Ferrer Ibanez	2018-02-16	1	-10/+89
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch combines some cases of ARMISD::CMOV for integers that arise in comparisons of the form a != b ? x : 0 a == b ? 0 : x and that currently (e.g. in Thumb1) are emitted as branches. Differential Revision: https://reviews.llvm.org/D34515 llvm-svn: 325323
*	[ARM] Allow 64- and 128-bit types with 't' inline asm constraint	Pablo Barrio	2018-02-15	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In LLVM, 't' selects a floating-point/SIMD register and only supports 32-bit values. This is appropriately documented in the LLVM Language Reference Manual. However, this behaviour diverges from that of GCC, where 't' selects the s0-s31 registers and its qX and dX variants depending on additional operand modifiers (q/P). For example, the following C code: #include <arm_neon.h> float32x4_t a, b, x; asm("vadd.f32 %0, %1, %2" : "=t" (x) : "t" (a), "t" (b)) results in the following assembly if compiled with GCC: vadd.f32 s0, s0, s1 whereas LLVM will show "error: couldn't allocate output register for constraint 't'", since a, b, x are 128-bit variables, not 32-bit. This patch extends the use of 't' to mean that of GCC, thus allowing selection of the lower Q vector regs and their D/S variants. For example, the earlier code will now compile as: vadd.f32 q0, q0, q1 This behaviour still differs from that of GCC but I think it is actually more correct, since LLVM picks up the right register type based on the datatype of x, while GCC would need an extra operand modifier to achieve the same result, as follows: asm("vadd.f32 %q0, %q1, %q2" : "=t" (x) : "t" (a), "t" (b)) Since this is only an extension of functionality, existing code should not be affected by this change. Note that operand modifiers q/P are already supported by LLVM, so this patch should suffice to support inline assembly with constraint 't' originally built for GCC. Reviewers: grosbach, rengolin Reviewed By: rengolin Subscribers: rogfer01, efriedma, olista01, aemerson, javed.absar, eraman, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42962 llvm-svn: 325244
*	[ARM] f16 vcmp fixes	Sjoerd Meijer	2018-02-15	1	-4/+4
\| \| \| \| \| \| \| \|	This adds f16 VCMP match rules and fixes the test cases. Differential Revision: https://reviews.llvm.org/D43291 llvm-svn: 325228
*	[ARM] f16 stack spill/reloads	Sjoerd Meijer	2018-02-14	1	-1/+21
\| \| \| \| \| \| \| \|	This adds support for handling f16 stack spills/reloads. Differential Revision: https://reviews.llvm.org/D43280 llvm-svn: 325130
*	[ARM] Allow half types in ConstantPool	Sjoerd Meijer	2018-02-13	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \|	Change ARMConstantIslandPass to: - accept f16 literals as litpool entries, - if the litpool needs to be inserted in the middle of a big block, then we need to 4-byte align the next instruction in ARM mode. Differential Revision: https://reviews.llvm.org/D42784 llvm-svn: 325012
*	[ARM] Don't print "Requires NEON" error message for M-profile	Andre Vieira	2018-02-13	1	-0/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D43125 llvm-svn: 325000
*	[Thumb] Handle addressing mode AddrMode5FP16	Sjoerd Meijer	2018-02-13	1	-0/+14
\| \| \| \| \| \| \| \|	This addressing mode wasn't checked, so we were running in an assert. Differential Revision: https://reviews.llvm.org/D43179 llvm-svn: 324996
*	[ARMFastISel] Replace deprecated calls to MemoryIntrinsic::getAlignment() (NFCI)	Daniel Neilson	2018-02-09	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes ARMFastISel to cease using the old getAlignment() API of MemoryIntrinsic in favour of getting source & dest specific alignments through the new API. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. ( rL323886, rL323891, rL324148, rL324273, rL324278, rL324384, rL324395, rL324402, rL324626, rL324642, rL324653, rL324654, rL324773, rL324774 ) Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 324781
*	[ARM] Re-commit r324600 with fixed LLVMBuild.txt	Oliver Stannard	2018-02-08	2	-10/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ARMDisassembler now depends on the banked register tables in ARMUtils, so the LLVMBuild.txt needed updating to reflect this. Original commit mesage: [ARM] Fix disassembly of invalid banked register moves When disassembling banked register move instructions, we don't have an assembly syntax for the unallocated register numbers, so we have to return Fail rather than SoftFail. Previously we were returning SoftFail, then crashing in the InstPrinter as we have no way to represent these encodings in an assembly string. This also switches the decoder to use the table-generated list of banked registers, removing the duplicated list of encodings. Differential revision: https://reviews.llvm.org/D43066 llvm-svn: 324606
*	Revert r324600 as it breaks a buildbot	Oliver Stannard	2018-02-08	1	-2/+9
\| \| \| \| \| \| \| \|	The broken bot (clang-ppc64le-linux-multistage) is doign a shared-object build, so I guess using lookupBankedRegByEncoding in the disassembler is a layering violation? llvm-svn: 324604
*	[ARM] Fix disassembly of invalid banked register moves	Oliver Stannard	2018-02-08	1	-9/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When disassembling banked register move instructions, we don't have an assembly syntax for the unallocated register numbers, so we have to return Fail rather than SoftFail. Previously we were returning SoftFail, then crashing in the InstPrinter as we have no way to represent these encodings in an assembly string. This also switches the decoder to use the table-generated list of banked registers, removing the duplicated list of encodings. Differential revision: https://reviews.llvm.org/D43066 llvm-svn: 324600
*	ARM: Remove dead code. NFCI.	Peter Collingbourne	2018-02-08	2	-6/+0
\| \| \| \|	llvm-svn: 324565
*	[ARM] FP16 mov imm pattern	Sjoerd Meijer	2018-02-07	1	-3/+4
\| \| \| \| \| \| \| \| \|	This is a follow up of r324321, adding a match pattern for mov with a FP16 immediate (also fixing operand vfp_f16imm that wasn't even compiling). Differential Revision: https://reviews.llvm.org/D42973 llvm-svn: 324456
*	[ARM] f16 conversions	Sjoerd Meijer	2018-02-06	1	-16/+23
\| \| \| \| \| \| \| \| \|	This is a follow up of r324321, adding f16 <-> f32 and f16 <-> f64 conversion match patterns. Differential Revision: https://reviews.llvm.org/D42954 llvm-svn: 324360
*	[ARM][AArch64] Add CSDB speculation barrier instruction	Oliver Stannard	2018-02-06	3	-11/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the CSDB instruction, which is a new barrier instruction described by the whitepaper at [1]. This is in encoding space which was previously executed as a NOP, so it is available for all targets that have the relevant NOP encoding space. This matches the binutils behaviour for these instructions [2][3]. [1] https://developer.arm.com/support/security-update [2] https://sourceware.org/ml/binutils/2018-01/msg00116.html [3] https://sourceware.org/ml/binutils/2018-01/msg00120.html llvm-svn: 324324
*	[ARM] Armv8.2-A FP16 code generation (part 3/3)	Sjoerd Meijer	2018-02-06	3	-34/+107
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds most of the FP16 codegen support, but these areas need further work: - FP16 literals and immediates are not properly supported yet (e.g. literal pool needs work), - Instructions that are generated from intrinsics (e.g. vabs) haven't been added. This will be addressed in follow-up patches. Differential Revision: https://reviews.llvm.org/D42849 llvm-svn: 324321
*	[ARM] FullFP16 LowerReturn Fix	Sjoerd Meijer	2018-02-01	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	Commit r323512 introduced an optimisation in LowerReturn for half-precision return values. A missing check caused a crash when the return value is "undef" (i.e. a node that has no operands). Differential Revision: https://reviews.llvm.org/D42743 llvm-svn: 323968
*	[ARM] Add support for unpredictable MVN instructions.	Yvan Roux	2018-02-01	1	-2/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes bugzilla 33011 https://bugs.llvm.org/show_bug.cgi?id=33011 Defines bits {19-16} as zero or unpredictable as specified by the ARM ARM in sections A8.8.116 and A8.8.117. It fixes also the usage of PC register as destination register for MVN register-shifted register version as specified in A8.8.117. Differential Revision: https://reviews.llvm.org/D41905 llvm-svn: 323954
*	Test commit: Fix a comment.	Yvan Roux	2018-02-01	1	-1/+1
\| \| \| \|	llvm-svn: 323947
*	Revert "[ARM] Lower lower saturate to 0 and lower saturate to -1 using ↵	Evgeniy Stepanov	2018-01-31	1	-20/+0
\| \| \| \| \| \| \| \| \| \|	bit-operations" Miscompiles code. Testcase pending. This reverts commit r323869. llvm-svn: 323929
*	Fix formatting for r323876. NFC	Diana Picus	2018-01-31	1	-5/+5
\| \| \| \|	llvm-svn: 323878
*	[ARM GlobalISel] Modernize LegalizerInfo. NFCI	Diana Picus	2018-01-31	1	-126/+68
\| \| \| \| \| \| \| \| \|	Start using the new LegalizerInfo API introduced in r323681. Keep the old API for opcodes that need Lowering in some circumstances (G_FNEG and G_UREM/G_SREM). llvm-svn: 323876
*	[ARM] Lower lower saturate to 0 and lower saturate to -1 using bit-operations	Pablo Barrio	2018-01-31	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Expressions of the form x < 0 ? 0 : x; and x < -1 ? -1 : x can be lowered using bit-operations instead of branching or conditional moves In thumb-mode this results in a two-instruction sequence, a shift followed by a bic or or while in ARM/thumb2 mode that has flexible second operand the shift can be folded into a single bic/or instructions. In most cases this results in smaller code and possibly less branches, and in no case larger than before. Patch by Marten Svanfeldt. Reviewers: fhahn, pbarrio Reviewed By: pbarrio Subscribers: efriedma, rogfer01, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42574 llvm-svn: 323869
*	[ARM] Armv8.2-A FP16 code generation (part 2/3)	Sjoerd Meijer	2018-01-31	3	-35/+86
\| \| \| \| \| \| \| \| \| \| \| \|	Half-precision arguments and return values are passed as if it were an int or float for ARM. This results in truncates and bitcasts to/from i16 and f16 values, which are legalized very early to stack stores/loads. When FullFP16 is enabled, we want to avoid codegen for these bitcasts as it is unnecessary and inefficient. Differential Revision: https://reviews.llvm.org/D42580 llvm-svn: 323861
*	[ARM] Allow the scheduler to clone a node with glue to avoid a copy CPSR ↔ ↵	Roger Ferrer Ibanez	2018-01-31	2	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	GPR. In Thumb 1, with the new ADDCARRY / SUBCARRY the scheduler may need to do copies CPSR ↔ GPR but not all Thumb1 targets implement them. The schedule can attempt, before attempting a copy, to clone the instructions but it does not currently do that for nodes with input glue. In this patch we introduce a target-hook to let the hook decide if a glued machinenode is still eligible for copying. In this case these are ARM::tADCS and ARM::tSBCS . As a follow-up of this change we should actually implement the copies for the Thumb1 targets that do implement them and restrict the hook to the targets that can't really do such copy as these clones are not ideal. This change fixes PR35836. Differential Revision: https://reviews.llvm.org/D42051 llvm-svn: 323857
*	[ARM GlobalISel] Map G_SITOFP and G_UITOFP	Diana Picus	2018-01-30	1	-0/+14
\| \| \| \| \| \| \|	Straightforward mapping (integer operand to GPR, floating point operand to FPR). llvm-svn: 323731
*	[ARM GlobalISel] Legalize G_SITOFP and G_UITOFP	Diana Picus	2018-01-30	1	-0/+11
\| \| \| \| \| \| \| \|	Legal if we have hardware support, libcall otherwise. Also add supporting code to the legalizer helper for libcalls. llvm-svn: 323730
*	[ARM GlobalISel] Map G_FPTOSI and G_FPTOUI	Diana Picus	2018-01-30	1	-0/+14
\| \| \| \| \| \| \|	Straightforward mapping (integer operand goes to GPR, floating point operand goes to FPR). llvm-svn: 323727
*	[ARM GlobalISel] Legalize G_FPTOSI and G_FPTOUI	Diana Picus	2018-01-30	1	-3/+12
\| \| \| \| \| \| \| \| \|	Legal if we have hardware support for floating point, libcalls otherwise. Also add the necessary support for libcalls in the legalizer helper. llvm-svn: 323726
*	[ARM][GISel] PR35965 Constrain RegClasses of nested instructions built from ↵	Daniel Sanders	2018-01-29	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Dst Pattern Summary: Apparently, we missed on constraining register classes of VReg-operands of all the instructions built from a destination pattern but the root (top-level) one. The issue exposed itself while selecting G_FPTOSI for armv7: the corresponding pattern generates VTOSIZS wrapped into COPY_TO_REGCLASS, so top-level COPY_TO_REGCLASS gets properly constrained, while nested VTOSIZS (or rather its destination virtual register to be exact) does not. Fixing this by issuing GIR_ConstrainSelectedInstOperands for every nested GIR_BuildMI. https://bugs.llvm.org/show_bug.cgi?id=35965 rdar://problem/36886530 Patch by Roman Tereshin Reviewers: dsanders, qcolombet, rovka, bogner, aditya_nandakumar, volkan Reviewed By: dsanders, qcolombet, rovka Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42565 llvm-svn: 323692
*	[globalisel] Make LegalizerInfo::LegalizeAction available outside of ↵	Daniel Sanders	2018-01-29	1	-10/+12
\| \| \| \| \| \| \| \| \| \| \| \|	LegalizerInfo. NFC Summary: The improvements to the LegalizerInfo discussed in D42244 require that LegalizerInfo::LegalizeAction be available for use in other classes. As such, it needs to be moved out of LegalizerInfo. This has been done separately to the next patch to minimize the noise in that patch. llvm-svn: 323669
*	[ARM] FP16Pat and FullFP16Pat patterns. NFC.	Sjoerd Meijer	2018-01-29	2	-10/+16
\| \| \| \| \| \| \| \| \|	Create and use FP16Pat FullFP16Pat helper patterns to make the difference explicit. Differential Revision: https://reviews.llvm.org/D42634 llvm-svn: 323640
*	[ARM] Accept a subset of Thumb GPR register class when emitting an SP-relative	Momchil Velikov	2018-01-26	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	load instruction The function `Thumb1InstrInfo::loadRegFromStackSlot` accepts only the `tGPR` register class. The function serves to emit a `tLDRspi` instruction and certainly any subset of the `tGPR` register class is a valid destination of the load. Differential revision: https://reviews.llvm.org/D42535 llvm-svn: 323514
*	[ARM] Armv8.2-A FP16 code generation (part 1/3)	Sjoerd Meijer	2018-01-26	9	-28/+166
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the groundwork for Armv8.2-A FP16 code generation . Clang passes and returns _Float16 values as floats, together with the required bitconverts and truncs etc. to implement correct AAPCS behaviour, see D42318. We will implement half-precision argument passing/returning lowering in the ARM backend soon, but for now this means that this: _Float16 sub(_Float16 a, _Float16 b) { return a + b; } gets lowered to this: define float @sub(float %a.coerce, float %b.coerce) { entry: %0 = bitcast float %a.coerce to i32 %tmp.0.extract.trunc = trunc i32 %0 to i16 %1 = bitcast i16 %tmp.0.extract.trunc to half <SNIP> %add = fadd half %1, %3 <SNIP> } When FullFP16 is not supported, we don't make f16 a legal type, and we get legalization for "free", i.e. nothing changes and everything works as before. And also f16 argument passing/returning is handled. When FullFP16 is supported, we do make f16 a legal type, and have 2 places that we need to patch up: f16 argument passing and returning, which involves minor tweaks to avoid unnecessary code generation for some bitcasts. As a "demonstrator" that this works for the different FP16, FullFP16, softfp modes, etc., I've added match rules to the VSUB instruction description showing that we can codegen this instruction from IR, but more importantly, also to some conversion instructions. These conversions were causing issue before in the FP16 and FullFP16 cases. I've also added match rules to the VLDRH and VSTRH desriptions, so that we can actually compile the entire half-precision sub code example above. This showed that these loads and stores had the wrong addressing mode specified: AddrMode5 instead of AddrMode5FP16, which turned out not be implemented at all, so that has also been added. This is the minimal patch that shows all the different moving parts. In patch 2/3 I will add some efficient lowering of bitcasts, and in 2/3 I will add the remaining Armv8.2-A FP16 instruction descriptions. Thanks to Sam Parker and Oliver Stannard for their help and reviews! Differential Revision: https://reviews.llvm.org/D38315 llvm-svn: 323512
*	[ARM] Expand long shifts for Thumb1 to __aeabi_ calls	Weiming Zhao	2018-01-24	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For long shifts, the inlined version takes about 20 instructions on Thumb1. To avoid the code bloat, expand to __aeabi_ calls if target is Thumb1. Reviewers: samparker Reviewed By: samparker Subscribers: samparker, aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42401 llvm-svn: 323354
*	[ARM] Call __chkstk for dynamic stack allocation in all windows environments	Martin Storsjo	2018-01-24	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This matches what MSVC does for alloca() function calls on ARM. Even if MSVC doesn't support VLAs at the language level, it does support the alloca function. On the clang level, both the _alloca() (when emulating MSVC, which is what the alloca() function expands to) and __builtin_alloca() builtin functions, and VLAs, map to the same LLVM IR "alloca" function - so within LLVM they're not distinguishable from each other. Differential Revision: https://reviews.llvm.org/D42292 llvm-svn: 323308
*	[ARM] Cleanup part of ARMBaseInstrInfo::optimizeCompareInstr (NFCI).	Joel Galenson	2018-01-22	1	-12/+8
\| \| \| \| \| \| \| \| \|	As noted in another review, this loop is confusing. This commit cleans it up somewhat. Differential Revision: https://reviews.llvm.org/D42312 llvm-svn: 323136
*	Separate LoopTraversal, ReachingDefAnalysis and BreakFalseDeps into their ↵	Marina Yatsina	2018-01-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	own files. This is the one of multiple patches that fix bugzilla https://bugs.llvm.org/show_bug.cgi?id=33869 Most of the patches are intended at refactoring the existent code. Additional relevant reviews: https://reviews.llvm.org/D40330 https://reviews.llvm.org/D40331 https://reviews.llvm.org/D40332 https://reviews.llvm.org/D40334 Differential Revision: https://reviews.llvm.org/D40333 Change-Id: Ie5f8eb34d98cfdfae23a3072eb69b5794f0e2d56 llvm-svn: 323095
*	Rename ExecutionDepsFix files to ExecutionDomainFix	Marina Yatsina	2018-01-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the one of multiple patches that fix bugzilla https://bugs.llvm.org/show_bug.cgi?id=33869 Most of the patches are intended at refactoring the existent code. Additional relevant reviews: https://reviews.llvm.org/D40330 https://reviews.llvm.org/D40331 https://reviews.llvm.org/D40333 https://reviews.llvm.org/D40334 Differential Revision: https://reviews.llvm.org/D40332 Change-Id: I6a048cca7fdafbfc42fb1bac94343e483befded8 llvm-svn: 323094