bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[AArch64][GlobalISel] Legalize G_INTRINSIC_ROUND	Jessica Paquette	2019-04-23	1	-2/+2
\| \| \| \| \| \| \|	Add it to the same rule as G_FCEIL etc. Add a legalizer test, and add a missing switch case to AArch64LegalizerInfo.cpp. llvm-svn: 359033
*	[AArch64][GlobalISel] Actually select G_INTRINSIC_TRUNC	Jessica Paquette	2019-04-23	1	-1/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Apparently FileCheck wasn't actually matching the fallback check lines in arm64-vfloatintrinsics.ll properly. So, there were selection fallbacks for G_INTRINSIC_TRUNC there. Actually hook it up into AArch64InstructionSelector.cpp and write a proper selection test. I guess I'll figure out the FileCheck magic to make the fallback checks work properly in arm64-vfloatintrinsics.ll. llvm-svn: 359030
*	[AArch64][GlobalISel] Teach regbankselect about G_INTRINSIC_TRUNC	Jessica Paquette	2019-04-23	1	-0/+1
\| \| \| \| \| \| \| \|	Add it to isPreISelGenericFloatingPointOpcode, and add a regbankselect test. Update arm64-vfloatintrinsics.ll now that we can select it. llvm-svn: 359022
*	[AArch64][GlobalISel] Legalize G_INTRINSIC_TRUNC	Jessica Paquette	2019-04-23	1	-1/+1
\| \| \| \| \| \| \| \| \|	Same patch as G_FCEIL etc. Add the missing switch case in widenScalar, add G_INTRINSIC_TRUNC to the correct rule in AArch64LegalizerInfo.cpp, and add a test. llvm-svn: 359021
*	[AMDGPU] Fixed addReg() in SIOptimizeExecMaskingPreRA.cpp	Stanislav Mekhanoshin	2019-04-23	1	-1/+1
\| \| \| \| \| \| \| \|	The second argument is flags, not subreg. Differential Revision: https://reviews.llvm.org/D61031 llvm-svn: 359017
*	[AArch64][GlobalISel] Legalize G_FMA for more vector types	Jessica Paquette	2019-04-23	1	-2/+3
\| \| \| \| \| \| \| \| \|	Same as G_FCEIL, G_FABS, etc. Just move it into that rule. Add a legalizer test for G_FMA, which we didn't have before and update arm64-vfloatintrinsics.ll. llvm-svn: 359015
*	[AArch64][GlobalISel] Add G_FMA to isPreISelGenericFloatingPointOpcode	Jessica Paquette	2019-04-23	1	-0/+1
\| \| \| \| \| \| \| \|	Noticed an unnecessary fallback in arm64-vmul caused by this. Also add a regbankselect test for G_FMA. llvm-svn: 359013
*	[x86] use psubus for more vsetcc lowering (PR39859)	Sanjay Patel	2019-04-23	1	-8/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Circling back to a leftover bit from PR39859: https://bugs.llvm.org/show_bug.cgi?id=39859#c1 ...we have this counter-intuitive (based on the test diffs) opportunity to use 'psubus'. This appears to be the better perf option for both Haswell and Jaguar based on llvm-mca. We already do this transform for the SETULT predicate, so this makes the code more symmetrical too. If we have pminub/pminuw, we prefer those, so this should not affect anything but pre-SSE4.1 subtargets. $ cat before.s movdqa -16(%rip), %xmm2 ## xmm2 = [32768,32768,32768,32768,32768,32768,32768,32768] pxor %xmm0, %xmm2 pcmpgtw -32(%rip), %xmm2 ## xmm2 = [255,255,255,255,255,255,255,255] pand %xmm2, %xmm0 pandn %xmm1, %xmm2 por %xmm2, %xmm0 $ cat after.s movdqa -16(%rip), %xmm2 ## xmm2 = [256,256,256,256,256,256,256,256] psubusw %xmm0, %xmm2 pxor %xmm3, %xmm3 pcmpeqw %xmm2, %xmm3 pand %xmm3, %xmm0 pandn %xmm1, %xmm3 por %xmm3, %xmm0 $ llvm-mca before.s -mcpu=haswell Iterations: 100 Instructions: 600 Total Cycles: 909 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 0.77 IPC: 0.66 Block RThroughput: 1.8 $ llvm-mca after.s -mcpu=haswell Iterations: 100 Instructions: 700 Total Cycles: 409 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.71 IPC: 1.71 Block RThroughput: 1.8 Differential Revision: https://reviews.llvm.org/D60838 llvm-svn: 358999
*	[SPARC] Use the correct register set for the "r" asm constraint.	Joerg Sonnenberger	2019-04-23	1	-0/+2
\| \| \| \| \| \| \| \|	64bit mode must use 64bit registers, otherwise assumptions about the top half of the registers are made. Problem found by Takeshi Nakayama in NetBSD. llvm-svn: 358998
*	Use llvm::stable_sort	Fangrui Song	2019-04-23	1	-2/+1
\| \| \| \| \| \|	While touching the code, simplify if feasible. llvm-svn: 358996
*	[RISCV] Support assembling %tls_{ie,gd}_pcrel_hi modifiers	Lewis Revill	2019-04-23	8	-4/+48
\| \| \| \| \| \| \| \| \|	This patch adds support for parsing and assembling the %tls_ie_pcrel_hi and %tls_gd_pcrel_hi modifiers. Differential Revision: https://reviews.llvm.org/D55342 llvm-svn: 358994
*	[AMDGPU] Fix hidden argument metadata duplication for V3	Scott Linder	2019-04-23	1	-30/+0
\| \| \| \| \| \| \| \| \| \|	Essentially complete a proper rebase of the V3 metadata change over https://reviews.llvm.org/D49096. Minimize the diff between the V2 and V3 variants of the relevant lit tests, and clean up some trailing whitespace. llvm-svn: 358992
*	[X86] Pull out collectConcatOps helper. NFCI.	Simon Pilgrim	2019-04-23	1	-21/+51
\| \| \| \| \| \|	Create collectConcatOps helper that returns all the subvector ops for CONCAT_VECTORS or a INSERT_SUBVECTOR series. llvm-svn: 358989
*	ARM: disallow add/sub to sp unless Rn is also sp.	Tim Northover	2019-04-23	2	-1/+27
\| \| \| \| \| \| \| \|	The manual says that Thumb2 add/sub instructions are only allowed to modify sp if the first source is also sp. This is slightly different from the usual rGPR restriction since it's context-sensitive, so implement it in C++. llvm-svn: 358987
*	AMDGPU: Fix LCSSA phi lowering in SILowerI1Copies	Nicolai Haehnle	2019-04-23	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When an LCSSA phi survives through instruction selection, the pass ends up removing that phi entirely because it is dominated by the logic that does the lanemask merging. This then used to trigger an assertion when processing a dependent phi instruction. Change-Id: Id4949719f8298062fe476a25718acccc109113b6 Reviewers: llvm-commits Subscribers: kzhuravl, jvesely, wdng, yaxunl, t-tye, tpr, dstuttard, rtaylor, arsenm Tags: #llvm Differential Revision: https://reviews.llvm.org/D60999 llvm-svn: 358983
*	[CallSite removal] move InlineCost to CallBase usage	Fedor Sergeev	2019-04-23	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \|	Converting InlineCost interface and its internals into CallBase usage. Inliners themselves are still not converted. Reviewed By: reames Tags: #llvm Differential Revision: https://reviews.llvm.org/D60636 llvm-svn: 358982
*	[ARM] Update check for CBZ in Ifcvt	David Green	2019-04-23	3	-43/+59
\| \| \| \| \| \| \| \| \| \| \|	The check for creating CBZ in constant island pass recently obtained the ability to search backwards to find a Cmp instruction. The code in IfCvt should mirror this to allow more conversions to the smaller form. The common code has been pulled out into a separate function to be shared between the two places. Differential Revision: https://reviews.llvm.org/D60090 llvm-svn: 358977
*	[ARM] Don't replicate instructions in Ifcvt at minsize	David Green	2019-04-23	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	Ifcvt can replicate instructions as it converts them to be predicated. This stops that from happening on thumb2 targets at minsize where an extra IT instruction is likely needed. Differential Revision: https://reviews.llvm.org/D60089 llvm-svn: 358974
*	Fix MSVC "32-bit shift implicitly converted to 64 bits" warning. NFCI.	Simon Pilgrim	2019-04-23	1	-2/+2
\| \| \| \|	llvm-svn: 358969
*	[AArch64] Add support for MTE intrinsics	Javed Absar	2019-04-23	4	-22/+78
\| \| \| \| \| \| \| \| \| \| \|	This patch provides intrinsics support for Memory Tagging Extension (MTE), which was introduced with the Armv8.5-a architecture. The intrinsics are described in detail in the latest ACLE Q1 2019 documentation: https://developer.arm.com/docs/101028/latest Reviewed by: David Spickett Differential Revision: https://reviews.llvm.org/D60486 llvm-svn: 358963
*	[ARM][FIX] Add missing f16.lane.vldN/vstN lowering	Diogo N. Sampaio	2019-04-23	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add missing D and Q lane VLDSTLane lowering for fp16 elements. Reviewers: efriedma, kosarev, SjoerdMeijer, ostannard Reviewed By: efriedma Subscribers: javed.absar, kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60874 llvm-svn: 358962
*	[WebAssembly] Bail out of fastisel earlier when computing PIC addresses	Sam Clegg	2019-04-23	1	-11/+6
\| \| \| \| \| \| \| \| \| \| \|	This change partially reverts https://reviews.llvm.org/D54647 in favor of bailing out during computeAddress instead. This catches the condition earlier and handles more cases. Differential Revision: https://reviews.llvm.org/D60986 llvm-svn: 358948
*	[SelectionDAG] move splat util functions up from x86 lowering	Sanjay Patel	2019-04-22	1	-56/+1
\| \| \| \| \| \| \| \| \| \|	This was supposed to be NFC, but the change in SDLoc definitions causes instruction scheduling changes. There's nothing x86-specific in this code, and it can likely be used from DAGCombiner's simplifyVBinOp(). llvm-svn: 358930
*	[AMDGPU] Fix an issue in `op_sel_hi` skipping.	Michael Liao	2019-04-22	1	-7/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: - Only apply packed literal `op_sel_hi` skipping on operands requiring packed literals. Even an instruction is `packed`, it may have operand requiring non-packed literal, such as `v_dot2_f32_f16`. Reviewers: rampitec, arsenm, kzhuravl Subscribers: jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60978 llvm-svn: 358922
*	AMDGPU: Skip debug instructions in assert	Matt Arsenault	2019-04-22	1	-2/+7
\| \| \| \| \| \| \| \| \| \|	These are inserted after branch relaxation, and for some reason it's decided to put them in the long branch expansion block. It's probably not great to rely on the source block address, so this should probably be switched to being PC relative instead of relying on the block address llvm-svn: 358909
*	AMDGPU/GlobalISel: Fix non-power-of-2 G_EXTRACT sources	Matt Arsenault	2019-04-22	1	-1/+3
\| \| \| \|	llvm-svn: 358894
*	AMDGPU: Fix not checking for copy when looking at copy src	Matt Arsenault	2019-04-22	1	-1/+6
\| \| \| \| \| \| \|	Effectively reverts r356956. The check for isFullCopy was excessive, but there still needs to be a check that this is a copy. llvm-svn: 358890
*	[AMDGPU][MC] Corrected parsing of SP3 'neg' modifier	Dmitry Preobrazhensky	2019-04-22	1	-24/+58
\| \| \| \| \| \| \| \| \| \|	See bug 41156: https://bugs.llvm.org/show_bug.cgi?id=41156 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D60624 llvm-svn: 358888
*	[TargetLowering][AMDGPU][X86] Improve SimplifyDemandedBits bitcast handling	Simon Pilgrim	2019-04-22	1	-11/+25
\| \| \| \| \| \| \| \| \| \| \| \|	This patch adds support for BigBitWidth -> SmallBitWidth bitcasts, splitting the DemandedBits/Elts accordingly. The AMDGPU backend needed an extra (srl (and x, c1 << c2), c2) -> (and (srl(x, c2), c1) combine to encourage BFE creation, I investigated putting this in DAGCombine but it caused a lot of noise on other targets - some improvements, some regressions. The X86 changes are all definite wins. Differential Revision: https://reviews.llvm.org/D60462 llvm-svn: 358887
*	[X86] Reject 512-bit types in getRegForInlineAsmConstraint when AVX512 is ↵	Craig Topper	2019-04-22	1	-2/+5
\| \| \| \| \| \|	not enabled. Same for 256 bit and AVX. llvm-svn: 358872
*	[ARM] Rewrite isLegalT2AddressImmediate	David Green	2019-04-21	1	-29/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This does two main things, firstly adding some at least basic addressing modes for i64 types, and secondly treats floats and doubles sensibly when there is no fpu. The floating point change can help codesize in some cases, especially with D60294. Most backends seems to not consider the exact VT in isLegalAddressingMode, instead switching on type size. That is now what this does when the target does not have an fpu (as the float data will be loaded using LDR's). i64's currently use the address range of an LDRD (even though they may be legalised and loaded with an LDR). This is at least better than marking them all as illegal addressing modes. I have not attempted to do much with vectors yet. That will need changing once MVE is added. Differential Revision: https://reviews.llvm.org/D60677 llvm-svn: 358845
*	[X86] Add the rounding control operand to the printing for some scalar FMA ↵	Craig Topper	2019-04-21	1	-1/+1
\| \| \| \| \| \|	instructions. llvm-svn: 358844
*	[X86] Don't form masked vfpclass instruction from and+vfpclass unless the ↵	Craig Topper	2019-04-21	1	-28/+36
\| \| \| \| \| \|	fpclass only has a single use. llvm-svn: 358841
*	Revert r358800. Breaks Obsequi from the test suite.	Amara Emerson	2019-04-20	1	-5/+8
\| \| \| \| \| \| \|	The last attempt fixed gcc and consumer-typeset, but Obsequi seems to fail with a different issue. llvm-svn: 358829
*	[X86] Disable argument copy elision for arguments passed via pointers	Craig Topper	2019-04-20	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If you pass two 1024 bit vectors in IR with AVX2 on Windows 64. Both vectors will be split in four 256 bit pieces. The four pieces of the first argument will be passed indirectly using 4 gprs. The second argument will get passed via pointers in memory. The PartOffsets stored for the second argument are all in terms of its original 1024 bit size. So the PartOffsets for each piece are 32 bytes apart. So if we consider it for copy elision we'll only load an 8 byte pointer, but we'll move the address 32 bytes. The stack object size we create for the first part is probably wrong too. This issue was encountered by ISPC. I'm working on getting a reduce test case, but wanted to go ahead and get feedback on the fix. Reviewers: rnk Reviewed By: rnk Subscribers: dbabokin, llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D60801 llvm-svn: 358817
*	[X86] Fix stack probing on x32 (PR41477)	Nikita Popov	2019-04-20	2	-8/+15
\| \| \| \| \| \| \| \| \| \|	Fix for https://bugs.llvm.org/show_bug.cgi?id=41477. On the x32 ABI with stack probing a dynamic alloca will result in a WIN_ALLOCA_32 with a 32-bit size. The current implementation tries to copy it into RAX, resulting in a physreg copy error. Fix this by copying to EAX instead. Also fix incorrect opcodes or registers used in subs. llvm-svn: 358807
*	[X86] Don't turn (and (shl X, C1), C2) into (shl (and X, (C1 >> C2), C2) if ↵	Craig Topper	2019-04-20	1	-18/+37
\| \| \| \| \| \| \| \| \| \|	the original AND can represented by MOVZX. The MOVZX doesn't require an immediate to be encoded at all. Though it does use a 2 byte opcode so its the same size as a 1 byte immediate. But it has a separate source and dest register so can help avoid copies. llvm-svn: 358805
*	[X86] Turn (and (anyextend (shl X, C1), C2)) into (shl (and (anyextend X), ↵	Craig Topper	2019-04-20	1	-9/+29
\| \| \| \| \| \| \| \| \|	(C1 >> C2), C2) if the AND could match a movzx. There's one slight regression in here because we don't check that the immediate already allowed movzx before the shift. I'll fix that next. llvm-svn: 358804
*	Revert "Revert "[GlobalISel] Add legalization support for non-power-2 loads ↵	Amara Emerson	2019-04-19	1	-8/+5
\| \| \| \| \| \| \| \| \|	and stores"" We were shifting the wrong component of a split load when trying to combine them back into a single value. llvm-svn: 358800
*	[GlobalISel][AArch64] Legalize + select G_FRINT	Jessica Paquette	2019-04-19	2	-1/+2
\| \| \| \| \| \| \| \| \| \|	Exactly the same as G_FCEIL, G_FABS, etc. Add tests for the fp16/nofp16 behaviour, update arm64-vfloatintrinsics, etc. Differential Revision: https://reviews.llvm.org/D60895 llvm-svn: 358799
*	[WebAssembly] FastISel: Don't fallback to SelectionDAG after BuildMI in ↵	Sam Clegg	2019-04-19	1	-6/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	selectCall My understanding is that once BuildMI has been called we can't fallback to SelectionDAG. This change moves the fallback for when getRegForValue() fails for that target of an indirect call. This was failing in -fPIC mode when the callee is GlobalValue. Add a test case that tickles this. Differential Revision: https://reviews.llvm.org/D60908 llvm-svn: 358793
*	[AArch64] Fix checks for AArch64MCExpr::VK_SABS flag.	Eli Friedman	2019-04-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	VK_SABS is part of the SymLoc bitfield in the variant kind which should be compared for equality, not by checking the VK_SABS bit. As far as I know, the existing code happened to produce the correct results in all cases, so this is just a cleanup. Patch by Stephen Crane. Differential Revision: https://reviews.llvm.org/D60596 llvm-svn: 358788
*	Revert "[GlobalISel] Add legalization support for non-power-2 loads and stores"	Amara Emerson	2019-04-19	1	-5/+8
\| \| \| \| \| \|	This introduces some runtime failures which I'll need to investigate further. llvm-svn: 358771
*	[GlobalISel][AArch64] Legalize vector G_FPOW	Jessica Paquette	2019-04-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	This instruction is legalized in the same way as G_FSIN, G_FCOS, G_FLOG10, etc. Update legalize-pow.mir and arm64-vfloatintrinsics.ll to reflect the change. Differential Revision: https://reviews.llvm.org/D60218 llvm-svn: 358764
*	[CodeGen] Add "const" to MachineInstr::mayAlias	Bjorn Pettersson	2019-04-19	14	-54/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The basic idea here is to make it possible to use MachineInstr::mayAlias also when the MachineInstr is const (or the "Other" MachineInstr is const). The addition of const in MachineInstr::mayAlias then rippled down to the need for adding const in several other places, such as TargetTransformInfo::getMemOperandWithOffset. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, MatzeB, arsenm, jvesely, nhaehnle, hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60856 llvm-svn: 358744
*	[AMDGPU] Ignore non-SUnits edges	Piotr Sobczak	2019-04-19	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Ignore edges to non-SUnits (e.g. ExitSU) when checking for low latency instructions. When calling the function isLowLatencyInstruction(), an ExitSU could be on the list of successors, not necessarily a regular SU. In other places in the code there is a check "Succ->NodeNum >= DAGSize" to prevent further processing of ExitSU as "Succ->getInstr()" is NULL in such a case. Also, 8 out of 9 cases of "SUnit *Succ = SuccDep.getSUnit())" has the guard, so it is clearly an omission here. Change-Id: Ica86f0327c7b2e6bcb56958e804ea6c71084663b Reviewers: nhaehnle Reviewed By: nhaehnle Subscribers: MatzeB, arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60864 llvm-svn: 358740
*	[X86] Turn (and (shl X, C1), C2) into (shl (and X, (C1 >> C2), C2) if the ↵	Craig Topper	2019-04-19	1	-0/+3
\| \| \| \| \| \| \| \|	AND could match a movzx. Could get further improvements by recognizing (i64 and (anyext (i32 shl))). llvm-svn: 358737
*	[X86] Make sure we copy the HandleSDNode back to N before executing the ↵	Craig Topper	2019-04-19	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	default code after the switch in matchAddressRecursively Summary: There are two places where we create a HandleSDNode in address matching in order to handle the case where N is changed by CSE. But if we end up not matching, we fall back to code at the bottom of the switch that really would like N to point to something that wasn't CSEd away. So we should make sure we copy the handle back to N on any paths that can reach that code. This appears to be the true reason we needed to check DELETED_NODE in the negation matching. In pr32329.ll we had two subtracts back to back. We recursed through the first subtract, and onto the second subtract. The second subtract called matchAddressRecursively on its LHS which caused that subtract to CSE. We ultimately failed the match and ended up in the default code. But N was pointing at the old node that had been deleted, but the default code didn't know that and took it as the base register. Then we unwound back to the first subtract and tried to access this bogus base reg requiring the check for deleted node. With this patch we now use the CSE result as the base reg instead. matchAdd has been broken since sometime in 2015 when it was pulled out of the switch into a helper function. The assignment to N at the end was still there, but N was passed by value and not by reference so the update didn't go anywhere. Reviewers: niravd, spatel, RKSimon, bkramer Reviewed By: niravd Subscribers: llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D60843 llvm-svn: 358735
*	[GlobalISel][AArch64] Legalize/select G_(S/Z/ANY)_EXT for v8s8s	Jessica Paquette	2019-04-18	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds legalization for G_SEXT, G_ZEXT, and G_ANYEXT for v8s8s. We were falling back on G_ZEXT in arm64-vabs.ll before, preventing us from selecting the @llvm.aarch64.neon.sabd.v8i8 intrinsic. This adds legalizer support for those 3, which gives us selection via the importer. Update the relevant tests (legalize-ext.mir, select-int-ext.mir) and add a GISel line to arm64-vabs.ll. Differential Revision: https://reviews.llvm.org/D60881 llvm-svn: 358715
*	[GlobalISel][AArch64] Legalize v8s8 loads	Jessica Paquette	2019-04-18	1	-0/+1
\| \| \| \| \| \| \| \|	Add legalizer support for loads of v8s8 and update legalize-load-store.mir. Differential Revision: https://reviews.llvm.org/D60877 llvm-svn: 358714