bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU/GlobalISel: Handle flat/global G_ATOMIC_CMPXCHG	Matt Arsenault	2019-10-25	9	-76/+97
\| \| \| \| \| \| \| \|	Custom lower this to a target instruction with the merge operands. I think it might be better to directly select this and emit a REG_SEQUENCE, but this would be more work since it would require splitting the tablegen patterns for these cases from the other atomics.
*	AMDGPU: Fix the broken dominator tree when creating waterfall loop for ↵	Changpeng Fang	2019-10-25	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	resource descriptor Summary: In loadSRsrcFromVGPR, if MBB is the same as Succ, Remiander is not the immediate dominator of Succ. Reviewer: arsenm Differential Revision: https://reviews.llvm.org/D69358
*	[Alignment][NFC] getMemoryOpCost uses MaybeAlign	Guillaume Chatelet	2019-10-25	12	-41/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: nemanjai, hiraditya, kbarton, MaskRay, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69307
*	[AMDGPU] Fold AGPR reg_sequence initializers	Stanislav Mekhanoshin	2019-10-25	1	-22/+131
\| \| \| \|	Differential Revision: https://reviews.llvm.org/D69413
*	[AMDGPU] Disallow dpp combining for dpp instructions without Src2 operand ↵	vpykhtin	2019-10-25	1	-1/+2
\| \| \| \| \| \|	(when Src2 is required) Differential revision: https://reviews.llvm.org/D69430
*	[X86] Add a check for SSE2 to the top of combineReductionToHorizontal.	Craig Topper	2019-10-25	1	-0/+4
\| \| \| \|	Without this, we can create a PSADBW node that isn't legal.
*	AMDGPU/GlobalISel: Legalize FDIV16	Austin Kerbow	2019-10-25	2	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, dstuttard, tpr, t-tye, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69347
*	[RISCV] Add support for half-precision floats	Luís Marques	2019-10-25	1	-1/+6
\| \| \| \| \| \| \| \| \|	Complete fp16 support by ensuring that load extension / truncate store operations are properly expanded. Reviewers: asb, lenary Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D69246
*	[MIPS GlobalISel] Select MSA vector generic and builtin fsqrt	Petar Avramovic	2019-10-25	2	-7/+17
\| \| \| \| \| \| \| \| \| \| \| \| \|	selectImpl is able to select G_FSQRT when we set bank for vector operands to fprb. Add detailed tests. Note: G_FSQRT is generated from llvm-ir intrinsics llvm.sqrt., and at the moment MIPS is not able to generate this intrinsic for vector type (some targets generate vector llvm.sqrt. from calls to a builtin function). __builtin_msa_fsqrt_<format> will be transformed into G_FSQRT in legalizeIntrinsic and selected in the same way. Differential Revision: https://reviews.llvm.org/D69376
*	[PowerPC] [Peephole] fold frame offset by using index form to save add.	czhengsz	2019-10-25	4	-0/+246
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	renamable $x6 = ADDI8 $x1, -80 ;;; 0 is replaced with -80 renamable $x6 = ADD8 killed renamable $x6, renamable $x5 STW killed renamable $r3, 4, killed renamable $x6 :: (store 4 into %ir.14, !tbaa !2) After PEI there is a peephole opt opportunity to combine above -80 in ADDI8 with 4 in the STW to eliminate unnecessary ADD8. Expected result: renamable $x6 = ADDI8 $x1, -76 STWX killed renamable $r3, renamable $x5, killed renamable $x6 :: (store 4 into %ir.6, !tbaa !2) Reviewed by: stefanp Differential Revision: https://reviews.llvm.org/D66329
*	[X86][GISel] Remove unneeded custom selection code for handling shifts.	Craig Topper	2019-10-24	1	-78/+0
\|
*	[AMDGPU] Fix mfma scheduling crash	Stanislav Mekhanoshin	2019-10-24	1	-1/+6
\| \| \| \| \| \| \|	An SUnit can be neither intruction not SDNode. It is all null if represents a nop. Fixed a crash on using SU->getInstr(). Differential Revision: https://reviews.llvm.org/D69395
*	[NFC] Remove redundant lines	dfukalov	2019-10-24	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: rampitec Reviewed By: rampitec Subscribers: arsenm, jvesely, nhaehnle, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69375
*	[ARM] Add IR intrinsics for MVE VLD[24] and VST[24].	Simon Tatham	2019-10-24	2	-0/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The VST2 and VST4 instructions take two or four vector registers as input, and store part of each register to memory in an interleaved pattern. They come in variants indicating which part of each register they store (VST20 and VST21; VST40 to VST43 inclusive); the intention is that issuing each of those variants in turn has the combined effect of loading or storing the whole set of registers to a memory block of equal size. The corresponding VLD2 and VLD4 instructions load from memory in the same interleaved format: each one overwrites only part of its output register set, and again, the idea is that if you use VLD4{0,1,2,3} or VLD2{0,1} together, you end up having written to the whole of each register. I've implemented the stores and loads quite differently. The loads were easiest to implement as a single intrinsic that expands to all four VLD4x instructions or both VLD2x, delivering four complete output registers. (Implementing each individual load as a separate instruction taking four input registers to partially overwrite is possible in theory, but pointless, and when I tried it, I found it would need extra work to get the register allocation not to be horrible.) Since that intrinsic delivers multiple outputs, it has to be instruction-selected in custom C++. But the store instructions are easier to model individually, because they don't overwrite any register at all and you can write a DAG Isel pattern in Tablegen for each one. Hence, my new intrinsic `int_arm_mve_vld4q` expands to four load instructions, delivers four full output vectors, and is handled by C++ code, whereas `int_arm_mve_vst4q` expands to just one store instruction, takes four input vectors and a constant indicating which lanes to store, and is handled entirely in Tablegen. (And similarly for vld2q/vst2q.) This is asymmetric, but it was the easiest way to do each one. Reviewers: dmgreen, miyuki, ostannard Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68700
*	[ARM] Add some sample IR MVE intrinsics with C++ isel.	Simon Tatham	2019-10-24	1	-0/+173
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds some initial example IR intrinsics for MVE instructions that deliver multiple output values, and hence, have to be instruction- selected by custom C++ code instead of Tablegen patterns. I've added the writeback gather load instructions (taking a vector of base addresses and a single common offset, returning a vector of loaded values and an updated vector of base addresses); one example from the long shift family (taking and returning a 64-bit value in two GPRs); and the VADC instruction (which propagates a carry bit from each vector-lane addition to the next, taking an input carry flag in FPSCR and outputting the final one in FPSCR as well). To support the VPT-predicated forms of these instructions, I've written some helper functions to add the cluster of MVE predicate operands to the end of a MachineInstr. `AddMVEPredicateToOps` is used when the instruction actually is predicated (so it takes a predicate mask argument), and `AddEmptyMVEPredicateToOps` is for when the instruction is unpredicated (so it fills in $noreg for the mask). Each one comes in a form suitable for `vpred_n`, and one for `vpred_r` which takes the extra 'inactive' parameter. For VADC, the representation of the carry flag in the IR intrinsic is a word intended to be moved directly to and from `FPSCR_nzcvqc`, i.e. with the carry flag in bit 29 of the word. (The user-facing ACLE intrinsic will want it to be in bit 0, but I'll do that on the clang side.) Reviewers: dmgreen, miyuki, ostannard Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68699
*	[ARM] Begin adding IR intrinsics for MVE instructions.	Simon Tatham	2019-10-24	2	-59/+157
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit, together with the next few, will add a representative sample of the kind of IR intrinsics that we'll need in order to implement the user-facing ACLE intrinsics for MVE. Supporting all of them will take more work; the intention of this initial series of commits is to implement an intrinsic or two from lots of different categories, as examples and proofs of concept. This initial commit introduces a small number of IR intrinsics for instructions simple enough that they can use Tablegen ISel patterns: the predicated versions of the VADD and VSUB instructions (both integer and FP), VMIN and VMAX, and the float->half VCVT instruction (predicated and unpredicated). When using VPT-predicated instructions in automatic code generation, it will be convenient to specify the predicate value as a vector of the appropriate number of i1. To make it easy to specify all sizes of an instruction in one go and give each one the matching predicate vector type, I've added a system of Tablegen informational records describing MVE's vector types: each one gives the underlying LLVM IR ValueType (which may not be the same if the MVE vector is of explicitly signed or unsigned integers) and an appropriate vNi1 to use as the predicate vector. (Also, those info records include the usual encoding for the types, so that as we add associations between each instruction encoding and one of the new `MVEVectorVTInfo` records, we can remove some of the existing template parameters and replace them with references to the vector type info's fields.) The user-facing ACLE intrinsics will receive a predicate mask as a 16-bit integer, so I've also provided a pair of intrinsics i2v and v2i, to convert between an integer and a vector of i1 by just changing the register class. Reviewers: dmgreen, miyuki, ostannard Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67158
*	[AMDGPU] Skip additional folding on the same operand.	Michael Liao	2019-10-24	1	-7/+19
\| \| \| \| \| \| \| \| \| \|	Reviewers: rampitec, arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69355
*	[MIPS GlobalISel] Select MSA vector generic and builtin fabs	Petar Avramovic	2019-10-24	2	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	selectImpl is able to select G_FABS when we set bank for vector operands to fprb. Add detailed tests. Note: G_FABS is generated from llvm-ir intrinsics llvm.fabs., and at the moment MIPS is not able to generate this intrinsic for vector type (some targets generate vector llvm.fabs. from calls to a builtin function). We can handle fabs using __builtin_msa_fmax_a_<format> and passing same vector as both arguments. __builtin_msa_fmax_a_<format> will be directly selected into FMAX_A_<format> in legalizeIntrinsic. Differential Revision: https://reviews.llvm.org/D69346
*	[MIPS GlobalISel] MSA vector generic and builtin fadd, fsub, fmul, fdiv	Petar Avramovic	2019-10-24	2	-3/+28
\| \| \| \| \| \| \| \| \| \| \|	Select vector G_FADD, G_FSUB, G_FMUL and G_FDIV for MIPS32 with MSA. We have to set bank for vector operands to fprb and selectImpl will do the rest. __builtin_msa_fadd_<format>, __builtin_msa_fsub_<format>, __builtin_msa_fmul_<format> and __builtin_msa_fdiv_<format> will be transformed into G_FADD, G_FSUB, G_FMUL and G_FDIV in legalizeIntrinsic respectively and selected in the same way. Differential Revision: https://reviews.llvm.org/D69340
*	[MIPS GlobalISel] MSA vector generic and builtin sdiv, srem, udiv, urem	Petar Avramovic	2019-10-24	2	-6/+32
\| \| \| \| \| \| \| \| \| \| \|	Select vector G_SDIV, G_SREM, G_UDIV and G_UREM for MIPS32 with MSA. We have to set bank for vector operands to fprb and selectImpl will do the rest. __builtin_msa_div_s_<format>, __builtin_msa_mod_s_<format>, __builtin_msa_div_u_<format> and __builtin_msa_mod_u_<format> will be transformed into G_SDIV, G_SREM, G_UDIV and G_UREM in legalizeIntrinsic respectively and selected in the same way. Differential Revision: https://reviews.llvm.org/D69333
*	[AMDGPU] Allow folding of sgpr to vgpr copy	Stanislav Mekhanoshin	2019-10-23	1	-2/+3
\| \| \| \| \| \| \| \|	Potentially sgpr to sgpr copy should also be possible. That is however trickier because we may end up with a wrong register class at use because of xm0/xexec permutations. Differential Revision: https://reviews.llvm.org/D69280
*	[Hexagon] Fix typo. NFC	Shoaib Meenai	2019-10-23	1	-1/+1
\| \| \| \|	Testing git push access.
*	[mips] Use `expandLoadAddress` for JAL expansion	Simon Atanasyan	2019-10-23	1	-47/+9
\| \| \| \| \|	- Reduce code duplication - Get partial support of JAL expansion for XGOT.
*	[mips] Implement `la` macro expansion for N32 ABI	Simon Atanasyan	2019-10-23	1	-1/+1
\|
*	[X86] combineX86ShufflesRecursively - assert the root mask is legal. NFCI.	Simon Pilgrim	2019-10-23	1	-0/+3
\|
*	[Mips] Use appropriate private label prefix based on Mips ABI	Mirko Brkusanin	2019-10-23	27	-37/+62
\| \| \| \| \| \| \| \| \| \|	MipsMCAsmInfo was using '$' prefix for Mips32 and '.L' for Mips64 regardless of -target-abi option. By passing MCTargetOptions to MCAsmInfo we can find out Mips ABI and pick appropriate prefix. Tags: #llvm, #clang, #lldb Differential Revision: https://reviews.llvm.org/D66795
*	[DebugInfo] Stop describing imms in TargetInstrInfo's describeLoadedValue() impl	David Stenberg	2019-10-23	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The default implementation of the describeLoadedValue() hook uses the MoveImm property to determine if an instruction moves an immediate. If an instruction has that property the function returns the second operand, assuming that that is the immediate value the instruction moves. As far as I can tell, the MoveImm property does not imply that the second operand is the immediate value, nor that any other operand necessarily holds the immediate value; it just means that the instruction moves some immediate value. One example where the second operand is not the immediate is SystemZ's LZER instruction, which moves a zero immediate implicitly: $f0S = LZER. That case triggered an out-of-bound assertion when getting the operand. I have added a test case for that instruction. Another example is ARM's MVN instruction, which holds the logical bitwise NOT'd value of the immediate that is moved. For the following reproducer: extern void foo(int); int main() { foo(-11); } an incorrect call site value would be emitted: $ clang --target=arm foo.c -O1 -g -Xclang -femit-debug-entry-values \ -c -o - \| ./build/bin/llvm-dwarfdump - \| \ grep -A2 call_site_parameter 0x00000058: DW_TAG_GNU_call_site_parameter DW_AT_location (DW_OP_reg0 R0) DW_AT_GNU_call_site_value (DW_OP_lit10) Another example is the A2_combineii instruction on Hexagon which moves two immediates to a super-register: $d0 = A2_combineii 20, 10. Perhaps these are rare exceptions, and most MoveImm instructions hold the immediate in the second operand, but in my opinion the default implementation of the hook should only describe values that it can, by some contract, guarantee are safe to describe, rather than leaving it up to the targets to override the exceptions, as that can silently result in incorrect call site values. This patch adds X86's relevant move immediate instructions to the target's hook implementation, so this commit should be a NFC for that target. We need to do the same for ARM and AArch64. Reviewers: djtodoro, NikolaPrica, aprantl, vsk Reviewed By: vsk Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D69109
*	[MIPS GlobalISel] Select MSA vector generic and builtin mul	Petar Avramovic	2019-10-23	3	-7/+10
\| \| \| \| \| \| \| \| \| \|	Select vector G_MUL for MIPS32 with MSA. We have to set bank for vector operands to fprb and selectImpl will do the rest. Manual selection of G_MUL is now done for gprb only. __builtin_msa_mulv_<format> will be transformed into G_MUL in legalizeIntrinsic and selected in the same way. Differential Revision: https://reviews.llvm.org/D69310
*	[MIPS GlobalISel] Select MSA vector generic and builtin sub	Petar Avramovic	2019-10-23	2	-3/+16
\| \| \| \| \| \| \| \| \| \| \|	Select vector G_SUB for MIPS32 with MSA. We have to set bank for vector operands to fprb and selectImpl will do the rest. __builtin_msa_subv_<format> will be transformed into G_SUB in legalizeIntrinsic and selected in the same way. __builtin_msa_subvi_<format> will be directly selected into SUBVI_<format> in legalizeIntrinsic. Differential Revision: https://reviews.llvm.org/D69306
*	[RISCV] Add support for -ffixed-xX flags	Simon Cook	2019-10-22	8	-0/+79
\| \| \| \| \| \| \| \| \| \| \|	This adds support for reserving GPRs such that the compiler will not choose a register for register allocation. The implementation follows the same design as for AArch64; each reserved register becomes a target feature and used for getting the reserved registers for a given MachineFunction. The backend checks that it does not need to write to any reserved register; if it does a relevant error is generated. Differential Revision: https://reviews.llvm.org/D67185
*	Test commit - add clarification to README regarding Darwin.	Kit Barton	2019-10-22	1	-0/+3
\|
*	[AMDGPU] Allow tied operand subreg folding	Stanislav Mekhanoshin	2019-10-22	1	-12/+0
\| \| \| \| \| \|	Turns out it makes sense, contrarily to what comment said. Differential Revision: https://reviews.llvm.org/D69287
*	[MIPS GlobalISel] Select MSA vector generic and builtin add	Petar Avramovic	2019-10-22	2	-2/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Select vector G_ADD for MIPS32 with MSA. We have to set bank for vector operands to fprb and selectImpl will do the rest. __builtin_msa_addv_<format> will be transformed into G_ADD in legalizeIntrinsic and selected in the same way. __builtin_msa_addvi_<format> will be directly selected into ADDVI_<format> in legalizeIntrinsic. MIR tests for it have unnecessary additional copies. Capture current state of tests with run-pass=legalizer with a test in test/CodeGen/MIR/Mips. Differential Revision: https://reviews.llvm.org/D68984 llvm-svn: 375501
*	[PowerPC] Turn on CR-Logical reducer pass	Nemanja Ivanovic	2019-10-22	2	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This re-commits r375152 which was pulled in r375233 because it broke the EXPENSIVE_CHECKS bot on Windows. The reason for the failure was a bug in the pass that the commit turned on by default. This patch fixes that bug and turns the pass back on. This patch has been verified on the buildbot that originally failed thanks to Simon Pilgrim. Differential revision: https://reviews.llvm.org/D52431 llvm-svn: 375497
*	[Alignment][NFC] Attributes use Align/MaybeAlign	Guillaume Chatelet	2019-10-22	1	-10/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jholewinski, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69278 llvm-svn: 375495
*	[X86][BMI] Pull out schedule classes from bmi_andn<> and bmi_bls<>	Simon Pilgrim	2019-10-21	2	-14/+15
\| \| \| \| \| \|	Stop hardwiring classes llvm-svn: 375470
*	[X86][SSE] Add OR(EXTRACTELT(X,0),OR(EXTRACTELT(X,1))) -> MOVMSK+CMP ↵	Simon Pilgrim	2019-10-21	1	-0/+18
\| \| \| \| \| \|	reduction combine llvm-svn: 375463
*	AMDGPU/GlobalISel: Legalize fast unsafe FDIV	Austin Kerbow	2019-10-21	2	-6/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, dstuttard, tpr, t-tye, hiraditya, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69231 llvm-svn: 375460
*	AMDGPU: Select basic interp directly from intrinsics	Matt Arsenault	2019-10-21	5	-57/+29
\| \| \| \|	llvm-svn: 375457
*	[GISel][CombinerHelper] Add a combine turning shuffle_vector into concat_vectors	Quentin Colombet	2019-10-21	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Teach the CombinerHelper how to turn shuffle_vectors, that concatenate vectors, into concat_vectors and add this combine to the AArch64 pre-legalizer combiner. Differential Revision: https://reviews.llvm.org/D69149 llvm-svn: 375452
*	AMDGPU: Use CopyToReg for interp intrinsic lowering	Matt Arsenault	2019-10-21	1	-16/+17
\| \| \| \| \| \| \|	This doesn't use the default value, so doesn't benefit from the hack to help optimize it. llvm-svn: 375450
*	AMDGPU: Erase redundant redefs of m0 in SIFoldOperands	Matt Arsenault	2019-10-21	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \|	Only handle simple inter-block redefs of m0 to the same value. This avoids interference from redefs of m0 in SILoadStoreOptimzer. I was initially teaching that pass to ignore redefs of m0, but having them not exist beforehand is much simpler. This is in preparation for deleting the current special m0 handling in SIFixSGPRCopies to allow the register coalescer to handle the difficult cases. llvm-svn: 375449
*	AMDGPU: Stop adding m0 implicit def to SGPR spills	Matt Arsenault	2019-10-21	1	-13/+2
\| \| \| \| \| \| \| \|	r375293 removed the SGPR spilling with scalar stores path, so this is no longer necessary. This also always had the defect of adding the def even when this path wasn't in use. llvm-svn: 375448
*	AMDGPU: Slightly restructure m0 init code	Matt Arsenault	2019-10-21	1	-13/+15
\| \| \| \| \| \| \|	This will allow using another operation to produce the glue in a future change. llvm-svn: 375447
*	[AMDGPU] Select AGPR in PHI operand legalization	Stanislav Mekhanoshin	2019-10-21	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a PHI defines AGPR legalize its operands to AGPR. At the moment we can get an AGPR PHI with VGPR operands. I am not aware of any problems as it seems to be handled gracefully in RA, but this is not right anyway. It also slightly decreases VGPR pressure in some cases because we do not have to a copy via VGPR. Differential Revision: https://reviews.llvm.org/D69206 llvm-svn: 375446
*	[X86] Rename matchBitOpReduction to matchScalarReduction. NFCI.	Simon Pilgrim	2019-10-21	1	-4/+4
\| \| \| \| \| \|	This doesn't need to be just for bitops, but the ops do need to be fully associative. llvm-svn: 375445
*	Reverted r375425 as it broke some buildbots.	Sander de Smalen	2019-10-21	4	-65/+4
\| \| \| \|	llvm-svn: 375444
*	SystemZISelLowering - supportedAddressingMode - silence static analyzer ↵	Simon Pilgrim	2019-10-21	1	-1/+1
\| \| \| \| \| \| \| \|	dyn_cast<> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<> directly and if not assert will fire for us. llvm-svn: 375430
*	[AArch64][DebugInfo] Do not recompute CalleeSavedStackSize (Take 2)	Sander de Smalen	2019-10-21	4	-4/+65
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commit message from D66935: This patch fixes a bug exposed by D65653 where a subsequent invocation of `determineCalleeSaves` ends up with a different size for the callee save area, leading to different frame-offsets in debug information. In the invocation by PEI, `determineCalleeSaves` tries to determine whether it needs to spill an extra callee-saved register to get an emergency spill slot. To do this, it calls 'estimateStackSize' and manually adds the size of the callee-saves to this. PEI then allocates the spill objects for the callee saves and the remaining frame layout is calculated accordingly. A second invocation in LiveDebugValues causes estimateStackSize to return the size of the stack frame including the callee-saves. Given that the size of the callee-saves is added to this, these callee-saves are counted twice, which leads `determineCalleeSaves` to believe the stack has become big enough to require spilling an extra callee-save as emergency spillslot. It then updates CalleeSavedStackSize with a larger value. Since CalleeSavedStackSize is used in the calculation of the frame offset in getFrameIndexReference, this leads to incorrect offsets for variables/locals when this information is recalculated after PEI. This patch fixes the lldb unit tests in `functionalities/thread/concurrent_events/*` Changes after D66935: Ensures AArch64FunctionInfo::getCalleeSavedStackSize does not return the uninitialized CalleeSavedStackSize when running `llc` on a specific pass where the MIR code has already been expected to have gone through PEI. Instead, getCalleeSavedStackSize (when passed the MachineFrameInfo) will try to recalculate the CalleeSavedStackSize from the CalleeSavedInfo. In debug mode, the compiler will assert the recalculated size equals the cached size as calculated through a call to determineCalleeSaves. This fixes two tests: test/DebugInfo/AArch64/asan-stack-vars.mir test/DebugInfo/AArch64/compiler-gen-bbs-livedebugvalues.mir that otherwise fail when compiled using msan. Reviewed By: omjavaid, efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D68783 llvm-svn: 375425
*	[NFC] Cleanup with variable name IsPPC64 & IsDarwin	Xiangling Liao	2019-10-21	1	-20/+18
\| \| \| \| \| \| \| \|	Clean up PPCAsmPrinter with IsPPC64 and IsDarwin. Differential Revision: https://reviews.llvm.org/D69259 llvm-svn: 375420