bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[Codegen] Revert rL354676/rL354677 and followups - introduced PR43446 miscompile	Roman Lebedev	2020-02-26	1	-2/+6
\| \| \| \| \| \| \| \| \| \|	This reverts https://reviews.llvm.org/D58468 (rL354676, 44037d7a6377ec8e5542cced73583283334b516b), and all and any follow-ups to that code block. https://bugs.llvm.org/show_bug.cgi?id=43446 (cherry picked from commit d20907d1de89bf63b589fadd8c096d4895e47fba)
*	Revert "[DebugInfo] Remove some users of DBG_VALUEs IsIndirect field"	Jeremy Morse	2020-02-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit ed29dbaafa49bb8c9039a35f768244c394411fea. I'm backing out D68945, which as the discussion for D73526 shows, doesn't seem to handle the -O0 path through the codegen backend correctly. I'll reland the patch when a fix is worked out, apologies for all the churn. The two parent commits are part of this revert too. Conflicts: llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp llvm/test/DebugInfo/X86/dbg-addr-dse.ll SelectionDAGBuilder conflict is due to a nearby change in e39e2b4a79c6 that's technically unrelated. dbg-addr-dse.ll conflicted because 41206b61e30c (legitimately) changes the order of two lines. There are further modifications to dbg-value-func-arg.ll: it landed after the patch being reverted, and I've converted indirection to be represented by the isIndirect field rather than DW_OP_deref. (cherry picked from commit 6531a78ac4b5b229bce272706593a0bc873877d7)
*	[StackColoring] Remap FixedStackPseudoSourceValue frame index referenced by ↵	Fangrui Song	2020-01-21	1	-0/+171
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MachineMemOperand StackColoring::remapInstructions() remaps MachineOperand frame index (e.g. %stack.1 -> %stack.0) but does not remap FixedStackPseudoSourceValue frame index (e.g. store 4 into %stack.1.ap2.i.i) referenced by MachineMemoryOperand. This can cause an assertion failure when LiveDebugValues references a dead stack object. It is difficult to craft a test case. -g, va_copy and stack-coloring are required. I can only reproduce it on ppc32. (cherry picked from commit eaab1bf21e1d6803fd217fe6052537fc33b06837) (cherry picked from commit 854f7be20a0cb1a95671a16d6cc8200107ee25f4) (cherry picked from commit 7a8b0b1595e7dc878b48cf9bbaa652087a6895db)
*	[AIX][XCOFF] Supporting the ReadOnlyWithRel SectionKnd	diggerlin	2020-01-14	1	-0/+19
\| \| \| \| \| \| \| \| \| \|	SUMMARY: In this patch we put the global variable in a Csect which's SectionKind is "ReadOnlyWithRel" into Data Section. Reviewers: hubert.reinterpretcast,jasonliu,Xiangling_L Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D72461
*	[AIX] ExternalSymbolSDNode lowering	Xiangling Liao	2020-01-14	2	-0/+186
\| \| \| \| \| \| \| \|	For memcpy/memset/memmove etc., replace ExternalSymbolSDNode with a MCSymbolSDNode, which have a prefix dot before function name as entry point symbol. Differential Revision: https://reviews.llvm.org/D70718
*	[PowerPC] [NFC] set instruction number as 1st priority of lsr cost model.	Zheng Chen	2020-01-14	1	-0/+66
\|
*	[FPEnv] Fix chain handling for fpexcept.strict nodes	Ulrich Weigand	2020-01-13	1	-9/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We need to ensure that fpexcept.strict nodes are not optimized away even if the result is unused. To do that, we need to chain them into the block's terminator nodes, like already done for PendingExcepts. This patch adds two new lists of pending chains, PendingConstrainedFP and PendingConstrainedFPStrict to hold constrained FP intrinsic nodes without and with fpexcept.strict markers. This allows not only to solve the above problem, but also to relax chains a bit further by no longer flushing all FP nodes before a store or other memory access. (They are still flushed before nodes with other side effects.) Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D72341
*	[PowerPC] Delete PPCDarwinAsmPrinter and PPCMCAsmInfoDarwin	Fangrui Song	2020-01-12	1	-140/+0
\| \| \| \| \| \| \| \|	Darwin support has been removed. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D72063
*	[NFC] [PowerPC] Update mi-peephole-splat test	Qiu Chaofan	2020-01-11	1	-23/+115
\| \| \| \| \|	Use script to re-generate the test case, for easier comparison with future patches.
*	[AIX] Allow vararg calls when all arguments reside in registers	jasonliu	2020-01-10	2	-2/+291
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch pushes the AIX vararg unimplemented error diagnostic later and allows vararg calls so long as all the arguments can be passed in register. This patch extends the AIX calling convention implementation to initialize GPR(s) for vararg float arguments. On AIX, both GPR(s) and FPR are allocated for floating point arguments. The GPR(s) are only initialized for vararg calls, otherwise the callee is expected to retrieve the float argument in the FPR. f64 in AIX PPC32 requires special handling in order to allocated and initialize 2 GPRs. This is performed with bitcast, SRL, truncation to initialize one GPR for the MSW and bitcast, truncations to initialize the other GPR for the LSW. A future patch will follow to add support for arguments passed on the stack. Patch provided by: cebowleratibm Reviewers: sfertile, ZarkoCA, hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D71013
*	[PowerPC] Handle constant zero bits in BitPermutationSelector	Nemanja Ivanovic	2020-01-10	1	-0/+141
\| \| \| \| \| \| \| \| \| \|	We currently crash when analyzing an AssertZExt node that has some bits that are constant zeros (i.e. as a result of an and with a constant). This issue was reported in https://bugs.llvm.org/show_bug.cgi?id=41088 and this patch fixes that. Differential revision: https://reviews.llvm.org/D72038
*	[DAGCombine] Fold the (fma -x, y, -z) to -(fma x, y, z)	QingShan Zhang	2020-01-09	5	-37/+28
\| \| \| \| \| \| \|	This is a positive combination as long as the NEG is NOT free, as we are reducing the number of NEG from two to one. Differential Revision: https://reviews.llvm.org/D72312
*	[PowerPC] when folding rlwinm+rlwinm. to andi., we should use first rlwinm	Zheng Chen	2020-01-08	1	-3/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	input reg. %2:gprc = RLWINM %1:gprc, 27, 5, 10 %3:gprc = RLWINM_rec %2:gprc, 8, 5, 10, implicit-def $cr0 ==> %3:gprc = ANDI_rec %1, 0, implicit-def $cr0 we should use %1 instead of %2 as ANDI_rec input. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D71885
*	[NFC][Test] Add the option -enable-no-signed-zeros-fp-math for test	QingShan Zhang	2020-01-08	1	-2/+4
\| \| \| \|	fma-combine.ll
*	[PowerPC][Triple] Use elfv2 on freebsd>=13 and linux-musl	Fangrui Song	2020-01-07	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Every powerpc64le platform uses elfv2. For powerpc64, the environments "elfv1" and "elfv2" were added for FreeBSD ELFv1->ELFv2 migration in D61950. FreeBSD developers have decided to use OS versions to select ABI, and no one is relying on the environments. Also use elfv2 on powerpc64-linux-musl. Users can always use -mabi=elfv1 and -mabi=elfv2 to override the default ABI. Reviewed By: adalava Differential Revision: https://reviews.llvm.org/D72352
*	[AIX][XCOFF]Implement mergeable const	diggerlin	2020-01-07	1	-0/+166
\| \| \| \| \| \| \| \| \| \|	SUMMARY: In this patch, we map mergeable const objects to the read-only section in the same manner as const objects that are not mergeable. Reviewers: hubert.reinterpretcast,jasonliu Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D71551
*	[NFC][Test] Add a test to verify the DAGCombine of fma	QingShan Zhang	2020-01-07	1	-0/+143
\|
*	[PowerPC][NFC] Rename record instructions to use _rec suffix instead of o	Jinsong Ji	2020-01-06	9	-166/+166
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We use o suffix to indicate record form instuctions, (as it is similar to dot '.' in mne?) This was fine before, as we did not support XO-form. However, with https://reviews.llvm.org/D66902, we now have XO-form support. It becomes confusing now to still use 'o' for record form, and it is weird to have something like 'Oo' . This patch rename all 'o' instructions to use '_rec' instead. Also rename `isDot` to `isRecordForm`. Reviewed By: #powerpc, hfinkel, nemanjai, steven.zhang, lkail Differential Revision: https://reviews.llvm.org/D70758
*	[AIX] Use csect reference for function address constants	diggerlin	2020-01-06	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SUMMARY: We currently emit a reference for function address constants as labels; for example: foo_ptr: .long foo however, there may be no such label in the case where the function is undefined. Although the label exists when the function is defined, we will (to be consistent) also use a csect reference in that case. Reviewers: daltenty,hubert.reinterpretcast,jasonliu,Xiangling_L Subscribers: cebowleratibm, wuzish, nemanjai Differential Revision: https://reviews.llvm.org/D71144
*	[DAGCombine] Don't check the legality of type when combine the SIGN_EXTEND_INREG	QingShan Zhang	2020-01-06	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the DAG node for SIGN_EXTEND_INREG : t21: v4i32 = sign_extend_inreg t18, ValueType:ch:v4i16 It has two operands. The first one is the value it want to extend, and the second one is the type to specify how to extend the value. For this example, it means that, it is signed extend the t18(v4i32) from v4i16 to v4i32. That is the semantics of c code: vector int foo(vector int m) { return m << 16 >> 16; } And it could be any vector type that hardware support the operation, though the type 'v4i16' is NOT legal for the target. When we are trying to combine the srl + sra, what we did now is calling the TLI.isOperationLegal(), which will also check the legality of the type. That doesn't make sense. Differential Revision: https://reviews.llvm.org/D70230
*	[PowerPC]: Fix predicate handling with SPE	Justin Hibbits	2020-01-02	1	-202/+510
\| \| \| \| \|	SPE floating-point compare instructions only update the GT bit in the CR field. All predicates must therefore be reduced to GT/LE.
*	Run update_llc_test_checks against SPE tests.	Justin Hibbits	2020-01-02	1	-175/+670
\| \| \| \| \|	This is in preparation for further tests which are better generated with the script. No functional change.
*	[PowerPC][AIX] Enable sret arguments.	Sean Fertile	2020-01-02	1	-7/+90
\| \| \| \| \| \|	Removes the fatal error for sret arguments and adds lit testing. Differential Revision: https://reviews.llvm.org/D71504
*	[PowerPC] Only legalize FNEARBYINT with unsafe fp math	Nemanja Ivanovic	2020-01-02	2	-79/+253
\| \| \| \| \| \| \|	Commit 0f0330a78709 legalized these nodes on PPC without consideration of unsafe math which means that we get inexact exceptions raised for nearbyint. Since this doesn't conform to the standard, switch this legalization to depend on unsafe fp math.
*	[NFC] Add comments in unit test aix-xcoff-toc.ll to clarify the intent	jasonliu	2019-12-31	1	-0/+10
\| \| \| \| \|	Address David's post review comment in https://reviews.llvm.org/D71667. Add comments to clarify what we are testing in that file.
*	[PowerPC] Legalize rounding nodes	Nemanja Ivanovic	2019-12-30	4	-278/+665
\| \| \| \| \| \| \| \|	VSX provides a full complement of rounding instructions yet we somehow ended up with some of them legal and others not. This just legalizes all of the FP rounding nodes and the FP -> int rounding nodes with unsafe math. Differential revision: https://reviews.llvm.org/D69949
*	[PowerPC] Exploit the rlwinm instructions for "and" with constant	QingShan Zhang	2019-12-30	3	-8/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For now, PowerPC will using several instructions to get the constant and "and" it with the following case: define i32 @test1(i32 %a) { %and = and i32 %a, -2 ret i32 %and } However, we could exploit it with the rotate mask instructions. MB ME +----------------------+ \|xxxxxxxxxxx00011111000\| +----------------------+ 0 32 64 Notice that, we can only do it if the MB is larger than 32 and MB <= ME as RLWINM will replace the content of [0 - 32) with [32 - 64) even we didn't rotate it. Differential Revision: https://reviews.llvm.org/D71829
*	[PowerPC] Change default for unaligned FP access for older subtargets	Nemanja Ivanovic	2019-12-28	2	-1/+45
\| \| \| \| \| \| \| \| \| \| \|	This is a fix for https://bugs.llvm.org/show_bug.cgi?id=40554 Some CPU's trap to the kernel on unaligned floating point access and there are kernels that do not handle the interrupt. The program then fails with a SIGBUS according to the PR. This just switches the default for unaligned access to only allow it on recent server CPUs that are known to allow this. Differential revision: https://reviews.llvm.org/D71954
*	[PowerPC] Modify the hasSideEffects of some VSX instructions from 1 to 0	Kang Zhang	2019-12-28	1	-30/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we didn't set the value for hasSideEffects bit in our td file, `llvm-tblgen` will set it as true for those instructions which has no match pattern. Below 6 instructions don't set the hasSideEffects flag and don't have match pattern, so their hasSideEffects flag will be set true by llvm-tblgen. But in fact below instructions don't modify any special register and don't have other SideEffects, they shouldn't have SideEffects. This patch is to modify the hasSideEffects of below instructions from 1 to 0. ``` VEXTUHLX VEXTUHRX VEXTUWLX VEXTUWRX VSPLTBs VSPLTHs ``` Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D71391
*	[PowerPC] stop folding if result rlwinm mask is wrap while original rlwinm ↵	czhengsz	2019-12-25	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	is not. %1:g8rc = RLWINM8 %0:g8rc, 0, 16, 9 %2:g8rc = RLWINM8 killed %1:g8rc, 0, 0, 31 -> %2:g8rc = RLWINM8 %0:g8rc, 0, 16, 9 The above folding is wrong. Before transformation, %2:g8rc is 32 bit value. After transformation, %2:g8rc becomes a 64 bit value. This patch fixes above issue. Reviewed by: steven.zhang Differential Revision: https://reviews.llvm.org/D71833
*	[PowerPC] Modify the hasSideEffects of MTLR and MFLR from 1 to 0	Kang Zhang	2019-12-26	13	-52/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we didn't set the value for hasSideEffects bit in our td file, `llvm-tblgen` will set it as true for those instructions which has no match pattern. The instructions `MTLR` and `MFLR` don't set the hasSideEffects flag and don't have match pattern, so their hasSideEffects flag will be set true by `llvm-tblgen`. But in fact, we can use `[LR]` to model the two instructions, so they should not have SideEffects. This patch is to modify the hasSideEffects of MTLR and MFLR from 1 to 0. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D71390
*	Migrate function attribute "no-frame-pointer-elim"="false" to ↵	Fangrui Song	2019-12-24	14	-25/+25
\| \| \| \|	"frame-pointer"="none" as cleanups after D56351
*	Migrate function attribute "no-frame-pointer-elim-non-leaf" to ↵	Fangrui Song	2019-12-24	7	-8/+8
\| \| \| \|	"frame-pointer"="non-leaf" as cleanups after D56351
*	Migrate function attribute "no-frame-pointer-elim" to "frame-pointer"="all" ↵	Fangrui Song	2019-12-24	28	-36/+36
\| \| \| \|	as cleanups after D56351
*	[PowerPC] NFC - fix the testcase bug of folding rlwinm	czhengsz	2019-12-23	1	-7/+7
\|
*	[Power9] Remove the PPCISD::XXREVERSE as it has completely the same ↵	QingShan Zhang	2019-12-23	1	-3/+2
\| \| \| \| \| \| \| \| \|	semantics of ISD::BSWAP The custom node PPCISD::XXREVERSE has completely the same semantics of generic node ISD::BSWAP. We need to clean up it as we have the combine rules for bswap in the base class, while nothing for xxreverse. Differential Revision: https://reviews.llvm.org/D70657
*	[NFC][Test][PowerPC] Add more tests for 'and mask'	QingShan Zhang	2019-12-23	2	-22/+91
\|
*	[PowerPC] Exploit `vrl(b\|h\|w\|d)` to perform vector rotation	Kai Luo	2019-12-23	2	-19/+141
\| \| \| \| \| \| \| \| \|	Summary: Currently, we set legalization action of `ISD::ROTL` vectors as `Expand` in `PPCISelLowering`. However, we can exploit `vrl(b\|h\|w\|d)` to lower `ISD::ROTL` directly. Differential Revision: https://reviews.llvm.org/D71324
*	[PPC32] Emit R_PPC_PLTREL24 for calls to dso_local ifunc	Fangrui Song	2019-12-20	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	static void *ifunc(void) __attribute__((ifunc("resolver"))); void foo() { ifunc(); } The relocation produced by the ifunc() call: 1. gcc -msecure-plt -fPIC => R_PPC_PLTREL24 r_addend=0x8000 2. gcc -msecure-plt -PIE => R_PPC_PLTREL24 r_addend=0x8000 3. clang -msecure-plt -fPIC => R_PPC_PLTREL24 r_addend=0x8000 4. clang -msecure-plt -fPIE => R_PPC_REL24 4 is incorrect. The R_PPC_REL24 needs a call stub due to ifunc. If this relocation is mixed with other R_PPC_PLTREL24(r_addend=0x8000) in a function, both GNU ld and lld (after D71621 fix) may produce a wrong result. This patch fixes 4 to use R_PPC_PLTREL24, which matches GCC. Both GNU ld and lld (after D71621) will be happy. Reviewed By: sfertile Differential Revision: https://reviews.llvm.org/D71649
*	[XCOFF][AIX] Fix for missing of undefined symbols from symbol table	jasonliu	2019-12-19	1	-22/+100
\| \| \| \| \| \| \| \| \|	Summary: When we use undefined symbol with its qualname, we are not able to generate that symbol because of the logic of early "continue" that skip the qualname symbol. This patch fixes it. Differential revision: https://reviews.llvm.org/D71667
*	[PowerPC] Only use PLT annotations if using PIC relocation model	Justin Hibbits	2019-12-19	7	-13/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The default static (non-PIC, non-PIE) model for 32-bit powerpc does not use @PLT annotations and relocations in GCC. LLVM shouldn't use @PLT annotations either, because it breaks secure-PLT linking with (some versions of?) GNU LD. Update the available-externally.ll test to reflect that default mode should be the same as the static relocation, by using the same check prefix. Reviewed by: sfertile Differential Revision: https://reviews.llvm.org/D70570
*	[PowerPC] make lwa as a valid ds candidate in ppcloopinstrformprep pass	czhengsz	2019-12-18	1	-13/+11
\| \| \| \| \| \| \| \|	Fix a FIXME in ppcloopinstrformprep pass. Reviewed by: nemanjai Differential Revision: https://reviews.llvm.org/D71346
*	[PowerPC] Add missing legalization for vector BSWAP	Nemanja Ivanovic	2019-12-17	1	-0/+115
\| \| \| \| \| \| \| \|	We somehow missed doing this when we were working on Power9 exploitation. This just adds the missing legalization and cost for producing the vector intrinsics. Differential revision: https://reviews.llvm.org/D70436
*	[AIX] Avoid unset csect assert for functions defined after their use in TOC	David Tenty	2019-12-17	1	-8/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If a function is defined after it appears in a TOC expression, we may try to access an unset containing csect when returning a symbol for the expression. Reviewers: Xiangling_L, DiggerLin, jasonliu, hubert.reinterpretcast Reviewed By: hubert.reinterpretcast Subscribers: hubert.reinterpretcast, wuzish, nemanjai, hiraditya, kbarton, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71125
*	[FPEnv] Remove unnecessary rounding mode argument for constrained intrinsics	Ulrich Weigand	2019-12-17	2	-68/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The following intrinsics currently carry a rounding mode metadata argument: llvm.experimental.constrained.minnum llvm.experimental.constrained.maxnum llvm.experimental.constrained.ceil llvm.experimental.constrained.floor llvm.experimental.constrained.round llvm.experimental.constrained.trunc This is not useful since the semantics of those intrinsics do not in any way depend on the rounding mode. In similar cases, other constrained intrinsics do not have the rounding mode argument. Remove it here as well. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D71218
*	[NFC][Test][PowerPC] Add the test to verify the mask with constant	QingShan Zhang	2019-12-17	1	-0/+22
\|
*	[PowerPC] Fix %llvm.ppc.altivec.vc* lowering	Jim Lin	2019-12-16	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: r372285 changed LLVM to use a `TargetConstant` for parameters of intrinsics that are required to be immediates. Since that commit, use of `%llvm.ppc.altivec.vc{fsx,fux,tsxs,tuxs}` intrinsics has not worked, and resulted in a `LLVM ERROR: Cannot select: intrinsic %llvm.ppc.altivec.vc*` error. The intrinsics' TableGen definitions matched on `imm` instead of `timm`. This commit updates those definitions to use `timm`. Fixes: https://llvm.org/PR44239 Reviewers: hfinkel, nemanjai, #powerpc, Jim Reviewed By: Jim Subscribers: qiucf, wuzish, Jim, hiraditya, kbarton, jsji, shchenz, llvm-commits Tags: #llvm Patched by vddvss (Colin Samples). Differential Revision: https://reviews.llvm.org/D71138
*	[PowerPC] Add Support for indirect calls on AIX.	Sean Fertile	2019-12-13	2	-0/+158
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Extends the desciptor-based indirect call support for 32-bit codegen, and enables indirect calls for AIX. In-depth Description: In a function descriptor based ABI, a function pointer points at a descriptor structure as opposed to the function's entry point. The descriptor takes the form of 3 pointers: 1 for the function's entry point, 1 for the TOC anchor of the module containing the function definition, and 1 for the environment pointer: struct FunctionDescriptor { void EntryPoint; void TOCAnchor; void *EnvironmentPointer; }; An indirect call has several steps of loading the the information from the descriptor into the proper registers for setting up the call. Namely it has to: 1) Save the caller's TOC pointer into the TOC save slot in the linkage area, and then load the callee's TOC pointer into the TOC register (GPR 2 on AIX). 2) Load the function descriptor's entry point into the count register. 3) Load the environment pointer into the environment pointer register (GPR 11 on AIX). 4) Perform the call by branching on count register. 5) Restore the caller's TOC pointer after returning from the indirect call. A couple important caveats to the above: - There is no way to directly load a value from memory into the count register. Instead we populate the count register by loading the entry point address into a gpr and then moving the gpr to the count register. - The TOC restore has to come immediately after the branch on count register instruction (i.e., the 1st instruction executed after we return from the call). This is an implementation limitation. We could, in theory, schedule the restore elsewhere as long as no uses of the TOC pointer fall in between the call and the restore; however, to keep it simple, we insert a pseudo instruction that represents both the indirect branch instruction and the load instruction that restores the caller's TOC from the linkage area. As they flow through the compiler as a single pseudo instruction, nothing can be inserted between them and the caller's TOC is then valid at any use. Differtential Revision: https://reviews.llvm.org/D70724
*	[DAGCombiner] fold shift-trunc-shift to shift-mask-trunc (2nd try)	Sanjay Patel	2019-12-13	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The initial attempt (rG89633320) botched the logic by reversing the source/dest types. Added x86 tests for additional coverage. The vector tests show a potential improvement (fold vector load instead of broadcasting), but that's a known/existing problem. This fold is done in IR by instcombine, and we have a special form of it already here in DAGCombiner, but we want the more general transform too: https://rise4fun.com/Alive/3jZm Name: general Pre: (C1 + zext(C2) < 64) %s = lshr i64 %x, C1 %t = trunc i64 %s to i16 %r = lshr i16 %t, C2 => %s2 = lshr i64 %x, C1 + zext(C2) %a = and i64 %s2, zext((1 << (16 - C2)) - 1) %r = trunc %a to i16 Name: special Pre: C1 == 48 %s = lshr i64 %x, C1 %t = trunc i64 %s to i16 %r = lshr i16 %t, C2 => %s2 = lshr i64 %x, C1 + zext(C2) %r = trunc %s2 to i16 ...because D58017 exposes a regression without this fold.
*	Revert "[DAGCombiner] fold shift-trunc-shift to shift-mask-trunc"	Sanjay Patel	2019-12-12	1	-1/+2
\| \| \| \| \|	This reverts commit 8963332c3327daa652ba3e26d35f9109b6991985. There was a logic bug typo in this code, but it wasn't visible in the asm for the tests.