bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[MBP] Disable aggressive loop rotate in plain mode	Guozhi Wei	2019-08-22	5	-173/+197
\| \| \| \| \| \| \| \| \| \|	Patch https://reviews.llvm.org/D43256 introduced more aggressive loop layout optimization which depends on profile information. If profile information is not available, the statically estimated profile information(generated by BranchProbabilityInfo.cpp) is used. If user program doesn't behave as BranchProbabilityInfo.cpp expected, the layout may be worse. To be conservative this patch restores the original layout algorithm in plain mode. But user can still try the aggressive layout optimization with -force-precise-rotation-cost=true. Differential Revision: https://reviews.llvm.org/D65673 llvm-svn: 369664
*	[DAGCombiner] Remove explicit call to AddToWorklist in sqrt and reciprocal ↵	Amaury Sechet	2019-08-22	1	-24/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	computations Summary: These nodes end up being processed regardless due to DAGCombiner ensuring arguments are processed. This changes the order in which nodes are processed, which fixes an issue on PowerPC. Reviewers: craig.topper, efriedma, RKSimon, lebedev.ri, mcberg2017, stefanp, hfinkel Subscribers: nemanjai, MaskRay, jsji, steven.zhang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66548 llvm-svn: 369662
*	[PowerPC] Regenerate reciprocal tests, as discussed on D66548	Simon Pilgrim	2019-08-22	1	-107/+294
\| \| \| \|	llvm-svn: 369659
*	Adds support for writing the .bss section for XCOFF object files.	Sean Fertile	2019-08-20	1	-2/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adds Wrapper classes for MCSymbol and MCSection into the XCOFF target object writer. Also adds a class to represent the top-level sections, which we materialize in the ObjectWriter. executePostLayoutBinding will map all csects into the appropriate container depending on its storage mapping class, and map all symbols into their containing csect. Once all symbols have been processed we - Assign addresses and symbol table indices. - Calaculte section sizes. - Build the section header table. - Assign the sections raw-pointer value for non-virtual sections. Since the .bss section is virtual, writing the header table is enough to add support. Writing of a sections raw data, or of any relocations is not included in this patch. Testing is done by dumping the section header table, but it needs to be extended to include dumping the symbol table once readobj support for dumping auxiallary entries lands. Differential Revision: https://reviews.llvm.org/D65159 llvm-svn: 369454
*	[PeepholeOptimizer] Don't assume bitcast def always has input	Jinsong Ji	2019-08-19	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we have a MI marked with bitcast bits, but without input operands, PeepholeOptimizer might crash with assert. eg: If we apply the changes in PPCInstrVSX.td as in this patch: [(set v4i32:$XT, (bitconvert (v16i8 immAllOnesV)))]>; We will get assert in PeepholeOptimizer. ``` llvm-lit llvm-project/llvm/test/CodeGen/PowerPC/build-vector-tests.ll -v llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:417: const llvm::MachineOperand &llvm::MachineInstr::getOperand(unsigned int) const: Assertion `i < getNumOperands() && "getOperand() out of range!"' failed. ``` The fix is to abort if we found out of bound access. Reviewers: qcolombet, MatzeB, hfinkel, arsenm Reviewed By: qcolombet Subscribers: wdng, arsenm, steven.zhang, wuzish, nemanjai, hiraditya, kbarton, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65542 llvm-svn: 369261
*	[CodeGen] Do the Simple Early Return in block-placement pass to optimize the ↵	Kang Zhang	2019-08-17	1	-8/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	blocks Summary: Fix a bug of preducessors. In `block-placement` pass, it will create some patterns for unconditional we can do the simple early retrun. But the `early-ret` pass is before `block-placement`, we don't want to run it again. This patch is to do the simple early return to optimize the blocks at the last of `block-placement`. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D63972 llvm-svn: 369191
*	Revert [CodeGen] Do the Simple Early Return in block-placement pass to ↵	Florian Hahn	2019-08-16	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	optimize the blocks This reverts r368997 (git commit 2a903c0b679bae1919f9fc01f78e4bc6cff2add0) It looks like this commit adds invalid predecessors to MBBs. The example below fails the verifier after MachineBlockPlacement (run llc -verify-machineinstrs): @global.4 = external constant i8* declare i32 @zot(...) define i16* @snork.67() personality i8* bitcast (i32 (...)* @zot to i8) { bb: invoke void undef() to label %bb5 unwind label %bb4 bb4: ; preds = %bb %tmp = landingpad { i8, i32 } catch i8* null unreachable bb5: ; preds = %bb %tmp6 = load i32, i32* null, align 4 %tmp7 = icmp eq i32 %tmp6, 0 br i1 %tmp7, label %bb14, label %bb8 bb8: ; preds = %bb11, %bb5 invoke void undef() to label %bb9 unwind label %bb11 bb9: ; preds = %bb8 %tmp10 = invoke i16* undef() to label %bb14 unwind label %bb11 bb11: ; preds = %bb9, %bb8 %tmp12 = landingpad { i8, i32 } cleanup catch i8 bitcast (i8** @global.4 to i8) %tmp13 = icmp ult i64 undef, undef br i1 %tmp13, label %bb8, label %bb14 bb14: ; preds = %bb11, %bb9, %bb5 %tmp15 = phi i16 [ null, %bb5 ], [ null, %bb11 ], [ %tmp10, %bb9 ] ret i16* %tmp15 } llvm-svn: 369104
*	[PowerPC] add testcases for folding frame offset - NFC	Chen Zheng	2019-08-16	1	-0/+114
\| \| \| \|	llvm-svn: 369077
*	[PowerPC] Use xxleqv to set all one vector IMM(-1).	Jinsong Ji	2019-08-15	7	-55/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: xxspltib/vspltisb are 3 cycle PM instructions, xxleqv is 2 cycle ALU instruction. We should use xxleqv to set all one vectors. Reviewers: hfinkel, nemanjai, steven.zhang Subscribers: hiraditya, kbarton, MaskRay, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65529 llvm-svn: 369006
*	[CodeGen] Do the Simple Early Return in block-placement pass to optimize the ↵	Kang Zhang	2019-08-15	1	-8/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	blocks Summary: This patch has trigger a bug of r368339, and the r368339 has been reverted, So upstream this patch again. In `block-placement` pass, it will create some patterns for unconditional we can do the simple early retrun. But the `early-ret` pass is before `block-placement`, we don't want to run it again. This patch is to do the simple early return to optimize the blocks at the last of `block-placement`. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D63972 llvm-svn: 368997
*	[PowerPC][NFC] Remove duplicate tests in build-vector-test.ll	Jinsong Ji	2019-08-14	1	-341/+221
\| \| \| \| \| \|	AllOnes has been split into build-vector-allones.ll. llvm-svn: 368900
*	[PowerPC][NFC] Add test for build all one vector with different types.	Jinsong Ji	2019-08-14	1	-0/+109
\| \| \| \| \| \| \|	build-vector-tests.ll is far too big, split such type tests for single buildvector into new file. llvm-svn: 368859
*	[AIX] Add call lowering for parameters that could pass onto FPRs	Jason Liu	2019-08-14	1	-0/+150
\| \| \| \| \| \| \| \| \| \|	Summary: This patch adds call lowering functionality to enable passing parameters onto floating point registers when needed. Differential Revision: https://reviews.llvm.org/D63654 llvm-svn: 368855
*	[AIX]Lowering global address for 32/64bit small/large code models	Xiangling Liao	2019-08-13	2	-0/+76
\| \| \| \| \| \| \| \| \| \| \| \|	This patch implements global address lowering for 32/64 bit with small/large code models. 1.For 32bit large code model on AIX, there are newly added pseudo opcode LWZtocL & ADDIStocHA32, the support of which on MC layer will be provided by future patches. 2.The default code model on AIX should be small code model. 3.Since AIX does not have medium code model, "report_fatal_error" when users specify it. Differential Revision: https://reviews.llvm.org/D63547 llvm-svn: 368744
*	Reland r368691: "[AIX] Implement LR prolog/epilog save/restore"	Hubert Tong	2019-08-13	2	-2/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Trying again with the code changes (and not just the new test). Summary: This patch fixes the offsets of fields in the stack frame linkage save area for AIX. Reviewers: sfertile, hubert.reinterpretcast, jasonliu, Xiangling_L, xingxue, ZarkoCA, daltenty Reviewed By: hubert.reinterpretcast Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64424 Patch by Chris Bowler! llvm-svn: 368721
*	Revert r368691; test checked in without changes by accident	Hubert Tong	2019-08-13	1	-32/+0
\| \| \| \|	llvm-svn: 368699
*	[AIX] Implement LR prolog/epilog save/restore	Hubert Tong	2019-08-13	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch fixes the offsets of fields in the stack frame linkage save area for AIX. Reviewers: sfertile, hubert.reinterpretcast, jasonliu, Xiangling_L, xingxue, ZarkoCA, daltenty Reviewed By: hubert.reinterpretcast Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64424 Patch by Chris Bowler! llvm-svn: 368691
*	[PowerPC] Fix ICE when truncating some vectors	Qiu Chaofan	2019-08-13	1	-0/+123
\| \| \| \| \| \| \| \| \| \| \| \|	The legalizer would hit an assertion on PowerPC platform when truncating a vector whose size is not power of 2. This patch is to add a check to prevent vectors with such odd-size elements from being custom lowered. Reviewed By: Hal Finkel Differential Revision: https://reviews.llvm.org/D65261 llvm-svn: 368654
*	[NFC][PowerPC] Add the test case shrink-wrap.mir and shrink-wrap.ll for PPC	Kang Zhang	2019-08-12	2	-0/+184
\| \| \| \|	llvm-svn: 368597
*	Revert r368339 "[MBP] Disable aggressive loop rotate in plain mode"	Hans Wennborg	2019-08-12	5	-197/+173
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It caused assertions to fire when building Chromium: lib/CodeGen/LiveDebugValues.cpp:331: bool {anonymous}::LiveDebugValues::OpenRangesSet::empty() const: Assertion `Vars.empty() == VarLocs.empty() && "open ranges are inconsistent"' failed. See https://crbug.com/992871#c3 for how to reproduce. > Patch https://reviews.llvm.org/D43256 introduced more aggressive loop layout optimization which depends on profile information. If profile information is not available, the statically estimated profile information(generated by BranchProbabilityInfo.cpp) is used. If user program doesn't behave as BranchProbabilityInfo.cpp expected, the layout may be worse. > > To be conservative this patch restores the original layout algorithm in plain mode. But user can still try the aggressive layout optimization with -force-precise-rotation-cost=true. > > Differential Revision: https://reviews.llvm.org/D65673 llvm-svn: 368579
*	Revert r368565: [CodeGen] Do the Simple Early Return in block-placement pass ↵	Kang Zhang	2019-08-12	1	-4/+8
\| \| \| \| \| \|	to optimize the blocks llvm-svn: 368574
*	[CodeGen] Do the Simple Early Return in block-placement pass to optimize the ↵	Kang Zhang	2019-08-12	1	-8/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	blocks Summary: In `block-placement` pass, it will create some patterns for unconditional we can do the simple early retrun. But the `early-ret` pass is before `block-placement`, we don't want to run it again. This patch is to do the simple early return to optimize the blocks at the last of `block-placement`. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D63972 llvm-svn: 368565
*	Revert r368509 "[CodeGen] Do the Simple Early Return in block-placement pass ↵	Hans Wennborg	2019-08-12	2	-6/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to optimize the blocks" > In `block-placement` pass, it will create some patterns for unconditional we can do the simple early retrun. > But the `early-ret` pass is before `block-placement`, we don't want to run it again. > This patch is to do the simple early return to optimize the blocks at the last of `block-placement`. > > Reviewed By: efriedma > > Differential Revision: https://reviews.llvm.org/D63972 This also revertes follow-ups r368514 and r368532. llvm-svn: 368560
*	[CodeGen] Do the Simple Early Return in block-placement pass to optimize the ↵	Kang Zhang	2019-08-10	2	-16/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	blocks Summary: In `block-placement` pass, it will create some patterns for unconditional we can do the simple early retrun. But the `early-ret` pass is before `block-placement`, we don't want to run it again. This patch is to do the simple early return to optimize the blocks at the last of `block-placement`. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D63972 llvm-svn: 368509
*	[MachinePipeliner] Avoid indeterminate order in FuncUnitSorter	Jinsong Ji	2019-08-09	1	-0/+116
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is exposed by adding a new testcase in PowerPC in https://reviews.llvm.org/rL367732 The testcase got different output on different platform, hence breaking buildbots. The problem is that we get differnt FuncUnitOrder when calculateResMII. The root cause is: 1. Two MachineInstr might get SAME priority(MFUsx) from minFuncUnits. 2. Current comparison operator() will return `MFUs1 > MFUs2`. 3. We use iterators for MachineInstr, so the input to FuncUnitSorter might be different on differnt platform due to the iterator nature. So for two MI with same MFU, their order is actually depends on the iterator order, which is platform (implemtation) dependent. This is risky, and may cause cross-compiling problems. The fix is to check make sure we assign a determine order when they are equal. Reviewers: bcahoon, hfinkel, jmolloy Subscribers: nemanjai, hiraditya, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65992 llvm-svn: 368441
*	[MBP] Disable aggressive loop rotate in plain mode	Guozhi Wei	2019-08-08	5	-173/+197
\| \| \| \| \| \| \| \| \| \|	Patch https://reviews.llvm.org/D43256 introduced more aggressive loop layout optimization which depends on profile information. If profile information is not available, the statically estimated profile information(generated by BranchProbabilityInfo.cpp) is used. If user program doesn't behave as BranchProbabilityInfo.cpp expected, the layout may be worse. To be conservative this patch restores the original layout algorithm in plain mode. But user can still try the aggressive layout optimization with -force-precise-rotation-cost=true. Differential Revision: https://reviews.llvm.org/D65673 llvm-svn: 368339
*	Re-commit "[PowerPC][NFC][MachinePipeliner] Add some regression testcases""	Jinsong Ji	2019-08-08	4	-0/+281
\| \| \| \| \| \|	Remove sms-cpy1.ll first while I investigate the problem. llvm-svn: 368318
*	Enable assembly output of local commons for AIX	David Tenty	2019-08-08	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch enable assembly output of local commons for AIX using .lcomm directives. Adds a EmitXCOFFLocalCommonSymbol to MCStreamer so we can emit the AIX version of .lcomm assembly directives which include a csect name. Handle the case of BSS locals in PPCAIXAsmPrinter by using EmitXCOFFLocalCommonSymbol. Adds a test for generating .lcomm on AIX Targets. Reviewers: cebowleratibm, hubert.reinterpretcast, Xiangling_L, jasonliu, sfertile Reviewed By: sfertile Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64825 llvm-svn: 368306
*	[Strict FP] Allow custom operation actions	Ulrich Weigand	2019-08-06	1	-173/+553
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch changes the DAG legalizer to respect the operation actions set by the target for strict floating-point operations. (Currently, the legalizer will usually fall back to mutate to the non-strict action (which is assumed to be legal), and only skip mutation if the strict operation is marked legal.) With this patch, if whenever a strict operation is marked as Legal or Custom, it is passed to the target as usual. Only if it is marked as Expand will the legalizer attempt to mutate to the non-strict operation. Note that this will now fail if the non-strict operation is itself marked as Custom -- the target will have to provide a Custom definition for the strict operation then as well. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D65226 llvm-svn: 368012
*	Temporarily Revert "[PowerPC][NFC][MachinePipeliner] Add some regression ↵	Eric Christopher	2019-08-03	5	-386/+0
\| \| \| \| \| \| \| \| \| \| \| \|	testcases" It's breaking a number of bots, e.g.: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-msan/builds/13893/steps/check-llvm%20msan/logs/stdio This reverts commit r367732. llvm-svn: 367741
*	[PowerPC][NFC][MachinePipeliner] Add some regression testcases	Jinsong Ji	2019-08-02	5	-0/+386
\| \| \| \| \| \|	Exposed by refactoring in https://reviews.llvm.org/D64665. llvm-svn: 367732
*	recommit:[PowerPC] Eliminate loads/swap feeding swap/store for vector type ↵	Zi Xuan Wu	2019-08-01	3	-116/+43
\| \| \| \| \| \| \| \| \| \| \| \|	by using big-endian load/store In PowerPC, there is instruction to load vector in big endian element order when it's in little endian target. So we can combine vector load + reverse into big endian load to eliminate the swap instruction. Also combine vector reverse + store into big endian store. Differential Revision: https://reviews.llvm.org/D65063 llvm-svn: 367516
*	Migrate some more fadd and fsub cases away from UnsafeFPMath control to ↵	Michael Berg	2019-07-31	4	-216/+294
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	utilize NoSignedZerosFPMath options control Summary: Honoring no signed zeroes is also available as a user control through clang separately regardless of fastmath or UnsafeFPMath context, DAG guards should reflect this context. Reviewers: spatel, arsenm, hfinkel, wristow, craig.topper Reviewed By: spatel Subscribers: rampitec, foad, nhaehnle, wuzish, nemanjai, jvesely, wdng, javed.absar, MaskRay, jsji Differential Revision: https://reviews.llvm.org/D65170 llvm-svn: 367486
*	revert r367382 because buildbot failure	Zi Xuan Wu	2019-07-31	3	-53/+168
\| \| \| \|	llvm-svn: 367388
*	[PowerPC] Eliminate loads/swap feeding swap/store for vector type by using ↵	Zi Xuan Wu	2019-07-31	3	-168/+53
\| \| \| \| \| \| \| \| \| \|	big-endian load/store In PowerPC, there is instruction to load vector in big endian element order when it's in little endian target. So we can combine vector load + reverse into big endian load to eliminate the swap instruction. Also combine vector reverse + store into big endian store. llvm-svn: 367382
*	Address post commit review comments on revision 366727.	Sean Fertile	2019-07-30	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	Addresses number of comment made on D64652 after commiting: - Reorders function decls in the TargetLoweringObjectFileXCOFF class. - Fix comment in MCSectionXCOFF to include description of external reference csects. - Convert several llvm_unreachables to report_fatal_error - Convert several dyn_casts to casts as they are expected not to fail. - Avoid copying DataLayout object. llvm-svn: 367324
*	[NFC][PowerPC] Add test case for D65063	Zi Xuan Wu	2019-07-30	1	-0/+851
\| \| \| \|	llvm-svn: 367283
*	[PowerPC][AIX]Add lowering of MCSymbol MachineOperand.	Sean Fertile	2019-07-26	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \|	Adds machine operand lowering for MCSymbolSDNodes to the PowerPC backend. This is needed to produce call instructions in assembly for AIX because the callee operand is a MCSymbolSDNode. The test is XFAIL'ed for asserts due to a (valid) assertion in PEI that the AIX ABI isn't supported yet. Differential Revision: https://reviews.llvm.org/D63738 llvm-svn: 367133
*	Some case eror for: detected memory leaks	Kang Zhang	2019-07-26	2	-6/+16
\| \| \| \|	llvm-svn: 367083
*	[PowerPC] Do the Simple Early Return in block-placement pass to optimize the ↵	Kang Zhang	2019-07-26	2	-16/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	blocks Summary: In `block-placement` pass, it will create some patterns for unconditional we can do the simple early retrun. But the `early-ret` pass is before `block-placement`, we don't want to run it again. This patch is to do the simple early return to optimize the blocks at the last of `block-placement`. Below is an example ``` BB: \| BB: XOR 3, 3, 4 \| XOR 3, 3, 4 B TBB \| B ChainBB ... \| ... ChainBB: \| ChainBB: B TBB \| ADD 3, 3, 4 ... \| BLR TBB: \| ADD 3, 3, 4 \| BLR \| ``` Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D63972 llvm-svn: 367080
*	[CodeGen] Don't resolve the stack protector frame accesses until PEI	Francis Visoiu Mistrih	2019-07-25	1	-6/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, stack protector loads and stores are resolved during LocalStackSlotAllocation (if the pass needs to run). When this is the case, the base register assigned to the frame access is going to be one of the vregs created during LocalStackSlotAllocation. This means that we are keeping a pointer to the stack protector slot, and we're using this pointer to load and store to it. In case register pressure goes up, we may end up spilling this pointer to the stack, which can be a security concern. Instead, leave it to PEI to resolve the frame accesses. In order to do that, we make all stack protector accesses go through frame index operands, then PEI will resolve this using an offset from sp/fp/bp. Differential Revision: https://reviews.llvm.org/D64759 llvm-svn: 367068
*	[PowerPC] exclude more icmps in LSR which is converted in later hardware ↵	Chen Zheng	2019-07-25	2	-8/+0
\| \| \| \| \| \| \| \|	loop pass Differential Revision: https://reviews.llvm.org/D64795 llvm-svn: 366976
*	[Codegen] (X & (C l>>/<< Y)) ==/!= 0 --> ((X <</l>> Y) & C) ==/!= 0 fold	Roman Lebedev	2019-07-24	1	-10/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This was originally reported in D62818. https://rise4fun.com/Alive/oPH InstCombine does the opposite fold, in hope that `C l>>/<< Y` expression will be hoisted out of a loop if `Y` is invariant and `X` is not. But as it is seen from the diffs here, if it didn't get hoisted, the produced assembly is almost universally worse. Much like with my recent "hoist add/sub by/from const" patches, we should get almost universal win if we hoist constant, there is almost always an "and/test by imm" instruction, but "shift of imm" not so much, so we may avoid having to materialize the immediate, and thus need one less register. And since we now shift not by constant, but by something else, the live-range of that something else may reduce. Special care needs to be applied not to disturb x86 `BT` / hexagon `tstbit` instruction pattern. And to not get into endless combine loop. Reviewers: RKSimon, efriedma, t.p.northover, craig.topper, spatel, arsenm Reviewed By: spatel Subscribers: hiraditya, MaskRay, wuzish, xbolva00, nikic, nemanjai, jvesely, wdng, nhaehnle, javed.absar, tpr, kristof.beyls, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62871 llvm-svn: 366955
*	[PowerPC] Remove redundant load immediate instructions	Yi-Hong Lyu	2019-07-23	2	-0/+403
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently PowerPC backend emits code like this: r3 = li 0 std r3, 264(r1) r3 = li 0 std r3, 272(r1) This patch fixes that and other cases where a register already contains a value that is loaded so we will get: r3 = li 0 std r3, 264(r1) std r3, 272(r1) Differential Revision: https://reviews.llvm.org/D64220 llvm-svn: 366840
*	[PowerPC] Replace float load/store pair with integer load/store pair when ↵	Zi Xuan Wu	2019-07-23	4	-23/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	it's only used in load/store Replace float load/store pair with integer load/store pair when it's only used in load/store, because float load/store instructions cost more cycles then integer load/store. A typical scenario is when there is a call with more than 13 float arguments passing, we need pass them by stack. So we need a load/store pair to do such memory operation if the variable is global variable. Differential Revision: https://reviews.llvm.org/D64195 llvm-svn: 366775
*	[NFC][PowerPC]Change ADDIStocHA to ADDIStocHA8 to follow 64-bit naming ↵	Jason Liu	2019-07-22	5	-14/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	convention Summary: Since we are planning to add ADDIStocHA for 32bit in later patch, we decided to change 64bit one first to follow naming convention with 8 behind opcode. Patch by: Xiangling_L Differential Revision: https://reviews.llvm.org/D64814 llvm-svn: 366731
*	Stubs out TLOF for AIX and add support for common vars in assembly output.	Sean Fertile	2019-07-22	1	-0/+24
\| \| \| \| \| \| \| \| \|	Stubs out a TargetLoweringObjectFileXCOFF class, implementing only SelectSectionForGlobal for common symbols. Also adds an override of EmitGlobalVariable in PPCAIXAsmPrinter which adds a number of defensive errors and adds support for emitting common globals. llvm-svn: 366727
*	[PowerPC][NFC] Precomit test case for upcoming patch	Nemanja Ivanovic	2019-07-21	1	-0/+125
\| \| \| \| \| \| \|	Just committing a test case for an upcoming patch so that the review can show only the codegen differences. llvm-svn: 366661
*	[PowerPC][NFC] Regenerate test using script	Nemanja Ivanovic	2019-07-21	1	-28/+161
\| \| \| \| \| \| \|	This test case ended up as a hybrid of generated checks and manually inserted checks. Regenerate using script to make it consistent. llvm-svn: 366659
*	[MachineCSE][MachinePRE] Avoid hoisting code from code regions into hot BBs.	Kai Luo	2019-07-19	1	-40/+42
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Current PRE hoists common computations into CMBB = DT->findNearestCommonDominator(MBB, MBB1). However, if CMBB is in a hot loop body, we might get performance degradation. Differential Revision: https://reviews.llvm.org/D64394 llvm-svn: 366570