bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Rename ExpandISelPseudo->FinalizeISel, delay register reservation	Matt Arsenault	2019-06-19	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	This allows targets to make more decisions about reserved registers after isel. For example, now it should be certain there are calls or stack objects in the frame or not, which could have been introduced by legalization. Patch by Matthias Braun llvm-svn: 363757
*	[GlobalISel][Localizer] Rewrite localizer to run in 2 phases, inter & intra ↵	Amara Emerson	2019-06-17	2	-0/+149
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	block. Inter-block localization is the same as what currently happens, except now it only runs on the entry block because that's where the problematic constants with long live ranges come from. The second phase is a new intra-block localization phase which attempts to re-sink the already localized instructions further right before one of the multiple uses. One additional change is to also localize G_GLOBAL_VALUE as they're constants too. However, on some targets like arm64 it takes multiple instructions to materialize the value, so some additional heuristics with a TTI hook have been introduced attempt to prevent code size regressions when localizing these. Overall, these changes improve CTMark code size on arm64 by 1.2%. Full code size results: Program baseline new diff ------------------------------------------------------------------------------ test-suite...-typeset/consumer-typeset.test 1249984 1217216 -2.6% test-suite...:: CTMark/ClamAV/clamscan.test 1264928 1232152 -2.6% test-suite :: CTMark/SPASS/SPASS.test 1394092 1361316 -2.4% test-suite...Mark/mafft/pairlocalalign.test 731320 714928 -2.2% test-suite :: CTMark/lencod/lencod.test 1340592 1324200 -1.2% test-suite :: CTMark/kimwitu++/kc.test 3853512 3820420 -0.9% test-suite :: CTMark/Bullet/bullet.test 3406036 3389652 -0.5% test-suite...ark/tramp3d-v4/tramp3d-v4.test 8017000 8016992 -0.0% test-suite...TMark/7zip/7zip-benchmark.test 2856588 2856588 0.0% test-suite...:: CTMark/sqlite3/sqlite3.test 765704 765704 0.0% Geomean difference -1.2% Differential Revision: https://reviews.llvm.org/D63303 llvm-svn: 363632
*	Propagate fmf in IRTranslate for fneg	Michael Berg	2019-06-17	1	-0/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This case is related to D63405 in that we need to be propagating FMF on negates. Reviewers: volkan, spatel, arsenm Reviewed By: arsenm Subscribers: wdng, javed.absar Differential Revision: https://reviews.llvm.org/D63458 llvm-svn: 363631
*	[test][AArch64] Relax the check line for G_BRJT in legalizer-info-validation.mir	Volkan Keles	2019-06-17	1	-1/+1
\| \| \| \| \| \|	Replace the specific number with a pattern to relax the test. llvm-svn: 363621
*	[globalisel] Fix iterator invalidation in the extload combines	Daniel Sanders	2019-06-17	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Change the way we deal with iterator invalidation in the extload combines as it was still possible to neglect to visit a use. Even worse, it happened in the in-tree test cases and the checks weren't good enough to detect it. We now take a cheap copy of the use list before iterating over it. This prevents iterator invalidation from occurring and has the nice side effect of making the existing schedule-for-erase/schedule-for-insert mechanism moot. Reviewers: aditya_nandakumar Reviewed By: aditya_nandakumar Subscribers: rovka, kristof.beyls, javed.absar, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61813 llvm-svn: 363616
*	[GlobalISel][AArch64] Fold G_SUB into G_ICMP when it's safe to do so	Jessica Paquette	2019-06-17	1	-0/+291
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Basically porting over the behaviour in AArch64ISelLowering to GISel. See emitComparison for reference. When we have something like this: ``` lhs = G_SUB 0, y ... G_ICMP lhs, rhs ``` We can fold away the G_SUB and produce a cmn instead, given that we produce the same value in NZCV. Add a test showing that the transformation works, and also showing that we don't perform the transformation when it's unsafe. Also factor out the CSet emission into emitCSetForICMP. Differential Revision: https://reviews.llvm.org/D63163 llvm-svn: 363596
*	Describe stack-id as an enum	Sander de Smalen	2019-06-17	6	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch changes MIR stack-id from an integer to an enum, and adds printing/parsing support for this in MIR files. The default stack-id '0' is now renamed to 'default'. This should make MIR tests that have stack objects with different stack-ids more descriptive. It also clarifies code operating on StackID. Reviewers: arsenm, thegameg, qcolombet Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D60137 llvm-svn: 363533
*	[GlobalISel] Add a G_BRJT opcode.	Amara Emerson	2019-06-14	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	This is a branch opcode that takes a jump table pointer, jump table index and an index into the table to do an indirect branch. We pass both the table pointer and JTI to allow targets like ARM64 to more easily use the existing jump table compression optimization without having to walk up the block to find a paired G_JUMP_TABLE. Differential Revision: https://reviews.llvm.org/D63159 llvm-svn: 363434
*	[GlobalISel][IRTranslator] Add debug loc with line 0 to constants emitted ↵	Amara Emerson	2019-06-13	1	-0/+74
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	into the entry block. Constants, including G_GLOBAL_VALUE, are all emitted into the entry block which lets us use the vreg def assuming it dominates all other users. However, it can cause jumpy debug behaviour since the DebugLoc attached to these MIs are from a user instruction that could be in a different block. Fixes PR40887. Differential Revision: https://reviews.llvm.org/D63286 llvm-svn: 363331
*	[GlobalISel] Translate memset/memmove/memcpy from undef ptrs into nops	Jessica Paquette	2019-06-10	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the source is undef, then just don't do anything. This matches SelectionDAG's behaviour in SelectionDAG.cpp. Also add a test showing that we do the right thing here. (irtranslator-memfunc-undef.ll) Differential Revision: https://reviews.llvm.org/D63095 llvm-svn: 362989
*	[AArch64][GlobalISel] Select immediate forms of cmp instructions.	Amara Emerson	2019-06-09	2	-3/+74
\| \| \| \| \| \| \| \|	A simple re-use of the immediate operand matcher and renderer functions. rdar://43795178 llvm-svn: 362896
*	[GlobalISel] IRTranslator: Translate the intrinsics ignored by CodeGen	Volkan Keles	2019-06-07	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Translate `llvm.assume`, `llvm.var.annotation` and `llvm.sideeffect` to nothing as they have no effect on CodeGen. Reviewers: qcolombet, aditya_nandakumar, dsanders, paquette, aemerson, arsenm Reviewed By: arsenm Subscribers: hiraditya, wdng, rovka, kristof.beyls, javed.absar, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63022 llvm-svn: 362834
*	[AArch64][GlobalISel] Add manual selection support for G_ZEXTLOADs to s64.	Amara Emerson	2019-06-06	1	-5/+134
\| \| \| \| \| \| \| \| \| \| \|	We already get support for G_ZEXTLOAD to s32 from the importer, but it can't deal with the SUBREG_TO_REG in the pattern. Tweaking the existing manual selection code for G_LOAD to handle an additional SUBREG_TO_REG when dealing with G_ZEXTLOAD isn't much work. Also add tests to check the imported pattern selections to s32 work. llvm-svn: 362681
*	[AArch64][GlobalISel] Add the new changes to fix PR42129 that were supposed ↵	Amara Emerson	2019-06-06	1	-0/+32
\| \| \| \| \| \| \| \|	to go into r362666. The changes weren't staged so ended up just re-commiting the unmodified reverted change. llvm-svn: 362677
*	Revert "Revert "[AArch64][GlobalISel] Optimize G_FCMP + G_SELECT pairs when ↵	Amara Emerson	2019-06-05	1	-0/+351
\| \| \| \| \| \| \| \| \| \| \| \|	G_SELECT is fp"" When looking through copies, make sure to not try to find the vreg def of a physreg. Normally getVRegDef will return nullptr in this case, but if there happens to be multiple defs then it will assert. This fixes PR42129. llvm-svn: 362666
*	Revert "[AArch64][GlobalISel] Optimize G_FCMP + G_SELECT pairs when G_SELECT ↵	Petr Hosek	2019-06-05	1	-351/+0
\| \| \| \| \| \| \| \|	is fp" This reverts commit r362435 as this triggers ICE, see PR42129 for details. llvm-svn: 362662
*	[AArch64][GlobalISel] Make extloads to i64 legal.	Amara Emerson	2019-06-04	1	-0/+148
\| \| \| \| \| \| \| \|	Although we had the support in the prelegalizer combiner to generate the G_SEXTLOAD or G_ZEXTLOAD ops, the legalizer definitions for arm64 had them as lowering back to separate ops. llvm-svn: 362553
*	[AArch64][GlobalISel] Optimize G_FCMP + G_SELECT pairs when G_SELECT is fp	Jessica Paquette	2019-06-03	1	-0/+351
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of emitting all of the test stuff for a compare when it's only used by a select, instead, just emit the compare + select. The select will use the value of NZCV correctly, so we don't need to emit all of the test instructions etc. For now, only support fp selects which use G_FCMP. Also only support condition codes which will only require one select to represent. Also add a test. Differential Revision: https://reviews.llvm.org/D62695 llvm-svn: 362446
*	[AArch64][GlobalISel] Select FCMPSri/FCMPDri when comparing against 0.0	Jessica Paquette	2019-05-28	1	-0/+56
\| \| \| \| \| \| \| \| \| \| \|	Add support for selecting FCMPSri and FCMPDri when comparing against 0.0, and factor out opcode selection for G_FCMP into its own function. Add a test to show that we don't do this with other immediates. Differential Revision: https://reviews.llvm.org/D62539 llvm-svn: 361888
*	[GlobalISel][AArch64] Make FP constraint checks consider possible use/def banks	Jessica Paquette	2019-05-24	1	-0/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In a few places in getInstrMapping, we check if use/def instructions for the instruction we're mapping have floating point constraints. We can improve this check and reduce the number of copies in GISel-compiled code if we make a couple observations: - For a def instruction, it only matters if the def instruction must always output a value stored on a FPR - For a use instruction, it only matters if the use instruction must always only take in values stored in FPRs This adds two new functions: - onlyUsesFP - onlyDefinesFP Then we can use those when we're checking the uses/defs instead. Without this patch, the load, unmerge, store, and select in the added test would have unnecessary copies. Differential Revision: https://reviews.llvm.org/D62426 llvm-svn: 361679
*	[GlobalISel][AArch64] Improve register bank mappings for G_SELECT	Jessica Paquette	2019-05-24	1	-0/+130
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The fcsel and csel instructions differ in only the register banks they work on. So, they're entirely interchangeable otherwise. With this in mind, this does two things: - Teach AArch64RegisterBankInfo to consider the inputs to G_SELECT as well as the outputs. - Teach it to choose the best register bank mapping based off the constraints of the inputs and outputs. The "best" in this case means the one that requires the smallest number of copies to properly emit a fcsel/csel. For example, if the inputs are all already going to be on FPRs, we should emit a fcsel, even if the output is a GPR. This costs one copy to produce the result, but saves us from copying the inputs into GPRs. Also update the regbank-select.mir to check that we end up with the right select instruction. Differential Revision: https://reviews.llvm.org/D62267 llvm-svn: 361665
*	GlobalISel: support swifterror attribute on AArch64.	Tim Northover	2019-05-24	2	-18/+521
\| \| \| \| \| \| \| \|	swifterror marks an argument as a register pretending to be a pointer, so we need a guaranteed mem2reg-like analysis of its uses. Fortunately most of the infrastructure can be reused from the DAG world. llvm-svn: 361608
*	GlobalISel: Define integer min/max instructions	Matt Arsenault	2019-05-17	1	-0/+12
\| \| \| \| \| \| \|	Doesn't attempt to emit them for anything yet, but some legalizations I want to port use them. llvm-svn: 361061
*	GlobalISel: Add G_FCOPYSIGN	Matt Arsenault	2019-05-16	2	-0/+15
\| \| \| \|	llvm-svn: 360850
*	[GlobalISel] Handle <1 x T> vector return types properly.	Amara Emerson	2019-05-06	1	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \|	After support for dealing with types that need to be extended in some way was added in r358032 we didn't correctly handle <1 x T> return types. These types don't have a GISel direct representation, instead we just see them as scalars. When we need to pad them into <2 x T> types however we need to use a G_BUILD_VECTOR instead of trying to do a G_CONCAT_VECTOR. This fixes PR41738. llvm-svn: 360068
*	[AArch64][GlobalISel] Use fcsel instead of csel for G_SELECT on FPRs	Jessica Paquette	2019-05-03	2	-0/+126
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This saves us some unnecessary copies. If the inputs to a G_SELECT are floating point, we should use fcsel rather than csel. Changes here are... - Teach selectCopy about s1-to-s1 copies across register banks. - AArch64RegisterBankInfo about G_SELECT in general. - Teach the instruction selector about the FCSEL instructions. Also add two tests: - select-select.mir to show that we get the expected FCSEL - regbank-select.mir (unfortunately named) to show the register banks on G_SELECT are properly preserved And update fast-isel-select.ll to show that we do the same thing as other instruction selectors in these cases. llvm-svn: 359940
*	[IRTranslator] Use the alloc size instead of the store size when translating ↵	Quentin Colombet	2019-05-03	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	allocas We use to incorrectly use the store size instead of the alloc size when creating the stack slot for allocas. On aarch64 this can be demonstrated by allocating weirdly sized types. For instance, in the added test case, we use an alloca for i19. We used to allocate a slot of size 24-bit (19 rounded up to the next byte), whereas we really want to use a full 32-bit slot for this type. llvm-svn: 359856
*	[GlobalISel][AArch64] Use fmov for G_FCONSTANT when possible	Jessica Paquette	2019-05-01	1	-6/+4
\| \| \| \| \| \| \| \| \| \|	This adds support for using fmov rather than a standard mov to materialize G_FCONSTANT when it's safe to do so. Update arm64-fast-isel-materialize.ll and select-constant.mir to show that the selection is correct. llvm-svn: 359734
*	[GlobalISel][AArch64] Use getConstantVRegValWithLookThrough for extracts	Jessica Paquette	2019-04-26	1	-0/+71
\| \| \| \| \| \| \| \| \| \| \| \|	getConstantVRegValWithLookThrough does the same thing as the getConstantValueForReg function, and has more visibility across GISel. Plus, it supports looking through G_TRUNC, G_SEXT, and G_ZEXT. So, we get better code reuse and more functionality for free by using it. Add some test cases to select-extract-vector-elt.mir to show that we can now look through those instructions. llvm-svn: 359351
*	[AArch64][GlobalISel] Select G_BSWAP for vectors of s32 and s64	Jessica Paquette	2019-04-26	1	-6/+70
\| \| \| \| \| \| \| \| \|	There are instructions for these, so mark them as legal. Select the correct instruction in AArch64InstructionSelector.cpp. Update select-bswap.mir and arm64-rev.ll to reflect the changes. llvm-svn: 359331
*	Fix alignment in AArch64InstructionSelector::emitConstantPoolEntry()	Hans Wennborg	2019-04-26	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code was using the alignment of a pointer to the value, not the alignment of the constant itself. Maybe we got away with it so far because the pointer alignment is fairly high, but we did end up under-aligning <16 x i8> vectors, which was caught in the Chromium build after lld stopped over-aligning the .rodata.cst16 section in r356428. (See crbug.com/953815) Differential revision: https://reviews.llvm.org/D61124 llvm-svn: 359287
*	[GlobalISel][AArch64] Make G_EXTRACT_VECTOR_ELT legal for v8s16s	Jessica Paquette	2019-04-25	1	-0/+23
\| \| \| \| \| \| \| \|	This case was missing before, so we couldn't legalize it. Add it to AArch64LegalizerInfo.cpp and update select-extract-vector-elt.mir. llvm-svn: 359231
*	[GlobalISel][AArch64] Add generic legalization rule for extends	Jessica Paquette	2019-04-25	3	-3/+303
\| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a legalization rule for G_ZEXT, G_ANYEXT, and G_SEXT which allows extends whenever the types will fit in registers (or the source is an s1). Update tests. Add GISel checks throughout all of arm64-vabs.ll, where we now select a good portion of the code. Add GISel checks to arm64-subvector-extend.ll, which has a good number of vector extends in it. Differential Revision: https://reviews.llvm.org/D60889 llvm-svn: 359222
*	[GlobalISel][AArch64] Legalize G_FNEARBYINT	Jessica Paquette	2019-04-25	4	-1/+558
\| \| \| \| \| \| \| \| \|	Add legalizer support for G_FNEARBYINT. It's the same as G_FCEIL etc. Since the importer allows us to automatically select this after legalization, also add tests for selection etc. Also update arm64-vfloatintrinsics.ll. llvm-svn: 359204
*	[GlobalISel] Add IRTranslator support for G_FNEARBYINT	Jessica Paquette	2019-04-25	1	-0/+8
\| \| \| \| \| \| \| \| \|	Translate llvm.nearbyint into G_FNEARBYINT as a simple intrinsic. Update arm64-irtranslator.ll. Differential Revision: https://reviews.llvm.org/D60922 llvm-svn: 359203
*	[GlobalISel] Add a G_FNEARBYINT opcode	Jessica Paquette	2019-04-25	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \|	For eventually selecting llvm.nearbyint. Equivalent to the SelectionDAG nearbyint node. Update legalizer-info-validation.mir. Differential Revision: https://reviews.llvm.org/D60921 llvm-svn: 359201
*	[AArch64][GlobalISel] Select G_INTRINSIC_ROUND	Jessica Paquette	2019-04-23	1	-0/+198
\| \| \| \| \| \| \|	Add selection support for G_INTRINSIC_ROUND, add a selection test, and add check lines to arm64-vfloatintrinsics.ll and f16-instructions.ll. llvm-svn: 359046
*	[AArch64][GlobalISel] Mark G_INTRINSIC_ROUND as a pre-isel floating point opcode	Jessica Paquette	2019-04-23	1	-0/+196
\| \| \| \| \| \| \| \| \|	Add G_INTRINSIC_ROUND to isPreISelGenericFloatingPointOpcode to ensure that its input and output are assigned the correct register bank. Add a regbankselect test to verify that we get what we expect here. llvm-svn: 359044
*	[AArch64][GlobalISel] Legalize G_INTRINSIC_ROUND	Jessica Paquette	2019-04-23	2	-1/+273
\| \| \| \| \| \| \|	Add it to the same rule as G_FCEIL etc. Add a legalizer test, and add a missing switch case to AArch64LegalizerInfo.cpp. llvm-svn: 359033
*	[AArch64][GlobalISel] Actually select G_INTRINSIC_TRUNC	Jessica Paquette	2019-04-23	1	-0/+198
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Apparently FileCheck wasn't actually matching the fallback check lines in arm64-vfloatintrinsics.ll properly. So, there were selection fallbacks for G_INTRINSIC_TRUNC there. Actually hook it up into AArch64InstructionSelector.cpp and write a proper selection test. I guess I'll figure out the FileCheck magic to make the fallback checks work properly in arm64-vfloatintrinsics.ll. llvm-svn: 359030
*	[AArch64][GlobalISel] Teach regbankselect about G_INTRINSIC_TRUNC	Jessica Paquette	2019-04-23	1	-0/+50
\| \| \| \| \| \| \| \|	Add it to isPreISelGenericFloatingPointOpcode, and add a regbankselect test. Update arm64-vfloatintrinsics.ll now that we can select it. llvm-svn: 359022
*	[AArch64][GlobalISel] Legalize G_INTRINSIC_TRUNC	Jessica Paquette	2019-04-23	2	-1/+204
\| \| \| \| \| \| \| \| \|	Same patch as G_FCEIL etc. Add the missing switch case in widenScalar, add G_INTRINSIC_TRUNC to the correct rule in AArch64LegalizerInfo.cpp, and add a test. llvm-svn: 359021
*	[AArch64][GlobalISel] Legalize G_FMA for more vector types	Jessica Paquette	2019-04-23	2	-1/+234
\| \| \| \| \| \| \| \| \|	Same as G_FCEIL, G_FABS, etc. Just move it into that rule. Add a legalizer test for G_FMA, which we didn't have before and update arm64-vfloatintrinsics.ll. llvm-svn: 359015
*	[AArch64][GlobalISel] Add G_FMA to isPreISelGenericFloatingPointOpcode	Jessica Paquette	2019-04-23	1	-0/+57
\| \| \| \| \| \| \| \|	Noticed an unnecessary fallback in arm64-vmul caused by this. Also add a regbankselect test for G_FMA. llvm-svn: 359013
*	Relax test to check for a valid number instead of a specific number.	Douglas Yung	2019-04-22	1	-1/+1
\| \| \| \|	llvm-svn: 358926
*	Revert r358800. Breaks Obsequi from the test suite.	Amara Emerson	2019-04-20	2	-49/+20
\| \| \| \| \| \| \|	The last attempt fixed gcc and consumer-typeset, but Obsequi seems to fail with a different issue. llvm-svn: 358829
*	Revert "Revert "[GlobalISel] Add legalization support for non-power-2 loads ↵	Amara Emerson	2019-04-19	2	-20/+49
\| \| \| \| \| \| \| \| \|	and stores"" We were shifting the wrong component of a split load when trying to combine them back into a single value. llvm-svn: 358800
*	[GlobalISel][AArch64] Legalize + select G_FRINT	Jessica Paquette	2019-04-19	4	-1/+634
\| \| \| \| \| \| \| \| \| \|	Exactly the same as G_FCEIL, G_FABS, etc. Add tests for the fp16/nofp16 behaviour, update arm64-vfloatintrinsics, etc. Differential Revision: https://reviews.llvm.org/D60895 llvm-svn: 358799
*	[GlobalISel] Add IRTranslator support for G_FRINT	Jessica Paquette	2019-04-19	1	-0/+8
\| \| \| \| \| \| \| \|	Add it as a simple intrinsic, update arm64-irtranslator.ll. Differential Revision: https://reviews.llvm.org/D60893 llvm-svn: 358787
*	[GlobalISel] Add a G_FRINT opcode	Jessica Paquette	2019-04-19	1	-0/+3
\| \| \| \| \| \| \| \|	Equivalent to SelectionDAG's frint node. Differential Revision: https://reviews.llvm.org/D60891 llvm-svn: 358785