bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[TargetLowering] SimplifyMultipleUseDemandedBits - add VECTOR_SHUFFLE support.	Simon Pilgrim	2019-07-23	1	-0/+23
\| \| \| \| \| \| \| \|	If all the demanded elts are from one operand and are inline, then we can use the operand directly. The changes are mainly from SSE41 targets which has blendvpd but not cmpgtq, allowing the v2i64 comparison to be simplified as we only need the signbit from alternate v4i32 elements. llvm-svn: 366817
*	[TargetLowering] Add SimplifyMultipleUseDemandedBits	Simon Pilgrim	2019-07-23	1	-1/+128
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces the DAG version of SimplifyMultipleUseDemandedBits, which attempts to peek through ops (mainly and/or/xor so far) that don't contribute to the demandedbits/elts of a node - which means we can do this even in cases where we have multiple uses of an op, which normally requires us to demanded all bits/elts. The intention is to remove a similar instruction - SelectionDAG::GetDemandedBits - once SimplifyMultipleUseDemandedBits has matured. The InstCombine version of SimplifyMultipleUseDemandedBits can constant fold which I haven't added here yet, and so far I've only wired this up to some basic binops (and/or/xor/add/sub/mul) to demonstrate its use. We do see a couple of regressions that need to be addressed: AMDGPU unsigned dot product codegen retains an AND mask (for ZERO_EXTEND) that it previously removed (but otherwise the dotproduct codegen is a lot better). X86/AVX2 has poor handling of vector ANY_EXTEND/ANY_EXTEND_VECTOR_INREG - it prematurely gets converted to ZERO_EXTEND_VECTOR_INREG. The code owners have confirmed its ok for these cases to fixed up in future patches. Differential Revision: https://reviews.llvm.org/D63281 llvm-svn: 366799
*	[DAGCombiner] Make ShrinkLoadReplaceStoreWithStore return an SDValue instead ↵	Craig Topper	2019-07-23	1	-9/+8
\| \| \| \| \| \| \| \| \| \|	of an SDNode*. NFCI The function was calling getNode() on an SDValue to return and the caller turned the result back into a SDValue. So just return the original SDValue to avoid this. llvm-svn: 366779
*	[DAGCombiner] Use SDNode::isOperandOf to simplify some code. NFCI	Craig Topper	2019-07-23	1	-7/+1
\| \| \| \|	llvm-svn: 366778
*	Move variable out from debug only section.	Richard Trieu	2019-07-23	1	-2/+0
\| \| \| \| \| \| \|	MFI is no longer just needed for an assert. Move it out of the debug only section to allow non-assert builds to be able to find it. llvm-svn: 366773
*	[Statepoints] Fix a bug in statepoint lowering for functions w/no-realign-stack	Philip Reames	2019-07-22	1	-1/+8
\| \| \| \| \| \| \| \| \| \|	We were silently using the ABI alignment for all of the stores generated for deopt and gc values. We'd gotten the alignment of the stack slot itself properly reduced (via MachineFrameInfo's clamping), but having the MMO on the store incorrect was enough for us to generate an aligned store to a unaligned location. The simplest fix would have been to just pass the alignment to the helper function, but once we do that, the helper function doesn't really help. So, inline it and directly call the MMO version of DAG.getStore with a properly constructed MMO. Note that there's a separate performance possibility here. Even if we can realign stacks, we probably don't want to if all of the stores are in slowpaths. But that's a later patch, if at all. :) llvm-svn: 366765
*	Stubs out TLOF for AIX and add support for common vars in assembly output.	Sean Fertile	2019-07-22	1	-0/+55
\| \| \| \| \| \| \| \| \|	Stubs out a TargetLoweringObjectFileXCOFF class, implementing only SelectSectionForGlobal for common symbols. Also adds an override of EmitGlobalVariable in PPCAIXAsmPrinter which adds a number of defensive errors and adds support for emitting common globals. llvm-svn: 366727
*	TableGen: Support physical register inputs > 255	Matt Arsenault	2019-07-22	1	-1/+4
\| \| \| \| \| \| \|	This was truncating register value that didn't fit in unsigned char. Switch AMDGPU sendmsg intrinsics to using a tablegen pattern. llvm-svn: 366695
*	Added address-space mangling for stack related intrinsics	Christudasan Devadasan	2019-07-22	2	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \|	Modified the following 3 intrinsics: int_addressofreturnaddress, int_frameaddress & int_sponentry. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D64561 llvm-svn: 366679
*	[IPRA][ARM] Make use of the "returned" parameter attribute	Oliver Stannard	2019-07-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	ARM has code to recognise uses of the "returned" function parameter attribute which guarantee that the value passed to the function in r0 will be returned in r0 unmodified. IPRA replaces the regmask on call instructions, so needs to be told about this to avoid reverting the optimisation. Differential revision: https://reviews.llvm.org/D64986 llvm-svn: 366669
*	[GISel]: Attach missing range metadata while translating G_LOADs	Aditya Nandakumar	2019-07-21	1	-2/+3
\| \| \| \| \| \| \| \| \| \|	https://reviews.llvm.org/D65048 Attach range information to G_LOAD when only defining one register. reviewed by: arsenm llvm-svn: 366656
*	[Codegen][SelectionDAG] X u% C == 0 fold: non-splat vector improvements	Roman Lebedev	2019-07-20	1	-35/+132
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Four things here: 1. Generalize the fold to handle non-splat divisors. Reasonably trivial. 2. Unban power-of-two divisors. I don't see any reason why they should be illegal. * There is no ban in Hacker's Delight * I think the ban came from the same bug that caused the miscompile in the base patch - in `floor((2^W - 1) / D)` we were dividing by `D0` instead of `D`, and we were ensuring that `D0` is not `1`, which made sense. 3. Unban `1` divisors. I no longer believe Hacker's Delight actually says that the fold is invalid for `D = 0`. Further considerations: * We know that * `(X u% 1) == 0` can be constant-folded to `1`, * `(X u% 1) != 0` can be constant-folded to `0`, * Also, we know that * `X u<= -1` can be constant-folded to `1`, * `X u> -1` can be constant-folded to `0`, * https://godbolt.org/z/7jnZJX https://rise4fun.com/Alive/oF6p * We know will end up with the following: `(setule/setugt (rotr (mul N, P), K), Q)` * Therefore, for given new DAG nodes and comparison predicates (`ule`/`ugt`), we will still produce the correct answer if: `Q` is a all-ones constant; and both `P` and `K` are anything other than `undef`. * The fold will indeed produce `Q = all-ones`. 4. Try to re-splat the `P` and `K` vectors - we don't care about their values for the lanes where divisor was `1`. Reviewers: RKSimon, hermord, craig.topper, spatel, xbolva00 Reviewed By: RKSimon Subscribers: hiraditya, javed.absar, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63963 llvm-svn: 366637
*	LiveIntervals: Fix handleMove asserting on BUNDLE	Matt Arsenault	2019-07-19	1	-1/+4
\| \| \| \| \| \| \| \| \|	The top-level BUNDLE instruction should behave as an ordinary instruction. It is supposed to have all relevant registers as implicit operands. Moving it should work as any other instruction. I believe the assert intended to avoid moving instructions inside bundles. llvm-svn: 366605
*	Revert "Use the MachineBasicBlock symbol for a callbr target"	Nick Desaulniers	2019-07-19	1	-7/+2
\| \| \| \| \| \| \| \| \| \| \|	This reverts commit r366523/ccbffefccaff42b0d094c9ef0f49fc3e8c8456ea. Two regressions were immediately reported: - https://github.com/ClangBuiltLinux/linux/issues/614 - https://github.com/ClangBuiltLinux/linux/issues/615 Reported-by: nathanchance llvm-svn: 366600
*	DAG: Handle dbg_value for arguments split into multiple subregs	Matt Arsenault	2019-07-19	1	-23/+52
\| \| \| \| \| \| \| \|	This was handled previously for arguments split due to not fitting in an MVT. This was dropping the register for argument registers split due to TLI::getRegisterTypeForCallingConv. llvm-svn: 366574
*	[MachineCSE][MachinePRE] Avoid hoisting code from code regions into hot BBs.	Kai Luo	2019-07-19	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Current PRE hoists common computations into CMBB = DT->findNearestCommonDominator(MBB, MBB1). However, if CMBB is in a hot loop body, we might get performance degradation. Differential Revision: https://reviews.llvm.org/D64394 llvm-svn: 366570
*	[IPRA] Don't rely on non-exact function definitions	Oliver Stannard	2019-07-19	1	-1/+5
\| \| \| \| \| \| \| \| \|	If a function definition is not exact, then the linker could select a differently-compiled version of it, which could use different registers. https://reviews.llvm.org/D64909 llvm-svn: 366557
*	Use the MachineBasicBlock symbol for a callbr target	Bill Wendling	2019-07-19	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Inline asm doesn't use labels when compiled as an object file. Therefore, we shouldn't create one for the (potential) callbr destination. Instead, use the symbol for the MachineBasicBlock. Reviewers: nickdesaulniers, craig.topper Reviewed By: nickdesaulniers Subscribers: xbolva00, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64888 llvm-svn: 366523
*	[GlobalISel] Translate calls to memcpy et al to G_INTRINSIC_W_SIDE_EFFECTs ↵	Amara Emerson	2019-07-19	2	-42/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	and legalize later. I plan on adding memcpy optimizations in the GlobalISel pipeline, but we can't do that unless we delay lowering to actual function calls. This patch changes the translator to generate G_INTRINSIC_W_SIDE_EFFECTS for these functions, and then have each target specify that using the new custom legalizer for intrinsics hook that they want it expanded it a libcall. Differential Revision: https://reviews.llvm.org/D64895 llvm-svn: 366516
*	CodeGen: Allow !associated metadata to point to aliases.	Peter Collingbourne	2019-07-18	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	This is a small extension of !associated, mostly useful for the implementation convenience of instrumentation passes that RAUW globals with aliases, such as LowerTypeTests. Differential Revision: https://reviews.llvm.org/D64951 llvm-svn: 366502
*	[COFF] Change a variable type to be const in the HeapAllocSite map.	Amy Huang	2019-07-18	4	-5/+7
\| \| \| \|	llvm-svn: 366479
*	[DAGCombine] Pull getSubVectorSrc helper out of ↵	Simon Pilgrim	2019-07-18	1	-22/+22
\| \| \| \| \| \| \| \|	narrowInsertExtractVectorBinOp. NFCI. NFC step towards reusing this in other EXTRACT_SUBVECTOR combines. llvm-svn: 366435
*	Changes to display code view debug info type records in hex format	Nilanjana Basu	2019-07-17	1	-1/+1
\| \| \| \|	llvm-svn: 366390
*	Adding inline comments to code view type record directives for better ↵	Nilanjana Basu	2019-07-17	1	-2/+15
\| \| \| \| \| \|	readability llvm-svn: 366372
*	[PEI] Don't re-allocate a pre-allocated stack protector slot	Francis Visoiu Mistrih	2019-07-17	2	-2/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The LocalStackSlotPass pre-allocates a stack protector and makes sure that it comes before the local variables on the stack. We need to make sure that later during PEI we don't re-allocate a new stack protector slot. If that happens, the new stack protector slot will end up being after the local variables that it should be protecting. Therefore, we would have two slots assigned for two different stack protectors, one at the top of the stack, and one at the bottom. Since PEI will overwrite the assigned slot for the stack protector, the load that is used to compare the value of the stack protector will use the slot assigned by PEI, which is wrong. For this, we need to check if the object is pre-allocated, and re-use that pre-allocated slot. Differential Revision: https://reviews.llvm.org/D64757 llvm-svn: 366371
*	[CodeGen][NFC] Simplify checks for stack protector index checking	Francis Visoiu Mistrih	2019-07-17	2	-13/+11
\| \| \| \| \| \| \|	Use `hasStackProtectorIndex()` instead of `getStackProtectorIndex() >= 0`. llvm-svn: 366369
*	GlobalISel: Handle widenScalar of arbitrary G_MERGE_VALUES sources	Matt Arsenault	2019-07-17	2	-48/+87
\| \| \| \| \| \| \| \| \| \| \|	Extract the sources to the GCD of the original size and target size, padding with implicit_def as necessary. Also fix the case where the requested source type is wider than the original result type. This was ignoring the type, and just using the destination. Do the operation in the requested type and truncate back. llvm-svn: 366367
*	GlobalISel: Handle more cases for widenScalar of G_MERGE_VALUES	Matt Arsenault	2019-07-17	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \|	Use an anyext to the requested type for the leftover operand to produce a slightly wider type, and then truncate the final merge. I have another implementation almost ready which handles arbitrary widens, but I think it produces worse code in this example (which I think is 90% due to not folding redundant copies or folding out implicit_def users), so I wanted to add this as a baseline first. llvm-svn: 366366
*	Basic codegen for MTE stack tagging.	Evgeniy Stepanov	2019-07-17	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \|	Implement IR intrinsics for stack tagging. Generated code is very unoptimized for now. Two special intrinsics, llvm.aarch64.irg.sp and llvm.aarch64.tagp are used to implement a tagged stack frame pointer in a virtual register. Differential Revision: https://reviews.llvm.org/D64172 llvm-svn: 366360
*	[AsmPrinter] Make the encoding of call sites in .gcc_except_table ↵	Alex Bradbury	2019-07-17	3	-6/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	configurable and use for RISC-V The original behavior was to always emit the offsets to each call site in the call site table as uleb128 values, however on some architectures (eg RISCV) these uleb128 offsets into the code cannot always be resolved until link time (because relaxation will invalidate any calculated offsets), and there are no appropriate relocations for uleb128 values. As a consequence it needs to be possible to specify an alternative. This also switches RISCV to use DW_EH_PE_udata4 for call side encodings in .gcc_except_table Differential Revision: https://reviews.llvm.org/D63415 Patch by Edward Jones. llvm-svn: 366329
*	[RISCV] Set correct encodings for DWARF exception handling	Alex Bradbury	2019-07-17	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	This patch sets correct encodings for DWARF exception handling for RISC-V (other than call site encoding, which must be udata4 rather than uleb128 and is handled by D63415). This has the same intend as D63409, except this version matches GCC/binutils behaviour which uses the same encodings regardless of PIC/non-PIC and medlow/medany code model. llvm-svn: 366327
*	[MIPS GlobalISel] ClampScalar and select pointer G_ICMP	Petar Avramovic	2019-07-17	1	-0/+36
\| \| \| \| \| \| \| \| \| \| \|	Add narrowScalar to half of original size for G_ICMP. ClampScalar G_ICMP's operands 2 and 3 to to s32. Select G_ICMP for pointers for MIPS32. Pointer compare is same as for integers, it is enough to declare them as legal type. Differential Revision: https://reviews.llvm.org/D64856 llvm-svn: 366317
*	GlobalISel: Add overload of handleAssignments with CCState	Matt Arsenault	2019-07-16	1	-2/+11
\| \| \| \| \| \| \| \| \| \| \|	AMDGPU needs to allocate special argument registers separately from the user function argument list, so needs direct control over the CCState. The ArgLocs argument is only really necessary because CCState doesn't allow access to it. llvm-svn: 366279
*	DWARF: Skip zero column for inline call sites	David Blaikie	2019-07-16	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	D64033 <https://reviews.llvm.org/D64033> added DW_AT_call_column for inline sites. However, that change wasn't aware of "-gno-column-info". To avoid adding column info when "-gno-column-info" is used, now DW_AT_call_column is only added when we have non-zero column (when "-gno-column-info" is used, column will be zero). Patch by Wenlei He! Differential Revision: https://reviews.llvm.org/D64784 llvm-svn: 366264
*	[Strict FP] Allow more relaxed scheduling	Ulrich Weigand	2019-07-16	1	-10/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reimplement scheduling constraints for strict FP instructions in ScheduleDAGInstrs::buildSchedGraph to allow for more relaxed scheduling. Specifially, allow one strict FP instruction to be scheduled across another, as long as it is not moved across any global barrier. Differential Revision: https://reviews.llvm.org/D64412 Reviewed By: cameron.mcinally llvm-svn: 366222
*	[Remarks][NFC] Combine ParserFormat and SerializerFormat	Francis Visoiu Mistrih	2019-07-16	1	-0/+1
\| \| \| \| \| \|	It's useless to have both. llvm-svn: 366216
*	[DAGCombiner] fold (addcarry (xor a, -1), b, c) -> (subcarry b, a, !c) and ↵	Amaury Sechet	2019-07-16	1	-16/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	flip carry. Summary: As per title. DAGCombiner only mathes the special case where b = 0, this patches extends the pattern to match any value of b. Depends on D57302 Reviewers: hfinkel, RKSimon, craig.topper Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59208 llvm-svn: 366214
*	Fix parameter name comments using clang-tidy. NFC.	Rui Ueyama	2019-07-16	9	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch applies clang-tidy's bugprone-argument-comment tool to LLVM, clang and lld source trees. Here is how I created this patch: $ git clone https://github.com/llvm/llvm-project.git $ cd llvm-project $ mkdir build $ cd build $ cmake -GNinja -DCMAKE_BUILD_TYPE=Debug \ -DLLVM_ENABLE_PROJECTS='clang;lld;clang-tools-extra' \ -DCMAKE_EXPORT_COMPILE_COMMANDS=On -DLLVM_ENABLE_LLD=On \ -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ ../llvm $ ninja $ parallel clang-tidy -checks='-,bugprone-argument-comment' \ -config='{CheckOptions: [{key: StrictMode, value: 1}]}' -fix \ ::: ../llvm/lib//.{cpp,h} ../clang/lib/*/.{cpp,h} ../lld/*/.{cpp,h} llvm-svn: 366177
*	[WebAssembly] Rename except_ref type to exnref	Heejin Ahn	2019-07-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We agreed to rename `except_ref` to `exnref` for consistency with other reference types in https://github.com/WebAssembly/exception-handling/issues/79. This also renames WebAssemblyInstrExceptRef.td to WebAssemblyInstrRef.td in order to use the file for other reference types in future. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64703 llvm-svn: 366145
*	GlobalISel: Implement narrowScalar for vector extract/insert indexes	Matt Arsenault	2019-07-15	1	-0/+11
\| \| \| \|	llvm-svn: 366113
*	[PowerPC] Support fp128 libcalls	Fangrui Song	2019-07-15	1	-0/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	On PowerPC, IEEE 754 quadruple-precision libcall names use "kf" instead of "tf". In libgcc, libgcc/config/rs6000/float128-sed converts TF names to KF names. This patch implements its 24 substitution rules. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D64282 llvm-svn: 366039
*	[DebugInfo] Add column info for inline sites	Jonas Devlieghere	2019-07-12	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The column field is missing for all inline sites, currently it's always zero. This changes populates DW_AT_call_column field for inline sites. Test case modified to cover this change. Patch by: Wenlei He Differential revision: https://reviews.llvm.org/D64033 llvm-svn: 365945
*	Delete dead stores	Fangrui Song	2019-07-12	4	-15/+3
\| \| \| \|	llvm-svn: 365903
*	[DAGCombine] narrowExtractedVectorBinOp - wrap subvector extraction in ↵	Simon Pilgrim	2019-07-12	1	-9/+11
\| \| \| \| \| \| \| \|	helper. NFCI. First step towards supporting 'free' subvector extractions other than concat_vectors. llvm-svn: 365896
*	Revert "[DwarfDebug] Dump call site debug info"	Djordje Todorovic	2019-07-12	10	-393/+43
\| \| \| \| \| \| \| \|	A build failure was found on the SystemZ platform. This reverts commit 9e7e73578e54cd22b3c7af4b54274d743b6607cc. llvm-svn: 365886
*	[MachinePipeliner] Fix order for nodes with Anti dependence in same cycle	Jinsong Ji	2019-07-12	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Problem exposed in PowerPC functional testing. We did not consider Anti dependence for nodes in same cycle, so we may end up generating bad machine code. eg: the reduced test won't verify. * Bad machine code: Using an undefined physical register * - function: lame_encode_buffer_interleaved - basic block: %bb.4 (0x4bde4e12928) - instruction: %29:gprc = ADDZE %27:gprc, implicit-def dead $carry, implicit $carry - operand 3: implicit $carry Reviewers: bcahoon, kparzysz, hfinkel Subscribers: MaskRay, wuzish, nemanjai, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64192 llvm-svn: 365859
*	[DAGCombine] narrowInsertExtractVectorBinOp - add CONCAT_VECTORS support	Simon Pilgrim	2019-07-11	1	-4/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We already split extract_subvector(binop(insert_subvector(v,x),insert_subvector(w,y))) -> binop(x,y). This patch adds support for extract_subvector(binop(concat_vectors(),concat_vectors())) cases as well. In particular this means we don't have to wait for X86 lowering to convert concat_vectors to insert_subvector chains, which helps avoid some cases where demandedelts/combine calls occur too late to split large vector ops. The fast-isel-store.ll load folding regression is annoying but I don't think is that critical. Differential Revision: https://reviews.llvm.org/D63653 llvm-svn: 365785
*	RegUsageInfoCollector: Skip calling conventions I missed before	Matt Arsenault	2019-07-11	1	-0/+3
\| \| \| \|	llvm-svn: 365784
*	GlobalISel: Use Register	Matt Arsenault	2019-07-11	1	-5/+5
\| \| \| \|	llvm-svn: 365780
*	OpaquePtr: switch to GlobalValue::getValueType in a few places. NFC.	Tim Northover	2019-07-11	1	-2/+2
\| \| \| \|	llvm-svn: 365770