bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	MISched: Don't schedule regions with 0 instructions	Matt Arsenault	2019-03-25	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I think this is correct, but may not necessarily be the correct fix for the assertion I'm really trying to solve. If a scheduling region was found that only has dbg_value instructions, the RegPressure tracker would end up in an inconsistent state because it would skip over any debug instructions and point to an instruction outside of the scheduling region. It may still be possible for this to happen if there are some real schedulable instructions between dbg_values, but I haven't managed to break this. The testcase is extremely sensitive and I'm not sure how to make it more resistent to future scheduler changes that would avoid stressing this situation. llvm-svn: 356926
*	[LegalizeDAG] Expand i16 bswap directly to a rotate by 8 instead of relying ↵	Craig Topper	2019-03-24	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	on DAG combine. An i16 bswap can be implemented with an i16 rotate by 8. We previously emitted a shift and OR sequence that DAG combine should be able to turn back into rotate. But we might as well go there directly. If rotate isn't legal, LegalizeDAG should further legalize it to either the opposite rotate, or the shift and OR pattern. I don't know of any way to get the existing DAG combine reliance to fail. So I don't know any way to add new tests for this that wouldn't have worked previously. llvm-svn: 356860
*	[CGP] Make several static functions member functions (NFC)	Teresa Johnson	2019-03-24	1	-19/+25
\| \| \| \| \| \| \|	This is extracted from D59696 as suggested in the review. It is preparation for making the DominatorTree a member variable. llvm-svn: 356857
*	[TargetLowering] SimplifyDemandedBits trunc(srl(x, C1)) - early out for out ↵	Simon Pilgrim	2019-03-22	1	-19/+19
\| \| \| \| \| \|	of range C1. NFCI. llvm-svn: 356810
*	GlobalISel: Fix RegBankSelect for REG_SEQUENCE	Matt Arsenault	2019-03-21	1	-4/+16
\| \| \| \| \| \| \| \| \| \| \| \| \|	The AArch64 test was broken since the result register already had a set register class, so this test was a no-op. The mapping verify call would fail because the result size is not the same as the inputs like in a copy or phi. The AMDGPU testcases are half broken and introduce illegal VGPR->SGPR copies which need much more work to handle correctly (same for phis), but add them as a baseline. llvm-svn: 356713
*	[ScalarizeMaskedMemIntrin] Add support for scalarizing expandload and ↵	Craig Topper	2019-03-21	1	-0/+158
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	compressstore intrinsics. This adds support for scalarizing these intrinsics as well the X86TargetTransformInfo support to avoid scalarizing them in the cases X86 can handle. I've omitted handling special cases for constant masks for this first pass. Though CodeGenPrepare can constant fold the branch conditions and remove some of the control flow anyway. Fixes PR40994 and is covers most of PR3666. Might want to implement constant masks to close that. Differential Revision: https://reviews.llvm.org/D59180 llvm-svn: 356687
*	[DAGCombiner] Use getTokenFactor in a few more cases.	Florian Hahn	2019-03-21	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SDNodes can only have 64k operands and for some inputs (e.g. large number of stores), we can reach this limit when creating TokenFactor nodes. This patch is a follow up to D56740 and updates a few more places that potentially can create TokenFactors with too many operands. Reviewers: efriedma, craig.topper, aemerson, RKSimon Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D59156 llvm-svn: 356668
*	[DAGCombine] SimplifySelectCC - call FoldSetCC with the setcc result type	Simon Pilgrim	2019-03-21	1	-2/+3
\| \| \| \| \| \| \| \|	We were calling FoldSetCC with the compare operand type instead of the result type. Found by OSS-Fuzz #13838 (https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=13838) llvm-svn: 356667
*	[CodeGenPrepare] limit formation of overflow intrinsics (PR41129)	Sanjay Patel	2019-03-21	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is probably a bigger limitation than necessary, but since we don't have any evidence yet that this transform led to real-world perf improvements rather than regressions, I'm making a quick, blunt fix. In the motivating x86 example from: https://bugs.llvm.org/show_bug.cgi?id=41129 ...and shown in the regression test, we want to avoid an extra instruction in the dominating block because that could be costly. The x86 LSR test diff is reversing the changes from D57789. There's no evidence that 1 version is any better than the other yet. Differential Revision: https://reviews.llvm.org/D59602 llvm-svn: 356665
*	[SelectionDAG] Add scalarization of ABS node (PR41149)	Simon Pilgrim	2019-03-21	1	-0/+1
\| \| \| \| \| \| \| \|	Patch by: @ikulagin (Ivan Kulagin) Differential Revision: https://reviews.llvm.org/D59577 llvm-svn: 356656
*	[ScalarizeMaskedMemIntrinsics] Reverse some if conditions to reduce ↵	Craig Topper	2019-03-21	1	-20/+16
\| \| \| \| \| \| \| \|	indentations to remove curly braces. Pre-commit for D59180 llvm-svn: 356646
*	Allow machine dce to remove uses in the same instruction	Stanislav Mekhanoshin	2019-03-20	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Machine DCE cannot remove a dead definition if there are non-dbg uses. A use however can be in the same instruction: dead %0 = INST %0 Such instructions sometimes created by Detect dead lanes pass. Allow this instruction to be deleted despite the use if the only use belongs to the same instruction. Differential Revision: https://reviews.llvm.org/D59565 llvm-svn: 356619
*	[CGP] fix formatting; NFC	Sanjay Patel	2019-03-20	1	-3/+4
\| \| \| \|	llvm-svn: 356572
*	[CGP] convert chain of 'if' to 'switch'; NFC	Sanjay Patel	2019-03-20	1	-14/+13
\| \| \| \| \| \| \| \|	This should be extended, but CGP does some strange things, so I'm intentionally not changing the potential order of any transforms yet. llvm-svn: 356566
*	Remove out of date comment. NFCI.	Simon Pilgrim	2019-03-20	1	-1/+0
\| \| \| \| \| \|	DAGCombiner::convertBuildVecZextToZext just requires the extractions to be sequential, they don't have to start from 0'th index. llvm-svn: 356552
*	[ExpandMemCmp] Trigger on bcmp too.	Clement Courbet	2019-03-20	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes 41150. Reviewers: gchatelet Subscribers: hiraditya, llvm-commits, ckennelly, sbenza, jyknight Tags: #llvm Differential Revision: https://reviews.llvm.org/D59593 llvm-svn: 356550
*	[DwarfDebug] Skip entries to big for 16 bit size field in Dwarf < 5.	Florian Hahn	2019-03-19	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Nothing prevents entries from being bigger than the 16 bit size field in Dwarf < 5. For entries that are too big, just emit an empty entry instead of crashing. This fixes PR41038. Reviewers: probinson, aprantl, davide Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D59518 llvm-svn: 356514
*	CodeGen: Refactor regallocator command line and target selection	Matt Arsenault	2019-03-19	1	-27/+34
\| \| \| \| \| \| \| \| \| \|	This will allow targets more flexibility to replace the register allocator core passes. In a future commit, AMDGPU will run the core register assignment passes twice, and will also want to disallow using the standard -regalloc option. llvm-svn: 356506
*	RegAllocFast: Do not allocate registers for undef uses	Matt Arsenault	2019-03-19	1	-0/+48
\| \| \| \| \| \| \| \| \| \|	Do not actually allocate a register for an undef use. Previously we we would create unnecessary reload instruction for undef uses where the register wasn't live. Patch by Matthias Braun llvm-svn: 356501
*	RegAllocFast: Remove early selection loop, the spill calculation will report ↵	Matt Arsenault	2019-03-19	1	-9/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	cost 0 anyway for free regs The 2nd loop calculates spill costs but reports free registers as cost 0 anyway, so there is little benefit from having a separate early loop. Surprisingly this is not NFC, as many register are marked regDisabled so the first loop often picks up later registers unnecessarily instead of the first one available in the allocation order... Patch by Matthias Braun llvm-svn: 356499
*	Fix for ABS legalization on PPC buildbot.	Simon Pilgrim	2019-03-19	1	-0/+1
\| \| \| \|	llvm-svn: 356498
*	Allow unordered loads to be considered invariant in CodeGen	Philip Reames	2019-03-19	2	-3/+10
\| \| \| \| \| \| \| \| \| \|	The actual code change is fairly straight forward, but exercising it isn't. First, it turned out we weren't adding the appropriate flags in SelectionDAG. Second, it turned out that we've got some optimization gaps, so obvious test cases don't work. My first attempt (in atomic-unordered.ll) points out a deficiency in our peephole-opt folding logic which I plan to fix separately. Instead, I'm exercising this through MachineLICM. Differential Revision: https://reviews.llvm.org/D59375 llvm-svn: 356494
*	[AtomicExpand] Fix a crash bug when lowering unordered loads to cmpxchg	Philip Reames	2019-03-19	1	-0/+3
\| \| \| \| \| \|	Add tests for wider atomic loads and stores. In the process, fix a crasher where we appearently handled unorder stores, but not loads, when lowering to cmpxchg idioms. llvm-svn: 356482
*	[DAGCombine] Fix a miscompile when reducing BUILD_VECTORs to a shuffle	Justin Bogner	2019-03-19	1	-11/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In r311255 we added a case where we split vectors whose elements are all derived from the same input vector so that we could shuffle it more efficiently. In doing so, createBuildVecShuffle was taught to adjust for the fact that all indices would be based off of the first vector when this happens, but it's possible for the code that checked that to fire incorrectly if we happen to have a BUILD_VECTOR of extracts from subvectors and don't hit this new optimization. Instead of trying to detect if we've split the vector by checking if we have extracts from the same base vector, we can just pass that information into createBuildVecShuffle, avoiding the miscompile. Differential Revision: https://reviews.llvm.org/D59507 llvm-svn: 356476
*	[SelectionDAG] Handle unary SelectPatternFlavor for ABS case in ↵	Simon Pilgrim	2019-03-19	5	-38/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SelectionDAGBuilder::visitSelect These changes are related to PR37743 and include: SelectionDAGBuilder::visitSelect handles the unary SelectPatternFlavor::SPF_ABS case to build ABS node. Delete the redundant recognizer of the integer ABS pattern from the DAGCombiner. Add promoting the integer ABS node in the LegalizeIntegerType. Expand-based legalization of integer result for the ABS nodes. Expand-based legalization of ABS vector operations. Add some integer abs testcases for different typesizes for Thumb arch Add the custom ABS expanding and change the SAD pattern recognizer for X86 arch: The i64 result of the ABS is expanded to: tmp = (SRA, Hi, 31) Lo = (UADDO tmp, Lo) Hi = (XOR tmp, (ADDCARRY tmp, hi, Lo:1)) Lo = (XOR tmp, Lo) The "detectZextAbsDiff" function is changed for the recognition of pattern with the ABS node. Given a ABS node, detect the following pattern: (ABS (SUB (ZERO_EXTEND a), (ZERO_EXTEND b))). Change integer abs testcases for codegen with the ABS node support for AArch64. Indicate that the ABS is legal for the i64 type when the NEON is supported. Change the integer abs testcases to show changing of codegen. Add combine and legalization of ABS nodes for Thumb arch. Extend 'matchSelectPattern' to recognize the ABS patterns with ICMP_SGE condition. For discussion, see https://bugs.llvm.org/show_bug.cgi?id=37743 Patch by: @ikulagin (Ivan Kulagin) Differential Revision: https://reviews.llvm.org/D49837 llvm-svn: 356468
*	[DebugInfo] Introduce DW_OP_LLVM_convert	Markus Lavin	2019-03-19	14	-37/+230
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduce a DW_OP_LLVM_convert Dwarf expression pseudo op that allows for a convenient way to perform type conversions on the Dwarf expression stack. As an additional bonus it paves the way for using other Dwarf v5 ops that need to reference a base_type. The new DW_OP_LLVM_convert is used from lib/Transforms/Utils/Local.cpp to perform sext/zext on debug values but mainly the patch is about preparing terrain for adding other Dwarf v5 ops that need to reference a base_type. For Dwarf v5 the op maps to DW_OP_convert and for earlier versions a complex shift & mask pattern is generated to emulate sext/zext. This is a recommit of r356442 with trivial fixes for the failing tests. Differential Revision: https://reviews.llvm.org/D56587 llvm-svn: 356451
*	Revert "[DebugInfo] Introduce DW_OP_LLVM_convert"	Markus Lavin	2019-03-19	14	-229/+37
\| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 1cf4b593a7ebd666fc6775f3bd38196e8e65fafe. Build bots found failing tests not detected locally. Failing Tests (3): LLVM :: DebugInfo/Generic/convert-debugloc.ll LLVM :: DebugInfo/Generic/convert-inlined.ll LLVM :: DebugInfo/Generic/convert-linked.ll llvm-svn: 356444
*	[DebugInfo] Introduce DW_OP_LLVM_convert	Markus Lavin	2019-03-19	14	-37/+229
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduce a DW_OP_LLVM_convert Dwarf expression pseudo op that allows for a convenient way to perform type conversions on the Dwarf expression stack. As an additional bonus it paves the way for using other Dwarf v5 ops that need to reference a base_type. The new DW_OP_LLVM_convert is used from lib/Transforms/Utils/Local.cpp to perform sext/zext on debug values but mainly the patch is about preparing terrain for adding other Dwarf v5 ops that need to reference a base_type. For Dwarf v5 the op maps to DW_OP_convert and for earlier versions a complex shift & mask pattern is generated to emulate sext/zext. Differential Revision: https://reviews.llvm.org/D56587 llvm-svn: 356442
*	[GlobalISel] Include missing change from r356396	Amara Emerson	2019-03-18	1	-4/+2
\| \| \| \| \| \|	Forgot to add a change to relax some asserts in r356396. llvm-svn: 356411
*	Revert r356304: remove subreg parameter from MachineIRBuilder::buildCopy()	Amara Emerson	2019-03-18	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	After review comments, it was preferred to not teach MachineIRBuilder about non-generic instructions beyond using buildInstr(). For AArch64 I've changed the buildCopy() calls to buildInstr() + a separate addReg() call. This also relaxes the MachineIRBuilder's COPY checking more because it may not always have a SrcOp given to it. llvm-svn: 356396
*	[TargetLowering] Add code size information on isFPImmLegal. NFC	Adhemerval Zanella	2019-03-18	2	-39/+63
\| \| \| \| \| \| \| \| \| \| \|	This allows better code size for aarch64 floating point materialization in a future patch. Reviewers: evandro Differential Revision: https://reviews.llvm.org/D58690 llvm-svn: 356389
*	[DAG] Cleanup unused node in SimplifySelectCC.	Nirav Dave	2019-03-18	1	-8/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Delete temporarily constructed node uses for analysis after it's use, holding onto original input nodes. Ideally this would be rewritten without making nodes, but this appears relatively complex. Reviewers: spatel, RKSimon, craig.topper Subscribers: jdoerfert, hiraditya, deadalnix, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57921 llvm-svn: 356382
*	[DebugInfo] Ignore bitcasts when lowering stack arg dbg.values	David Stenberg	2019-03-18	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Look past bitcasts when looking for parameter debug values that are described by frame-index loads in `EmitFuncArgumentDbgValue()`. In the attached test case we would be left with an undef `DBG_VALUE` for the parameter without this patch. A similar fix was done for parameters passed in registers in D13005. This fixes PR40777. Reviewers: aprantl, vsk, jmorse Reviewed By: aprantl Subscribers: bjope, javed.absar, jdoerfert, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D58831 llvm-svn: 356363
*	[CodeGen] Defined MVTs v3i32, v3f32, v5i32, v5f32	Tim Renouf	2019-03-17	1	-0/+8
\| \| \| \| \| \| \| \| \|	AMDGPU would like to use these MVTs. Differential Revision: https://reviews.llvm.org/D58901 Change-Id: I6125fea810d7cc62a4b4de3d9904255a1233ae4e llvm-svn: 356351
*	[CodeGen] Prepare for introduction of v3 and v5 MVTs	Tim Renouf	2019-03-17	3	-2/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AMDGPU would like to have MVTs for v3i32, v3f32, v5i32, v5f32. This commit does not add them, but makes preparatory changes: * Exclude non-legal non-power-of-2 vector types from ComputeRegisterProp mechanism in TargetLoweringBase::getTypeConversion. * Cope with SETCC and VSELECT for odd-width i1 vector when the other vectors are legal type. Some of this patch is from Matt Arsenault, also of AMD. Differential Revision: https://reviews.llvm.org/D58899 Change-Id: Ib5f23377dbef511be3a936211a0b9f94e46331f8 llvm-svn: 356350
*	RegAllocFast: Add hint to debug printing	Matt Arsenault	2019-03-17	1	-1/+2
\| \| \| \|	llvm-svn: 356348
*	[DAGCombine] Fold (x & ~y) \| y patterns	Nikita Popov	2019-03-17	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fold (x & ~y) \| y and it's four commuted variants to x \| y. This pattern can in particular appear when a vselect c, x, -1 is expanded to (x & ~c) \| (-1 & c) and combined to (x & ~c) \| c. This change has some overlap with D59066, which avoids creating a vselect of this form in the first place during uaddsat expansion. Differential Revision: https://reviews.llvm.org/D59174 llvm-svn: 356333
*	[TargetLowering] improve the default expansion of uaddsat/usubsat	Sanjay Patel	2019-03-17	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a subset of what was proposed in: D59006 ...and may overlap with test changes from: D59174 ...but it seems like a good general optimization to turn selects into bitwise-logic when possible because we never know exactly what can happen at this stage of DAG combining depending on how the target has defined things. Differential Revision: https://reviews.llvm.org/D59066 llvm-svn: 356332
*	[DAGCombine] combineShuffleOfScalars - handle non-zero SCALAR_TO_VECTOR ↵	Simon Pilgrim	2019-03-16	1	-2/+2
\| \| \| \| \| \| \| \|	indices (PR41097) rL356292 reduces the size of scalar_to_vector if we know the upper bits are undef - which means that shuffles may find they are suddenly referencing scalar_to_vector elements other than zero - so make sure we handle this as undef. llvm-svn: 356327
*	[WebAssembly] Make rethrow take an except_ref type argument	Heejin Ahn	2019-03-16	2	-31/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In the new wasm EH proposal, `rethrow` takes an `except_ref` argument. This change was missing in r352598. This patch adds `llvm.wasm.rethrow.in.catch` intrinsic. This is an intrinsic that's gonna eventually be lowered to wasm `rethrow` instruction, but this intrinsic can appear only within a catchpad or a cleanuppad scope. Also this intrinsic needs to be invokable - otherwise EH pad successor for it will not be correctly generated in clang. This also adds lowering logic for this intrinsic in `SelectionDAGBuilder::visitInvoke`. This routine is basically a specialized and simplified version of `SelectionDAGBuilder::visitTargetIntrinsic`, but we can't use it because if is only for `CallInst`s. This deletes the previous `llvm.wasm.rethrow` intrinsic and related tests, which was meant to be used within a `__cxa_rethrow` library function. Turned out this needs some more logic, so the intrinsic for this purpose will be added later. LateEHPrepare takes a result value of `catch` and inserts it into matching `rethrow` as an argument. `RETHROW_IN_CATCH` is a pseudo instruction that serves as a link between `llvm.wasm.rethrow.in.catch` and the real wasm `rethrow` instruction. To generate a `rethrow` instruction, we need an `except_ref` argument, which is generated from `catch` instruction. But `catch` instrutions are added in LateEHPrepare pass, so we use `RETHROW_IN_CATCH`, which takes no argument, until we are able to correctly lower it to `rethrow` in LateEHPrepare. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59352 llvm-svn: 356316
*	[GlobalISel] Make isel verification checks of vregs run under NDEBUG only.	Amara Emerson	2019-03-16	1	-4/+4
\| \| \| \|	llvm-svn: 356309
*	[GlobalISel] Allow MachineIRBuilder to build subregister copies.	Amara Emerson	2019-03-15	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \|	This relaxes some asserts about sizes, and adds an optional subreg parameter to buildCopy(). Also update AArch64 instruction selector to use this in places where we previously used MachineInstrBuilder manually. Differential Revision: https://reviews.llvm.org/D59434 llvm-svn: 356304
*	[SelectionDAG] Add SimplifyDemandedBits handling for ISD::SCALAR_TO_VECTOR	Simon Pilgrim	2019-03-15	1	-0/+13
\| \| \| \| \| \|	Fixes a lot of constant folding mismatches between i686 and x86_64 llvm-svn: 356273
*	[CodeGenPrepare] avoid crashing from replacing a phi twice	Mikael Holmen	2019-03-15	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is a fix to bug 41052: https://bugs.llvm.org/show_bug.cgi?id=41052 While trying to optimize a memory instruction in a dead basic block, we end up registering the same phi for replacement twice. This patch avoids registering more than the first replacement candidate for a phi. Patch by: JesperAntonsson Reviewers: skatkov, aprantl Reviewed By: aprantl Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59358 llvm-svn: 356260
*	[CGP] add another bailout for degenerate code (PR41064)	Sanjay Patel	2019-03-14	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \|	This is almost the same as: rL355345 ...and should prevent any potential crashing from examples like: https://bugs.llvm.org/show_bug.cgi?id=41064 ...although the bug was masked by: rL355823 ...and I'm not sure how to repro the problem after that change. llvm-svn: 356218
*	MIR: Allow targets to serialize MachineFunctionInfo	Matt Arsenault	2019-03-14	4	-235/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This has been a very painful missing feature that has made producing reduced testcases difficult. In particular the various registers determined for stack access during function lowering were necessary to avoid undefined register errors in a large percentage of cases. Implement a subset of the important fields that need to be preserved for AMDGPU. Most of the changes are to support targets parsing register fields and properly reporting errors. The biggest sort-of bug remaining is for fields that can be initialized from the IR section will be overwritten by a default initialized machineFunctionInfo section. Another remaining bug is the machineFunctionInfo section is still printed even if empty. llvm-svn: 356215
*	Allow code motion (and thus folding) for atomic (but unordered) memory operands	Philip Reames	2019-03-14	1	-3/+1
\| \| \| \| \| \| \| \| \| \|	Building on the work done in D57601, now that we can distinguish between atomic and volatile memory accesses, go ahead and allow code motion of unordered atomics. As seen in the diffs, this allows much better folding of memory operations into using instructions. (Mostly done by the PeepholeOpt pass.) Note: I have not reviewed all callers of hasOrderedMemoryRef since one of them - isSafeToMove - is very widely used. I'm relying on the documented semantics of each method to judge correctness. Differential Revision: https://reviews.llvm.org/D59345 llvm-svn: 356170
*	Add IR debug info support for Elemental, Pure, and Recursive Procedures.	Adrian Prantl	2019-03-14	1	-0/+6
\| \| \| \| \| \| \| \|	Patch by Eric Schweitz! Differential Revision: https://reviews.llvm.org/D54043 llvm-svn: 356163
*	GlobalISel: Use multiple returns for intrinsic structs	Matt Arsenault	2019-03-14	2	-16/+9
\| \| \| \| \| \| \| \| \| \| \|	This is consistent with what SelectionDAG does and is much easier to work with than the extract sequence with an artificial wide register. For the AMDGPU control flow intrinsics, this was producing an s128 for the i64, i1 tuple return. Any legalization that should apply to a real s128 value would badly obscure the direct values that need to be seen. llvm-svn: 356147
*	[GlobalISel][Utils] Add a getConstantVRegVal variant that looks through instrs	Quentin Colombet	2019-03-14	2	-10/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	getConstantVRegVal used to only look for G_CONSTANT when looking at unboxing the value of a vreg. However, constants are sometimes not directly used and are hidden behind trunc, s\|zext or copy chain of computation. In particular this may be introduced by the legalization process that doesn't want to simplify these patterns because it can lead to infine loop when legalizing a constant. To circumvent that problem, add a new variant of getConstantVRegVal, named getConstantVRegValWithLookThrough, that allow to look through extensions. Differential Revision: https://reviews.llvm.org/D59227 llvm-svn: 356116