bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] Resync Host.cpp with compiler-rt's cpu_model.c to enable 0x55 to be ↵	Craig Topper	2019-05-31	1	-5/+10
\| \| \| \| \| \| \| \|	identified as cascadelake when avx512vnni is detected. Some other formatting changes. llvm-svn: 362256
*	Revert "[CVP] Simplify non-overflowing saturating add/sub"	Nikita Popov	2019-05-31	1	-24/+1
\| \| \| \| \| \| \| \|	This reverts commit 1e692d1777ae34dcb93524b5798651a29defae09. Causes assertion failure in builtins-wasm.c clang test. llvm-svn: 362254
*	[MIR-Canon] Don't do vreg skip for independent instructions if there are none.	Puyan Lotfi	2019-05-31	1	-1/+2
\| \| \| \| \| \| \| \| \|	We don't want to create vregs if there is nothing to use them for. That causes verifier errors. Differential Revision: https://reviews.llvm.org/D62740 llvm-svn: 362247
*	[CVP] Simplify non-overflowing saturating add/sub	Nikita Popov	2019-05-31	1	-1/+24
\| \| \| \| \| \| \| \| \| \|	If we can determine that a saturating add/sub will not overflow based on range analysis, convert it into a simple binary operation. This is a sibling transform to the existing with.overflow handling. Differential Revision: https://reviews.llvm.org/D62703 llvm-svn: 362242
*	Revert revert of r362112 with minor SystemZ test file corrections.	Kevin P. Neal	2019-05-31	2	-1/+63
\| \| \| \| \| \| \| \| \| \| \| \| \|	[FPEnv] Added a special UnrollVectorOp method to deal with the chain on StrictFP opcodes This change creates UnrollVectorOp_StrictFP. The purpose of this is to address a failure that consistently occurs when calling StrictFP functions on vectors whose number of elements is 3 + 2n on most platforms, such as PowerPC or SystemZ. The old UnrollVectorOp method does not expect that the vector that it will unroll will have a chain, so it has an assert that prevents it from running if this is the case. This new StrictFP version of the method deals with the chain while unrolling the vector. With this new function in place during vector widending, llc can run vector-constrained-fp-intrinsics.ll for SystemZ successfully. Submitted by: Drew Wock <drew.wock@sas.com> Reviewed by: Cameron McInally, Kevin P. Neal Approved by: Cameron McInally Differential Revision: https://reviews.llvm.org/D62546 llvm-svn: 362241
*	[AMDGPU] Use InliningThresholdMultiplier for inline hint	Stanislav Mekhanoshin	2019-05-31	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	AMDGPU uses multiplier 9 for the inline cost. It is taken into account everywhere except for inline hint threshold. As a result we are penalizing functions with the inline hint making them less probable to be inlined than those without the hint. Defaults are 225 for a normal function and 325 for a function with an inline hint. Currently we have effective threshold 225 * 9 = 2025 for normal functions and just 325 for those with the hint. That is fixed by this patch. Differential Revision: https://reviews.llvm.org/D62707 llvm-svn: 362239
*	[PPC] Correctly adjust branch probability in PPCReduceCRLogicals	Guozhi Wei	2019-05-31	1	-6/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In PPCReduceCRLogicals after splitting the original MBB into 2, the 2 impacted branches still use original branch probability. This is unreasonable. Suppose we have following code, and the probability of each successor is 50%. condc = conda \|\| condb br condc, label %target, label %fallthrough It can be transformed to following, br conda, label %target, label %newbb newbb: br condb, label %target, label %fallthrough Since each branch has a probability of 50% to each successor, the total probability to %fallthrough is 25% now, and the total probability to %target is 75%. This actually changed the original profiling data. A more reasonable probability can be set to 70% to the false side for each branch instruction, so the total probability to %fallthrough is close to 50%. This patch assumes the branch target with two incoming edges have same edge frequency and computes new probability fore each target, and keep the total probability to original targets unchanged. Differential Revision: https://reviews.llvm.org/D62430 llvm-svn: 362237
*	[MachinePipeliner][NFC] Add some debug log and statistics	Jinsong Ji	2019-05-31	1	-12/+71
\| \| \| \| \| \| \| \|	This is to add some log and statistics for debugging Differential Revision: https://reviews.llvm.org/D62165 llvm-svn: 362233
*	ftime-trace: Trace loop passes	Russell Gallop	2019-05-31	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	These can take a significant amount of time in some builds. Suggested by Andrea Di Biagio. Differential Revision: https://reviews.llvm.org/D62666 llvm-svn: 362219
*	[InstCombine] 'C-(C2-X) --> X+(C-C2)' constant-fold	Roman Lebedev	2019-05-31	1	-1/+6
\| \| \| \| \| \| \| \| \| \|	It looks this fold was already partially happening, indirectly via some other folds, but with one-use limitation. No other fold here has that restriction. https://rise4fun.com/Alive/ftR llvm-svn: 362217
*	[InstCombine] 'add (sub C1, X), C2 --> sub (add C1, C2), X' constant-fold	Roman Lebedev	2019-05-31	1	-1/+8
\| \| \| \| \| \|	https://rise4fun.com/Alive/qJQ llvm-svn: 362216
*	[AArch64][SVE2] Asm: support WHILE instructions	Cullen Rhodes	2019-05-31	2	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds support for the following instructions: * WHILEGE, WHILEGT, WHILEHS, WHILEHI, WHILEWR, WHILERW The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62601 llvm-svn: 362215
*	[AArch64][SVE2] Asm: support TBL/TBX instructions	Cullen Rhodes	2019-05-31	2	-7/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: A three sources variant of the TBL instruction is added to the existing SVE instruction in SVE2. This is implemented with minor changes to the existing TableGen class. TBX is a new instruction with its own definition. The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62600 llvm-svn: 362214
*	[AArch64][SVE2] Asm: support SVE2 store instructions	Cullen Rhodes	2019-05-31	2	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds support for the following instructions: * STNT1B, STNT1H, STNT1S, STNT1D The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62599 llvm-svn: 362213
*	[MIPS GlobalISel] Handle position independent code	Petar Avramovic	2019-05-31	5	-19/+79
\| \| \| \| \| \| \| \| \| \| \| \| \|	Handle position independent code for MIPS32. When callee is global address, lower call will emit callee as G_GLOBAL_VALUE and add target flag if needed. Support $gp in getRegBankFromRegClass(). Select G_GLOBAL_VALUE, specially handle case when there are target flags attached by lowerCall. Differential Revision: https://reviews.llvm.org/D62589 llvm-svn: 362210
*	[mips] Move initGlobalBaseReg to MipsFunctionInfo. NFC	Petar Avramovic	2019-05-31	4	-94/+93
\| \| \| \| \| \| \| \| \| \| \|	Move initGlobalBaseReg from MipsSEDAGToDAGISel to MipsFunctionInfo. This way functions used for handling position independent code during instruction selection, getGlobalBaseReg and initGlobalBaseReg, end up in same class. Differential Revision: https://reviews.llvm.org/D62586 llvm-svn: 362206
*	[InstructionSimplify] Add missing implementation of llvm::SimplifyUnOp. NFC	Craig Topper	2019-05-31	1	-0/+4
\| \| \| \| \| \| \|	There are no callers currently, but the function is declared so we should at least implement it. llvm-svn: 362205
*	[MIPS GlobalISel] Lower call for callee that is register	Petar Avramovic	2019-05-31	1	-6/+8
\| \| \| \| \| \| \| \| \|	Lower call for callee that is register for MIPS32. Register should contain callee function address. Differential Revision: https://reviews.llvm.org/D62585 llvm-svn: 362204
*	[X86] Remove patterns for X86VSintToFP/X86VUintToFP+loadv4f32 to v2f64.	Craig Topper	2019-05-31	2	-57/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	These patterns can incorrectly narrow a volatile load from 128-bits to 64-bits. Similar to PR42079. Switch to using (v4i32 (bitcast (v2i64 (scalar_to_vector (loadi64))))) as the load pattern used in the instructions. This probably still has issues in 32-bit mode where loadi64 isn't legal. Maybe we should use VZMOVL for widened loads even when we don't need the upper bits as zeroes? llvm-svn: 362203
*	[X86] Remove avx512 isel patterns for fpextend+load. Prefer to only match fp ↵	Craig Topper	2019-05-31	1	-11/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	extloads instead. DAG combine will usually fold fpextend+load to an fp extload anyway. So the 256 and 512 patterns were probably unnecessary. The 128 bit pattern was special in that it looked for a v4f32 load, but then used it in an instruction that only loads 64-bits. This is bad if the load happens to be volatile. We could probably make the patterns volatile aware, but that's more work for something that's probably rare. The peephole pass might kick in and save us anyway. We might also be able to fix this with some additional DAG combines. This also adds patterns for vselect+extload to enabled masked vcvtps2pd to be used. Previously we looked for the unlikely vselect+fpextend+load. llvm-svn: 362199
*	[MIR-Canon] Skip the first N vreg names lazily.	Puyan Lotfi	2019-05-31	1	-9/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This consolidates the vreg skip code into one function (SkipVRegs()). SkipVRegs() now knows if it should skip as if it is the first initialization or subsequent skips. The first skip is also done the first time createVirtualRegister is called by the cursor instead of by the cursor's constructor. This prevents verifier errors on machine functions that have no vregs (where the verifier will complain that there are vregs when the function uses none). Differential Revision: https://reviews.llvm.org/D62717 llvm-svn: 362195
*	[X86] Correct the ins operand order for MASKPAIR16STORE to match other store ↵	Craig Topper	2019-05-31	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instructions. This makes the 5 address operands come first. And the data operand comes last. This matches the operand order the instruction is created with. It's also the expected order in X86MCInstLower. So everything appeared to work, but the operands didn't match their declared type. Fixes a -verify-machineinstrs failure. Also remove the isel patterns from these instructions since they should only be used for stack spills and reloads. I'm not even sure what types the patterns were looking for to match. llvm-svn: 362193
*	[MIR-Canon] Hardening propagateLocalCopies.	Puyan Lotfi	2019-05-31	1	-4/+12
\| \| \| \| \| \| \| \| \| \|	This is am almost NFC, it does the following: - If there is no register class for a COPY's src or dst, bail. - Fixes uses iterator invalidation bug. Differential Revision: https://reviews.llvm.org/D62713 llvm-svn: 362191
*	[X86] Add VP2INTERSECT instructions	Pengfei Wang	2019-05-31	16	-0/+301
\| \| \| \| \| \| \| \| \| \|	Support Intel AVX512 VP2INTERSECT instructions in llvm Patch by Xiang Zhang (xiangzhangllvm) Differential Revision: https://reviews.llvm.org/D62366 llvm-svn: 362188
*	Fix -DBUILD_SHARED_LIBS=ON build after rL362160	Sam Clegg	2019-05-31	1	-0/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D62709 llvm-svn: 362180
*	[X86] Remove result type constraints from the ↵	Craig Topper	2019-05-30	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	extloadv2f32/extloadv4f32/extloadv8f32 PatFrags. NFC The result types aren't mentioned in the pattern name so really shouldn't be in the PatFrags. The users of these either have their own type constraint or rely on the type constranit system to realize the only legal extend would be to f64. llvm-svn: 362175
*	MISched: Fix -misched-regpressure=0 if subreg liveness enabled	Matt Arsenault	2019-05-30	1	-1/+3
\| \| \| \| \| \| \|	Test is waiting on fixing several more crashes in the AMDGPU scheduler implementation with this. llvm-svn: 362174
*	[X86] Remove code that unnecessarily sets EXTLOAD with src type of ↵	Craig Topper	2019-05-30	1	-9/+0
\| \| \| \| \| \| \| \| \| \|	v2f32/v4f32/v8f32 as Legal for SSE2/AVX/AVX512 respectively. NFC The LoadExt table defaults to all combinations being Legal. For vector types, only src VTs with an i1 element type were ever changed. So we don't need to mark them legal manually. llvm-svn: 362170
*	[Remarks] Fix usage of enum class	Francis Visoiu Mistrih	2019-05-30	1	-1/+1
\| \| \| \| \| \| \| \|	Breaks the build on some compilers: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux/builds/9720/steps/build%20stage%201/logs/stdio llvm-svn: 362165
*	[Remarks][NFC] Move the serialization to lib/Remarks	Francis Visoiu Mistrih	2019-05-30	6	-155/+258
\| \| \| \| \| \| \| \| \| \| \| \|	Separate the remark serialization to YAML from the LLVM Diagnostics. This adds a new serialization abstraction: remarks::Serializer. It's completely independent from lib/IR and it provides an easy way to replace YAML by providing a new remarks::Serializer. Differential Revision: https://reviews.llvm.org/D62632 llvm-svn: 362160
*	[MIR-Canon] Fixing case where MachineFunction is empty.	Puyan Lotfi	2019-05-30	1	-0/+2
\| \| \| \| \| \| \| \|	In cases where the machine function is empty: bail on the RPO traversal. Differential Revision: https://reviews.llvm.org/D62617 llvm-svn: 362158
*	[DAGCombine] Limit 'hoist add/sub binop w/ constant op' to non-opaque consts	Roman Lebedev	2019-05-30	1	-6/+8
\| \| \| \| \| \| \| \| \|	I don't have a test case for these, but there is a test case for D62266 where, even after all the constant-folding patches, we still end up with endless combine loop. Which makes sense, since we don't constant fold for opaque constants. llvm-svn: 362156
*	[CVP] Generalize willNotOverflow(); NFC	Nikita Popov	2019-05-30	1	-5/+5
\| \| \| \| \| \| \|	Change argument from WithOverflowInst to BinaryOpIntrinsic, so this function can also be used for saturating math intrinsics. llvm-svn: 362152
*	[RuntimeDyld] Update reserveAllocationSpace to account for stub padding.	Lang Hames	2019-05-30	1	-1/+8
\| \| \| \| \| \|	This should fix the buildbot failures caused by r362139. llvm-svn: 362151
*	[InstCombine] Avoid use after free in DenseMap, when built with GCC	Martin Storsjo	2019-05-30	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, this used a statement like this: Map[A] = Map[B]; This is equivalent to the following: const auto &Src = Map[B]; auto &Dest = Map[A]; Dest = Src; The second statement, "auto &Dest = Map[A];" can insert a new element into the DenseMap, which can potentially grow and reallocate the DenseMap's internal storage, which will invalidate the existing reference to the source. When doing the actual assignment, the Src reference is dereferenced, accessing memory that was freed when the DenseMap grew. This issue hasn't shown up when LLVM was built with Clang, because the right hand side ended up dereferenced before evaulating the left hand side. (If the value type is a larger data type, Clang doesn't do this but behaves like GCC.) With GCC, a cast to Value* isn't enough to make it dereference the right hand side reference before invoking operator[] (while that is enough to make Clang/LLVM do the right thing for larger types), but storing it in an intermediate variable in a separate statement works. This fixes PR42065. Differential Revision: https://reviews.llvm.org/D62624 llvm-svn: 362150
*	[DAGCombiner][X86][AArch64] (x - C) + y -> (x + y) - C fold. Try 2	Roman Lebedev	2019-05-30	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Only vector tests are being affected here, since subtraction by scalar constant is rewritten as addition by negated constant. No surprising test changes. https://rise4fun.com/Alive/pbT This is a recommit, originally committed in rL361852, but reverted to investigate test-suite compile-time hangs. Reviewers: RKSimon, craig.topper, spatel Reviewed By: RKSimon Subscribers: javed.absar, kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62257 llvm-svn: 362146
*	[DAGCombine] (x - C) - y -> (x - y) - C fold. Try 3	Roman Lebedev	2019-05-30	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Again only vectors affected. Frustrating. Let me take a look into that.. https://rise4fun.com/Alive/AAq This is a recommit, originally committed in rL361852, but reverted to investigate test-suite compile-time hangs, and then reverted in rL362109 to fix missing constant folds that were causing endless combine loops. Reviewers: RKSimon, craig.topper, spatel Reviewed By: RKSimon Subscribers: javed.absar, JDevlieghere, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62294 llvm-svn: 362145
*	[DAGCombine][X86][AArch64][AMDGPU] (x - y) + -1 -> add (xor y, -1), x ↵	Roman Lebedev	2019-05-30	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fold. Try 3 Summary: This prevents regressions in next patch, and somewhat recovers from the regression to AMDGPU test in D62223. It is indeed not great that we leave vector decrement, don't transform it into vector add all-ones.. https://rise4fun.com/Alive/ZRl This is a recommit, originally committed in rL361852, but reverted to investigate test-suite compile-time hangs, and then reverted in rL362109 to fix missing constant folds that were causing endless combine loops. Reviewers: RKSimon, craig.topper, spatel, arsenm Reviewed By: RKSimon, arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, javed.absar, dstuttard, tpr, t-tye, kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62263 llvm-svn: 362144
*	[DAGCombiner][X86][AArch64][SPARC][SystemZ] y - (x + C) -> (y - x) - C ↵	Roman Lebedev	2019-05-30	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fold. Try 3 Summary: Direct sibling of D62223 patch. While i don't have a direct motivational pattern for this, it would seem to make sense to handle both patterns (or none), for symmetry? The aarch64 changes look neutral; sparc and systemz look like improvement (one less instruction each); x86 changes - 32bit case improves, 64bit case shows that LEA no longer gets constructed, which may be because that whole test is `-mattr=+slow-lea,+slow-3ops-lea` https://rise4fun.com/Alive/ffh This is a recommit, originally committed in rL361852, but reverted to investigate test-suite compile-time hangs, and then reverted in rL362109 to fix missing constant folds that were causing endless combine loops. Reviewers: RKSimon, craig.topper, spatel, t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, jyknight, javed.absar, kristof.beyls, fedor.sergeev, jrtc27, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62252 llvm-svn: 362143
*	[DAGCombiner][X86][AArch64][AMDGPU] (x + C) - y -> (x - y) + C fold. Try 3	Roman Lebedev	2019-05-30	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The main motivation is shown by all these `neg` instructions that are now created. In particular, the `@reg32_lshr_by_negated_unfolded_sub_b` test. AArch64 test changes all look good (`neg` created), or neutral. X86 changes look neutral (vectors), or good (`neg` / `xor eax, eax` created). I'm not sure about `X86/ragreedy-hoist-spill.ll`, it looks like the spill is now hoisted into preheader (which should still be good?), 2 4-byte reloads become 1 8-byte reload, and are elsewhere, but i'm not sure how that affects that loop. I'm unable to interpret AMDGPU change, looks neutral-ish? This is hopefully a step towards solving [[ https://bugs.llvm.org/show_bug.cgi?id=41952 \| PR41952 ]]. https://rise4fun.com/Alive/pkdq (we are missing more patterns, i'll submit them later) This is a recommit, originally committed in rL361852, but reverted to investigate test-suite compile-time hangs, and then reverted in rL362109 to fix missing constant folds that were causing endless combine loops. Reviewers: craig.topper, RKSimon, spatel, arsenm Reviewed By: RKSimon Subscribers: bjope, qcolombet, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, javed.absar, dstuttard, tpr, t-tye, kristof.beyls, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62223 llvm-svn: 362142
*	[RuntimeDyld] Apply padding and alignment bumps to all sections with stubs, and	Lang Hames	2019-05-30	2	-7/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	increase the MachO/x86-64 stub alignment to 8. Stub alignment should be guaranteed for any section containing RuntimeDyld stubs/GOT-entries. To do this we should pad and align all sections containing stubs, not just code sections. This commit also bumps the MachO/x86-64 stub alignment to 8, so that GOT entries will be aligned. llvm-svn: 362139
*	AMDGPU/GlobalISel: Add wave scratch offset argument	Matt Arsenault	2019-05-30	1	-0/+42
\| \| \| \| \| \|	Avoids crashing in PEI in a future change. llvm-svn: 362136
*	[DAGCombine] ((c1-A)-c2) -> ((c1-c2)-A) constant-fold	Roman Lebedev	2019-05-30	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: https://rise4fun.com/Alive/B0A Reviewers: t.p.northover, RKSimon, spatel, craig.topper Reviewed By: RKSimon Subscribers: javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62691 llvm-svn: 362135
*	[DAGCombine] (A-C1)-C2 -> A-(C1+C2) constant-fold	Roman Lebedev	2019-05-30	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: https://rise4fun.com/Alive/Mb1M Reviewers: RKSimon, craig.topper, spatel, t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62689 llvm-svn: 362134
*	[DAGCombine] (A+C1)-C2 -> A+(C1-C2) constant-fold	Roman Lebedev	2019-05-30	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Direct sibling of D62662, the root cause of the endless combine loop in D62257 https://rise4fun.com/Alive/d3W Reviewers: RKSimon, craig.topper, spatel, t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62664 llvm-svn: 362133
*	[DAGCombine] Use FoldConstantArithmetic() to perform C2-(A+C1) -> (C2-C1)-A fold	Roman Lebedev	2019-05-30	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: No tests change, and i'm not sure how to test this, but it's better safe than sorry. Reviewers: spatel, RKSimon, craig.topper, t.p.northover Reviewed By: craig.topper Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62663 llvm-svn: 362132
*	[DAGCombine] ((A-c1)+c2) -> (A+(c2-c1)) constant-fold	Roman Lebedev	2019-05-30	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This was the root cause of the endless combine loop in D62257 https://rise4fun.com/Alive/d3W Reviewers: RKSimon, spatel, craig.topper, t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62662 llvm-svn: 362131
*	[DAGCombine] Use FoldConstantArithmetic() to perform ((c1-A)+c2) -> ↵	Roman Lebedev	2019-05-30	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(c1+c2)-A fold Summary: No tests change, and i'm not sure how to test this, but it's better safe than sorry. Reviewers: spatel, RKSimon, craig.topper, t.p.northover Reviewed By: craig.topper Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62661 llvm-svn: 362130
*	Reapply: IR: add optional type to 'byval' function parameters	Tim Northover	2019-05-30	15	-27/+294
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we switch to opaque pointer types we will need some way to describe how many bytes a 'byval' parameter should occupy on the stack. This adds a (for now) optional extra type parameter. If present, the type must match the pointee type of the argument. The original commit did not remap byval types when linking modules, which broke LTO. This version fixes that. Note to front-end maintainers: if this causes test failures, it's probably because the "byval" attribute is printed after attributes without any parameter after this change. llvm-svn: 362128
*	[AMDGPU] Added target-specific attribute amdgpu-max-memory-clause	Tim Renouf	2019-05-30	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With LLPC, previous investigation has suggested that si-scheduler interacts badly with SiFormMemoryClauses on an XNACK target in some games. That needs further investigation in the future. In the meantime, this commit adds a target-specific attribute to allow us to disable SIFormMemoryClauses by setting it to 1 on a per-function basis for LLPC to use. Differential Revision: https://reviews.llvm.org/D62572 Change-Id: Ia0ca12ce79093cbbe86caded723ffb13384ede92 llvm-svn: 362127