bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Fix for lld buildbot	Sam Parker	2019-06-07	1	-2/+1
\| \| \| \| \| \|	Removed unused (in non-debug builds) variable. llvm-svn: 362775
*	[CodeGen] Generic Hardware Loop Support	Sam Parker	2019-06-07	11	-580/+805
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Patch which introduces a target-independent framework for generating hardware loops at the IR level. Most of the code has been taken from PowerPC CTRLoops and PowerPC has been ported over to use this generic pass. The target dependent parts have been moved into TargetTransformInfo, via isHardwareLoopProfitable, with HardwareLoopInfo introduced to transfer information from the backend. Three generic intrinsics have been introduced: - void @llvm.set_loop_iterations Takes as a single operand, the number of iterations to be executed. - i1 @llvm.loop_decrement(anyint) Takes the maximum number of elements processed in an iteration of the loop body and subtracts this from the total count. Returns false when the loop should exit. - anyint @llvm.loop_decrement_reg(anyint, anyint) Takes the number of elements remaining to be processed as well as the maximum numbe of elements processed in an iteration of the loop body. Returns the updated number of elements remaining. llvm-svn: 362774
*	[AVR] Expand 16-bit rotations during the legalization stage	Dylan McKay	2019-06-07	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In r356860, the legalization logic for BSWAP was modified to ISD::ROTL, rather than the old ISD::{SHL, SRL, OR} nodes. This works fine on AVR for 8-bit rotations, but 16-bit rotations are currently unimplemented - they always trigger an assertion error in the AVRExpandPseudoInsts pass ("RORW unimplemented"). This patch instructions the legalizer to expand 16-bit rotations into the previous SHL, SRL, OR pattern it did previously. This fixes the 'issue-cannot-select-bswap.ll' test. Interestingly, this test failure seems flaky - it passes successfully on the avr-build-01 buildbot, but fails locally on my Arch Linux install. llvm-svn: 362773
*	[MC][ELF] Don't create relocations with section symbols for STB_LOCAL ifunc	Fangrui Song	2019-06-07	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \|	We should keep the symbol type (STT_GNU_IFUNC) for a local ifunc because it may result in an IRELATIVE reloc that the dynamic loader will use to resolve the address at startup time. There is another problem that is not fixed by this patch: a PC relative relocation should also create a relocation with the ifunc symbol. llvm-svn: 362767
*	[LV] Fix -Wunused-function after r362736	Fangrui Song	2019-06-07	1	-0/+2
\| \| \| \|	llvm-svn: 362762
*	AMDGPU: Don't count mask branch pseudo towards skip threshold	Matt Arsenault	2019-06-07	1	-10/+8
\| \| \| \|	llvm-svn: 362761
*	AMDGPU: Insert skips for blocks with FLAT	Matt Arsenault	2019-06-07	1	-1/+2
\| \| \| \| \| \| \|	This already forced a skip for VMEM, so it should also be done for flat. I'm somewhat skeptical about the benefit of this though. llvm-svn: 362760
*	[PowerPC] Exploit the vector min/max instructions	Nemanja Ivanovic	2019-06-06	3	-0/+65
\| \| \| \| \| \| \| \| \| \|	Use the PPC vector min/max instructions for computing the corresponding operation as these should be faster than the compare/select sequences we currently emit. Differential revision: https://reviews.llvm.org/D47332 llvm-svn: 362759
*	AMDGPU: Insert skip branches over return blocks	Matt Arsenault	2019-06-06	2	-3/+4
\| \| \| \| \| \| \| \| \| \|	SIInsertSkips really doesn't understand the control flow, and makes very stupid assumptions about the block layout. This was able to get away with not skipping return blocks, since usually after structurization there is only one placed at the end of the function. Tail duplication can break this assumption. llvm-svn: 362754
*	[DebugInfo] Incorrect debug info record generated for loop counter.	Alexey Lapshin	2019-06-06	1	-19/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Incorrect Debug Variable Range was calculated while "COMPUTING LIVE DEBUG VARIABLES" stage. Range for Debug Variable("i") computed according to current state of instructions inside of basic block. But Register Allocator creates new instructions which were not taken into account when Live Debug Variables computed. In the result DBG_VALUE instruction for the "i" variable was put after these newly inserted instructions. This is incorrect. Debug Value for the loop counter should be inserted before any loop instruction. Differential Revision: https://reviews.llvm.org/D62650 llvm-svn: 362750
*	[AMDGPU] Partial revert for the ba447bae7448435c9986eece0811da1423972fdd	Alexander Timofeev	2019-06-06	4	-163/+107
\| \| \| \| \| \| \| \| \| \| \| \|	"Divergence driven ISel. Assign register class for cross block values according to the divergence." that discovered the design flaw leading to several issues that required to be solved before. This change reverts AMDGPU specific changes and keeps common part unaffected. llvm-svn: 362749
*	[X86] Make a bunch of merge masked binops commutable for loading folding.	Craig Topper	2019-06-06	1	-8/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	This primarily affects add/fadd/mul/fmul/and/or/xor/pmuludq/pmuldq/max/min/fmaxc/fminc/pmaddwd/pavg. We already commuted the unmasked and zero masked versions. I've added 512-bit stack folding tests for most of the instructions affected. I've tested needing commuting and not commuting across unmasked, merged masked, and zero masked. The 128/256 bit instructions should behave similarly. llvm-svn: 362746
*	[CFLGraph] Add support for unary fneg instruction.	Craig Topper	2019-06-06	1	-0/+10
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D62791 llvm-svn: 362737
*	[LV] Wrap LV illegality reporting in a function. NFC.	Renato Golin	2019-06-06	1	-100/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A function for loop vectorization illegality reporting has been introduced: void LoopVectorizationLegality::reportVectorizationFailure( const StringRef DebugMsg, const StringRef OREMsg, const StringRef ORETag, Instruction * const I) const; The function prints a debug message when the debug for the compilation unit is enabled as well as invokes the optimization report emitter to generate a message with a specified tag. The function doesn't cover any complicated logic when a custom lambda should be passed to the emitter, only generating a message with a tag is supported. The function always prints the instruction `I` after the debug message whenever the instruction is specified, otherwise the debug message ends with a dot: 'LV: Not vectorizing: Disabled/already vectorized.' Patch by Pavel Samolysov <samolisov@gmail.com> llvm-svn: 362736
*	[AIX] Implement function descriptor on SDAG	Jason Liu	2019-06-06	7	-20/+82
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: (1) Function descriptor on AIX On AIX, a called routine may have 2 distinct symbols associated with it: * A function descriptor (Name) * A function entry point (.Name) The descriptor structure on AIX is the same as those in the ELF V1 ABI: * The address of the entry point of the function. * The TOC base address for the function. * The environment pointer. The descriptor symbol uses the same name as the source level function in C. The function entry point is analogous to the symbol we would generate for a function in a non-descriptor-based ABI, except that it is renamed by prepending a ".". Which symbol gets referenced depends on the context: * Taking the address of the function references the descriptor symbol. * Calling the function references the entry point symbol. (2) Speaking of implementation on AIX, for direct function call target, we create proper MCSymbol SDNode(e.g . ".foo") while constructing SDAG to replace original TargetGlobalAddress SDNode. Then down the path, we can take advantage of this MCSymbol. Patch by: Xiangling_L Reviewed by: sfertile, hubert.reinterpretcast, jasonliu, syzaara Differential Revision: https://reviews.llvm.org/D62532 llvm-svn: 362735
*	[InlineCost] Add support for unary fneg.	Craig Topper	2019-06-06	1	-0/+23
\| \| \| \| \| \| \| \| \| \|	This adds support for unary fneg based on the implementation of BinaryOperator without the soft float FP cost. Previously we would just delegate to visitUnaryInstruction. I think the only real change is that we will pass the FastMath flags to SimplifyFNeg now. Differential Revision: https://reviews.llvm.org/D62699 llvm-svn: 362732
*	[LoopPred] Fix a bug in unconditional latch bailout introduced in r362284	Philip Reames	2019-06-06	1	-2/+2
\| \| \| \| \| \|	This is a really silly bug that even a simple test w/an unconditional latch would have caught. I tried to guard against the case, but put it in the wrong if check. Oops. llvm-svn: 362727
*	[DAGCombine] MergeConsecutiveStores - improve non-temporal load\store ↵	Simon Pilgrim	2019-06-06	1	-7/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	handling (PR42123) This patch is the first step towards ensuring MergeConsecutiveStores correctly handles non-temporal loads\stores: 1 - When merging load\stores we must ensure that they all have the same non-temporal flag. This is unlikely to occur, but can in strange cases where we're storing at the end of one page and the beginning of another. 2 - The merged load\store node must retain the non-temporal flag. Differential Revision: https://reviews.llvm.org/D62910 llvm-svn: 362723
*	Remove unused PPC.h includes under llvm/lib/Target/PowerPC.	Dmitri Gribenko	2019-06-06	3	-4/+1
\| \| \| \|	llvm-svn: 362718
*	[X86] Make masked floating point equality/ordered compares commutable for ↵	Craig Topper	2019-06-06	2	-7/+17
\| \| \| \| \| \| \| \|	load folding purposes. Same as what is supported for the unmasked form. llvm-svn: 362717
*	[DA] Add an option to control delinearization validity checks	Whitney Tsang	2019-06-06	1	-10/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Dependence Analysis performs static checks to confirm validity of delinearization. These checks often fail for 64-bit targets due to type conversions and integer wrapping that prevent simplification of the SCEV expressions. These checks would also fail at compile-time if the lower bound of the loops are compile-time unknown. For example: void foo(int n, int m, int a[][m]) { for (int i = 0; i < n; ++i) for (int j = 0; j < m; ++j) { a[i][j] = a[i+1][j-2]; } } opt -mem2reg -instcombine -indvars -loop-simplify -loop-rotate -inline -pass-remarks=.* -debug-pass=Arguments -da-permissive-validity-checks=false k3.ll -analyze -da will produce the following by default: da analyze - anti [* *\|<]! but will produce the following expected dependence vector if the validity checks are disabled: da analyze - consistent anti [1 -2]! This revision will introduce a debug option that will leave the validity checks in place by default, but allow them to be turned off. New tests are added for cases where it cannot be proven at compile-time that the individual subscripts stay in-bound with respect to a particular dimension of an array. These tests enable the option to provide user guarantee that the subscripts do not over/under-flow into other dimensions, thereby producing more accurate dependence vectors. For prior discussion on this topic, leading to this change, please see the following thread: http://lists.llvm.org/pipermail/llvm-dev/2019-May/132372.html Reviewers: Meinersbur, jdoerfert, kbarton, dmgreen, fhahn Reviewed By: Meinersbur, jdoerfert, dmgreen Subscribers: fhahn, hiraditya, javed.absar, llvm-commits, Whitney, etiotto Tag: LLVM Differential Revision: https://reviews.llvm.org/D62610 llvm-svn: 362711
*	[AIX] Implement call lowering with parameters could pass onto GPRs	Jason Liu	2019-06-06	2	-15/+83
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch implements SDAG call lowering on AIX for functions which only have parameters that could fit into GPRs. Reviewers: hubert.reinterpretcast, syzaara Differential Revision: https://reviews.llvm.org/D62823 llvm-svn: 362708
*	FileCheck [6/12]: Introduce numeric variable definition	Thomas Preud'homme	2019-06-06	1	-124/+283
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch is part of a patch series to add support for FileCheck numeric expressions. This specific patch introduces support for defining numeric variable in a CHECK directive. This commit introduces support for defining numeric variable from a litteral value in the input text. Numeric expressions can then use the variable provided it is on a later line. Copyright: - Linaro (changes up to diff 183612 of revision D55940) - GraphCore (changes in later versions of revision D55940 and in new revision created off D55940) Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk Subscribers: hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, tra, rnk, kristina, hfinkel, rogfer01, JonChesterfield Tags: #llvm Differential Revision: https://reviews.llvm.org/D60386 llvm-svn: 362705
*	AArch64] Handle ISD::LRINT and ISD::LLRINT for float16	Adhemerval Zanella	2019-06-06	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \|	This patch is a follow up for D62018 to add lrint/llrint support for float16. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D62863 llvm-svn: 362700
*	Revert "[SCEV] Use wrap flags in InsertBinop"	Benjamin Kramer	2019-06-06	1	-31/+18
\| \| \| \| \| \|	This reverts commit r362687. Miscompiles llvm-profdata during selfhost. llvm-svn: 362699
*	[AArch64] Handle ISD::LROUND and ISD::LLROUND for float16	Adhemerval Zanella	2019-06-06	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \|	This patch is a follow up for D61391 to add lround/llround support for float16. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D62861 llvm-svn: 362698
*	Include what you use in LanaiAsmParser.cpp	Dmitri Gribenko	2019-06-06	1	-1/+0
\| \| \| \|	llvm-svn: 362696
*	[DAGCombine] Cleanup isNegatibleForFree/GetNegatedExpression. NFCI.	Simon Pilgrim	2019-06-06	1	-20/+21
\| \| \| \| \| \|	Prep work for PR42105 - clang-format, use auto for cast and merge nested if()s llvm-svn: 362695
*	[MIPS GlobalISel] Select sqrt	Petar Avramovic	2019-06-06	2	-2/+3
\| \| \| \| \| \| \| \|	Select G_FSQRT for MIPS32. Differential Revision: https://reviews.llvm.org/D62905 llvm-svn: 362692
*	[MIPS GlobalISel] Select fabs	Petar Avramovic	2019-06-06	3	-2/+13
\| \| \| \| \| \| \| \|	Select G_FABS for MIPS32. Differential Revision: https://reviews.llvm.org/D62903 llvm-svn: 362690
*	[MIPS GlobalISel] Select fpext and fptrunc	Petar Avramovic	2019-06-06	2	-0/+14
\| \| \| \| \| \| \| \|	Select G_FPEXT and G_FPTRUNC for MIPS32. Differential Revision: https://reviews.llvm.org/D62902 llvm-svn: 362689
*	[MIPS GlobalISel] Select floor and ceil	Petar Avramovic	2019-06-06	2	-1/+12
\| \| \| \| \| \| \| \|	Select G_FFLOOR and G_FCEIL for MIPS32. Differential Revision: https://reviews.llvm.org/D62901 llvm-svn: 362688
*	[SCEV] Use wrap flags in InsertBinop	Sam Parker	2019-06-06	1	-18/+31
\| \| \| \| \| \| \| \| \| \|	If the given SCEVExpr has no (un)signed flags attached to it, transfer these to the resulting instruction or use them to find an existing instruction. Differential Revision: https://reviews.llvm.org/D61934 llvm-svn: 362687
*	[AArch64][GlobalISel] Add manual selection support for G_ZEXTLOADs to s64.	Amara Emerson	2019-06-06	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \|	We already get support for G_ZEXTLOAD to s32 from the importer, but it can't deal with the SUBREG_TO_REG in the pattern. Tweaking the existing manual selection code for G_LOAD to handle an additional SUBREG_TO_REG when dealing with G_ZEXTLOAD isn't much work. Also add tests to check the imported pattern selections to s32 work. llvm-svn: 362681
*	[AArch64][GlobalISel] Add the new changes to fix PR42129 that were supposed ↵	Amara Emerson	2019-06-06	1	-0/+5
\| \| \| \| \| \| \| \|	to go into r362666. The changes weren't staged so ended up just re-commiting the unmodified reverted change. llvm-svn: 362677
*	[X86] Don't turn avx masked.load with constant mask into masked.load+vselect ↵	Craig Topper	2019-06-06	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	when passthru value is all zeroes. This is intended to enable the use of an immediate blend or more optimal instruction. But if the passthru is zero we don't need any additional instructions. llvm-svn: 362675
*	Revert "Revert "[AArch64][GlobalISel] Optimize G_FCMP + G_SELECT pairs when ↵	Amara Emerson	2019-06-05	1	-8/+96
\| \| \| \| \| \| \| \| \| \| \| \|	G_SELECT is fp"" When looking through copies, make sure to not try to find the vreg def of a physreg. Normally getVRegDef will return nullptr in this case, but if there happens to be multiple defs then it will assert. This fixes PR42129. llvm-svn: 362666
*	AMDGPU: Don't fix emergency stack slot at offset 0	Matt Arsenault	2019-06-05	2	-26/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This forced the caller to be aware of this, which is an ugly ABI feature. Partially reverts r295877. The original reasons for doing this are mostly fixed. Alloca is now in a non-0 address space, so it should be OK to have 0 as a valid pointer. Since we treat the absolute address as the pointer value, this part only really needed to apply to kernels. Since r357093, we avoid the need to increment/decrement the offset register in more cases, and since r354816 the scavenger can fail without spilling, so it's less critical that we try to avoid an offset that fits in the MUBUF offset. Restrict to callable functions for now to split this into 2 steps to limit thte number of test updates and in case anything breaks. llvm-svn: 362665
*	[MSAN] Add unary FNeg visitor to the MemorySanitizer	Cameron McInally	2019-06-05	1	-0/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D62909 llvm-svn: 362664
*	Allow target to handle STRICT floating-point nodes	Ulrich Weigand	2019-06-05	20	-207/+332
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The ISD::STRICT_ nodes used to implement the constrained floating-point intrinsics are currently never passed to the target back-end, which makes it impossible to handle them correctly (e.g. mark instructions are depending on a floating-point status and control register, or mark instructions as possibly trapping). This patch allows the target to use setOperationAction to switch the action on ISD::STRICT_ nodes to Legal. If this is done, the SelectionDAG common code will stop converting the STRICT nodes to regular floating-point nodes, but instead pass the STRICT nodes to the target using normal SelectionDAG matching rules. To avoid having the back-end duplicate all the floating-point instruction patterns to handle both strict and non-strict variants, we make the MI codegen explicitly aware of the floating-point exceptions by introducing two new concepts: - A new MCID flag "mayRaiseFPException" that the target should set on any instruction that possibly can raise FP exception according to the architecture definition. - A new MI flag FPExcept that CodeGen/SelectionDAG will set on any MI instruction resulting from expansion of any constrained FP intrinsic. Any MI instruction that is both marked as mayRaiseFPException and FPExcept then needs to be considered as raising exceptions by MI-level codegen (e.g. scheduling). Setting those two new flags is straightforward. The mayRaiseFPException flag is simply set via TableGen by marking all relevant instruction patterns in the .td files. The FPExcept flag is set in SDNodeFlags when creating the STRICT_ nodes in the SelectionDAG, and gets inherited in the MachineSDNode nodes created from it during instruction selection. The flag is then transfered to an MIFlag when creating the MI from the MachineSDNode. This is handled just like fast-math flags like no-nans are handled today. This patch includes both common code changes required to implement the new features, and the SystemZ implementation. Reviewed By: andrew.w.kaylor Differential Revision: https://reviews.llvm.org/D55506 llvm-svn: 362663
*	Revert "[AArch64][GlobalISel] Optimize G_FCMP + G_SELECT pairs when G_SELECT ↵	Petr Hosek	2019-06-05	1	-96/+8
\| \| \| \| \| \| \| \|	is fp" This reverts commit r362435 as this triggers ICE, see PR42129 for details. llvm-svn: 362662
*	AMDGPU: Invert frame index offset interpretation	Matt Arsenault	2019-06-05	9	-209/+218
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since the beginning, the offset of a frame index has been consistently interpreted backwards. It was treating it as an offset from the scratch wave offset register as a frame register. The correct interpretation is the offset from the SP on entry to the function, before the prolog. Frame index elimination then should select either SP or another register as an FP. Treat the scratch wave offset on kernel entry as the pre-incremented SP. Rely more heavily on the standard hasFP and frame pointer elimination logic, and clean up the private reservation code. This saves a copy in most callee functions. The kernel prolog emission code is still kind of a mess relying on checking the uses of physical registers, which I would prefer to eliminate. Currently selection directly emits MUBUF instructions, which require using a reference to some register. Use the register chosen for SP, and then ignore this later. This should probably be cleaned up to use pseudos that don't refer to any specific base register until frame index elimination. Add a workaround for shaders using large numbers of SGPRs. I'm not sure these cases were ever working correctly, since as far as I can tell the logic for figuring out which SGPR is the scratch wave offset doesn't match up with the shader input initialization in the shader programming guide. llvm-svn: 362661
*	[CallSite removal] Refactoring llvm::InlineFunction APIs	Mircea Trofin	2019-06-05	1	-8/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change only unifies the API previous API pair accepting CallInst and InvokeInst, thus making it easier to refactor inliner pass ode to CallBase. The implementation of the unified API still relies on the CallSite implementation. Reviewers: eraman, chandlerc, jdoerfert Reviewed By: jdoerfert Subscribers: jdoerfert, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62283 llvm-svn: 362656
*	[InstCombine] simplify code for bitcast of insertelement; NFC	Sanjay Patel	2019-06-05	1	-5/+4
\| \| \| \|	llvm-svn: 362655
*	NewGVN: Handle addrspacecast	Matt Arsenault	2019-06-05	1	-2/+3
\| \| \| \| \| \| \| \| \| \|	The AllConstant check needs to be moved out of the if/else if chain to avoid a test regression. The "there is no SimplifyZExt" comment puzzles me, since there is SimplifyCastInst. Additionally, the Simplify* calls seem to not see the operand as constant, so this needs to be tried if the simplify failed. llvm-svn: 362653
*	[X86] Fix mistake that marked ↵	Craig Topper	2019-06-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	VADDSSrrb_Int/VADDSDrrb_Int/VMULSSrrb_Int/VMULSDrrb_Int as commutable. One of the sources controls the pass through value for the upper bits of the result so we can't really commute it. In practice this problem isn't a functional issue because we would only try to commute this instruction in order to fold a load. But we can't do embedded rounding and fold a load at the same time. So the load fold would never succeed so I don't think we would ever commute or at least keep the version after commuting. llvm-svn: 362647
*	[LOOPINFO] Extend Loop object to add utilities to get the loop bounds,	Whitney Tsang	2019-06-05	1	-0/+214
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	step, and loop induction variable. Summary: This PR extends the loop object with more utilities to get loop bounds, step, and loop induction variable. There already exists passes which try to obtain the loop induction variable in their own pass, e.g. loop interchange. It would be useful to have a common area to get these information. /// Example: /// for (int i = lb; i < ub; i+=step) /// <loop body> /// --- pseudo LLVMIR --- /// beforeloop: /// guardcmp = (lb < ub) /// if (guardcmp) goto preheader; else goto afterloop /// preheader: /// loop: /// i1 = phi[{lb, preheader}, {i2, latch}] /// <loop body> /// i2 = i1 + step /// latch: /// cmp = (i2 < ub) /// if (cmp) goto loop /// exit: /// afterloop: /// /// getBounds /// getInitialIVValue --> lb /// getStepInst --> i2 = i1 + step /// getStepValue --> step /// getFinalIVValue --> ub /// getCanonicalPredicate --> '<' /// getDirection --> Increasing /// getInductionVariable --> i1 /// getAuxiliaryInductionVariable --> {i1} /// isCanonical --> false Reviewers: kbarton, hfinkel, dmgreen, Meinersbur, jdoerfert, syzaara, fhahn Reviewed By: kbarton Subscribers: tvvikram, bmahjour, etiotto, fhahn, jsji, hiraditya, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D60565 llvm-svn: 362644
*	InstCombine: correctly change byval type attribute alongside call args.	Tim Northover	2019-06-05	1	-4/+20
\| \| \| \| \| \| \| \|	When the byval attribute has a type, it must match the pointee type of any parameter; but InstCombine was not updating the attribute when folding casts of various kinds away. llvm-svn: 362643
*	IR: make getParamByValType Just Work. NFC.	Tim Northover	2019-06-05	5	-5/+11
\| \| \| \| \| \| \| \| \| \| \|	Most parts of LLVM don't care whether the byval type is derived from an explicit Attribute or from the parameter's pointee type, so it makes sense for the main access function to just return the right value. The very few users who do care (only BitcodeReader so far) can find out how it's specified by accessing the Attribute directly. llvm-svn: 362642
*	AMDGPU: Remove amdgpu-max-work-group-size attribute	Matt Arsenault	2019-06-05	1	-10/+1
\| \| \| \| \| \| \|	This has been deprecated for a long time, and mesa recently switched to amdgpu-flat-work-group-size. llvm-svn: 362641