bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[NativePDB] Reconstruct function declarations from debug info.	Zachary Turner	2018-12-07	1	-1/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Previously we would create an lldb::Function object for each function parsed, but we would not add these to the clang AST. This is a first step towards getting local variable support working, as we first need an AST decl so that when we create local variable entries, they have the proper DeclContext. Differential Revision: https://reviews.llvm.org/D55384 llvm-svn: 348631
*	[llvm-tapi] Don't try to override SequenceTraits for std::string	Sam Clegg	2018-12-07	1	-17/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For some reason this doesn't seem to work with LLVM_LINK_LLVM_DYLIB build. See https://logs.chromium.org/logs/chromium/bb/client.wasm.llvm/linux/37764/+/recipes/steps/LLVM_regression_tests/0/stdout What is more it seems that overriding these traits for core types (including std::string) is not supported/recommend by YAMLTraits.h. See line 1918 which has the assertion: "only use LLVM_YAML_IS_SEQUENCE_VECTOR for types you control" Differential Revision: https://reviews.llvm.org/D55381 llvm-svn: 348630
*	[DAGCombiner] split trunc from extend in hoistLogicOpWithSameOpcodeHands; NFC	Sanjay Patel	2018-12-07	1	-33/+48
\| \| \| \| \| \| \|	This duplicates several shared checks, but we need to split this up to fix underlying bugs in smaller steps. llvm-svn: 348627
*	[X86] Replace instregex with instrs list. NFCI.	Simon Pilgrim	2018-12-07	3	-3/+3
\| \| \| \|	llvm-svn: 348626
*	AMDGPU: Allow f32 types for llvm.amdgcn.s.buffer.load	Matt Arsenault	2018-12-07	2	-5/+12
\| \| \| \|	llvm-svn: 348625
*	[X86] Initialize and Register X86CondBrFoldingPass	Craig Topper	2018-12-07	3	-4/+8
\| \| \| \| \| \| \| \| \| \|	To make X86CondBrFoldingPass can be run with --run-pass option, this can test one wrong assertion on analyzeCompare function for SUB32ri when its operand is not imm Patch by Jianping Chen Differential Revision: https://reviews.llvm.org/D55412 llvm-svn: 348620
*	AMDGPU: Remove llvm.SI.tbuffer.store	Matt Arsenault	2018-12-07	2	-67/+0
\| \| \| \|	llvm-svn: 348619
*	[X86] Improve pfm counter coverage for llvm-exegesis	Simon Pilgrim	2018-12-07	1	-0/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch attempts to improve pfm perf counter coverage for all the x86 CPUs that libpfm4 supports. Intel/AMD CPU families tend to share names for cycle/uops counters so even if they don't have a scheduler model yet they can at least use the default values (checked against the libpfm4 source code). The remaining CPUs (where their port/pipe resource counters are known) I've tried to add to the existing model mappings. These are untested but don't represent a regression to current llvm-exegesis behaviour for these CPUs. Differential Revision: https://reviews.llvm.org/D55432 llvm-svn: 348617
*	AMDGPU: Remove llvm.SI.buffer.load.dword	Matt Arsenault	2018-12-07	2	-62/+0
\| \| \| \|	llvm-svn: 348616
*	AMDGPU: Remove llvm.AMDGPU.kill	Matt Arsenault	2018-12-07	5	-35/+5
\| \| \| \| \| \|	This is the last of the old AMDGPU intrinsics. llvm-svn: 348615
*	[DAGCombiner] disable truncation of binops by default	Sanjay Patel	2018-12-07	1	-1/+7
\| \| \| \| \| \| \| \| \| \|	As discussed in the post-commit thread of r347917, this transform is fighting with an existing transform causing an infinite loop or out-of-memory, so this is effectively reverting r347917 and its follow-up r348195 while we investigate the bug. llvm-svn: 348604
*	Reapply "[DemandedBits][BDCE] Support vectors of integers"	Nikita Popov	2018-12-07	2	-27/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DemandedBits and BDCE currently only support scalar integers. This patch extends them to also handle vector integer operations. In this case bits are not tracked for individual vector elements, instead a bit is demanded if it is demanded for any of the elements. This matches the behavior of computeKnownBits in ValueTracking and SimplifyDemandedBits in InstCombine. Unlike the previous iteration of this patch, getDemandedBits() can now again be called on arbirary (sized) instructions, even if they don't have integer or vector of integer type. (For vector types the size of the returned mask will now be the scalar size in bits though.) The added LoopVectorize test case shows a case which triggered an assertion failure with the previous attempt, because getDemandedBits() was called on a pointer-typed instruction. Differential Revision: https://reviews.llvm.org/D55297 llvm-svn: 348602
*	[AMDGPU] Shrink scalar AND, OR, XOR instructions	Graham Sellers	2018-12-07	1	-0/+84
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change attempts to shrink scalar AND, OR and XOR instructions which take an immediate that isn't inlineable. It performs: AND s0, s0, ~(1 << n) -> BITSET0 s0, n OR s0, s0, (1 << n) -> BITSET1 s0, n AND s0, s1, x -> ANDN2 s0, s1, ~x OR s0, s1, x -> ORN2 s0, s1, ~x XOR s0, s1, x -> XNOR s0, s1, ~x In particular, this catches setting and clearing the sign bit for fabs (and x, 0x7ffffffff -> bitset0 x, 31 and or x, 0x80000000 -> bitset1 x, 31). llvm-svn: 348601
*	[DAGCombiner] remove explicit calls to AddToWorkList; NFCI	Sanjay Patel	2018-12-07	1	-6/+0
\| \| \| \| \| \| \| \|	As noted in the post-commit thread for rL347917: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181203/608936.html ...we don't need to repeat these calls because the combiner does it automatically. llvm-svn: 348597
*	Introduce llvm.experimental.widenable_condition intrinsic	Max Kazantsev	2018-12-07	5	-0/+124
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces a new instinsic `@llvm.experimental.widenable_condition` that allows explicit representation for guards. It is an alternative to using `@llvm.experimental.guard` intrinsic that does not contain implicit control flow. We keep finding places where `@llvm.experimental.guard` is not supported or treated too conservatively, and there are 2 reasons to that: - `@llvm.experimental.guard` has memory write side effect to model implicit control flow, and this sometimes confuses passes and analyzes that work with memory; - Not all passes and analysis are aware of the semantics of guards. These passes treat them as regular throwing call and have no idea that the condition of guard may be used to prove something. One well-known place which had caused us troubles in the past is explicit loop iteration count calculation in SCEV. Another example is new loop unswitching which is not aware of guards. Whenever a new pass appears, we potentially have this problem there. Rather than go and fix all these places (and commit to keep track of them and add support in future), it seems more reasonable to leverage the existing optimizer's logic as much as possible. The only significant difference between guards and regular explicit branches is that guard's condition can be widened. It means that a guard contains (explicitly or implicitly) a `deopt` block successor, and it is always legal to go there no matter what the guard condition is. The other successor is a guarded block, and it is only legal to go there if the condition is true. This patch introduces a new explicit form of guards alternative to `@llvm.experimental.guard` intrinsic. Now a widenable guard can be represented in the CFG explicitly like this: %widenable_condition = call i1 @llvm.experimental.widenable.condition() %new_condition = and i1 %cond, %widenable_condition br i1 %new_condition, label %guarded, label %deopt guarded: ; Guarded instructions deopt: call type @llvm.experimental.deoptimize(<args...>) [ "deopt"(<deopt_args...>) ] The new intrinsic `@llvm.experimental.widenable.condition` has semantics of an `undef`, but the intrinsic prevents the optimizer from folding it early. This form should exploit all optimization boons provided to `br` instuction, and it still can be widened by replacing the result of `@llvm.experimental.widenable.condition()` with `and` with any arbitrary boolean value (as long as the branch that is taken when it is `false` has a deopt and has no side-effects). For more motivation, please check llvm-dev discussion "[llvm-dev] Giving up using implicit control flow in guards". This patch introduces this new intrinsic with respective LangRef changes and a pass that converts old-style guards (expressed as intrinsics) into the new form. The naming discussion is still ungoing. Merging this to unblock further items. We can later change the name of this intrinsic. Reviewed By: reames, fedor.sergeev, sanjoy Differential Revision: https://reviews.llvm.org/D51207 llvm-svn: 348593
*	ARM: use correct offset from base pointer (r6) in call frame regions.	Tim Northover	2018-12-07	1	-0/+1
\| \| \| \| \| \| \| \| \|	When we had dynamic call frames (i.e. sp adjustment around each call) we were including that adjustment into offsets calculated based on r6, even though it's only sp that changes. This led to incorrect stack slot accesses. llvm-svn: 348591
*	[Targets] Add errors for tiny and kernel codemodel on targets that don't ↵	David Green	2018-12-07	19	-119/+69
\| \| \| \| \| \| \| \| \| \| \|	support them Adds fatal errors for any target that does not support the Tiny or Kernel codemodels by rejigging the getEffectiveCodeModel calls. Differential Revision: https://reviews.llvm.org/D50141 llvm-svn: 348585
*	Fix gcc7.3 -Wparentheses warning. NFCI.	Simon Pilgrim	2018-12-07	1	-3/+3
\| \| \| \|	llvm-svn: 348581
*	[X86] Add ivybridge to llvm-exegesis PFM counter mappings	Simon Pilgrim	2018-12-07	1	-0/+1
\| \| \| \|	llvm-svn: 348575
*	[SelectionDAG] Don't pass on DemandedElts when handling SCALAR_TO_VECTOR	Simon Pilgrim	2018-12-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Fixes an assertion: llc: lib/CodeGen/SelectionDAG/SelectionDAG.cpp:2200: llvm::KnownBits llvm::SelectionDAG::computeKnownBits(llvm::SDValue, const llvm::APInt&, unsigned int) const: Assertion `(!Op.getValueType().isVector() \|\| NumElts == Op.getValueType().getVectorNumElements()) && "Unexpected vector size"' failed. Committed on behalf of: @pendingchaos (Rhys Perry) Differential Revision: https://reviews.llvm.org/D55223 llvm-svn: 348574
*	[IR] Don't assume all functions are 4 byte aligned	Ranjeet Singh	2018-12-07	1	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In some cases different alignments for function might be used to save space e.g. thumb mode with -Oz will try to use 2 byte function alignment. Similar patch that fixed this in other areas exists here https://reviews.llvm.org/D46110 This was approved previously https://reviews.llvm.org/D55115 (r348215) but when committed it caused failures on the sanitizer buildbots when building llvm with clang (containing this patch). This is now fixed because I've added a check to see if getting the parent module returns null if it does then set the alignment to 0. Differential Revision: https://reviews.llvm.org/D55115 llvm-svn: 348571
*	[PM] Port LoadStoreVectorizer to the new pass manager.	Markus Lavin	2018-12-07	4	-16/+36
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D54848 llvm-svn: 348570
*	[LoopSimplifyCFG] Do not deal with loops with irreducible CFG inside	Max Kazantsev	2018-12-07	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current algorithm that collects live/dead/inloop blocks relies on some invariants related to RPO and PO traversals. In particular, the important fact it requires is that the only loop's latch is the first block in PO traversal. It also relies on fact that during RPO we visit all prececessors of a block before we visit this block (backedges ignored). If a loop has irreducible non-loop cycle inside, both these assumptions may break. This patch adds detection for this situation and prohibits the terminator folding for loops with irreducible CFG. We can in theory support this later, for this some algorithmic changes are needed. Besides, irreducible CFG is not a frequent situation and we can just don't bother. Thanks @uabelho for finding this! Differential Revision: https://reviews.llvm.org/D55357 Reviewed By: skatkov llvm-svn: 348567
*	[PowerPC] Fix assert from machine verify pass that missing undef register flag	Zi Xuan Wu	2018-12-07	1	-15/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix assert about using an undefined physical register in machine instruction verify pass. The reason is that register flag undef is missing when doing transformation from If Conversion Pass. ``` Bad machine code: Using an undefined physical register - function: func_65 - basic block: %bb.0 entry (0x10024740738) - instruction: BCLR killed $cr5lt, implicit $lr8, implicit $rm, implicit undef $x3 - operand 0: killed $cr5lt LLVM ERROR: Found 1 machine code errors. ``` There are also other existing testcases with same issue. So I add -verify-machineinstrs option to open verifying. Differential Revision: https://reviews.llvm.org/D55408 llvm-svn: 348566
*	[CodeExtractor] Store outputs at the first valid insertion point	Vedant Kumar	2018-12-07	1	-12/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When CodeExtractor outlines values which are used by the original function, it must store those values in some in-out parameter. This store instruction must not be inserted in between a PHI and an EH pad instruction, as that results in invalid IR. This fixes the following verifier failure seen while outlining within ObjC methods with live exit values: The unwind destination does not have an exception handling instruction! %call35 = invoke i8* bitcast (i8* (i8, i8, ...)* @objc_msgSend to i8* (i8, i8))(i8 %exn.adjusted, i8* %1) to label %invoke.cont34 unwind label %lpad33, !dbg !4183 The unwind destination does not have an exception handling instruction! invoke void @objc_exception_throw(i8* %call35) #12 to label %invoke.cont36 unwind label %lpad33, !dbg !4184 LandingPadInst not the first non-PHI instruction in the block. %3 = landingpad { i8, i32 } catch i8 null, !dbg !1411 rdar://46540815 llvm-svn: 348562
*	Revert "[llvm-tapi] Don't override SequenceTraits for std::string"	Armando Montanez	2018-12-07	1	-17/+12
\| \| \| \| \| \|	Revert r348551 since it triggered some warnings that don't appear to have a quick fix. llvm-svn: 348560
*	Revert "[DemandedBits][BDCE] Support vectors of integers"	Nikita Popov	2018-12-07	2	-51/+28
\| \| \| \| \| \| \|	This reverts commit r348549. Causing assertion failures during clang build. llvm-svn: 348558
*	[DAGCombiner] use root SDLoc for all nodes created by logic fold	Sanjay Patel	2018-12-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	If this is not a valid way to assign an SDLoc, then we get this wrong all over SDAG. I don't know enough about the SDAG to explain this. IIUC, theoretically, debug info is not supposed to affect codegen. But here it has clearly affected 3 different targets, and the x86 change is an actual improvement. llvm-svn: 348552
*	[llvm-tapi] Don't override SequenceTraits for std::string	Armando Montanez	2018-12-06	1	-12/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Change the ELF YAML implementation of TextAPI so NeededLibs uses flow sequence vector correctly instead of overriding the YAML implementation for std::vector<std::string>>. This should fix the test failure with the LLVM_LINK_LLVM_DYLIB build mentioned in D55381. Still passes existing tests that cover this. Differential Revision: https://reviews.llvm.org/D55390 llvm-svn: 348551
*	[DAGCombiner] don't bother saving a SDLoc for a node that's dead; NFCI	Sanjay Patel	2018-12-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We shouldn't care about the debug location for a node that we're creating, but attaching the root of the pattern should be the best effort. (If this is not true, then we are doing it wrong all over the SDAG). This is no-functional-change-intended, and there are no regression test diffs...and that's what I expected. But there's a similar line above this diff, where those assumptions apparently do not hold. llvm-svn: 348550
*	[DemandedBits][BDCE] Support vectors of integers	Nikita Popov	2018-12-06	2	-28/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DemandedBits and BDCE currently only support scalar integers. This patch extends them to also handle vector integer operations. In this case bits are not tracked for individual vector elements, instead a bit is demanded if it is demanded for any of the elements. This matches the behavior of computeKnownBits in ValueTracking and SimplifyDemandedBits in InstCombine. The getDemandedBits() method can now only be called on instructions that have integer or vector of integer type. Previously it could be called on any sized instruction (even if it was not particularly useful). The size of the return value is now always the scalar size in bits (while previously it was the type size in bits). Differential Revision: https://reviews.llvm.org/D55297 llvm-svn: 348549
*	[DAGCombiner] more clean up in hoistLogicOpWithSameOpcodeHands(); NFC	Sanjay Patel	2018-12-06	1	-41/+34
\| \| \| \| \| \|	This code can still misbehave. llvm-svn: 348547
*	[X86] Directly create ADC/SBB nodes instead of using ADD/SUB with (and ↵	Craig Topper	2018-12-06	2	-47/+8
\| \| \| \| \| \| \| \| \| \| \| \|	SETCC_CARRY, 1) This addresses a FIXME and avoids depending on an isel pattern match I think. I've remove the isel patterns too since he have no lit tests left that cover them. Hopefully that really means they are unused. I'm trying to decide if we need SETCC_CARRY. This removes one of its usages. Differential Revision: https://reviews.llvm.org/D55355 llvm-svn: 348536
*	[DAGCombiner] don't group bswap with casts in logic hoisting fold	Sanjay Patel	2018-12-06	1	-6/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was probably organized as it was because bswap is a unary op. But that's where the similarity to the other opcodes ends. We should not limit this transform to scalars, and we should not try it if either input has other uses. This is another step towards trying to clean this whole function up to prevent it from causing infinite loops and memory explosions. Earlier commits in this series: rL348501 rL348508 rL348518 llvm-svn: 348534
*	[DAGCombiner] reduce indent; NFC	Sanjay Patel	2018-12-06	1	-38/+31
\| \| \| \| \| \| \| \|	Unlike some of the folds in hoistLogicOpWithSameOpcodeHands() above this shuffle transform, this has the expected hasOneUse() checks in place. llvm-svn: 348523
*	[DagCombiner][X86] Simplify a ConcatVectors of a scalar_to_vector with undef.	Andrea Di Biagio	2018-12-06	1	-4/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces a new DAGCombiner rule to simplify concat_vectors nodes: concat_vectors( bitcast (scalar_to_vector %A), UNDEF) --> bitcast (scalar_to_vector %A) This patch only partially addresses PR39257. In particular, it is enough to fix one of the two problematic cases mentioned in PR39257. However, it is not enough to fix the original test case posted by Craig; that particular case would probably require a more complicated approach (and knowledge about used bits). Before this patch, we used to generate the following code for function PR39257 (-mtriple=x86_64 , -mattr=+avx): vmovsd (%rdi), %xmm0 # xmm0 = mem[0],zero vxorps %xmm1, %xmm1, %xmm1 vblendps $3, %xmm0, %xmm1, %xmm0 # xmm0 = xmm0[0,1],xmm1[2,3] vmovaps %ymm0, (%rsi) vzeroupper retq Now we generate this: vmovsd (%rdi), %xmm0 # xmm0 = mem[0],zero vmovaps %ymm0, (%rsi) vzeroupper retq As a side note: that VZEROUPPER is completely redundant... I guess the vzeroupper insertion pass doesn't realize that the definition of %xmm0 from vmovsd is already zeroing the upper half of %ymm0. Note that on %-mcpu=btver2, we don't get that vzeroupper because pass vzeroupper insertion %pass is disabled. Differential Revision: https://reviews.llvm.org/D55274 llvm-svn: 348522
*	[DAGCombiner] don't hoist logic op if operands have other uses, part 2	Sanjay Patel	2018-12-06	1	-5/+7
\| \| \| \| \| \| \| \| \|	The PPC test with 2 extra uses seems clearly better by avoiding this transform. With 1 extra use, we also prevent an extra register move (although that might be an RA problem). The general rule should be to only make a change here if it is always profitable. The x86 diffs are all neutral. llvm-svn: 348518
*	Fix Wdocumentation warning. NFCI.	Simon Pilgrim	2018-12-06	1	-2/+0
\| \| \| \|	llvm-svn: 348517
*	Reapply "Adapt gcov to changes in CFE."	Adrian Prantl	2018-12-06	2	-17/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r348203 and reapplies D55085 with an additional GCOV bugfix to make the change NFC for relative file paths in .gcno files. Thanks to Ilya Biryukov for additional testing! Original commit message: Update Diagnostic handling for changes in CFE. The clang frontend no longer emits the current working directory for DIFiles containing an absolute path in the filename: and will move the common prefix between current working directory and the file into the directory: component. https://reviews.llvm.org/D55085 llvm-svn: 348512
*	[AArch64] Fix Exynos predicate	Evandro Menezes	2018-12-06	1	-8/+10
\| \| \| \| \| \|	Fix predicate for arithmetic instructions with shift and/or extend. llvm-svn: 348510
*	[DAGCombiner] don't hoist logic op if operands have other uses	Sanjay Patel	2018-12-06	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The AVX512 diffs are neutral, but the bswap test shows a clear overreach in hoistLogicOpWithSameOpcodeHands(). If we don't check for other uses, we can increase the instruction count. This could also fight with transforms trying to go in the opposite direction and possibly blow up/infinite loop. This might be enough to solve the bug noted here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181203/608593.html I did not add the hasOneUse() checks to all opcodes because I see a perf regression for at least one opcode. We may decide that's irrelevant in the face of potential compiler crashing, but I'll see if I can salvage that first. llvm-svn: 348508
*	[PDB] Move some code around. NFC.	Zachary Turner	2018-12-06	2	-1/+55
\| \| \| \|	llvm-svn: 348505
*	[DAGCombiner] refactor function that hoists bitwise logic; NFCI	Sanjay Patel	2018-12-06	1	-56/+65
\| \| \| \| \| \| \| \| \|	Added FIXME and TODO comments for lack of safety checks. This function is a suspect in out-of-memory errors as discussed in the follow-up thread to r347917: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181203/608593.html llvm-svn: 348501
*	Support skewed stream arrays.	Zachary Turner	2018-12-06	2	-7/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	VarStreamArray was built on the assumption that it is backed by a StreamRef, and offset 0 of that StreamRef is the first byte of the first record in the array. This is a logical and intuitive assumption, but unfortunately we have use cases where it doesn't hold. Specifically, a PDB module's symbol stream is prefixed by 4 bytes containing a magic value, and the first byte of record data in the array is actually at offset 4 of this byte sequence. Previously, we would just truncate the first 4 bytes and then construct the VarStreamArray with the resulting StreamRef, so that offset 0 of the underlying stream did correspond to the first byte of the first record, but this is problematic, because symbol records reference other symbol records by the absolute offset including that initial magic 4 bytes. So if another record wants to refer to the first record in the array, it would say "the record at offset 4". This led to extremely confusing hacks and semantics in loading code, and after spending 30 minutes trying to get some math right and failing, I decided to fix this in the underlying implementation of VarStreamArray. Now, we can say that a stream is skewed by a particular amount. This way, when we access a record by absolute offset, we can use the same values that the records themselves contain, instead of having to do fixups. Differential Revision: https://reviews.llvm.org/D55344 llvm-svn: 348499
*	[X86] Refactored IsSplatVector to use switch. NFCI.	Simon Pilgrim	2018-12-06	1	-21/+24
\| \| \| \| \| \| \| \|	Initial step towards making the function more generic (and probably move into SelectionDAG). This is necessary to avoid massive codegen bloat for PR38243 (Add modulo rotate support to LowerRotate). llvm-svn: 348498
*	[DEBUGINFO, NVPTX] Disable emission of ',debug' option if only debug ↵	Alexey Bataev	2018-12-06	1	-1/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	directives are allowed. Summary: If the output of debug directives only is requested, we should drop emission of ',debug' option from the target directive. Required for supporting of nvprof profiler. Reviewers: echristo Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46061 llvm-svn: 348497
*	[GVN] Don't perform scalar PRE on GEPs	Alexandros Lamprineas	2018-12-06	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Partial Redundancy Elimination of GEPs prevents CodeGenPrepare from sinking the addressing mode computation of memory instructions back to its uses. The problem comes from the insertion of PHIs, which confuse CGP and make it bail. I've autogenerated the check lines of an existing test and added a store instruction to demonstrate the motivation behind this change. The store is now using the gep instead of a phi. Differential Revision: https://reviews.llvm.org/D55009 llvm-svn: 348496
*	[DEBUGINFO, NVPTX]Emit last debugging directives.	Alexey Bataev	2018-12-06	3	-3/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We may end up with not emitted debug directives at the end of the module emission. Patch fixes this problem emitting those last directives the end of the module emission. Reviewers: echristo Subscribers: jholewinski, llvm-commits Differential Revision: https://reviews.llvm.org/D54320 llvm-svn: 348495
*	DAGCombiner::visitINSERT_VECTOR_ELT - pull out repeated ↵	Simon Pilgrim	2018-12-06	1	-3/+4
\| \| \| \| \| \|	VT.getVectorNumElements(). NFCI. llvm-svn: 348494
*	[NFC][AArch64] Split out backend features	Diogo N. Sampaio	2018-12-06	6	-72/+297
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch splits backend features currently hidden behind architecture versions. For example, currently the only way to activate complex numbers extension is targeting an v8.3 architecture, where after the patch this extension can be added separately. This refactoring is required by the new command lines proposal: http://lists.llvm.org/pipermail/llvm-dev/2018-September/126346.html Reviewers: DavidSpickett, olista01, t.p.northover Subscribers: kristof.beyls, bryanpkc, javed.absar, pbarrio Differential revision: https://reviews.llvm.org/D54633 -- It was reverted in rL348249 due a build bot failure in one of the regression tests: http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/14386 The problem seems to be that FileCheck behaves different in windows and linux. This new patch splits the test file in multiple, and does more exact pattern matching attempting to circumvent the issue. llvm-svn: 348493