bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert "AVX512: Implemented encoding and intrinsics for vextracti64x4 ↵	Renato Golin	2015-09-09	10	-996/+29
\| \| \| \| \| \| \| \|	,vextracti64x2, vextracti32x8, vextracti32x4, vextractf64x4, vextractf64x2, vextractf32x8, vextractf32x4 Added tests for intrinsics and encoding." This reverts commit r247149, as it was breaking numerous buildbots of varied architectures. llvm-svn: 247177
*	[PM/AA] Rebuild LLVM's alias analysis infrastructure in a way compatible	Chandler Carruth	2015-09-09	30	-35/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	with the new pass manager, and no longer relying on analysis groups. This builds essentially a ground-up new AA infrastructure stack for LLVM. The core ideas are the same that are used throughout the new pass manager: type erased polymorphism and direct composition. The design is as follows: - FunctionAAResults is a type-erasing alias analysis results aggregation interface to walk a single query across a range of results from different alias analyses. Currently this is function-specific as we always assume that aliasing queries are within a function. - AAResultBase is a CRTP utility providing stub implementations of various parts of the alias analysis result concept, notably in several cases in terms of other more general parts of the interface. This can be used to implement only a narrow part of the interface rather than the entire interface. This isn't really ideal, this logic should be hoisted into FunctionAAResults as currently it will cause a significant amount of redundant work, but it faithfully models the behavior of the prior infrastructure. - All the alias analysis passes are ported to be wrapper passes for the legacy PM and new-style analysis passes for the new PM with a shared result object. In some cases (most notably CFL), this is an extremely naive approach that we should revisit when we can specialize for the new pass manager. - BasicAA has been restructured to reflect that it is much more fundamentally a function analysis because it uses dominator trees and loop info that need to be constructed for each function. All of the references to getting alias analysis results have been updated to use the new aggregation interface. All the preservation and other pass management code has been updated accordingly. The way the FunctionAAResultsWrapperPass works is to detect the available alias analyses when run, and add them to the results object. This means that we should be able to continue to respect when various passes are added to the pipeline, for example adding CFL or adding TBAA passes should just cause their results to be available and to get folded into this. The exception to this rule is BasicAA which really needs to be a function pass due to using dominator trees and loop info. As a consequence, the FunctionAAResultsWrapperPass directly depends on BasicAA and always includes it in the aggregation. This has significant implications for preserving analyses. Generally, most passes shouldn't bother preserving FunctionAAResultsWrapperPass because rebuilding the results just updates the set of known AA passes. The exception to this rule are LoopPass instances which need to preserve all the function analyses that the loop pass manager will end up needing. This means preserving both BasicAAWrapperPass and the aggregating FunctionAAResultsWrapperPass. Now, when preserving an alias analysis, you do so by directly preserving that analysis. This is only necessary for non-immutable-pass-provided alias analyses though, and there are only three of interest: BasicAA, GlobalsAA (formerly GlobalsModRef), and SCEVAA. Usually BasicAA is preserved when needed because it (like DominatorTree and LoopInfo) is marked as a CFG-only pass. I've expanded GlobalsAA into the preserved set everywhere we previously were preserving all of AliasAnalysis, and I've added SCEVAA in the intersection of that with where we preserve SCEV itself. One significant challenge to all of this is that the CGSCC passes were actually using the alias analysis implementations by taking advantage of a pretty amazing set of loop holes in the old pass manager's analysis management code which allowed analysis groups to slide through in many cases. Moving away from analysis groups makes this problem much more obvious. To fix it, I've leveraged the flexibility the design of the new PM components provides to just directly construct the relevant alias analyses for the relevant functions in the IPO passes that need them. This is a bit hacky, but should go away with the new pass manager, and is already in many ways cleaner than the prior state. Another significant challenge is that various facilities of the old alias analysis infrastructure just don't fit any more. The most significant of these is the alias analysis 'counter' pass. That pass relied on the ability to snoop on AA queries at different points in the analysis group chain. Instead, I'm planning to build printing functionality directly into the aggregation layer. I've not included that in this patch merely to keep it smaller. Note that all of this needs a nearly complete rewrite of the AA documentation. I'm planning to do that, but I'd like to make sure the new design settles, and to flesh out a bit more of what it looks like in the new pass manager first. Differential Revision: http://reviews.llvm.org/D12080 llvm-svn: 247167
*	[WebAssembly] Implement calls with void return types.	Dan Gohman	2015-09-09	1	-0/+10
\| \| \| \|	llvm-svn: 247158
*	AMDGPU/SI: Fold operands through REG_SEQUENCE instructions	Tom Stellard	2015-09-09	3	-14/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This helps mostly when we use add instructions for address calculations that contain immediates. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12256 llvm-svn: 247157
*	[CostModel][AArch64] Remove amortization factor for some of the vector ↵	Silviu Baranga	2015-09-09	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	select instructions Summary: We are not scalarizing the wide selects in codegen for i16 and i32 and therefore we can remove the amortization factor. We still have issues with i64 vectors in codegen though. Reviewers: mcrosier Subscribers: mcrosier, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D12724 llvm-svn: 247156
*	AVX512: Implemented encoding and intrinsics for	Igor Breger	2015-09-09	10	-29/+996
\| \| \| \| \| \| \| \| \|	vextracti64x4 ,vextracti64x2, vextracti32x8, vextracti32x4, vextractf64x4, vextractf64x2, vextractf32x8, vextractf32x4 Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D11802 llvm-svn: 247149
*	[mips][microMIPS] Implement ADDU16, AND16, ANDI16, NOT16, OR16, SLL16 and ↵	Zoran Jovanovic	2015-09-09	2	-0/+21
\| \| \| \| \| \| \| \|	SRL16 instructions Differential Revision: http://reviews.llvm.org/D11178 llvm-svn: 247146
*	Fix PR 24633 - Handle undef values when parsing standalone constants.	Alex Lorenz	2015-09-09	1	-0/+16
\| \| \| \|	llvm-svn: 247145
*	Delay predication of stores until near the end of vector code generation	James Molloy	2015-09-09	1	-9/+34
\| \| \| \| \| \| \| \|	Predicating stores requires creating extra blocks. It's much cleaner if we do this in one pass instead of mutating the CFG while writing vector instructions. Besides which we can make use of helper functions to update domtree for us, reducing the work we need to do. llvm-svn: 247139
*	Fix vector splitting for extract_vector_elt and vector elements of <8-bits.	Daniel Sanders	2015-09-09	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: One of the vector splitting paths for extract_vector_elt tries to lower: define i1 @via_stack_bug(i8 signext %idx) { %1 = extractelement <2 x i1> <i1 false, i1 true>, i8 %idx ret i1 %1 } to: define i1 @via_stack_bug(i8 signext %idx) { %base = alloca <2 x i1> store <2 x i1> <i1 false, i1 true>, <2 x i1>* %base %2 = getelementptr <2 x i1>, <2 x i1>* %base, i32 %idx %3 = load i1, i1* %2 ret i1 %3 } However, the elements of <2 x i1> are not byte-addressible. The result of this is that the getelementptr expands to '%base + %idx * (1 / 8)' which simplifies to '%base + %idx * 0', and then simply '%base' causing all values of %idx to extract element zero. This commit fixes this by promoting the vector elements of <8-bits to i8 before splitting the vector. This fixes a number of test failures in pocl. Reviewers: pekka.jaaskelainen Subscribers: pekka.jaaskelainen, llvm-commits Differential Revision: http://reviews.llvm.org/D12591 llvm-svn: 247128
*	[mips][microMIPS] Implement CACHEE and PREFE instructions	Zoran Jovanovic	2015-09-09	3	-0/+14
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D11628 llvm-svn: 247125
*	[RuntimeDyld] Add support for MachO x86_64 SUBTRACTOR relocation.	Lang Hames	2015-09-09	1	-0/+13
\| \| \| \|	llvm-svn: 247119
*	[WebAssembly] Fix lowering of calls with more than one argument.	Dan Gohman	2015-09-09	1	-0/+13
\| \| \| \|	llvm-svn: 247118
*	SelectionDAG: Support Expand of f16 extloads	Matt Arsenault	2015-09-09	1	-0/+81
\| \| \| \| \| \| \| \| \| \|	Currently this hits an assert that extload should always be supported, which assumes integer extloads. This moves a hack out of SI's argument lowering and is covered by existing tests. llvm-svn: 247113
*	[WebAssembly] Implement WebAssemblyInstrInfo::copyPhysReg	Dan Gohman	2015-09-09	1	-0/+26
\| \| \| \|	llvm-svn: 247110
*	[llvm-readobj] MachO -- dump LinkerOptions load command.	Davide Italiano	2015-09-09	2	-49/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Example output: Linker Options { Size: 32 Count: 2 Strings [ Value: -framework Value: Cocoa ] } There were only two tests using this -- so I converted them as part of this commit rather than separately. Differential Revision: http://reviews.llvm.org/D12702 llvm-svn: 247106
*	[WinEH] Avoid creating MBBs for LLVM BBs that cannot contain code	Reid Kleckner	2015-09-08	1	-6/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Typically these are catchpads, which hold data used to decide whether to catch the exception or continue unwinding. We also shouldn't create MBBs for catchendpads, cleanupendpads, or terminatepads, since no real code can live in them. This fixes a problem where MI passes (like the register allocator) would try to put code into catchpad blocks, which are not executed by the runtime. In the new world, blocks ending in invokes now have many possible successors. llvm-svn: 247102
*	Re-apply r247080 with order of evaluation fix.	Peter Collingbourne	2015-09-08	4	-4/+95
\| \| \| \|	llvm-svn: 247095
*	[WinEH] Emit prologues and epilogues for funclets	Reid Kleckner	2015-09-08	2	-14/+190
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: 32-bit funclets have short prologues that allocate enough stack for the largest call in the whole function. The runtime saves CSRs for the funclet. It doesn't restore CSRs after we finally transfer control back to the parent funciton via a CATCHRET, but that's a separate issue. 32-bit funclets also have to adjust the incoming EBP value, which is what llvm.x86.seh.recoverframe does in the old model. 64-bit funclets need to spill CSRs as normal. For simplicity, this just spills the same set of CSRs as the parent function, rather than trying to compute different CSR sets for the parent function and each funclet. 64-bit funclets also allocate enough stack space for the largest outgoing call frame, like 32-bit. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12546 llvm-svn: 247092
*	Revert r247080, "LowerBitSets: Extend pass to support functions as bitset	Peter Collingbourne	2015-09-08	4	-95/+4
\| \| \| \| \| \|	members." as it causes test failures on a number of bots. llvm-svn: 247088
*	[Bitcode] Add compatibility tests for new instructions	Vedant Kumar	2015-09-08	1	-3/+97
\| \| \| \| \| \| \| \| \|	Adds basic compatibility tests for the following instructions: catchpad, catchendpad, cleanuppad, cleanupendpad, terminatepad, cleanupret, catchret llvm-svn: 247087
*	Fix the PPC CTR Loop pass to look for calls to the intrinsics that	Eric Christopher	2015-09-08	1	-0/+349
\| \| \| \| \| \|	read CTR and count them as reading the CTR. llvm-svn: 247083
*	LowerBitSets: Extend pass to support functions as bitset members.	Peter Collingbourne	2015-09-08	4	-4/+95
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change extends the bitset lowering pass to support bitsets that may contain either functions or global variables. A function bitset is lowered to a jump table that is laid out before one of the functions in the bitset. Also add support for non-string bitset identifier names. This allows for distinct metadata nodes to stand in for names with internal linkage, as done in D11857. Differential Revision: http://reviews.llvm.org/D11856 llvm-svn: 247080
*	AMDGPU/SI: Fix input vcc operand for VOP2b instructions	Matt Arsenault	2015-09-08	2	-18/+67
\| \| \| \| \| \| \| \| \|	Adds vcc to output string input for e32. Allows option of using e64 encoding with assembler. Also fixes these instructions not implicitly reading exec. llvm-svn: 247074
*	Fix comments and RUN line in x86-64 stdarg test leftover from last commit	Derek Schuff	2015-09-08	1	-2/+2
\| \| \| \| \| \|	From http://reviews.llvm.org/D12346 llvm-svn: 247070
*	x32. Fixes a bug in how struct va_list is initialized in x32	Derek Schuff	2015-09-08	4	-2/+153
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch modifies X86TargetLowering::LowerVASTART so that struct va_list is initialized with 32 bit pointers in x32. It also includes tests that call @llvm.va_start() for x32. Patch by João Porto Subscribers: llvm-commits, hjl.tools Differential Revision: http://reviews.llvm.org/D12346 llvm-svn: 247069
*	x32. Fixes a bug in i8mem_NOREX declaration.	Derek Schuff	2015-09-08	4	-8/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The old implementation assumed LP64 which is broken for x32. Specifically, the MOVE8rm_NOREX and MOVE8mr_NOREX, when selected, would cause a 'Cannot emit physreg copy instruction' error message to be reported. This patch also enable the h-register*ll tests for x32. Differential Revision: http://reviews.llvm.org/D12336 Patch by João Porto llvm-svn: 247058
*	AMDGPU: Handle sub of constant for DS offset folding	Matt Arsenault	2015-09-08	1	-0/+125
\| \| \| \| \| \| \| \| \|	sub C, x - > add (sub 0, x), C for DS offsets. This is mostly to fix regressions that show up when SeparateConstOffsetFromGEP is enabled. llvm-svn: 247054
*	Fix PR 24723 - Handle 0-mass backedges in irreducible loops	Diego Novillo	2015-09-08	1	-0/+155
\| \| \| \| \| \| \| \| \| \| \| \|	This corner case happens when we have an irreducible SCC that is deeply nested. As we work down the tree, the backedge masses start getting smaller and smaller until we reach one that is down to 0. Since we distribute the incoming mass using the backedge masses as weight, the distributor does not allow zero weights. So, we simply ignore them (which will just use the weights of the non-zero nodes). llvm-svn: 247050
*	[MC/ELF] Accept zero for .align directive	Davide Italiano	2015-09-08	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	.align directive refuses alignment 0 -- a comment in the code hints this is done for GNU as compatibility, but it seems GNU as accepts .align 0 (and silently rounds up alignment to 1). Differential Revision: http://reviews.llvm.org/D12682 llvm-svn: 247048
*	Fix CPP Backend for GEP API changes for opaque pointer types	David Blaikie	2015-09-08	1	-0/+10
\| \| \| \| \| \|	Based on a patch by Jerome Witmann. llvm-svn: 247047
*	Merge or combine tests and convert to FileCheck.	Benjamin Kramer	2015-09-08	4	-92/+110
\| \| \| \| \| \| \| \|	- Move tests only exercising instsimplify to instsimplify's apint-or.ll - Actually test the CHECK lines in instsimplify's apint-or.ll - Merge the remaining tests in apint-or1.ll and apint-or2.ll, use FileCheck llvm-svn: 247045
*	add tests for De Morgan instcombines based on PR22723	Sanjay Patel	2015-09-08	1	-0/+38
\| \| \| \|	llvm-svn: 247040
*	fix typos, remove noise; NFCI	Sanjay Patel	2015-09-08	1	-23/+24
\| \| \| \|	llvm-svn: 247035
*	[Bitcode] Add compatibility test for llvm 3.7.0	Vedant Kumar	2015-09-08	2	-0/+1280
\| \| \| \| \| \| \|	This patch adds llvm-3.7 IR and generated bitcode for our compatibility test (in accordance with the developer policy). llvm-svn: 247031
*	WebAssembly: NFC rename shr/sar	JF Bastien	2015-09-08	3	-6/+6
\| \| \| \| \| \|	Renamed from: https://github.com/WebAssembly/design/pull/332 llvm-svn: 247028
*	[mips][microMIPS] Implement LLE, LUI, LW and LWE instructions	Zoran Jovanovic	2015-09-08	2	-0/+12
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D1179 llvm-svn: 247017
*	[WebAssembly] Temporarily disable this test, as it depends on additional ↵	Dan Gohman	2015-09-08	1	-0/+3
\| \| \| \| \| \|	patches that aren't yet checked in. llvm-svn: 247011
*	AVX512: kunpck encoding implementation	Igor Breger	2015-09-08	3	-1/+38
\| \| \| \| \| \| \| \|	Added tests for encoding. Differential Revision: http://reviews.llvm.org/D12061 llvm-svn: 247010
*	[WebAssembly] Enable SSA lowering and other pre-regalloc passes	Dan Gohman	2015-09-08	1	-0/+22
\| \| \| \|	llvm-svn: 247008
*	[mips][microMIPS] Implement SB, SBE, SCE, SH and SHE instructions	Zoran Jovanovic	2015-09-08	2	-0/+15
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D11801 llvm-svn: 246999
*	There is a trunc(lshr (zext A), Cst) optimization in InstCombineCasts that	Jakub Kuderski	2015-09-08	1	-0/+37
\| \| \| \| \| \| \| \| \| \| \|	removes cast by performing the lshr on smaller types. However, currently there is no trunc(lshr (sext A), Cst) variant. This patch add such optimization by transforming trunc(lshr (sext A), Cst) to ashr A, Cst. Differential Revision: http://reviews.llvm.org/D12520 llvm-svn: 246997
*	[mips] Reserve address spaces 1-255 for software use.	Daniel Sanders	2015-09-08	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: And define them to have noop casts with address spaces 0-255. Reviewers: pekka.jaaskelainen Subscribers: pekka.jaaskelainen, llvm-commits Differential Revision: http://reviews.llvm.org/D12678 llvm-svn: 246990
*	[mips][microMIPS] Add microMIPS32r6 and microMIPS64r6 tests for existing ↵	Zoran Jovanovic	2015-09-08	6	-0/+61
\| \| \| \| \| \| \| \|	16-bit LBU16, LHU16, LW16, LWGP and LWSP instructions Differential Revision: http://reviews.llvm.org/D10956 llvm-svn: 246987
*	AVX-512: Lowering for 512-bit vector shuffles.	Elena Demikhovsky	2015-09-08	3	-506/+367
\| \| \| \| \| \| \| \|	Vector types: <8 x 64>, <16 x 32>, <32 x 16> float and integer. Differential Revision: http://reviews.llvm.org/D10683 llvm-svn: 246981
*	add missing regression tests for De Morgan's Law transform in InstCombine	Sanjay Patel	2015-09-07	2	-2/+20
\| \| \| \|	llvm-svn: 246973
*	[mips][microMIPS] Implement ABS.fmt, CEIL.L.fmt, CEIL.W.fmt, FLOOR.L.fmt, ↵	Zoran Jovanovic	2015-09-07	3	-1/+90
\| \| \| \| \| \| \| \|	FLOOR.W.fmt, TRUNC.L.fmt, TRUNC.W.fmt, RSQRT.fmt and SQRT.fmt instructions Differential Revision: http://reviews.llvm.org/D11674 llvm-svn: 246968
*	[mips][microMIPS] Implement BC16, BEQZC16 and BNEZC16 instructions	Zoran Jovanovic	2015-09-07	6	-0/+32
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D11181 llvm-svn: 246963
*	[mips][microMIPS] Implement CVT.D.fmt, CVT.L.fmt, CVT.S.fmt, CVT.W.fmt, ↵	Zoran Jovanovic	2015-09-07	4	-0/+300
\| \| \| \| \| \| \| \|	MAX.fmt, MIN.fmt, MAXA.fmt, MINA.fmt and CMP.condn.fmt instructions Differential Revision: http://reviews.llvm.org/D12141 llvm-svn: 246960
*	[X86][MMX] Added missing stack folding tests for MMX/SSSE3 instructions	Simon Pilgrim	2015-09-06	1	-2/+146
\| \| \| \|	llvm-svn: 246949