bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[LV] Sink scalar operands of predicated instructions	Matthew Simpson	2016-10-25	3	-16/+16
\| \| \| \| \| \| \| \| \| \| \| \| \|	When we predicate an instruction (div, rem, store) we place the instruction in its own basic block within the vectorized loop. If a predicated instruction has scalar operands, it's possible to recursively sink these scalar expressions into the predicated block so that they might avoid execution. This patch sinks as much scalar computation as possible into predicated blocks. We previously were able to sink such operands only if they were extractelement instructions. Differential Revision: https://reviews.llvm.org/D25632 llvm-svn: 285097
*	[InstCombine] add tests for missing icmp + shl nuw fold	Sanjay Patel	2016-10-25	1	-0/+80
\| \| \| \| \| \| \| \|	Patch by bryant! Differential Revision: https://reviews.llvm.org/D25952 llvm-svn: 285095
*	Add -strip-nonlinetable-debuginfo capability	Michael Ilseman	2016-10-25	6	-5/+194
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a new function to DebugInfo.cpp that takes an llvm::Module as input and removes all debug info metadata that is not directly needed for line tables, thus effectively stripping all type and variable information from the module. The primary motivation for this feature was the bitcode work flow (cf. http://lists.llvm.org/pipermail/llvm-dev/2016-June/100643.html for more background). This is not wired up yet, but will be in subsequent patches. For testing, the new functionality is exposed to opt with a -strip-nonlinetable-debuginfo option. The secondary use-case (and one that works right now!) is as a reduction pass in bugpoint. I added two new bugpoint options (-disable-strip-debuginfo and -disable-strip-debug-types) to control the new features. By default it will first attempt to remove all debug information, then only the type info, and then proceed to hack at any remaining MDNodes. Thanks to Adrian Prantl for stewarding this patch! llvm-svn: 285094
*	Remove debug location from common tail when tail-merging	Robert Lougher	2016-10-25	2	-6/+80
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The branch folding pass tail merges blocks into a common-tail. However, the tail retains the debug information from one of the original inputs to the merge (chosen randomly). This is a problem for sampled-based PGO, as hits on the common-tail will be attributed to whichever block was chosen, irrespective of which path was actually taken to the common-tail. This patch fixes the issue by nulling the debug location for the common-tail. Differential Revision: https://reviews.llvm.org/D25742 llvm-svn: 285093
*	[llvm-cov] Add support for loading coverage from multiple objects	Vedant Kumar	2016-10-25	2	-1/+12
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D25086 llvm-svn: 285088
*	[WebAssembly] Add immediate fields to call_indirect and memory operators.	Dan Gohman	2016-10-25	1	-27/+0
\| \| \| \| \| \| \|	call_indirect, grow_memory, and current_memory now have immediate operands in the 0xd binary encoding. llvm-svn: 285085
*	[IndVarSimplify][Dwarf] When widening the IV increment, correctly set the ↵	Andrea Di Biagio	2016-10-25	1	-0/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	debug loc. When indvars widened an induction variable, the debug location for the loop increment computation was incorrectly set equal to the debug loc of the loop latch terminator. This patch fixes the issue by propagating the correct location from the original loop increment instruction to the new widened increment. Differential Revision: https://reviews.llvm.org/D25872 llvm-svn: 285083
*	[EarlyCSE] Make MemorySSA memory dependency check more aggressive.	Geoff Berry	2016-10-25	1	-0/+39
\| \| \| \| \| \| \| \| \| \|	Now that MemorySSA keeps track of whether MemoryUses are optimized, use getClobberingMemoryAccess() to check MemoryUse memory dependencies since it should no longer be so expensive. This is a follow-up change to https://reviews.llvm.org/D25881 llvm-svn: 285080
*	[SystemZ] Do not use LOC(G) for volatile loads	Ulrich Weigand	2016-10-25	2	-0/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	It is not safe to use LOAD ON CONDITION to implement access to a memory location marked "volatile", since the architecture leaves it unspecified whether or not an access happens if the condition is false. The current code already appears to care about that: def LOC : CondUnaryRSY<"loc", 0xEBF2, nonvolatile_load, GR32, 4>; Unfortunately, that "nonvolatile_load" operator is simply ignored by the CondUnaryRSY class, and there was no test to catch it. llvm-svn: 285077
*	[InstCombine] add test and code comment to show potentially misguided icmp ↵	Sanjay Patel	2016-10-25	1	-0/+13
\| \| \| \| \| \|	trunc transform llvm-svn: 285075
*	[X86][SSE] Add support for (V)PMOVSX* constant folding	Simon Pilgrim	2016-10-25	8	-36/+27
\| \| \| \| \| \| \| \| \| \|	We already have (V)PMOVZX* combining support, this is the beginning of handling (V)PMOVSX* similarly - other combines in combineVSZext can be generalized in future patches. This unearthed an interesting bug in that we were generating illegal build vectors on 32-bit targets - it was proving difficult to create a test for it from PMOVZX, but it fired immediately with PMOVSX. I've created a more general form of the existing getConstVector to handle these cases - ideally this should be handled in non-target-specific code but I couldn't find an equivalent. Differential Revision: https://reviews.llvm.org/D25874 llvm-svn: 285072
*	[InstCombine] fix checks for previous commit (r285069)	Sanjay Patel	2016-10-25	1	-6/+9
\| \| \| \| \| \|	Accidentally put in the hoped-for checks ahead of the transform! llvm-svn: 285070
*	[InstCombine] add tests for bitcast interference with min/max (PR28001)	Sanjay Patel	2016-10-25	1	-0/+60
\| \| \| \|	llvm-svn: 285069
*	[DAGCombine] Preserve shuffles when one of the vector operands is constant	Zvi Rackover	2016-10-25	1	-54/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Do not perform combines such as: vector_shuffle<4,1,2,3>(build_vector(Ud, C0, C1 C2), scalar_to_vector(X)) -> build_vector(X, C0, C1, C2) Keeping the shuffle allows lowering the constant build_vector to a materialized constant vector (such as a vector-load from the constant-pool or some other idiom). Reviewers: delena, igorb, spatel, mkuper, andreadb, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25524 llvm-svn: 285063
*	[AVX-512] Add support for creating SIGN_EXTEND_VECTOR_INREG and ↵	Craig Topper	2016-10-25	3	-27/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	ZERO_EXTEND_VECTOR_INREG for 512-bit vectors to support vpmovzxbq and vpmovsxbq. Summary: The one tricky thing about this is that the sign/zero_extend_inreg uses v64i8 as an input type which isn't legal without BWI support. Though the vpmovsxbq and vpmovzxbq instructions themselves don't require BWI. To support this we need to add custom lowering for ZERO_EXTEND_VECTOR_INREG with v64i8 input. This can mostly reuse the existing sign extend code with a couple checks for sign extend vs zero extend added. Reviewers: delena, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25594 llvm-svn: 285053
*	[InstCombine] auto-generate checks	Sanjay Patel	2016-10-25	1	-39/+66
\| \| \| \|	llvm-svn: 285046
*	[InstCombine] auto-generate checks	Sanjay Patel	2016-10-25	1	-74/+102
\| \| \| \|	llvm-svn: 285045
*	[llvm-cov] Do not print out the filename of the object file	Vedant Kumar	2016-10-25	8	-15/+14
\| \| \| \| \| \| \| \| \|	When we load coverage data from multiple objects, we don't have a way to attribute a source object to a function record. Printing out the object filename next to the source filename is already not very useful: soon, it'll actually become misleading. Stop printing out the filename now. llvm-svn: 285043
*	[InstCombine] regenerate some checks	Sanjay Patel	2016-10-24	1	-75/+86
\| \| \| \|	llvm-svn: 285036
*	Fix regression from my recent GlobalsAA fix.	Eli Friedman	2016-10-24	1	-0/+54
\| \| \| \| \| \| \| \| \| \| \| \| \|	There are two fixes here: one, AnalyzeUsesOfPointer can't return false until it has checked all the uses of the pointer. Two, if a global uses another global, we have to assume the address of the first global escapes. Fixes https://llvm.org/bugs/show_bug.cgi?id=30707 . Differential Revision: https://reviews.llvm.org/D25798 llvm-svn: 285034
*	[SelectionDAG] Update ComputeNumSignBits SRA/SHL handlers to accept scalar ↵	Simon Pilgrim	2016-10-24	1	-8/+2
\| \| \| \| \| \| \| \| \| \|	or vector splats Use isConstOrConstSplat helper. Also use APInt instead of getZExtValue directly to avoid out of range issues. llvm-svn: 285033
*	nother additional error check for an invalid Mach-O file	Kevin Enderby	2016-10-24	2	-0/+3
\| \| \| \| \| \| \|	when contained in a Mach-O universal file and the cputypes in both headers don’t match. llvm-svn: 285026
*	[x86] add tests for {-1,0,1} select of constants	Sanjay Patel	2016-10-24	1	-0/+93
\| \| \| \|	llvm-svn: 285005
*	[llvm] Remove redundant --check-prefix=CHECK from tests	Mandeep Singh Grang	2016-10-24	16	-16/+16
\| \| \| \| \| \| \| \|	Reviewers: MatzeB, mcrosier, rengolin Differential Revision: https://reviews.llvm.org/D25894 llvm-svn: 285003
*	add-discriminators: Fix handling of lexical scopes.	Adrian Prantl	2016-10-24	2	-4/+92
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a bug in the handling of lexical scopes, when more than one scope is defined on the same line or functions are inlined into call sites that are on the same line as the function definition. This situation can easily happen in macro expansions. The problem is solved by introducing a SmallDenseMap<DIScope , DILexicalBlockFile , 1> that keeps track of all the different lexical scopes that share a line/file location. Fixes PR30681. llvm-svn: 284998
*	[PPC] Generate positive FP zero using xor insn instead of loading from ↵	Ehsan Amiri	2016-10-24	3	-10/+10
\| \| \| \| \| \| \| \| \| \| \|	constant area https://reviews.llvm.org/D23614 Currently we load +0.0 from constant area. That can change to be generated using XOR instruction. llvm-svn: 284995
*	Revert r284580+r284917. ("Synthesize TBB/TBH instructions")	Eli Friedman	2016-10-24	6	-127/+23
\| \| \| \| \| \| \|	The optimization has correctness issues, so reverting for now to fix tests on thumb1 targets. llvm-svn: 284993
*	[AArch64] Optionally use the Newton series for reciprocal estimation	Evandro Menezes	2016-10-24	2	-0/+376
\| \| \| \| \| \| \| \| \|	Add support for estimating the square root or its reciprocal and division or reciprocal using the combiner generic Newton series. Differential revision: https://reviews.llvm.org/D25291 llvm-svn: 284986
*	[EarlyCSE] Optimize MemoryPhis and reduce memory clobber queries w/ MemorySSA	Geoff Berry	2016-10-24	1	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When using MemorySSA, re-optimize MemoryPhis when removing a store since this may create MemoryPhis with all identical arguments. Also, when using MemorySSA to check if two MemoryUses are reading from the same version of the heap, use the defining access instead of calling getClobberingAccess, since the latter can currently result in many more AA calls. Once the MemorySSA use optimization tracking changes are done, we can remove this limitation, which should result in more loads being CSE'd. Reviewers: dberlin Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D25881 llvm-svn: 284984
*	[PPC] Better codegen for AND, ANY_EXT, SRL sequence	Ehsan Amiri	2016-10-24	1	-0/+29
\| \| \| \| \| \| \| \|	https://reviews.llvm.org/D24924 This improves the code generated for a sequence of AND, ANY_EXT, SRL instructions. This is a targetted fix for this special pattern. The pattern is generated by target independet dag combiner and so a more general fix may not be necessary. If we come across other similar cases, some ideas for handling it are discussed on the code review. llvm-svn: 284983
*	[x86] regenerate checks	Sanjay Patel	2016-10-24	1	-10/+11
\| \| \| \|	llvm-svn: 284982
*	AMDGPU: Fix Two Address problems with v_movreld	Nicolai Haehnle	2016-10-24	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The v_movreld machine instruction is used with three operands that are in a sense tied to each other (the explicit VGPR_32 def and the implicit VGPR_NN def and use). There is no way to express that using the currently available operand bits, and indeed there are cases where the Two Address instructions pass does the wrong thing. This patch introduces a new set of pseudo instructions that are identical in intended semantics as v_movreld, but they only have two tied operands. Having to add a new set of pseudo instructions is admittedly annoying, but it's a fairly straightforward and solid approach. The only alternative I see is to try to teach the Two Address instructions pass about Three Address instructions, and I'm afraid that's trickier and is going to end up more fragile. Note that v_movrels does not suffer from this problem, and so this patch does not touch it. This fixes several GL45-CTS.shaders.indexing.* tests. Reviewers: tstellarAMD, arsenm Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25633 llvm-svn: 284980
*	Revert 284971.	Nico Weber	2016-10-24	1	-23/+0
\| \| \| \| \| \| \| \| \|	It seems to break selfhost on some bots, see e.g. http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/21 http://lab.llvm.org:8011/builders/clang-ppc64be-linux-multistage/builds/20 http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/22 llvm-svn: 284979
*	[MC] Fix Various End Of Line Comment checkings	Nirav Dave	2016-10-24	3	-5/+265
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fix AsmParser lines to correctly handle end-of-line pre-processor comments parsing when '#' is not the assembly line comment prefix. Reviewers: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25567 llvm-svn: 284978
*	AArch64 ILP32 relocations for assembly and ELF	Joel Jones	2016-10-24	6	-70/+512
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add relocations for AArch64 ILP32. Includes: - Addition of definitions for R_AARCH32_* - Definition of new -target-abi: ilp32 - Definition of data layout string - Tests for added relocations. Not comprehensive, but matches existing tests for 64-bit. Renames "CHECK-OBJ" to "CHECK-OBJ-LP64". - Tests for llvm-readobj Reviewers: zatrazz, peter.smith, echristo, t.p.northover Subscribers: aemerson, rengolin, mehdi_amini Differential Revision: https://reviews.llvm.org/D25159 llvm-svn: 284973
*	[JumpThreading] Unfold selects that depend on the same condition	Pablo Barrio	2016-10-24	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: These are good candidates for jump threading. This enables later opts (such as InstCombine) to combine instructions from the selects with instructions out of the selects. SimplifyCFG will fold the select again if unfolding wasn't worth it. Patch by James Molloy and Pablo Barrio. Reviewers: reames, bkramer, mcrosier, gberry, haicheng, jmolloy, sebpop Subscribers: jojo, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D25477 llvm-svn: 284971
*	[mips] synci microMIPS instruction definition.	Simon Dardis	2016-10-24	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add synci to the microMIPS instruction definitions, mark the MIPS sync & synci as not being part of microMIPS. This does not cover the sync instruction alias, as that will be handled with a different patch. Add sync to the valid tests for microMIPS. Reviewers: vkalintiris Differential Revision: https://reviews.llvm.org/D25795 llvm-svn: 284962
*	[llvm-opt-report] Fix unroll-count reporting	Hal Finkel	2016-10-24	3	-0/+137
\| \| \| \| \| \| \|	Fix the implementation of OptReportLocationInfo's operator < so that contexts with different unroll counts are reported separately. llvm-svn: 284957
*	[AVX-512] Remove masked pmin/pmax intrinsics and autoupgrade to native IR.	Craig Topper	2016-10-24	8	-871/+836
\| \| \| \| \| \|	Clang patch to replace 512-bit vector and 64-bit element versions with native IR will follow. llvm-svn: 284955
*	[DAG] enhance computeKnownBits to handle SRL/SRA with vector splat constant	Sanjay Patel	2016-10-23	2	-21/+6
\| \| \| \|	llvm-svn: 284953
*	[CostModel][X86] Added tests for current integer signed/unsigned remainder costs	Simon Pilgrim	2016-10-23	1	-0/+116
\| \| \| \|	llvm-svn: 284940
*	[X86][SSE] Add SSE41/AVX1 costs for vector shifts.	Simon Pilgrim	2016-10-23	3	-109/+109
\| \| \| \| \| \|	We were defaulting to SSE2 costs which weren't taking into account the availability of PBLENDW/PBLENDVB to improve merging of per-element shift results. llvm-svn: 284939
*	[CostModel][X86] Added tests for current integer trunc costs	Simon Pilgrim	2016-10-23	1	-0/+141
\| \| \| \|	llvm-svn: 284938
*	[X86][AVX512VL] Added support for combining target 256-bit shuffles to ↵	Simon Pilgrim	2016-10-22	2	-55/+92
\| \| \| \| \| \|	AVX512VL VPERMV3 llvm-svn: 284922
*	[X86][AVX512] Added support for combining target shuffles to AVX512 VPERMV3	Simon Pilgrim	2016-10-22	2	-0/+72
\| \| \| \|	llvm-svn: 284921
*	[ARM] Fix crash in ConstantIslands	James Molloy	2016-10-22	1	-0/+43
\| \| \| \| \| \|	tPCRelJT may not be the first instruction in a block. Check that instead of dereferencing a broken iterator. llvm-svn: 284917
*	[X86] Apply the Update LLC Test Checks tool on the mmx-bitcast test	Zvi Rackover	2016-10-22	1	-10/+15
\| \| \| \|	llvm-svn: 284916
*	[X86] Add support for printing shuffle comments for VALIGN instructions.	Craig Topper	2016-10-22	2	-3/+12
\| \| \| \|	llvm-svn: 284915
*	[BasicAA] Fix - missed alias in GEP expressions	Gerolf Hoflehner	2016-10-22	1	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \|	In BasicAA GEP operand values get adjusted ("wrap-around") based on the pointersize. Otherwise, in non-64b modes, AA could report false negatives. However, a wrap-around is valid only for a fully evaluated expression. It had been introduced to fix an alias problem in http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20160118/326163.html. This commit restricts the wrap-around to constant gep operands only where the value is known at compile-time. llvm-svn: 284908
*	[x86] add test for missing vector SRA combine via computeKnownBits	Sanjay Patel	2016-10-21	1	-0/+18
\| \| \| \|	llvm-svn: 284896