bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	ThinLTO: do not import function whose linkage prevents inlining.	Mehdi Amini	2016-05-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268341
*	Revert "ThinLTO: do not import function whose linkage prevents inlining."	Mehdi Amini	2016-05-02	1	-2/+2
\| \| \| \| \| \| \|	This reverts commit r268315, the tests are not passing. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268317
*	ThinLTO: do not import function whose linkage prevents inlining.	Mehdi Amini	2016-05-02	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	There is not point in importing a "weak" or a "linkonce" function since we won't be able to inline it anyway. We already had a targeted check for WeakAny, this is using the same check on GlobalValue as the inline, i.e. isMayBeOverriddenLinkage() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268315
*	Code refactoring -- preparation for new PM porting /NFC	Xinliang David Li	2016-05-02	1	-9/+19
\| \| \| \|	llvm-svn: 268301
*	Revert "[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for ↵	Reid Kleckner	2016-05-02	2	-153/+59
\| \| \| \| \| \| \| \| \| \| \|	empty block including lifetime intrinsics" This reverts commit r268254. This change causes assertion failures while building Chromium. Reduced test case coming soon. llvm-svn: 268288
*	Typo. NFC.	Chad Rosier	2016-05-02	1	-1/+1
\| \| \| \|	llvm-svn: 268280
*	Use false rather than 0 for a boolean value. NFC.	Chad Rosier	2016-05-02	1	-1/+1
\| \| \| \|	llvm-svn: 268279
*	ReversePostOrderFunctionAttrs is not modifying the call graph, let's ↵	Mehdi Amini	2016-05-02	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	preserve it. When running cc1 with -flto=thin, it is followed by GlobalOpt, which requires the callgraph. This saves rebuilding one. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268266
*	[SimplifyCFG] Extend TryToSimplifyUncondBranchFromEmptyBlock for empty block ↵	Hans Wennborg	2016-05-02	2	-59/+153
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	including lifetime intrinsics Make it possible that TryToSimplifyUncondBranchFromEmptyBlock merges empty basic block including lifetime intrinsics as well as phi nodes and unconditional branch into its successor or predecessor(s). If successor of empty block has single predecessor, all contents including lifetime intrinsics are sinked into the successor. Otherwise, they are hoisted into its predecessor(s) and then merged into the predecessor(s). Patch by Josh Yoon <josh.yoon@samsung.com>! Differential Revision: http://reviews.llvm.org/D19257 llvm-svn: 268254
*	Move createReversePostOrderFunctionAttrsPass right after the inliner is done	Mehdi Amini	2016-05-02	1	-8/+8
\| \| \| \| \| \| \| \| \| \|	This is where it was originally, until LoopVersioningLICM was inserted before in r259986, I don't believe it was on purpose. Differential Revision: http://reviews.llvm.org/D19809 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 268252
*	[LLE] Fix typo from r263058	Adam Nemet	2016-05-02	1	-1/+1
\| \| \| \| \| \| \| \|	This was meant to check unit stride for both the load and the store. Thanks to Roman Shirokiy for noticing this. llvm-svn: 268251
*	[InstCombine][SSE] Added support to VPERMD/VPERMPS to shuffle combine to ↵	Simon Pilgrim	2016-05-01	1	-8/+13
\| \| \| \| \| \|	accept UNDEF elements. llvm-svn: 268206
*	[InstCombine][SSE] Added support to VPERMILVAR to shuffle combine to accept ↵	Simon Pilgrim	2016-05-01	1	-20/+27
\| \| \| \| \| \|	UNDEF elements. llvm-svn: 268204
*	[InstCombine][SSE] Added support to PSHUFB to shuffle combine to accept ↵	Simon Pilgrim	2016-05-01	1	-16/+17
\| \| \| \| \| \|	UNDEF elements. llvm-svn: 268202
*	[InstCombine][AVX2] Combine VPERMD/VPERMPS intrinsics with constant masks to ↵	Simon Pilgrim	2016-05-01	1	-0/+37
\| \| \| \| \| \|	shufflevector. llvm-svn: 268199
*	[ASan] Add shadow offset for SystemZ.	Marcin Koscielnicki	2016-04-30	1	-2/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SystemZ on Linux currently has 53-bit address space. In theory, the hardware could support a full 64-bit address space, but that's not supported due to kernel limitations (it'd require 5-level page tables), and there are no plans for that. The default process layout stays within first 4TB of address space (to avoid creating 4-level page tables), so any offset >= (1 << 42) is fine. Let's use 1 << 52 here, ie. exactly half the address space. I've originally used 7 << 50 (uses top 1/8th of the address space), but ASan runtime assumes there's some space after the shadow area. While this is fixable, it's simpler to avoid the issue entirely. Also, I've originally wanted to have the shadow aligned to 1/8th the address space, so that we can use OR like X86 to assemble the offset. I no longer think it's a good idea, since using ADD enables us to load the constant just once and use it with register + register indexed addressing. Differential Revision: http://reviews.llvm.org/D19650 llvm-svn: 268161
*	[InstCombine][AVX] VPERMILVAR to shuffle combine to use general aggregate ↵	Simon Pilgrim	2016-04-30	1	-18/+20
\| \| \| \| \| \| \| \|	elements. NFCI. Make use of Constant::getAggregateElement instead of checking constant types - first step towards adding support for UNDEF mask elements. llvm-svn: 268158
*	[LowerGuardIntrinsics] Keep track of !make.implicit metadata	Sanjoy Das	2016-04-30	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	If a guard call being lowered by LowerGuardIntrinsics has the `!make.implicit` metadata attached, then reattach the metadata to the branch in the resulting expanded form of the intrinsic. This allows us to implement null checks as guards and still get the benefit of implicit null checks. llvm-svn: 268148
*	Reroll loops with multiple IV and negative step part 3	Lawrence Hu	2016-04-30	1	-9/+155
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	support multiple induction variables This patch enable loop reroll for the following case: for(int i=0; i<N; i += 2) { S += a++; S += a++; }; Differential Revision: http://reviews.llvm.org/D16550 llvm-svn: 268147
*	[LowerGuardIntrinsics] Preserve calling conv when lowering	Sanjoy Das	2016-04-30	1	-0/+2
\| \| \| \|	llvm-svn: 268142
*	Reapply r268107 after fixing a bug breaks debug build.	Xinliang David Li	2016-04-29	1	-70/+80
\| \| \| \| \| \|	Makes the new method to set data needed by debug dump. llvm-svn: 268130
*	Mark guards on true as "trivially dead"	Sanjoy Das	2016-04-29	2	-11/+8
\| \| \| \| \| \| \| \| \|	This moves some logic added to EarlyCSE in rL268120 into `llvm::isInstructionTriviallyDead`. Adds a test case for DCE to demonstrate that passes other than EarlyCSE can now pick up on the new information. llvm-svn: 268126
*	[EarlyCSE] Simplify guard intrinsics	Sanjoy Das	2016-04-29	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change teaches EarlyCSE some basic properties of guard intrinsics: - Guard intrinsics read all memory, but don't write to any memory - After a guard has executed, the condition it was guarding on can be assumed to be true - Guard intrinsics on a constant `true` are no-ops Reviewers: reames, hfinkel Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19578 llvm-svn: 268120
*	Revert r268107 -- debug build failure	Xinliang David Li	2016-04-29	1	-78/+70
\| \| \| \|	llvm-svn: 268116
*	[InstCombine][SSE] PSHUFB to shuffle combine to use general aggregate ↵	Simon Pilgrim	2016-04-29	1	-17/+23
\| \| \| \| \| \| \| \|	elements. NFCI. Make use of Constant::getAggregateElement instead of checking constant types - first step towards adding support for UNDEF mask elements. llvm-svn: 268115
*	[inliner]: Refactor inline deferring logic into its own method /NFC	Xinliang David Li	2016-04-29	1	-70/+78
\| \| \| \| \| \| \| \|	The implemented heuristic has a large body of code which better sits in its own function for better readability. It also allows adding more heuristics easier in the future. llvm-svn: 268107
*	[InstCombine] Determine the result of a select based on a dominating condition.	Chad Rosier	2016-04-29	1	-0/+18
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19550 llvm-svn: 268104
*	[InstCombine] clean up; NFC	Sanjay Patel	2016-04-29	1	-1/+1
\| \| \| \|	llvm-svn: 268099
*	[MemorySSA] Fix bugs in walker; refactor unittests a bit.	George Burgess IV	2016-04-29	1	-8/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes two somewhat related bugs in MemorySSA's caching walker. These bugs were found because D19695 brought up the problem that we'd have defs cached to themselves, which is incorrect. The bugs this fixes are: - We would sometimes skip the nearest clobber of a MemoryAccess, because we would query our cache for a given potential clobber before checking if the potential clobber is the clobber we're looking for. The cache entry for the potential clobber would point to the nearest clobber of the potential clobber, so if that was a cache hit, we'd ignore the potential clobber entirely. - There are times (sometimes in DFS, sometimes in the getClobbering... functions) where we would insert cache entries that say a def clobbers itself. There's a bit of common code between the fixes for the bugs, so they aren't split out into multiple commits. This patch also adds a few unit tests, and refactors existing tests a bit to reduce the duplication of setup code. llvm-svn: 268087
*	Do not read callee name when matching IR to profile as it is not used.	Dehao Chen	2016-04-29	1	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Callee name is not used to identify a callsite now, so do not read it during annotation. Reviewers: davidxl, dnovillo Subscribers: dnovillo, danielcdh, llvm-commits Differential Revision: http://reviews.llvm.org/D19704 llvm-svn: 268069
*	[InstCombine] add helper function for ICmp with constant canonicalization; NFCI	Sanjay Patel	2016-04-29	1	-24/+38
\| \| \| \| \| \| \|	As suggested in http://reviews.llvm.org/D17859 , we should enhance this to support vectors. llvm-svn: 268059
*	Unify XDEBUG and EXPENSIVE_CHECKS (into the latter), and add an option to ↵	Filipe Cabecinhas	2016-04-29	3	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the cmake build to enable them. Summary: Historically, we had a switch in the Makefiles for turning on "expensive checks". This has never been ported to the cmake build, but the (dead-ish) code is still around. This will also make it easier to turn it on in buildbots. Reviewers: chandlerc Subscribers: jyknight, mzolotukhin, RKSimon, gberry, llvm-commits Differential Revision: http://reviews.llvm.org/D19723 llvm-svn: 268050
*	[GlobalOpt] Propagate operand bundles	David Majnemer	2016-04-29	1	-5/+10
\| \| \| \| \| \| \|	We neglected to transfer operand bundles for some transforms. These were found via inspection, I'll try to come up with some test cases. llvm-svn: 268011
*	[InstCombine] Propagate operand bundles	David Majnemer	2016-04-29	2	-3/+9
\| \| \| \| \| \| \|	We neglected to transfer operand bundles for some transforms. These were found via inspection, I'll try to come up with some test cases. llvm-svn: 268010
*	[DeadArgumentElimination] Propagate operand bundles to promoted call sites	David Majnemer	2016-04-29	1	-4/+10
\| \| \| \| \| \| \|	We neglected to transfer operand bundles when performing argument promotion. llvm-svn: 268008
*	[LoopDist] Also emit optimization remark on success (-Rpass=)	Adam Nemet	2016-04-29	1	-0/+3
\| \| \| \| \| \| \|	The option -Rpass=loop-distribute now reports the loops that were distributed. llvm-svn: 268006
*	[LoopDist] Pass 'Function' to main class. NFC	Adam Nemet	2016-04-29	1	-6/+8
\| \| \| \| \| \|	Next patch will add another use for 'Function' inside the class. llvm-svn: 268005
*	[SLPVectorizer] Add operand bundles to vectorized functions	David Majnemer	2016-04-29	1	-2/+16
\| \| \| \| \| \| \|	SLPVectorizing a call site should result in further propagation of its bundles. llvm-svn: 268004
*	[LoopVectorize] Add operand bundles to vectorized functions	David Majnemer	2016-04-29	1	-5/+7
\| \| \| \| \| \| \|	Also, do not crash when calculating a cost model for loop-invariant token values. llvm-svn: 268003
*	[ArgumentPromotion] Propagate operand bundles to promoted call sites	David Majnemer	2016-04-29	1	-2/+5
\| \| \| \| \| \| \| \| \|	We neglected to transfer operand bundles when performing argument promotion. This fixes PR27568. llvm-svn: 267986
*	[PR25281] Remove AAResultsWrapper from preserved analyses of loop vectorizer.	Michael Zolotukhin	2016-04-29	1	-1/+0
\| \| \| \| \| \| \|	We don't preserve AAResults, because, for one, we don't preserve SCEV-AA. That fixes PR25281. llvm-svn: 267980
*	Fix build by casting to the proper int type.	Ivan Krasin	2016-04-29	1	-1/+1
\| \| \| \| \| \| \| \|	Reviewers: eugenis Differential Revision: http://reviews.llvm.org/D19706 llvm-svn: 267974
*	[LoopVectorize] Keep hints from original loop on the vector loop	Hal Finkel	2016-04-29	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We need to keep loop hints from the original loop on the new vector loop. Failure to do this meant that, for example: void foo(int *b) { #pragma clang loop unroll(disable) for (int i = 0; i < 16; ++i) b[i] = 1; } this loop would be unrolled. Why? Because we'd vectorize it, thus dropping the hints that unrolling should be disabled, and then we'd unroll it. llvm-svn: 267970
*	[msan] Handle vector compare x86 intrinsics.	Evgeniy Stepanov	2016-04-29	1	-0/+69
\| \| \| \| \| \|	This handles SSE and SSE2 cmp_* and comiXX_* intrinsics. llvm-svn: 267966
*	[LoopDist] Emit optimization remarks (-Rpass*)	Adam Nemet	2016-04-28	1	-0/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I closely followed the precedents set by the vectorizer: * With -Rpass-missed, the loop is reported with further details pointing to -Rpass--analysis. * -Rpass-analysis reports the details why distribution has failed. * Regardless of -Rpass*, when distribution fails for a loop where distribution was forced with the pragma, a warning is produced according to -Wpass-failed. In this case the analysis info is also printed even without -Rpass-analysis. llvm-svn: 267952
*	[LoopDist] Improve debug messages	Adam Nemet	2016-04-28	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \|	The next patch will start using these for -Rpass-analysis so they won't be internal-only anymore. Move the 'Skipping; ' prefix that some of the message are using into the 'fail' function. We don't want to include this prefix in the -Rpass-analysis report. llvm-svn: 267951
*	[LoopDist] Add helper to print debug message when distribution fails. NFC	Adam Nemet	2016-04-28	1	-23/+20
\| \| \| \| \| \|	This will form the basis to emit optimization remarks (-Rpass*). llvm-svn: 267950
*	[Inliner] Preserve llvm.mem.parallel_loop_access metadata	Hal Finkel	2016-04-28	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When inlining a call site with llvm.mem.parallel_loop_access metadata, this metadata needs to be propagated to all cloned memory-accessing instructions. Otherwise, inlining parts of the loop body will invalidate the annotation. With this functionality, we now vectorize the following as expected: void Body(int res, int c, int d, int p, int i) { res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; } void Test(int res, int c, int d, int p, int n) { int i; #pragma clang loop vectorize(assume_safety) for (i = 0; i < 1600; i++) { Body(res, c, d, p, i); } } llvm-svn: 267949
*	[PGO] Fix incorrect Twine usage in emitting optimization remarks.	Rong Xu	2016-04-28	1	-9/+8
\| \| \| \| \| \| \|	Should not store Twine objects to local variables. This is fixed the test failures with r267815 in VS2015 X64 build. llvm-svn: 267908
*	Minor format change and fixing typos in the comments. NFC.	Rong Xu	2016-04-28	1	-10/+7
\| \| \| \|	llvm-svn: 267905