bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[InstCombine] move folds for shift-shift pairs; NFCI	Sanjay Patel	2017-02-01	1	-48/+34
\| \| \| \| \| \| \| \| \| \| \|	Although this is 'no-functional-change-intended', I'm adding tests for shl-shl and lshr-lshr pairs because there is no existing test coverage for those folds. It seems like we should be able to remove some code from foldShiftedShift() at this point because we're handling those patterns on the general path. llvm-svn: 293814
*	Shut up another GCC warning about operator precedence. NFC.	Michael Kuperstein	2017-02-01	1	-1/+1
\| \| \| \|	llvm-svn: 293812
*	[JumpThread] No need to erase BB from LoopHeaders. NFC.	Jun Bum Lim	2017-02-01	1	-14/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: No need to try to ease BB from LoopHeaders as we already know that BB is not in LoopHeaders. Reviewers: hsung, majnemer, mcrosier, haicheng, rengolin Reviewed By: rengolin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29232 llvm-svn: 293802
*	[IPSCCP] Don't propagate return values of functions marked as noinline.	Davide Italiano	2017-02-01	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This tries to address what Hal defined (in the post-commit review of r293727) a long-standing problem with noinline, where we end up de facto inlining trivial functions e.g. __attribute__((noinline)) int patatino(void) { return 5; } because of return value propagation. llvm-svn: 293799
*	[LV] Move interleaved access helper functions to VectorUtils (NFC)	Matthew Simpson	2017-02-01	1	-99/+3
\| \| \| \| \| \| \| \| \| \| \| \|	This patch moves some helper functions related to interleaved access vectorization out of LoopVectorize.cpp and into VectorUtils.cpp. We would like to use these functions in a follow-on patch that improves interleaved load and store lowering in (ARM/AArch64)ISelLowering.cpp. One of the functions was already duplicated there and has been removed. Differential Revision: https://reviews.llvm.org/D29398 llvm-svn: 293788
*	[InstCombine] Allow InstCombine to merge adjacent guards	Sanjoy Das	2017-02-01	1	-6/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If there are two adjacent guards with different conditions, we can remove one of them and include its condition into the condition of another one. This patch allows InstCombine to merge them by the following pattern: guard(a); guard(b) -> guard(a & b). Reviewers: reames, apilipenko, igor-laevsky, anna, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29378 llvm-svn: 293778
*	[LoopPredication] Add a new line to debug output in LoopPredication pass	Artur Pilipenko	2017-02-01	1	-1/+1
\| \| \| \|	llvm-svn: 293762
*	[LoopUnroll] Use addClonedBlockToLoopInfo to add loop header to LI (NFC).	Florian Hahn	2017-02-01	1	-11/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I have a similar patch up for review already (D29173). If you prefer I can squash them both together. Also I think there more potential for code sharing between LoopUnroll.cpp and LoopUnrollRuntime.cpp. Do you think patches for that would be worthwhile? Reviewers: mkuper, mzolotukhin Reviewed By: mkuper, mzolotukhin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29311 llvm-svn: 293758
*	SanitizerCoverage: Support sanitizer guard section on darwin	Justin Bogner	2017-02-01	1	-13/+30
\| \| \| \| \| \| \|	MachO's sections need a segment as well as a section name, and the section start and end symbols are spelled differently than on ELF. llvm-svn: 293733
*	[IPSCCP] Teach how to not propagate return values of naked functions.	Davide Italiano	2017-02-01	1	-1/+4
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D29360 llvm-svn: 293727
*	InferAddressSpaces: Handle select	Matt Arsenault	2017-02-01	1	-0/+8
\| \| \| \| \| \| \|	This fails to handle some cases where one of the inputs is a constant to be fixed in a later commit. llvm-svn: 293723
*	InferAddressSpaces: Remove dead declaration	Matt Arsenault	2017-01-31	1	-1/+0
\| \| \| \|	llvm-svn: 293720
*	InferAddressSpaces: Avoid double map lookup	Matt Arsenault	2017-01-31	1	-6/+4
\| \| \| \|	llvm-svn: 293719
*	InferAddressSpaces: Fix broken casting of constants	Matt Arsenault	2017-01-31	1	-2/+7
\| \| \| \|	llvm-svn: 293718
*	NewGVN: Dead argument cleanup	Daniel Berlin	2017-01-31	1	-91/+63
\| \| \| \|	llvm-svn: 293708
*	NewGVN: Cleanup conditions to match reality	Daniel Berlin	2017-01-31	1	-13/+8
\| \| \| \|	llvm-svn: 293707
*	NewGVN: Add basic support for symbolic comparison evaluation	Daniel Berlin	2017-01-31	1	-3/+20
\| \| \| \|	llvm-svn: 293706
*	NewGVN: Formatting cleanup after lookupOperandLeader change	Daniel Berlin	2017-01-31	1	-6/+3
\| \| \| \|	llvm-svn: 293705
*	NewGVN: Remove the unsued two arguments from lookupOperandLeader.	Daniel Berlin	2017-01-31	1	-21/+17
\| \| \| \|	llvm-svn: 293704
*	NewGVN: Cleanup header files we are using.	Daniel Berlin	2017-01-31	1	-5/+0
\| \| \| \|	llvm-svn: 293703
*	[NewGVN] Preserve TargetLibraryInfo analysis.	Davide Italiano	2017-01-31	1	-0/+2
\| \| \| \| \| \| \|	We can maybe preserve more but this is a first step. Ack'ed by Danny on IRC. llvm-svn: 293694
*	Do not propagate DebugLoc across basic blocks	Taewook Oh	2017-01-31	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: DebugLoc shouldn't be propagated across basic blocks to prevent incorrect stepping and imprecise sample profile result. rL288903 addressed the wrong DebugLoc propagation issue by limiting the copy of DebugLoc when GVN removes a fully redundant load that is dominated by some other load. However, DebugLoc is still incorrectly propagated in the following example: ``` 1: extern int g; 2: 3: void foo(int x, int y, int z) { 4: if (x) 5: g = 0; 6: else 7: g = 1; 8: 9: int i = 0; 10: for ( ; i < y ; i++) 11: if (i > z) 12: g++; 13: } ``` Below is LLVM IR representation of the program before GVN: ``` @g = external local_unnamed_addr global i32, align 4 ; Function Attrs: nounwind uwtable define void @foo(i32 %x, i32 %y, i32 %z) local_unnamed_addr #0 !dbg !4 { entry: %not.tobool = icmp eq i32 %x, 0, !dbg !8 %.sink = zext i1 %not.tobool to i32, !dbg !8 store i32 %.sink, i32* @g, align 4, !tbaa !9 %cmp8 = icmp sgt i32 %y, 0, !dbg !13 br i1 %cmp8, label %for.body.preheader, label %for.end, !dbg !17 for.body.preheader: ; preds = %entry br label %for.body, !dbg !19 for.body: ; preds = %for.body.preheader, %for.inc %i.09 = phi i32 [ %inc4, %for.inc ], [ 0, %for.body.preheader ] %cmp1 = icmp sgt i32 %i.09, %z, !dbg !19 br i1 %cmp1, label %if.then2, label %for.inc, !dbg !21 if.then2: ; preds = %for.body %0 = load i32, i32* @g, align 4, !dbg !22, !tbaa !9 %inc = add nsw i32 %0, 1, !dbg !22 store i32 %inc, i32* @g, align 4, !dbg !22, !tbaa !9 br label %for.inc, !dbg !23 for.inc: ; preds = %for.body, %if.then2 %inc4 = add nuw nsw i32 %i.09, 1, !dbg !24 %exitcond = icmp ne i32 %inc4, %y, !dbg !13 br i1 %exitcond, label %for.body, label %for.end.loopexit, !dbg !17 for.end.loopexit: ; preds = %for.inc br label %for.end, !dbg !26 for.end: ; preds = %for.end.loopexit, %entry ret void, !dbg !26 } ``` where ``` !21 = !DILocation(line: 11, column: 9, scope: !15) !22 = !DILocation(line: 12, column: 8, scope: !20) !23 = !DILocation(line: 12, column: 7, scope: !20) !24 = !DILocation(line: 10, column: 20, scope: !25) ``` And below is after GVN: ``` @g = external local_unnamed_addr global i32, align 4 define void @foo(i32 %x, i32 %y, i32 %z) local_unnamed_addr !dbg !4 { entry: %not.tobool = icmp eq i32 %x, 0, !dbg !8 %.sink = zext i1 %not.tobool to i32, !dbg !8 store i32 %.sink, i32* @g, align 4, !tbaa !9 %cmp8 = icmp sgt i32 %y, 0, !dbg !13 br i1 %cmp8, label %for.body.preheader, label %for.end, !dbg !17 for.body.preheader: ; preds = %entry br label %for.body, !dbg !19 for.body: ; preds = %for.inc, %for.body.preheader %0 = phi i32 [ %1, %for.inc ], [ %.sink, %for.body.preheader ], !dbg !21 %i.09 = phi i32 [ %inc4, %for.inc ], [ 0, %for.body.preheader ] %cmp1 = icmp sgt i32 %i.09, %z, !dbg !19 br i1 %cmp1, label %if.then2, label %for.inc, !dbg !22 if.then2: ; preds = %for.body %inc = add nsw i32 %0, 1, !dbg !21 store i32 %inc, i32* @g, align 4, !dbg !21, !tbaa !9 br label %for.inc, !dbg !23 for.inc: ; preds = %if.then2, %for.body %1 = phi i32 [ %inc, %if.then2 ], [ %0, %for.body ] %inc4 = add nuw nsw i32 %i.09, 1, !dbg !24 %exitcond = icmp ne i32 %inc4, %y, !dbg !13 br i1 %exitcond, label %for.body, label %for.end.loopexit, !dbg !17 for.end.loopexit: ; preds = %for.inc br label %for.end, !dbg !26 for.end: ; preds = %for.end.loopexit, %entry ret void, !dbg !26 } ``` As you see, GVN removes the load in if.then2 block and creates a phi instruction in for.body for it. The problem is that DebugLoc of remove load instruction is propagated to the newly created phi instruction, which is wrong. rL288903 cannot handle this case because ValuesPerBlock.size() is not 1 in this example when the load is removed. Reviewers: aprantl, andreadb, wolfgangp Reviewed By: andreadb Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D29254 llvm-svn: 293688
*	[Instcombine] Combine consecutive identical fences	Davide Italiano	2017-01-31	2	-0/+10
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D29314 llvm-svn: 293661
*	Don't combine stores to a swifterror pointer operand to a different type	Arnold Schwaighofer	2017-01-31	1	-1/+2
\| \| \| \|	llvm-svn: 293658
*	Explicitly promote indirect calls before sample profile annotation.	Dehao Chen	2017-01-31	1	-5/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In iterative sample pgo where profile is collected from PGOed binary, we may see indirect call targets promoted and inlined in the profile. Before profile annotation, we need to make this happen in order to annotate correctly on IR. This patch explicitly promotes these indirect calls and inlines them before profile annotation. Reviewers: xur, davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29040 llvm-svn: 293657
*	fix formatting; NFC	Sanjay Patel	2017-01-31	5	-17/+17
\| \| \| \|	llvm-svn: 293652
*	[InstCombine] Make sure that LHS and RHS have the same type in	Silviu Baranga	2017-01-31	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	transformToIndexedCompare If they don't have the same type, the size of the constant index would need to be adjusted (and this wouldn't be always possible). Alternatively we could try the analysis with the initial RHS value, which would guarantee that the two sides have the same type. However it is unlikely that in practice this would pass our transformation requirements. Fixes PR31808 (https://llvm.org/bugs/show_bug.cgi?id=31808). llvm-svn: 293629
*	[LoopUnroll] Use addClonedBlockToLoopInfo to clone the top level loop (NFC)	Florian Hahn	2017-01-31	1	-14/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: rL293124 added the necessary infrastructure to properly add the cloned top level loop to LoopInfo, which means we do not have to do it manually in CloneLoopBlocks. @mkuper sorry for not pointing this out during my review of D29156, I just realized that today. Reviewers: mzolotukhin, chandlerc, mkuper Reviewed By: mkuper Subscribers: llvm-commits, mkuper Differential Revision: https://reviews.llvm.org/D29173 llvm-svn: 293615
*	InferAddressSpaces: Rename constant	Matt Arsenault	2017-01-31	1	-6/+6
\| \| \| \|	llvm-svn: 293594
*	InferAddressSpaces: Handle icmp	Matt Arsenault	2017-01-31	1	-8/+64
\| \| \| \|	llvm-svn: 293593
*	InferAddressSpaces: Support memory intrinsics	Matt Arsenault	2017-01-31	1	-14/+146
\| \| \| \|	llvm-svn: 293587
*	InferAddressSpaces: Support atomics	Matt Arsenault	2017-01-31	1	-16/+44
\| \| \| \|	llvm-svn: 293584
*	InferAddressSpaces: Don't replace volatile users	Matt Arsenault	2017-01-31	1	-2/+5
\| \| \| \|	llvm-svn: 293582
*	NVPTX: Move InferAddressSpaces to generic code	Matt Arsenault	2017-01-31	3	-0/+612
\| \| \| \|	llvm-svn: 293579
*	[InstCombine] enable (X <<nsw C1) >>s C2 --> X <<nsw (C1 - C2) for vectors ↵	Sanjay Patel	2017-01-30	1	-54/+19
\| \| \| \| \| \|	with splat constants llvm-svn: 293570
*	[ICP] Fix bool conversion warning and actually write out the reason instead ↵	Benjamin Kramer	2017-01-30	1	-1/+1
\| \| \| \| \| \|	of dropping it. llvm-svn: 293564
*	[InstCombine] enable more lshr(shl X, C1), C2 folds for vectors with splat ↵	Sanjay Patel	2017-01-30	1	-23/+17
\| \| \| \| \| \|	constants llvm-svn: 293562
*	Expose isLegalToPromot as a global helper function so that SamplePGO pass ↵	Dehao Chen	2017-01-30	1	-46/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	can call it for legality check. Summary: SamplePGO needs to check if it is legal to promote a target before it actually promotes it. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29306 llvm-svn: 293559
*	Revert r292979 which causes compile time failure.	Dehao Chen	2017-01-30	1	-19/+5
\| \| \| \|	llvm-svn: 293557
*	LSR: Don't drop address space when type doesn't match	Matt Arsenault	2017-01-30	1	-4/+7
\| \| \| \| \| \| \| \| \| \|	For targets with different addressing modes in each address space, if this is dropped querying isLegalAddressingMode later with this will give a nonsense result, breaking the isLegalUse assertions. This is a candidate for the 4.0 release branch. llvm-svn: 293542
*	[InstCombine] enable (X >>?exact C1) << C2 --> X >>?exact (C1-C2) for ↵	Sanjay Patel	2017-01-30	1	-24/+22
\| \| \| \| \| \|	vectors with splat constants llvm-svn: 293524
*	NewGVN: Instead of changeToUnreachable, insert an instruction SimplifyCFG ↵	Daniel Berlin	2017-01-30	1	-4/+5
\| \| \| \| \| \|	will turn into unreachable when it runs llvm-svn: 293515
*	[InstCombine] use auto with obvious type; NFC	Sanjay Patel	2017-01-30	1	-3/+3
\| \| \| \|	llvm-svn: 293508
*	[InstCombine] enable (X <<nsw C1) >>s C2 --> X <<nsw (C1-C2) for vectors ↵	Sanjay Patel	2017-01-30	1	-20/+16
\| \| \| \| \| \|	with splat constants llvm-svn: 293507
*	Revert "NewGVN: Make unreachable blocks be marked with unreachable"	Daniel Berlin	2017-01-30	1	-13/+18
\| \| \| \| \| \| \| \| \|	This reverts commit r293196 Besides making things look nicer, ATM, we'd like to preserve analysis more than we'd like to destroy the CFG. We'll probably revisit in the future llvm-svn: 293501
*	[InstCombine] fixed to propagate 'exact' on lshr	Sanjay Patel	2017-01-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original shift is bigger, so this may qualify as 'obvious', but here's an attempt at an Alive-based proof: Name: exact Pre: (C1 u< C2) %a = shl i8 %x, C1 %b = lshr exact i8 %a, C2 => %c = lshr exact i8 %x, C2 - C1 %b = and i8 %c, ((1 << width(C1)) - 1) u>> C2 Optimization is correct! llvm-svn: 293498
*	[Coroutines] Add header guard to header that's missing one.	Benjamin Kramer	2017-01-30	1	-0/+5
\| \| \| \|	llvm-svn: 293494
*	[Inliner] Fold analysis remarks into missed remarks	Adam Nemet	2017-01-30	1	-15/+12
\| \| \| \| \| \|	This significantly reduces the noise level of these messages. llvm-svn: 293492
*	[Inliner] Fix a comment to match the code. NFC.	Haicheng Wu	2017-01-30	1	-2/+2
\| \| \| \| \| \| \| \|	TotalAltCost => TotalSecondaryCost Differential Revision: https://reviews.llvm.org/D29231 llvm-svn: 293490
*	[InstCombine] enable lshr(shl X, C1), C2 folds for vectors with splat constants	Sanjay Patel	2017-01-30	1	-25/+25
\| \| \| \|	llvm-svn: 293489