bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86] Lowering sqrt intrinsics to native IR	Tomasz Krupa	2018-06-15	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Complementary patch to lowering sqrt intrinsics in Clang. Reviewers: craig.topper, spatel, RKSimon, DavidKreitzer, uriel.k Reviewed By: craig.topper Subscribers: tkrupa, mike.dvoretsky, llvm-commits Differential Revision: https://reviews.llvm.org/D41599 llvm-svn: 334849
*	[InstCombine] Avoid iteration/mutation conflict	Joseph Tremoulet	2018-06-15	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When iterating users of a multiply in processUMulZExtIdiom, the call to setOperand in the truncation case may replace the use being visited; make sure the iterator has been advanced before doing that replacement. Reviewers: majnemer, davide Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48192 llvm-svn: 334844
*	[LV] Prevent LV to run cost model twice for VF=2	Diego Caballero	2018-06-15	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a minor fix for LV cost model, where the cost for VF=2 was computed twice when the vectorization of the loop was forced without specifying a VF. Reviewers: xusx595, hsaito, fhahn, mkuper Reviewed By: hsaito, xusx595 Differential Revision: https://reviews.llvm.org/D48048 llvm-svn: 334840
*	Re-apply "[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue"	Bjorn Pettersson	2018-06-15	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is r334704 (which was reverted in r334732) with a fix for types like x86_fp80. We need to use getTypeAllocSizeInBits and not getTypeStoreSizeInBits to avoid dropping debug info for such types. Original commit msg: > Summary: > Do not convert a DbgDeclare to DbgValue if the store > instruction only refer to a fragment of the variable > described by the DbgDeclare. > > Problem was seen when for example having an alloca for an > array or struct, and there were stores to individual elements. > In the past we inserted a DbgValue intrinsics for each store, > just as if the store wrote the whole variable. > > When handling store instructions we insert a DbgValue that > indicates that the variable is "undefined", as we do not know > which part of the variable that is updated by the store. > > When ConvertDebugDeclareToDebugValue is used with a load/phi > instruction we assert that the referenced value is large enough > to cover the whole variable. Afaict this should be true for all > scenarios where those methods are used on trunk. If the assert > blows in the future I guess we could simply skip to insert a > dbg.value instruction. > > In the future I think we should examine which part of the variable > that is accessed, and add a DbgValue instrinsic with an appropriate > DW_OP_LLVM_fragment expression. > > Reviewers: dblaikie, aprantl, rnk > > Reviewed By: aprantl > > Subscribers: JDevlieghere, llvm-commits > > Tags: #debug-info > > Differential Revision: https://reviews.llvm.org/D48024 llvm-svn: 334830
*	[InstCombine] Recommit: Fold (x << y) >> y -> x & (-1 >> y)	Roman Lebedev	2018-06-15	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We already do it for splat constants, but not just values. Also, undef cases are mostly non-functional. The original commit was reverted because it broke tests for amdgpu backend, which i didn't check. Now, the backed was updated to recognize these new patterns, so we are good. https://bugs.llvm.org/show_bug.cgi?id=37603 https://rise4fun.com/Alive/cplX Reviewers: spatel, craig.topper, mareko, bogner, rampitec, nhaehnle, arsenm Reviewed By: spatel, rampitec, nhaehnle Subscribers: wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D47980 llvm-svn: 334818
*	Revert rL334704: "[DebugInfo] Check size of variable in ↵	Bjorn Pettersson	2018-06-14	1	-38/+0
\| \| \| \| \| \| \| \| \| \|	ConvertDebugDeclareToDebugValue" This reverts commit r334704. Buildbots detected an assertion in "test tsan in debug compiler-rt build". llvm-svn: 334732
*	[EarlyCSE] Fix MSVC build. NFCI.	Simon Pilgrim	2018-06-14	1	-9/+5
\| \| \| \| \| \|	MSVC doesn't let you assign different lambdas through a ternary operator. llvm-svn: 334715
*	[EarlyCSE] Propagate conditions of AND and OR instructions	Max Kazantsev	2018-06-14	1	-14/+43
\| \| \| \| \| \| \| \| \| \| \|	This patches teaches EarlyCSE to figure out that if `and i1 %x, %y` is true then both `%x` and `%y` are true in the taken branch, and if `or i1 %x, %y` is false then both `%x` and `%y` are false in non-taken branch. Fix for PR37635. Differential Revision: https://reviews.llvm.org/D47574 Reviewed By: reames llvm-svn: 334707
*	[DebugInfo] Check size of variable in ConvertDebugDeclareToDebugValue	Bjorn Pettersson	2018-06-14	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Do not convert a DbgDeclare to DbgValue if the store instruction only refer to a fragment of the variable described by the DbgDeclare. Problem was seen when for example having an alloca for an array or struct, and there were stores to individual elements. In the past we inserted a DbgValue intrinsics for each store, just as if the store wrote the whole variable. When handling store instructions we insert a DbgValue that indicates that the variable is "undefined", as we do not know which part of the variable that is updated by the store. When ConvertDebugDeclareToDebugValue is used with a load/phi instruction we assert that the referenced value is large enough to cover the whole variable. Afaict this should be true for all scenarios where those methods are used on trunk. If the assert blows in the future I guess we could simply skip to insert a dbg.value instruction. In the future I think we should examine which part of the variable that is accessed, and add a DbgValue instrinsic with an appropriate DW_OP_LLVM_fragment expression. Reviewers: dblaikie, aprantl, rnk Reviewed By: aprantl Subscribers: JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D48024 llvm-svn: 334704
*	[SLPVectorizer] Remove RawInstructionsData/getMainOpcode and merge into ↵	Simon Pilgrim	2018-06-14	1	-49/+20
\| \| \| \| \| \| \| \| \| \| \| \|	getSameOpcode This is part of the work to cleanup use of 'alternate' ops so we can use the more general SK_Select shuffle type. Only getSameOpcode calls getMainOpcode and much of the logic is repeated in both functions. This will require some reworking of D28907 but that patch has hit trouble and is unlikely to be completed anytime soon. Differential Revision: https://reviews.llvm.org/D48120 llvm-svn: 334701
*	[NFC] fix trivial typos in comments	Hiroshi Inoue	2018-06-14	14	-31/+31
\| \| \| \|	llvm-svn: 334687
*	[WinASan] Don't instrument globals in sections containing '$'	Reid Kleckner	2018-06-13	1	-5/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	Such globals are very likely to be part of a sorted section array, such the .CRT sections used for dynamic initialization. The uses its own sorted sections called ATL$__a, ATL$__m, and ATL$__z. Instead of special casing them, just look for the dollar sign, which is what invokes linker section sorting for COFF. Avoids issues with ASan and the ATL uncovered after we started instrumenting comdat globals on COFF. llvm-svn: 334653
*	[SLPVectorizer] getSameOpcode - remove useless cast [NFC]	Simon Pilgrim	2018-06-13	1	-3/+2
\| \| \| \| \| \|	There's no need to cast the base Value to an Instruction llvm-svn: 334588
*	[SLPVectorizer] getSameOpcode - remove unusued alternate code [NFC]	Simon Pilgrim	2018-06-13	1	-4/+1
\| \| \| \| \| \|	We early-out for the case where we don't use alternate opcodes, so no need to check for it later. llvm-svn: 334587
*	[SimplifyIndVars] Ignore dead users	Max Kazantsev	2018-06-13	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \|	IndVarSimplify sometimes makes transforms basing on users that are trivially dead. In particular, if DCE wasn't run before it, there may be a dead `sext/zext` in loop that will trigger widening transforms, however it makes no sense to do it. This patch teaches IndVarsSimplify ignore the mist trivial cases of that. Differential Revision: https://reviews.llvm.org/D47974 Reviewed By: sanjoy llvm-svn: 334567
*	[CostModel] Replace ShuffleKind::SK_Alternate with ShuffleKind::SK_Select ↵	Simon Pilgrim	2018-06-12	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR33744) As discussed on PR33744, this patch relaxes ShuffleKind::SK_Alternate which requires shuffle masks to only match an alternating pattern from its 2 sources: e.g. v4f32: <0,5,2,7> or <4,1,6,3> This seems far too restrictive as most SIMD hardware which will implement it using a general blend/bit-select instruction, so replaces it with SK_Select, permitting elements from either source as long as they are inline: e.g. v4f32: <0,5,2,7>, <4,1,6,3>, <0,1,6,7>, <4,1,2,3> etc. This initial patch just updates the name and cost model shuffle mask analysis, later patch reviews will update SLP to better utilise this - it still limits itself to SK_Alternate style patterns. Differential Revision: https://reviews.llvm.org/D47985 llvm-svn: 334513
*	Use SmallPtrSet explicitly for SmallSets with pointer types (NFC).	Florian Hahn	2018-06-12	15	-39/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently SmallSet<PointerTy> inherits from SmallPtrSet<PointerTy>. This patch replaces such types with SmallPtrSet, because IMO it is slightly clearer and allows us to get rid of unnecessarily including SmallSet.h Reviewers: dblaikie, craig.topper Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D47836 llvm-svn: 334492
*	[SampleFDO] Add a new compact binary format for sample profile.	Wei Mi	2018-06-11	1	-3/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Name table occupies a big chunk of size in current binary format sample profile. In order to reduce its size, the patch changes the sample writer/reader to save/restore MD5Hash of names in the name table. Sample annotation phase will also use MD5Hash of name to query samples accordingly. Experiment shows compact binary format can reduce the size of sample profile by 2/3 compared with binary format generally. Differential Revision: https://reviews.llvm.org/D47955 llvm-svn: 334447
*	Revert rL334371 / D47980: "[InstCombine] Fold (x << y) >> y -> x & (-1 >> y)"	Roman Lebedev	2018-06-10	1	-9/+0
\| \| \| \| \| \| \|	test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll broke, and i did not notice because i did not build that backend. llvm-svn: 334373
*	[InstCombine] Fold (x >> y) << y -> x & (-1 << y)	Roman Lebedev	2018-06-10	1	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We already do it for matching splat constants, but not just values. Further improvements for non-matching splat constants, as noted in https://reviews.llvm.org/D46760#1123713 will be needed, but i'd prefer to do that as a follow-up. https://bugs.llvm.org/show_bug.cgi?id=37603 https://rise4fun.com/Alive/cplX https://rise4fun.com/Alive/0HF Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47981 llvm-svn: 334372
*	[InstCombine] Fold (x << y) >> y -> x & (-1 >> y)	Roman Lebedev	2018-06-10	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We already do it for splat constants, but not just values. Also, undef cases are mostly non-functional. https://bugs.llvm.org/show_bug.cgi?id=37603 https://rise4fun.com/Alive/cplX Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47980 llvm-svn: 334371
*	[X86] Remove masking from the 512-bit masked floating point add/sub/mul/div ↵	Craig Topper	2018-06-10	1	-50/+17
\| \| \| \| \| \|	intrinsics. Use a select in IR instead. llvm-svn: 334358
*	Use SmallPtrSet instead of SmallSet in places where we iterate over the set.	Craig Topper	2018-06-09	4	-6/+6
\| \| \| \| \| \| \| \|	SmallSet forwards to SmallPtrSet for pointer types. SmallPtrSet supports iteration, but a normal SmallSet doesn't. So if it wasn't for the forwarding, this wouldn't work. These places were found by hiding the begin/end methods in the SmallSet forwarding llvm-svn: 334343
*	[InstCombine] Skip dbg.value(s) when looking at stack{save,restore}.	Davide Italiano	2018-06-08	1	-1/+8
\| \| \| \| \| \|	Fixes PR37713. llvm-svn: 334317
*	[asan] Instrument comdat globals on COFF targets	Reid Kleckner	2018-06-08	1	-8/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we can use comdats, then we can make it so that the global metadata is thrown away if the prevailing definition of the global was uninstrumented. I have only tested this on COFF targets, but in theory, there is no reason that we cannot also do this for ELF. This will allow us to re-enable string merging with ASan on Windows, reducing the binary size cost of ASan on Windows. Reviewers: eugenis, vitalybuka Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47841 llvm-svn: 334313
*	[VPlan] Move recipe construction to VPRecipeBuilder.	Florian Hahn	2018-06-08	4	-153/+218
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch moves the recipe-creation functions out of LoopVectorizationPlanner, which should do the high-level orchestration of the transformations. Reviewers: dcaballe, rengolin, hsaito, Ayal Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D47595 llvm-svn: 334305
*	reapply r334209 with fixes for harfbuzz in Chromium	Daniil Fukalov	2018-06-08	1	-16/+26
\| \| \| \| \| \| \| \| \| \| \|	r334209 description: [LSR] Check yet more intrinsic pointer operands the patch fixes another assertion in isLegalUse() Differential Revision: https://reviews.llvm.org/D47794 llvm-svn: 334300
*	[VPlan] Move recipe based VPlan generation to separate function.	Florian Hahn	2018-06-08	2	-41/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This first step separates VPInstruction-based and VPRecipe-based VPlan creation, which should make it easier to migrate to VPInstruction based code-gen step by step. Reviewers: Ayal, rengolin, dcaballe, hsaito, mkuper, mzolotukhin Reviewed By: dcaballe Subscribers: bollu, tschuett, rkruppe, llvm-commits Differential Revision: https://reviews.llvm.org/D47477 llvm-svn: 334284
*	[LV] Fix PR36983. For a given recurrence, fix all phis in exit block	Roman Shirokiy	2018-06-08	1	-2/+1
\| \| \| \| \| \| \| \| \|	There could be more than one PHIs in exit block using same loop recurrence. Don't assume there is only one and fix each user. Differential Revision: https://reviews.llvm.org/D47788 llvm-svn: 334271
*	Revert r334209 "[LSR] Check yet more intrinsic pointer operands"	Reid Kleckner	2018-06-08	1	-12/+4
\| \| \| \| \| \| \|	This causes cast failures when compiling harfbuzz in Chromium. Reproducer on the way. llvm-svn: 334254
*	[LSR] Check yet more intrinsic pointer operands	Daniil Fukalov	2018-06-07	1	-4/+12
\| \| \| \| \| \| \| \|	the patch fixes another assertion in isLegalUse() Differential Revision: https://reviews.llvm.org/D47794 llvm-svn: 334209
*	[Mem2Reg] Avoid replacing load with itself in promoteSingleBlockAlloca.	Florian Hahn	2018-06-07	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We do the same thing in rewriteSingleStoreAlloca. Fixes PR37632. Reviewers: chandlerc, davide, efriedma Reviewed By: davide Differential Revision: https://reviews.llvm.org/D47825 llvm-svn: 334187
*	[NFC] Use variable instead of accessing pair many times	Max Kazantsev	2018-06-07	1	-6/+6
\| \| \| \|	llvm-svn: 334173
*	SpeculativeExecution Pass: Set PreserveCFG to avoid unnecessary analyses ↵	Michael Zolotukhin	2018-06-07	1	-0/+2
\| \| \| \| \| \| \| \| \|	invalidation. The pass doesn't touch CFG in any way, only moves instructions between blocks. llvm-svn: 334150
*	[ThinLTO] Rename index IsAnalysis flag to HaveGVs (NFC)	Teresa Johnson	2018-06-06	2	-2/+2
\| \| \| \| \| \| \| \| \|	With the upcoming patch to add summary parsing support, IsAnalysis would be true in contexts where we are not performing module summary analysis. Rename to the more specific and approprate HaveGVs, which is essentially what this flag is indicating. llvm-svn: 334140
*	[InstCombine] fold another shifty abs pattern to cmp+sel (PR36036)	Sanjay Patel	2018-06-06	2	-1/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The bug report: https://bugs.llvm.org/show_bug.cgi?id=36036 ...requests a DAG change for this, but an IR canonicalization probably handles most cases. If we still want to match this pattern in the backend, there's a proposal for that too: D47831 Alive proofs including nsw/nuw cases that were first noted in: D46988 https://rise4fun.com/Alive/Kmp This patch is largely copied from the existing code that was initially added with: D40984 ...but I didn't see much gain from trying to share code. llvm-svn: 334137
*	[InstCombine] PR37603: low bit mask canonicalization	Roman Lebedev	2018-06-06	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is [[ https://bugs.llvm.org/show_bug.cgi?id=37603 \| PR37603 ]]. https://godbolt.org/g/VCMNpS https://rise4fun.com/Alive/idM When doing bit manipulations, it is quite common to calculate some bit mask, and apply it to some value via `and`. The typical C code looks like: ``` int mask_signed_add(int nbits) { return (1 << nbits) - 1; } ``` which is translated into (with `-O3`) ``` define dso_local i32 @mask_signed_add(int)(i32) local_unnamed_addr #0 { %2 = shl i32 1, %0 %3 = add nsw i32 %2, -1 ret i32 %3 } ``` But there is a second, less readable variant: ``` int mask_signed_xor(int nbits) { return ~(-(1 << nbits)); } ``` which is translated into (with `-O3`) ``` define dso_local i32 @mask_signed_xor(int)(i32) local_unnamed_addr #0 { %2 = shl i32 -1, %0 %3 = xor i32 %2, -1 ret i32 %3 } ``` Since we created such a mask, it is quite likely that we will use it in `and` next. And then we may get rid of `not` op by folding into `andn`. But now that i have actually looked: https://godbolt.org/g/VTUDmU _some_ backend changes will be needed too. We clearly loose `bzhi` recognition. Reviewers: spatel, craig.topper, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47428 llvm-svn: 334127
*	InstCombine: ignore debug instructions during fence combine	Tim Northover	2018-06-06	1	-1/+5
\| \| \| \| \| \| \| \| \| \|	We should never get different CodeGen based on whether the code is being compiled in debug mode so we must skip over @llvm.dbg.value (and similar) calls. Should fix at least the worst part of PR37690. llvm-svn: 334090
*	[InstCombine] Correct the cmp operand type used when canonicalizing abs/nabs	John Brawn	2018-06-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	When adjusting a cmp in order to canonicalize an abs/nabs select pattern we need to use the type of the existing operand when creating a new operand not the type of a select operand, as the two may be different. This fixes PR37686. llvm-svn: 334019
*	[InstCombine] refine UB-handling in shuffle-binop transform	Sanjay Patel	2018-06-04	1	-14/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As noted in rL333782, we can be both better for optimization and safer with this transform: BinOp (shuffle V1, Mask), C --> shuffle (BinOp V1, NewC), Mask The only potentially unsafe-to-speculate binops are integer div/rem. All other binops are always safe (although I don't see a way to assert that in code here). For opcodes like shifts that can produce poison, it can't matter here because we know the lanes with undef are dropped by the subsequent shuffle. Differential Revision: https://reviews.llvm.org/D47686 llvm-svn: 333962
*	Move Analysis/Utils/Local.h back to Transforms	David Blaikie	2018-06-04	72	-72/+72
\| \| \| \| \| \| \| \| \| \|	Review feedback from r328165. Split out just the one function from the file that's used by Analysis. (As chandlerc pointed out, the original change only moved the header and not the implementation anyway - which was fine for the one function that was used (since it's a template/inlined in the header) but not in general) llvm-svn: 333954
*	In thin and full LTO + CFI, direct function calls may go through jump table	Dmitry Mikulin	2018-06-04	1	-16/+97
\| \| \| \| \| \| \| \| \| \|	entries to reach the target. Since these calls don't require type checks, we can short-circuit them to their real targets, except in cases when they can be pre-empted. Differential Revision: https://reviews.llvm.org/D46326 llvm-svn: 333937
*	[InstCombine] Fix div handling	Serguei Katkov	2018-06-04	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	When we optimize select basing on fact that div by 0 is undef we should not traverse the instruction which are not guaranteed to transfer execution to next instruction. Guard intrinsic is an example. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47576 llvm-svn: 333864
*	[InstCombine] improve sub with bool folds	Sanjay Patel	2018-06-03	1	-13/+14
\| \| \| \| \| \| \| \|	There's a patchwork of existing transforms trying to handle these cases, but as seen in the changed test, we weren't catching them all. llvm-svn: 333845
*	[InstCombine] call simplify before trying vector folds	Sanjay Patel	2018-06-02	6	-76/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As noted in the review thread for rL333782, we could have made a bug harder to hit if we were simplifying instructions before trying other folds. The shuffle transform in question isn't ever a simplification; it's just a canonicalization. So I've renamed that to make that clearer. This is NFCI at this point, but I've regenerated the test file to show the cosmetic value naming difference of using instcombine's RAUW vs. the builder. Possible follow-ups: 1. Move reassociation folds after simplifies too. 2. Refactor common code; we shouldn't have so much repetition. llvm-svn: 333820
*	[PM/LoopUnswitch] Fix how the cloned loops are handled when updating analyses.	Chandler Carruth	2018-06-02	1	-44/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I noticed this issue because we didn't put the primary cloned loop into the `NonChildClonedLoops` vector and so never iterated on it. Once I fixed that, it made it clear why I had to do a really complicated and unnecesasry dance when updating the loops to remain in canonical form -- I was unwittingly working around the fact that the primary cloned loop wasn't in the expected list of cloned loops. Doh! Now that we include it in this vector, we don't need to return it and we can consolidate the update logic as we correctly have a single place where it can be handled. I've just added a test for the iteration order aspect as every time I changed the update logic partially or incorrectly here, an existing test failed and caught it so that seems well covered (which is also evidenced by the extensive working around of this missing update). Reviewers: asbirlea, sanjoy Subscribers: mcrosier, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D47647 llvm-svn: 333811
*	[InstCombine] fix vector shuffle transform to replace undef elements (PR37648)	Sanjay Patel	2018-06-01	1	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This bug: https://bugs.llvm.org/show_bug.cgi?id=37648 ...was created with the enhancement to this transform with rL332479. The urem test shows the disaster potential: any undef divisor lane makes the whole op undef. The test diffs show that vector demanded elements turns some of the potential, but not all, unused binop operands back into undef already. llvm-svn: 333782
*	[ThinLTOBitcodeWriter] Emit summaries for regular LTO modules	Vlad Tsyrklevich	2018-06-01	1	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Emit summaries for bitcode modules that are only destined for the regular LTO portion of the build so they can participate in summary-based dead stripping. This change reduces the size of a nacl_helper build with cfi-icall enabled by 7%, removing the majority of the overhead due to enabling cfi-icall. The cfi-icall size increase was caused by compiling in lots of unused code and cfi-icall generating jumptable references to unused symbols that could no longer be removed by -Wl,-gc-sections. Increasing the visibility of summary-based dead stripping prevented jumptable entries being created for unused symbols from the regular LTO portion of the build. Reviewers: pcc Reviewed By: pcc Subscribers: dschuff, mehdi_amini, inglorion, eraman, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D47594 llvm-svn: 333768
*	Revert r333740: IPSCCP] Use PredicateInfo to propagate facts from cmp.	Florian Hahn	2018-06-01	2	-134/+10
\| \| \| \| \| \|	This is breaking the clang-with-thin-lto-ubuntu bot. llvm-svn: 333745
*	Recommit r333268: [IPSCCP] Use PredicateInfo to propagate facts from cmp ↵	Florian Hahn	2018-06-01	2	-10/+134
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instructions. This patch updates IPSCCP to use PredicateInfo to propagate facts to true branches predicated by EQ and to false branches predicated by NE. As a follow up, we should be able to extend it to also propagate additional facts about nonnull. Reviewers: davide, mssimpso, dberlin, efriedma Reviewed By: davide, dberlin Differential Revision: https://reviews.llvm.org/D45330 llvm-svn: 333740