bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[BPF] Generate array dimension size properly for zero-size elements	Yonghong Song	2019-09-24	1	-26/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, if an array element type size is 0, the number of array elements will be set to 0, regardless of what user specified. This implementation is done in the beginning where BTF is mostly used to calculate the member offset. For example, struct s {}; struct s1 { int b; struct s a[2]; }; struct s1 s1; The BTF will have struct "s1" member "a" with element count 0. Now BTF types are used for compile-once and run-everywhere relocations and we need more precise type representation for type comparison. Andrii reported the issue as there are differences between original structure and BTF-generated structure. This patch made the change to correctly assign "2" as the number elements of member "a". Some dead codes related to ElemSize compuation are also removed. Differential Revision: https://reviews.llvm.org/D67979 llvm-svn: 372785
*	[PGO][PGSO] ProfileSummary changes.	Hiroshi Yamauchi	2019-09-24	1	-0/+67
\| \| \| \| \| \| \| \| \| \|	(Split of off D67120) ProfileSummary changes for profile guided size optimization. Differential Revision: https://reviews.llvm.org/D67377 llvm-svn: 372783
*	Extends the expansion of the LWZtoc pseduo op for AIX.	Sean Fertile	2019-09-24	1	-15/+38
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D67853 llvm-svn: 372772
*	[GCRelocate] Add a peephole to canonicalize base pointer relocation	Philip Reames	2019-09-24	1	-1/+12
\| \| \| \| \| \|	If we generate the gc.relocate, and then later prove two arguments to the statepoint are equivalent, we should canonicalize the gc.relocate to the form we would have produced if this had been known before rewriting. llvm-svn: 372771
*	[X86] Add MMX MOVD/MOVQ stores to folding tables to support stack folding	Simon Pilgrim	2019-09-24	1	-0/+2
\| \| \| \|	llvm-svn: 372770
*	[InstCombine] (a+b) < a && (a+b) != 0 -> (0-b) < a iff a/b != 0 (PR43259)	Roman Lebedev	2019-09-24	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is again motivated by D67122 sanitizer check enhancement. That patch seemingly worsens `-fsanitize=pointer-overflow` overhead from 25% to 50%, which strongly implies missing folds. For ``` #include <cassert> char* test(char& base, signed long offset) { __builtin_assume(offset < 0); return &base + offset; } ``` We produce https://godbolt.org/z/r40U47 and again those two icmp's can be merged: ``` Name: 0 Pre: C != 0 %adjusted = add i8 %base, C %not_null = icmp ne i8 %adjusted, 0 %no_underflow = icmp ult i8 %adjusted, %base %r = and i1 %not_null, %no_underflow => %neg_offset = sub i8 0, C %r = icmp ugt i8 %base, %neg_offset ``` https://rise4fun.com/Alive/ALap https://rise4fun.com/Alive/slnN There are 3 other variants of this pattern, i believe they all will go into InstSimplify. https://bugs.llvm.org/show_bug.cgi?id=43259 Reviewers: spatel, xbolva00, nikic Reviewed By: spatel Subscribers: efriedma, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67849 llvm-svn: 372768
*	[InstCombine] (a+b) <= a && (a+b) != 0 -> (0-b) < a (PR43259)	Roman Lebedev	2019-09-24	1	-2/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is again motivated by D67122 sanitizer check enhancement. That patch seemingly worsens `-fsanitize=pointer-overflow` overhead from 25% to 50%, which strongly implies missing folds. This pattern isn't exactly what we get there (strict vs. non-strict predicate), but this pattern does not require known-bits analysis, so it is best to handle it first. ``` Name: 0 %adjusted = add i8 %base, %offset %not_null = icmp ne i8 %adjusted, 0 %no_underflow = icmp ule i8 %adjusted, %base %r = and i1 %not_null, %no_underflow => %neg_offset = sub i8 0, %offset %r = icmp ugt i8 %base, %neg_offset ``` https://rise4fun.com/Alive/knp There are 3 other variants of this pattern, they all will go into InstSimplify: https://rise4fun.com/Alive/bIDZ https://bugs.llvm.org/show_bug.cgi?id=43259 Reviewers: spatel, xbolva00, nikic Reviewed By: spatel Subscribers: hiraditya, majnemer, vsk, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67846 llvm-svn: 372767
*	[TextAPI] Remove redundant checking causing warnings. NFC.	Michael Liao	2019-09-24	1	-4/+4
\| \| \| \| \| \|	- Minor coding format. llvm-svn: 372765
*	Regex: Make "match" and "sub" const member functions	Thomas Preud'homme	2019-09-24	5	-18/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The Regex "match" and "sub" member functions were previously not "const" because they wrote to the "error" member variable. This commit removes those assignments, and instead assumes that the validity of the regex is already known after the initial compilation of the regular expression. As a result, these member functions were possible to make "const". This makes it easier to do things like pre-compile Regexes up-front, and makes "match" and "sub" thread-safe. The error status is now returned as an optional output, which also makes the API of "match" and "sub" more consistent with each other. Also, some uses of Regex that could be refactored to be const were made const. Patch by Nicolas Guillemot Reviewers: jankratochvil, thopre Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67241 llvm-svn: 372764
*	[yaml2obj/obj2yaml] - Add support for .stack_sizes sections.	George Rimar	2019-09-24	2	-8/+67
\| \| \| \| \| \| \| \| \| \| \|	.stack_sizes is a SHT_PROGBITS section that contains pairs of <address (4/8 bytes), stack size (uleb128)>. This patch teach tools to parse and dump it. Differential revision: https://reviews.llvm.org/D67757 llvm-svn: 372762
*	AggressiveAntiDepBreaker - silence static analyzer null dereference warning. ↵	Simon Pilgrim	2019-09-24	1	-1/+1
\| \| \| \| \| \| \| \|	NFCI. Assert that we've found the critical path. llvm-svn: 372759
*	SafepointIRVerifier - silence static analyzer dyn_cast<Instruction> null ↵	Simon Pilgrim	2019-09-24	1	-2/+2
\| \| \| \| \| \| \| \|	dereference warnings. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<Instruction> directly and if not assert will fire for us. llvm-svn: 372758
*	Revert r372333: [DAG][X86] Convert isNegatibleForFree/GetNegatedExpression ↵	Ilya Biryukov	2019-09-24	4	-401/+293
\| \| \| \| \| \| \| \| \| \|	to a target hook (PR42863) Reason: this caused severe compile time regressions in JAX. See email thread of original revision on llvm-commits for details: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190923/697042.html llvm-svn: 372756
*	[Orc] Silence static analyzer dyn_cast<ConstantInt> null dereference ↵	Simon Pilgrim	2019-09-24	1	-1/+1
\| \| \| \| \| \|	warning. NFCI. llvm-svn: 372746
*	ConstantFold - silence static analyzer dyn_cast<> null dereference warning. ↵	Simon Pilgrim	2019-09-24	1	-0/+1
\| \| \| \| \| \| \| \|	NFCI. Early out if the vector element is not Constant. llvm-svn: 372743
*	Fix cppcheck "reduce variable scope" warning. NFCI.	Simon Pilgrim	2019-09-24	1	-2/+1
\| \| \| \|	llvm-svn: 372742
*	[IR] IntrinsicInst - silence static analyzer dyn_cast<> null dereference ↵	Simon Pilgrim	2019-09-24	1	-2/+2
\| \| \| \| \| \| \| \|	warnings. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<> directly and if not assert will fire for us. llvm-svn: 372733
*	LoopVectorize - silence static analyzer dyn_cast<CmpInst> null dereference ↵	Simon Pilgrim	2019-09-24	1	-1/+1
\| \| \| \| \| \| \| \|	warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<CmpInst> directly and if not assert will fire for us. llvm-svn: 372732
*	[SimplifyCFG] FoldTwoEntryPHINode - silence static analyzer null dereference ↵	Simon Pilgrim	2019-09-24	1	-0/+1
\| \| \| \| \| \| \| \|	warning. NFCI. Assert that we've found the DomBlock. llvm-svn: 372728
*	SimplifyCFG - silence static analyzer dyn_cast<LandingPadInst> null ↵	Simon Pilgrim	2019-09-24	1	-1/+1
\| \| \| \| \| \| \| \|	dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<LandingPadInst> directly and if not assert will fire for us. llvm-svn: 372727
*	SimplifyCFG - silence static analyzer dyn_cast<Instruction> null dereference ↵	Simon Pilgrim	2019-09-24	1	-2/+1
\| \| \| \| \| \| \| \|	warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<Instruction> directly and if not assert will fire for us. llvm-svn: 372726
*	[ModuloSchedule] KernelRewriter::rewrite - silence static analyzer ↵	Simon Pilgrim	2019-09-24	1	-0/+1
\| \| \| \| \| \| \| \|	dyn_cast<> null dereference warning. NFCI. Assert that we've found the start of the MI schedule list. llvm-svn: 372723
*	[ARM] Split large widening MVE loads	David Green	2019-09-24	1	-3/+72
\| \| \| \| \| \| \| \| \| \| \| \|	Similar to rL372717, we can force the splitting of extends of vector loads in MVE, in order to use the better widening loads as opposed to going through expensive extends. This adds a combine to early-on detect extends of loads and split the load in two, from where normal legalisation will kick in and we get a series of widening loads. Differential Revision: https://reviews.llvm.org/D67909 llvm-svn: 372721
*	lowerObjCCall - silence static analyzer dyn_cast<CallInst> null dereference ↵	Simon Pilgrim	2019-09-24	1	-1/+1
\| \| \| \| \| \| \| \|	warnings. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<CallInst> directly and if not assert will fire for us. llvm-svn: 372720
*	[ARM] Split large truncating MVE stores	David Green	2019-09-24	1	-82/+148
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MVE does not have a simple sign extend instruction that can move elements across lanes. We currently often end up moving each lane into and out of a GPR, in order to get elements into the correct places. When we have a store of a trunc (or a extend of a load), we can instead just split the store/load in two, using the narrowing/widening load/store instructions from each half of the vector. This does that for stores. It happens very early in a store combine, so as to easily detect the truncates. (It would be possible to do this later, but that would involve looking through a buildvector of extract elements. Not impossible but this way seemed simpler). By enabling store combines we also get a vmovdrr combine for free, helping some other tests. Differential Revision: https://reviews.llvm.org/D67828 llvm-svn: 372717
*	MCRegisterInfo: Merge getLLVMRegNum and getLLVMRegNumFromEH	Pavel Labath	2019-09-24	8	-44/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The functions different in two ways: - getLLVMRegNum could return both "eh" and "other" dwarf register numbers, while getLLVMRegNumFromEH only returned the "eh" number. - getLLVMRegNum asserted if the register was not found, while the second function returned -1. The second distinction was pretty important, but it was very hard to infer that from the function name. Aditionally, for the use case of dumping dwarf expressions, we needed a function which can work with both kinds of number, but does not assert. This patch solves both of these issues by merging the two functions into one, returning an Optional<unsigned> value. While the same thing could be achieved by adding an "IsEH" argument to the (renamed) getLLVMRegNumFromEH function, it seemed better to avoid the confusion of two functions and put the choice of asserting into the hands of the caller -- if he checks the Optional value, he can safely process "untrusted" input, and if he blindly dereferences the Optional, he gets the assertion. I've updated all call sites to the new API, choosing between the two options according to the function they were calling originally, except that I've updated the usage in DWARFExpression.cpp to use the "safe" method instead, and added a test case which would have previously triggered an assertion failure when processing (incorrect?) dwarf expressions. Reviewers: dsanders, arsenm, JDevlieghere Subscribers: wdng, aprantl, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67154 llvm-svn: 372710
*	[Debuginfo] dbg.value points to undef value after Induction Variable ↵	Alexey Lapshin	2019-09-24	1	-9/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Simplification. Induction Variable Simplification pass does not update dbg.value intrinsic. Before: %add = add nuw nsw i32 %ArgIndex.06, 1 call void @llvm.dbg.value(metadata i32 %add, metadata !17, metadata !DIExpression()) After: %indvars.iv.next = add nuw nsw i64 %indvars.iv, 1 call void @llvm.dbg.value(metadata i64 undef, metadata !17, metadata !DIExpression()) There should be: %indvars.iv.next = add nuw nsw i64 %indvars.iv, 1 call void @llvm.dbg.value(metadata i64 %indvars.iv.next, metadata !17, metadata !DIExpression()) Differential Revision: https://reviews.llvm.org/D67770 llvm-svn: 372703
*	[LV] Forced vectorization with runtime checks and OptForSize	Sjoerd Meijer	2019-09-24	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When vectorisation is forced with a pragma, we optimise for min size, and we need to emit runtime memory checks, then allow this code growth and don't run in an assert like we currently do. This is the result of D65197 and D66803, and was a use-case not really considered before. If this now happens, we emit an optimisation remark warning about the code-size expansion, which can be avoided by not forcing vectorisation or possibly source-code modifications. Differential Revision: https://reviews.llvm.org/D67764 llvm-svn: 372694
*	[InstCombine] Fold a shifty implementation of clamp-to-allones.	Huihui Zhang	2019-09-24	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fold or(ashr(subNSW(Y, X), ScalarSizeInBits(Y)-1), X) into X s> Y ? -1 : X https://rise4fun.com/Alive/d8Ab clamp255 is a common operator in image processing, can be implemented in a shifty way "(255 - X) >> 31 \| X & 255". Fold shift into select enables more optimization, e.g., vmin generation for ARM target. Reviewers: lebedev.ri, efriedma, spatel, kparzysz, bcahoon Reviewed By: lebedev.ri Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67800 llvm-svn: 372678
*	[InstCombine] Fold a shifty implementation of clamp-to-zero.	Huihui Zhang	2019-09-24	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fold and(ashr(subNSW(Y, X), ScalarSizeInBits(Y)-1), X) into X s> Y ? X : 0 https://rise4fun.com/Alive/lFH Fold shift into select enables more optimization, e.g., vmax generation for ARM target. Reviewers: lebedev.ri, efriedma, spatel, kparzysz, bcahoon Reviewed By: lebedev.ri Subscribers: xbolva00, andreadb, craig.topper, RKSimon, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67799 llvm-svn: 372676
*	[GlobalISel][IRTranslator] Fix switch table lowering to use signed LE not ↵	Amara Emerson	2019-09-24	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	unsigned. We were miscompiling switch value comparisons with the wrong signedness, which shows up when we have things like switch case values with i1 types, which end up being legalized incorrectly. Fixes PR43383 llvm-svn: 372675
*	[MemorySSA] Update Phi insertion.	Alina Sbirlea	2019-09-23	1	-43/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: MemoryPhis may be needed following a Def insertion inthe IDF of all the new accesses added (phis + potentially a def). Ensure this also occurs when only the new MemoryPhis are the defining accesses. Note: The need for computing IDF here is because of new Phis added with edges incoming from unreachable code, Phis that had previously been simplified. The preferred solution is to not reintroduce such Phis. This patch is the needed fix while working on the preferred solution. Reviewers: george.burgess.iv Subscribers: Prazek, sanjoy.google, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67927 llvm-svn: 372673
*	HotColdSplitting: invalidate the AssumptionCache on split	Saleem Abdulrasool	2019-09-23	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \|	When a cold path is outlined, the value tracking in the assumption cache may be invalidated due to the code motion. We would previously trip an assertion in subsequent passes (but required the passes to happen in a single run as the assumption cache is shared across the passes). Invalidating the cache ensures that we get the correct information when needed with the legacy pass manager as well. llvm-svn: 372667
*	[SampleFDO] Treat names in profile as not cold only when profile symbol list	Wei Mi	2019-09-23	1	-20/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	is available In rL372232, we treated names showing up in profile as not cold when profile-sample-accurate is enabled. This caused 70k size regression in Chrome/Android. The patch put a guard and only enable the change when profile symbol list is available, i.e., keep the old behavior when profile symbol list is not available. Differential Revision: https://reviews.llvm.org/D67931 llvm-svn: 372665
*	Fix uninitialized variable warning. NFCI.	Simon Pilgrim	2019-09-23	1	-1/+1
\| \| \| \|	llvm-svn: 372662
*	[WebAssembly] vNxM.load_splat instructions	Thomas Lively	2019-09-23	4	-1/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Adds the new load_splat instructions as specified at https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#load-and-splat. DAGISel does not allow matching multiple copies of the same load in a single pattern, so we use a new node in WebAssemblyISD to wrap loads that should be splatted. Depends on D67783. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67784 llvm-svn: 372655
*	[InstCombine] foldOrOfICmps(): Acquire SimplifyQuery with set CxtI	Roman Lebedev	2019-09-23	1	-2/+4
\| \| \| \| \| \|	Extracted from https://reviews.llvm.org/D67849#inline-610377 llvm-svn: 372654
*	[InstCombine] foldAndOfICmps(): Acquire SimplifyQuery with set CxtI	Roman Lebedev	2019-09-23	1	-2/+4
\| \| \| \| \| \|	Extracted from https://reviews.llvm.org/D67849#inline-610377 llvm-svn: 372653
*	[WebAssembly] Remove unused memory instructions and patterns	Thomas Lively	2019-09-23	3	-130/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Removes duplicated SIMD loads and store instructions and removes patterns involving GlobalAddresses that were not used in any tests. Reviewers: aheejin, sunfish Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67783 llvm-svn: 372648
*	[InstCombine] Annotate strndup calls with dereferenceable_or_null	David Bolvansky	2019-09-23	1	-9/+18
\| \| \| \| \| \|	"Implementations are free to malloc() a buffer containing either (size + 1) bytes or (strnlen(s, size) + 1) bytes. Applications should not assume that strndup() will allocate (size + 1) bytes when strlen(s) is smaller than size." llvm-svn: 372647
*	[X86] Use TargetConstant for condition code on X86ISD::SETCC/CMOV/BRCOND nodes.	Craig Topper	2019-09-23	4	-141/+136
\| \| \| \| \| \| \| \| \| \|	This removes the need for ConvertToTarget opcodes in the isel table. It's also consistent with the recent changes to use TargetConstant for intrinsic nodes that always take immediates. Differential Revision: https://reviews.llvm.org/D67902 llvm-svn: 372645
*	[IR] Add getExtendedType() to IntegerType and Type (dispatching to ↵	Roman Lebedev	2019-09-23	1	-10/+2
\| \| \| \| \| \|	IntegerType or VectorType) llvm-svn: 372638
*	[InstCombine] dropRedundantMaskingOfLeftShiftInput(): improve comment	Roman Lebedev	2019-09-23	1	-4/+4
\| \| \| \|	llvm-svn: 372637
*	[SLC] Convert some strndup calls to strdup calls	David Bolvansky	2019-09-23	3	-3/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Motivation: - If we can fold it to strdup, we should (strndup does more things than strdup). - Annotation mechanism. (Works for strdup well). strdup and strndup are part of C 20 (currently posix fns), so we should optimize them. Reviewers: efriedma, jdoerfert Reviewed By: jdoerfert Subscribers: lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67679 llvm-svn: 372636
*	[InstCombine] dropRedundantMaskingOfLeftShiftInput(): pat. c/d/e with mask ↵	Roman Lebedev	2019-09-23	1	-3/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR42563) Summary: If we have a pattern `(x & (-1 >> maskNbits)) << shiftNbits`, we already know (have a fold) that will drop the `& (-1 >> maskNbits)` mask iff `(shiftNbits-maskNbits) s>= 0` (i.e. `shiftNbits u>= maskNbits`). So even if `(shiftNbits-maskNbits) s< 0`, we can still fold, we will just need to apply a constant mask afterwards: ``` Name: c, normal+mask %t0 = lshr i32 -1, C1 %t1 = and i32 %t0, %x %r = shl i32 %t1, C2 => %n0 = shl i32 %x, C2 %n1 = i32 ((-(C2-C1))+32) %n2 = zext i32 %n1 to i64 %n3 = lshr i64 -1, %n2 %n4 = trunc i64 %n3 to i32 %r = and i32 %n0, %n4 ``` https://rise4fun.com/Alive/gslRa Naturally, old `%masked` will have to be one-use. This is not valid for pattern f - where "masking" is done via `ashr`. https://bugs.llvm.org/show_bug.cgi?id=42563 Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67725 llvm-svn: 372630
*	[InstCombine] dropRedundantMaskingOfLeftShiftInput(): pat. a/b with mask ↵	Roman Lebedev	2019-09-23	1	-3/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(PR42563) Summary: And this is finally the interesting part of that fold! If we have a pattern `(x & (~(-1 << maskNbits))) << shiftNbits`, we already know (have a fold) that will drop the `& (~(-1 << maskNbits))` mask iff `(maskNbits+shiftNbits) u>= bitwidth(x)`. But that is actually ignorant, there's more general fold here: In this pattern, `(maskNbits+shiftNbits)` actually correlates with the number of low bits that will remain in the final value. So even if `(maskNbits+shiftNbits) u< bitwidth(x)`, we can still fold, we will just need to apply a constant mask afterwards: ``` Name: a, normal+mask %onebit = shl i32 -1, C1 %mask = xor i32 %onebit, -1 %masked = and i32 %mask, %x %r = shl i32 %masked, C2 => %n0 = shl i32 %x, C2 %n1 = add i32 C1, C2 %n2 = zext i32 %n1 to i64 %n3 = shl i64 -1, %n2 %n4 = xor i64 %n3, -1 %n5 = trunc i64 %n4 to i32 %r = and i32 %n0, %n5 ``` https://rise4fun.com/Alive/F5R Naturally, old `%masked` will have to be one-use. Similar fold exists for patterns c,d,e, will post patch later. https://bugs.llvm.org/show_bug.cgi?id=42563 Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67677 llvm-svn: 372629
*	[BreakFalseDeps] ignore function with minsize attribute	Sanjay Patel	2019-09-23	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \|	This came up in the x86-specific: https://bugs.llvm.org/show_bug.cgi?id=43239 ...but it is a general problem for the BreakFalseDeps pass. Dependencies may be broken by adding some other instruction, so that should be avoided if the overall goal is to minimize size. Differential Revision: https://reviews.llvm.org/D67363 llvm-svn: 372628
*	[SLP] Fix for PR31847: Assertion failed: (isLoopInvariant(Operands[i], L) && ↵	Alexey Bataev	2019-09-23	1	-66/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	"SCEVAddRecExpr operand is not loop-invariant!") Summary: Initially SLP vectorizer replaced all going-to-be-vectorized instructions with Undef values. It may break ScalarEvaluation and may cause a crash. Reworked SLP vectorizer so that it does not replace vectorized instructions by UndefValue anymore. Instead vectorized instructions are marked for deletion inside if BoUpSLP class and deleted upon class destruction. Reviewers: mzolotukhin, mkuper, hfinkel, RKSimon, davide, spatel Subscribers: RKSimon, Gerolf, anemet, hans, majnemer, llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D29641 llvm-svn: 372626
*	[InstCombine] foldUnsignedUnderflowCheck(): s/Subtracted/ZeroCmpOp/	Roman Lebedev	2019-09-23	1	-7/+7
\| \| \| \|	llvm-svn: 372625
*	[AMDGPU][MC] Corrected handling of relocatable expressions	Dmitry Preobrazhensky	2019-09-23	1	-11/+20
\| \| \| \| \| \| \| \| \| \|	See bug 43359: https://bugs.llvm.org//show_bug.cgi?id=43359 Reviewers: rampitec Differential Revision: https://reviews.llvm.org/D67829 llvm-svn: 372622