bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[InstCombine] -X / C --> X / -C for FP	Sanjay Patel	2018-02-20	1	-10/+8
\| \| \| \| \| \| \|	We already do this in DAGCombiner, but it should also be good to eliminate the fsub use in IR. llvm-svn: 325648
*	[InstCombine] add tests for fdiv with negated op and constant op; NFC	Sanjay Patel	2018-02-20	1	-0/+44
\| \| \| \|	llvm-svn: 325644
*	[PatternMatch] allow vector matches with m_FNeg	Sanjay Patel	2018-02-20	2	-10/+3
\| \| \| \|	llvm-svn: 325642
*	[DSE] Don't DSE stores that subsequent memmove calls read from	Sanjoy Das	2018-02-20	1	-0/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We used to remove the first memmove in cases like this: memmove(p, p+2, 8); memmove(p, p+2, 8); which is incorrect. Fix this by changing isPossibleSelfRead to what was most likely the intended behavior. Historical note: the buggy code was added in https://reviews.llvm.org/rL120974 to address PR8728. Reviewers: rsmith Subscribers: mcrosier, llvm-commits, jlebar Differential Revision: https://reviews.llvm.org/D43425 llvm-svn: 325641
*	[InstCombine] auto-generate full checks; NFC	Sanjay Patel	2018-02-20	1	-41/+47
\| \| \| \|	llvm-svn: 325639
*	[InstCombine] add test for vector -X/-Y; NFC	Sanjay Patel	2018-02-20	1	-4/+17
\| \| \| \| \| \|	m_FNeg doesn't match vector types. llvm-svn: 325637
*	Fix broken test from r325630.	Benjamin Kramer	2018-02-20	1	-1/+1
\| \| \| \|	llvm-svn: 325634
*	[MemoryBuiltins] Check nobuiltin status when identifying calls to free.	Benjamin Kramer	2018-02-20	1	-1/+21
\| \| \| \| \| \| \| \|	This is usually not a problem because this code's main purpose is eliminating unused new/delete pairs. We got deletes of nullptr or nobuiltin deletes of builtin new wrong though. llvm-svn: 325630
*	[PatternMatch] enhance m_SignMask() to ignore undef elements in vectors	Sanjay Patel	2018-02-20	1	-6/+2
\| \| \| \|	llvm-svn: 325623
*	[InstSimplify] add tests for m_SignMask with undef vector elements; NFC	Sanjay Patel	2018-02-20	1	-2/+28
\| \| \| \|	llvm-svn: 325622
*	[LV] Fix test checks, NFC.	Alexey Bataev	2018-02-20	2	-140/+3506
\| \| \| \|	llvm-svn: 325617
*	[SLP] Fix tests checks, NFC.	Alexey Bataev	2018-02-20	5	-74/+249
\| \| \| \|	llvm-svn: 325605
*	[InstCombine] fold fdiv with non-splat divisor to fmul: X/C --> X * (1/C)	Sanjay Patel	2018-02-20	2	-4/+16
\| \| \| \|	llvm-svn: 325590
*	[InstCombine] allow fdiv with constant dividend folds with less than full ↵	Sanjay Patel	2018-02-19	1	-6/+29
\| \| \| \| \| \| \| \| \| \| \| \|	-ffast-math It's possible that we could allow this either 'arcp' or 'reassoc' alone, but this should be conservatively better than what we have right now. GCC allows this with only -freciprocal-math. The last test is changed to show a case that is expected to fold, but we need D43398. llvm-svn: 325533
*	[InstCombine] move fdiv tests; NFC	Sanjay Patel	2018-02-19	2	-33/+37
\| \| \| \| \| \|	Also, use vector constants just to prove that already works. llvm-svn: 325530
*	[TTI CostModel] change default cost of FP ops to 1 (PR36280)	Sanjay Patel	2018-02-19	7	-183/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change was mentioned at least as far back as: https://bugs.llvm.org/show_bug.cgi?id=26837#c26 ...and I found a real program that is harmed by this: Himeno running on AMD Jaguar gets 6% slower with SLP vectorization: https://bugs.llvm.org/show_bug.cgi?id=36280 ...but the change here appears to solve that bug only accidentally. The div/rem costs for x86 look very wrong in some cases, but that's already true, so we can fix those in follow-up patches. There's also evidence that more cost model changes are needed to solve SLP problems as shown in D42981, but that's an independent problem (though the solution may be adjusted after this change is made). Differential Revision: https://reviews.llvm.org/D43079 llvm-svn: 325515
*	[Transforms] Propagate new-format TBAA tags on simplification of ↵	Ivan A. Kosarev	2018-02-19	1	-0/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	memory-transfer intrinsics With this patch in place, when a new-format TBAA tag is available for a memory-transfer intrinsic call, we prefer propagating that new-format tag. Otherwise, we fallback to the old approach where we try to construct a proper TBAA access tag from 'tbaa.struct' metadata. Differential Revision: https://reviews.llvm.org/D41543 llvm-svn: 325488
*	[PatternMatch, InstSimplify] enhance m_AllOnes() to ignore undef elements in ↵	Sanjay Patel	2018-02-18	2	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vectors Loosening the matcher definition reveals a subtle bug in InstSimplify (we should not assume that because an operand constant matches that it's safe to return it as a result). So I'm making that change here too (that diff could be independent, but I'm not sure how to reveal it before the matcher change). This also seems like a good reason to not include matchers that capture the value. We don't want to encourage the potential misstep of propagating undef values when it's not allowed/intended. I didn't include the capture variant option here or in the related rL325437 (m_One), but it already exists for other constant matchers. llvm-svn: 325466
*	[InstSimplify] add tests with vector undef elts; NFC	Sanjay Patel	2018-02-18	2	-7/+18
\| \| \| \|	llvm-svn: 325465
*	[PatternMatch] enhance m_One() to ignore undef elements in vectors	Sanjay Patel	2018-02-17	3	-9/+7
\| \| \| \|	llvm-svn: 325437
*	[InstSimplify, InstCombine] add tests with vector undef elts; NFC	Sanjay Patel	2018-02-17	2	-6/+43
\| \| \| \| \| \|	These would fold if the m_One pattern matcher accounted for undef elts. llvm-svn: 325436
*	[InstSimplify] add vector select tests with undef elts in condition; NFC	Sanjay Patel	2018-02-17	1	-0/+20
\| \| \| \|	llvm-svn: 325419
*	[InstCombine] add FMF to better show current fdiv fold behavior; NFC	Sanjay Patel	2018-02-16	1	-4/+4
\| \| \| \|	llvm-svn: 325365
*	[ThinLTO] Fix data race in test #2	Eugene Leviant	2018-02-16	1	-1/+1
\| \| \| \| \| \|	Switched to the right option (-thinlto-threads) llvm-svn: 325362
*	[ThinLTO] Fix data race in test	Eugene Leviant	2018-02-16	1	-1/+1
\| \| \| \|	llvm-svn: 325361
*	[JumpThreading] PR36133 enable/disable DominatorTree for LVI analysis	Brian M. Rzycki	2018-02-16	1	-0/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The LazyValueInfo pass caches a copy of the DominatorTree when available. Whenever there are pending DominatorTree updates within JumpThreading's DeferredDominance object we cannot use the cached DT for LVI analysis. This commit adds the new methods enableDT() and disableDT() to LVI. JumpThreading also sets the appropriate usage model before calling LVI analysis methods. Fixes https://bugs.llvm.org/show_bug.cgi?id=36133 Reviewers: sebpop, dberlin, kuhar Reviewed by: sebpop, kuhar Subscribers: uabelho, llvm-commits, aprantl, hiraditya, a.elovikov Differential Revision: https://reviews.llvm.org/D42717 llvm-svn: 325356
*	[Transforms] Propagate TBAA info in SROA	Ivan A. Kosarev	2018-02-16	1	-151/+274
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that we have the new TBAA metadata format that is capable of representing accesses to aggregates, we can propagate TBAA access tags from memory setting and transferring intrinsics to load and store instructions and vice versa. Since SROA produces lots of new loads and stores on optimized builds, this change significantly decreases the share of undecorated memory accesses on such builds. Differential Revision: https://reviews.llvm.org/D41563 llvm-svn: 325329
*	[ThinLTO] Import global variables	Eugene Leviant	2018-02-16	1	-2/+11
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D43077 llvm-svn: 325320
*	Remove brittle check lines from a test, NFC	Vedant Kumar	2018-02-16	1	-41/+0
\| \| \| \|	llvm-svn: 325310
*	[GVN] Partially revert debug info salvage change (r325063)	Vedant Kumar	2018-02-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	In r325063, we salvaged debug values from dying instructions in GVN::processBlock() and GVN::performScalarPRE(). The change in performScalarPRE(), while correct, is unhelpful. It introduced a call to salvageDebugInfo() which was immediately followed by a RAUW, meaning it prevented the RAUW from efficiently updating dbg.value intrinsics. This commit reverts the mistake and tightens up the affected test case. llvm-svn: 325308
*	[DCE] Salvage debug info from dead insts	Vedant Kumar	2018-02-15	1	-4/+8
\| \| \| \| \| \| \|	This results in small increases in the size of the .debug_loc section and the number of unique source variables in a stage2 build of opt. llvm-svn: 325301
*	[Coroutines] Don't move stores for allocator args	Brian Gesiak	2018-02-15	1	-0/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The behavior described in Coroutines TS `[dcl.fct.def.coroutine]/7` allows coroutine parameters to be passed into allocator functions. The instructions to store values into the alloca'd parameters must not be moved past the frame allocation, otherwise uninitialized values are passed to the allocator. Test Plan: `check-llvm` Reviewers: rsmith, GorNishanov, eric_niebler Reviewed By: GorNishanov Subscribers: compnerd, EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D43000 llvm-svn: 325285
*	[SCCP] Test that constant propagation updates debug info, NFC	Vedant Kumar	2018-02-15	1	-5/+17
\| \| \| \| \| \| \|	This extends an existing test to check that SCCP updates the operands of relevant dbg.value instructions as it does its work. llvm-svn: 325281
*	[SLP] Fix the test for the reversed stores, NFC.	Alexey Bataev	2018-02-15	1	-18/+11
\| \| \| \|	llvm-svn: 325268
*	[SLP] Added test for reversed stores, NFC.	Alexey Bataev	2018-02-15	1	-1/+64
\| \| \| \|	llvm-svn: 325265
*	[InstCombine] test fdiv folds better; NFC	Sanjay Patel	2018-02-15	2	-33/+43
\| \| \| \| \| \| \|	We had redundant tests, but no tests for extra uses or vectors. 'fast' is an overly conservative requirement for these folds. llvm-svn: 325262
*	[InstCombine] allow sin/cos transforms with 'reassoc'	Sanjay Patel	2018-02-15	2	-83/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The variable name 'AllowReassociate' is a lie at this point because it's set to 'isFast()' which is more than the 'reassoc' FMF after rL317488. In D41286, we showed that this transform may be valid even with strict math by brute force checking every 32-bit float result. There's a potential problem here because we're replacing with a tan() libcall rather than a hypothetical LLVM tan intrinsic. So we might set errno when we should be guaranteed not to do that. But that's independent of this change. llvm-svn: 325247
*	[InstCombine] allow X / C -> X * (1.0/C) for vector splat FP constants	Sanjay Patel	2018-02-15	1	-3/+14
\| \| \| \|	llvm-svn: 325237
*	[NFC] Fix metadata placement in test	Max Kazantsev	2018-02-15	1	-3/+1
\| \| \| \|	llvm-svn: 325215
*	[SCEV] Favor isKnownViaSimpleReasoning over constant ranges check	Max Kazantsev	2018-02-15	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There is a more powerful but still simple function `isKnownViaSimpleReasoning ` that does constant range check and few more additional checks. We use it some places (e.g. when proving implications) and in some other places we only check constant ranges. Currently, indvar simplifier fails to remove the check in following loop: int inc = ...; for (int i = inc, j = inc - 1; i < 200; ++i, ++j) if (i > j) { ... } This patch replaces all usages of `isKnownPredicateViaConstantRanges` with `isKnownViaSimpleReasoning` to have smarter proofs. In particular, it fixes the case above. Reviewed-By: sanjoy Differential Revision: https://reviews.llvm.org/D43175 llvm-svn: 325214
*	[InstCombine] add tests and comments for fdiv X, C; NFC	Sanjay Patel	2018-02-14	1	-10/+77
\| \| \| \|	llvm-svn: 325161
*	[InstCombine] Don't fold select(C, Z, binop(select(C, X, Y), W)) -> ↵	Craig Topper	2018-02-14	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \|	select(C, Z, binop(Y, W)) if the binop is rem or div. The select may have been preventing a division by zero or INT_MIN/-1 so removing it might not be safe. Fixes PR36362. Differential Revision: https://reviews.llvm.org/D43276 llvm-svn: 325148
*	[InstCombine] regenerate checks; NFC	Sanjay Patel	2018-02-14	1	-227/+352
\| \| \| \|	llvm-svn: 325144
*	[SLP] Allow vectorization of reversed loads.	Alexey Bataev	2018-02-14	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Reversed loads are handled as gathering. But we can just reshuffle these values. Patch adds support for vectorization of reversed loads. Reviewers: RKSimon, spatel, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43022 llvm-svn: 325134
*	Recommit r325001: [CallSiteSplitting] Support splitting of blocks with ↵	Florian Hahn	2018-02-14	5	-128/+351
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instrs before call. For basic blocks with instructions between the beginning of the block and a call we have to duplicate the instructions before the call in all split blocks and add PHI nodes for uses of the duplicated instructions after the call. Currently, the threshold for the number of instructions before a call is quite low, to keep the impact on binary size low. Reviewers: junbuml, mcrosier, davidxl, davide Reviewed By: junbuml Differential Revision: https://reviews.llvm.org/D41860 llvm-svn: 325126
*	[LoopInterchange] Incrementally update the dominator tree.	Florian Hahn	2018-02-14	11	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \|	We can use incremental dominator tree updates to avoid re-calculating the dominator tree after interchanging 2 loops. Reviewers: dmgreen, kuhar Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D43176 llvm-svn: 325122
*	[Utils] Salvage the debug info of DCE'ed 'and' instructions	Petar Jovanovic	2018-02-14	1	-0/+10
\| \| \| \| \| \| \| \| \| \|	Preserve debug info from a dead 'and' instruction with a constant. Patch by Djordje Todorovic. Differential Revision: https://reviews.llvm.org/D43163 llvm-svn: 325119
*	Adding a width of the GEP index to the Data Layout.	Elena Demikhovsky	2018-02-14	6	-0/+1419
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Making a width of GEP Index, which is used for address calculation, to be one of the pointer properties in the Data Layout. p[address space]:size:memory_size:alignment:pref_alignment:index_size_in_bits. The index size parameter is optional, if not specified, it is equal to the pointer size. Till now, the InstCombiner normalized GEPs and extended the Index operand to the pointer width. It works fine if you can convert pointer to integer for address calculation and all registered targets do this. But some ISAs have very restricted instruction set for the pointer calculation. During discussions were desided to retrieve information for GEP index from the Data Layout. http://lists.llvm.org/pipermail/llvm-dev/2018-January/120416.html I added an interface to the Data Layout and I changed the InstCombiner and some other passes to take the Index width into account. This change does not affect any in-tree target. I added tests to cover data layouts with explicitly specified index size. Differential Revision: https://reviews.llvm.org/D42123 llvm-svn: 325102
*	[InstCombine] put tests of mul with neg operand(s) together; NFC	Sanjay Patel	2018-02-13	4	-58/+64
\| \| \| \|	llvm-svn: 325066
*	[GVN] Salvage debug info from dead insts	Vedant Kumar	2018-02-13	2	-2/+11
\| \| \| \| \| \| \| \| \| \|	This preserves an additional 581 unique source variables in a stage2 build of clang (according to `llvm-dwarfdump --statistics`). It increases the size of the .debug_loc section by 0.1% (or 87139 bytes). Differential Revision: https://reviews.llvm.org/D43255 llvm-svn: 325063