bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SimplifyCFG] Re-apply Relax restriction for folding unconditional branches	Serguei Katkov	2018-02-08	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The commit rL308422 introduces a restriction for folding unconditional branches. Specifically if empty block with unconditional branch leads to header of the loop then elimination of this basic block is prohibited. However it seems this condition is redundantly strict. If elimination of this basic block does not introduce more back edges then we can eliminate this block. The patch implements this relax of restriction. The test profile/Linux/counter_promo_nest.c in compiler-rt project is updated to meet this change. Reviewers: efriedma, mcrosier, pacxx, hsung, davidxl Reviewed By: pacxx Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42691 llvm-svn: 324572
*	[InlineFunction] Update deprecated use of IRBuilder CreateMemCpy (NFC)	Daniel Neilson	2018-02-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the InlineFunction pass to ceause using the old IRBuilder CreateMemCpy single-alignment API in favour of the new API that allows setting source and destination alignments independently. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. ( rL323886, rL323891, rL324148, rL324273, rL324278 ) Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 324384
*	[LowerMemIntrinsics] Update uses of deprecated MemIntrinsic::getAlignment ↵	Daniel Neilson	2018-02-05	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	API (NFC) Summary: This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the LowerMemIntrinsics pass to cease using the old getAlignment() API of MemoryIntrinsic in favour of getting source & dest specific alignments through the new API. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. ( rL323886, rL323891, rL324148, rL324273 ) Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 324278
*	[SimplifyLibCalls] Update from deprecated IRBuilder API for creating memory ↵	Daniel Neilson	2018-02-05	1	-25/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	intrinsics (NFC) Summary: This change is part of step five in the series of changes to remove alignment argument from memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the SimplifyLibCalls pass to cease using the old IRBuilder createMemCpy/createMemMove single-alignment APIs in favour of the new API that allows setting source and destination alignments independently. Steps: Step 1) Remove alignment parameter and create alignment parameter attributes for memcpy/memmove/memset. ( rL322965, rC322964, rL322963 ) Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing source and dest alignments. ( rL323597 ) Step 3) Update Clang to use the new IRBuilder API. ( rC323617 ) Step 4) Update Polly to use the new IRBuilder API. ( rL323618 ) Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API, and those that use use MemIntrinsicInst::[get\|set]Alignment() to use [get\|set]DestAlignment() and [get\|set]SourceAlignment() instead. ( rL323886, rL323891, r3L24148 ) Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the MemIntrinsicInst::[get\|set]Alignment() methods. Reference http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html llvm-svn: 324273
*	LTO: Include dso-local bit in ThinLTO cache key.	Peter Collingbourne	2018-02-05	1	-10/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D42713 llvm-svn: 324253
*	Revert r323472 "[Debug] Add dbg.value intrinsics for PHIs created during LCSSA."	Hans Wennborg	2018-02-05	1	-7/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This broke the Chromium build; see PR36238. > This patch is an enhancement to propagate dbg.value information when > Phis are created on behalf of LCSSA. I noticed a case where a value > carried across a loop was reported as <optimized out>. > > Specifically this case: > > int bar(int x, int y) { > return x + y; > } > > int foo(int size) { > int val = 0; > for (int i = 0; i < size; ++i) { > val = bar(val, i); // Both val and i are correct > } > return val; // <optimized out> > } > > In the above case, after all of the interesting computation completes > our value is reported as "optimized out." This change will add a > dbg.value to correct this. > > This patch also moves the dbg.value insertion routine from > LoopRotation.cpp into Local.cpp, so that we can share it in both places > (LoopRotation and LCSSA). > > Patch by Matt Davis! > > Differential Revision: https://reviews.llvm.org/D42551 llvm-svn: 324247
*	Revert [SimplifyCFG] Relax restriction for folding unconditional branches	Serguei Katkov	2018-02-05	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \|	The patch causes the failure of the test compiler-rt/test/profile/Linux/counter_promo_nest.c To unblock buildbot, revert the patch while investigation is in progress. Differential Revision: https://reviews.llvm.org/D42691 llvm-svn: 324214
*	[SimplifyCFG] Relax restriction for folding unconditional branches	Serguei Katkov	2018-02-05	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The commit rL308422 introduces a restriction for folding unconditional branches. Specifically if empty block with unconditional branch leads to header of the loop then elimination of this basic block is prohibited. However it seems this condition is redundantly strict. If elimination of this basic block does not introduce more back edges then we can eliminate this block. The patch implements this relax of restriction. Reviewers: efriedma, mcrosier, pacxx, hsung, davidxl Reviewed By: pacxx Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42691 llvm-svn: 324208
*	[InlineFunction] Set arg attrs even if there only are VarArg attrs.	Florian Hahn	2018-02-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	When using the partial inliner, we might have attributes for forwarded varargs, but the CodeExtractor does not create an empty argument attribute set for regular arguments in that case, because it does not know of the additional arguments. So in case we have attributes for VarArgs, we also have to make sure we create (empty) attributes for all regular arguments. This fixes PR36210. llvm-svn: 324197
*	[LV] Use Demanded Bits and ValueTracking for reduction type-shrinking	Chad Rosier	2018-02-04	1	-72/+144
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The type-shrinking logic in reduction detection, although narrow in scope, is also rather ad-hoc, which has led to bugs (e.g., PR35734). This patch modifies the approach to rely on the demanded bits and value tracking analyses, if available. We currently perform type-shrinking separately for reductions and other instructions in the loop. Long-term, we should probably think about computing minimal bit widths in a more complete way for the loops we want to vectorize. PR35734 Differential Revision: https://reviews.llvm.org/D42309 llvm-svn: 324195
*	Utils: Fix DomTree update for entry block	Matt Arsenault	2018-01-31	1	-5/+14
\| \| \| \| \| \| \|	If SplitBlockPredecessors was used on a function entry block, it wouldn't update the dominator tree. llvm-svn: 323928
*	[GlobalOpt] Fix exponential compile-time with selects.	Eli Friedman	2018-01-31	1	-17/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If you have a long chain of select instructions created from something like `int* p = &g; if (foo()) p += 4; if (foo2()) p += 4;` etc., a naive recursive visitor will recursively visit each select twice, which is O(2^N) in the number of select instructions. Use the visited set to cut off recursion in this case. (No testcase because this doesn't actually change the behavior, just the time.) Differential Revision: https://reviews.llvm.org/D42451 llvm-svn: 323910
*	Teach ValueMapper to use ODR uniqued types when available	Teresa Johnson	2018-01-30	1	-4/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is exposed during ThinLTO compilation, when we import an alias by creating a clone of the aliasee. Without this fix the debug type is unnecessarily cloned and we get a duplicate, undoing the uniquing. Fixes PR36089. Reviewers: mehdi_amini, pcc Subscribers: eraman, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D41669 llvm-svn: 323813
*	[Debug] LCSSA: Insert dbg.value at the first available insertion point	Vedant Kumar	2018-01-25	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \|	Inserting a dbg.value instruction at the start of a basic block with a landingpad instruction triggers a verifier failure. We should be OK if we insert the instruction a bit later. Speculative fix for the bot failure described here: https://reviews.llvm.org/D42551 llvm-svn: 323482
*	[Debug] Add dbg.value intrinsics for PHIs created during LCSSA.	Vedant Kumar	2018-01-25	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch is an enhancement to propagate dbg.value information when Phis are created on behalf of LCSSA. I noticed a case where a value carried across a loop was reported as <optimized out>. Specifically this case: int bar(int x, int y) { return x + y; } int foo(int size) { int val = 0; for (int i = 0; i < size; ++i) { val = bar(val, i); // Both val and i are correct } return val; // <optimized out> } In the above case, after all of the interesting computation completes our value is reported as "optimized out." This change will add a dbg.value to correct this. This patch also moves the dbg.value insertion routine from LoopRotation.cpp into Local.cpp, so that we can share it in both places (LoopRotation and LCSSA). Patch by Matt Davis! Differential Revision: https://reviews.llvm.org/D42551 llvm-svn: 323472
*	[Debug] Add a utility to propagate dbg.value to new PHIs, NFC	Vedant Kumar	2018-01-25	1	-0/+38
\| \| \| \| \| \| \| \| \| \|	This simply moves an existing utility to Utils for reuse. Split out of: https://reviews.llvm.org/D42551 Patch by Matt Davis! llvm-svn: 323471
*	[NFC] fix trivial typos in comments	Hiroshi Inoue	2018-01-19	2	-2/+2
\| \| \| \| \| \|	"the the" -> "the" llvm-svn: 322934
*	Add a ProfileCount class to represent entry counts.	Easwaran Raman	2018-01-17	2	-17/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The class wraps a uint64_t and an enum to represent the type of profile count (real and synthetic) with some helper methods. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41883 llvm-svn: 322771
*	[JumpThreading] Preservation of DT and LVI across the pass	Brian M. Rzycki	2018-01-12	2	-50/+177
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: See D37528 for a previous (non-deferred) version of this patch and its description. Preserves dominance in a deferred manner using a new class DeferredDominance. This reduces the performance impact of updating the DominatorTree at every edge insertion and deletion. A user may call DDT->flush() within JumpThreading for an up-to-date DT. This patch currently has one flush() at the end of runImpl() to ensure DT is preserved across the pass. LVI is also preserved to help subsequent passes such as CorrelatedValuePropagation. LVI is simpler to maintain and is done immediately (not deferred). The code to perform the preversation was minimally altered and simply marked as preserved for the PassManager to be informed. This extends the analysis available to JumpThreading for future enhancements such as threading across loop headers. Reviewers: dberlin, kuhar, sebpop Reviewed By: kuhar, sebpop Subscribers: mgorny, dmgreen, kuba, rnk, rsmith, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D40146 llvm-svn: 322401
*	[LoopDeletion] Handle users in unreachable block	Serguei Katkov	2018-01-12	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a fix for PR35884. When we want to delete dead loop we must clean uses in unreachable blocks otherwise we'll get an assert during deletion of instructions from the loop. Reviewers: anna, davide Reviewed By: anna Subscribers: llvm-commits, lebedev.ri Differential Revision: https://reviews.llvm.org/D41943 llvm-svn: 322357
*	[InstCombine] Missed optimization in math expression: sin(x) / cos(x) => tan(x)	Dmitry Venikov	2018-01-11	2	-15/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch enables folding sin(x) / cos(x) -> tan(x), cos(x) / sin(x) -> 1 / tan(x) under -ffast-math flag Reviewers: hfinkel, spatel Reviewed By: spatel Subscribers: andrew.w.kaylor, efriedma, scanon, llvm-commits Differential Revision: https://reviews.llvm.org/D41286 llvm-svn: 322255
*	[NFC] Commit to mention that r322248 is actually made by AndrewScheidecker	Marcello Maggioni	2018-01-11	1	-1/+1
\| \| \| \|	llvm-svn: 322249
*	[SimplifyCFG] Add cut-off for InitializeUniqueCases.	Marcello Maggioni	2018-01-11	1	-13/+25
\| \| \| \| \| \| \| \| \| \| \| \| \|	The function can take a significant amount of time on some complicated test cases, but for the currently only use of the function we can stop the initialization much earlier when we find out we are going to discard the result anyway in the caller of the function. Adding configurable cut-off points so that we avoid wasting time. NFCI. llvm-svn: 322248
*	[CodeExtractor] Use subset of function attributes for extracted function.	Florian Hahn	2018-01-07	1	-4/+74
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In addition to target-dependent attributes, we can also preserve a white-listed subset of target independent function attributes. The white-list excludes problematic attributes, most prominently: * attributes related to memory accesses, as alloca instructions could be moved in/out of the extracted block * control-flow dependent attributes, like no_return or thunk, as the relerelevant instructions might or might not get extracted. Thanks @efriedma and @aemerson for providing a set of attributes that cannot be propagated. Reviewers: efriedma, davidxl, davide, silvas Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D41334 llvm-svn: 321961
*	[InlineFunction] Preserve calling convention when forwarding VarArgs.	Florian Hahn	2018-01-06	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Reviewers: efriedma, rnk, davide Reviewed By: rnk, davide Differential Revision: https://reviews.llvm.org/D41556 llvm-svn: 321943
*	[InlineFunction] Preserve attributes when forwarding VarArgs.	Florian Hahn	2018-01-06	1	-4/+23
\| \| \| \| \| \| \| \| \| \|	Reviewers: rnk, efriedma Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D41555 llvm-svn: 321942
*	[InlineFunction] Inline vararg functions that do not access varargs.	Florian Hahn	2018-01-06	1	-17/+20
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the varargs are not accessed by a function, we can inline the function. Reviewers: dblaikie, chandlerc, davide, efriedma, rnk, hfinkel Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D41335 llvm-svn: 321940
*	[Utils] Simplify salvageDebugInfo, NFCI	Vedant Kumar	2018-01-05	1	-34/+30
\| \| \| \| \| \| \| \| \|	Having a single call to findDbgUsers() allows salvageDebugInfo() to return earlier. Differential Revision: https://reviews.llvm.org/D41787 llvm-svn: 321915
*	Revert "[JumpThreading] Preservation of DT and LVI across the pass"	Reid Kleckner	2018-01-04	2	-173/+50
\| \| \| \| \| \| \|	This reverts r321825, it causes crashes in Chromium. Reproducer forthcoming. llvm-svn: 321832
*	[JumpThreading] Preservation of DT and LVI across the pass	Brian M. Rzycki	2018-01-04	2	-50/+173
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: See D37528 for a previous (non-deferred) version of this patch and its description. Preserves dominance in a deferred manner using a new class DeferredDominance. This reduces the performance impact of updating the DominatorTree at every edge insertion and deletion. A user may call DDT->flush() within JumpThreading for an up-to-date DT. This patch currently has one flush() at the end of runImpl() to ensure DT is preserved across the pass. LVI is also preserved to help subsequent passes such as CorrelatedValuePropagation. LVI is simpler to maintain and is done immediately (not deferred). The code to perfom the preversation was minimally altered and was simply marked as preserved for the PassManager to be informed. This extends the analysis available to JumpThreading for future enhancements. One example is loop boundary threading. Reviewers: dberlin, kuhar, sebpop Reviewed By: kuhar, sebpop Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D40146 llvm-svn: 321825
*	Add assertion on DT availability during LI update in UpdateAnalysisInformation	Anna Thomas	2018-01-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	This came up during discussions in llvm-commits for rL321653: Check for unreachable preds before updating LI in UpdateAnalysisInformation The assert provides hints to passes to require both DT and LI if we plan on updating LI through this function. Tests run: make check llvm-svn: 321805
*	[BasicBlockUtils] Check for unreachable preds before updating LI in ↵	Anna Thomas	2018-01-02	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	UpdateAnalysisInformation Summary: We are incorrectly updating the LI when loop-simplify generates dedicated exit blocks for a loop. The issue is that there's an implicit assumption that the Preds passed into UpdateAnalysisInformation are reachable. However, this is not true and breaks LI by incorrectly updating the header of a loop. One such case is when we generate dedicated exits when the exit block is a landing pad (through SplitLandingPadPredecessors). There maybe other cases as well, since we do not guarantee that Preds passed in are reachable basic blocks. The added test case shows how loop-simplify breaks LI for the outer loop (and DT in turn) after we try to generate the LoopSimplifyForm. Reviewers: davide, chandlerc, sanjoy Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41519 llvm-svn: 321653
*	[SimplifyCFG] Return to the pass manager the correct value.	Davide Italiano	2017-12-31	1	-1/+1
\| \| \| \| \| \| \|	I wanted to commit this with r321603, but I failed to squash the two commits. llvm-svn: 321606
*	[Utils/Local] Use `auto` when the type is obvious. NFCI.	Davide Italiano	2017-12-31	1	-6/+6
\| \| \| \|	llvm-svn: 321605
*	[Utils] Remove commented debug message. NFCI.	Davide Italiano	2017-12-31	1	-4/+0
\| \| \| \|	llvm-svn: 321604
*	[SimplifyCFG] Stop hoisting musttail calls incorrectly.	Davide Italiano	2017-12-31	1	-0/+11
\| \| \| \| \| \|	PR35774. llvm-svn: 321603
*	Use phi ranges to simplify code. No functionality change intended.	Benjamin Kramer	2017-12-30	9	-173/+112
\| \| \| \|	llvm-svn: 321585
*	Revert r321377, it causes regression to https://reviews.llvm.org/P8055.	Guozhi Wei	2017-12-28	1	-179/+0
\| \| \| \|	llvm-svn: 321528
*	[Unroll][DebugInfo] Propagate loop body's debug location to epilog preheader	Zhaoshi Zheng	2017-12-26	1	-1/+6
\| \| \| \| \| \| \|	NewExit and epilog PreHeader should has the same debug loc as the original loop body, instead of original loop exit. llvm-svn: 321465
*	Make helpers static. No functionality change.	Benjamin Kramer	2017-12-24	1	-3/+4
\| \| \| \|	llvm-svn: 321425
*	[SimplifyCFG] Don't do if-conversion if there is a long dependence chain	Guozhi Wei	2017-12-22	1	-0/+179
\| \| \| \| \| \| \| \| \| \|	If after if-conversion, most of the instructions in this new BB construct a long and slow dependence chain, it may be slower than cmp/branch, even if the branch has a high miss rate, because the control dependence is transformed into data dependence, and control dependence can be speculated, and thus, the second part can execute in parallel with the first part on modern OOO processor. This patch checks for the long dependence chain, and give up if-conversion if find one. Differential Revision: https://reviews.llvm.org/D39352 llvm-svn: 321377
*	Add hasProfileData() to check if a function has profile data. NFC.	Easwaran Raman	2017-12-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This replaces calls to getEntryCount().hasValue() with hasProfileData that does the same thing. This refactoring is useful to do before adding synthetic function entry counts but also a useful cleanup IMO even otherwise. I have used hasProfileData instead of hasRealProfileData as David had earlier suggested since I think profile implies "real" and I use the phrase "synthetic entry count" and not "synthetic profile count" but I am fine calling it hasRealProfileData if you prefer. Reviewers: davidxl, silvas Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41461 llvm-svn: 321331
*	[SimplifyCFG] Avoid quadratic on a predecessors number behavior in ↵	Michael Zolotukhin	2017-12-21	1	-14/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instruction sinking. If a block has N predecessors, then the current algorithm will try to sink common code to this block N times (whenever we visit a predecessor). Every attempt to sink the common code includes going through all predecessors, so the complexity of the algorithm becomes O(N^2). With this patch we try to sink common code only when we visit the block itself. With this, the complexity goes down to O(N). As a side effect, the moment the code is sunk is slightly different than before (the order of simplifications has been changed), that's why I had to adjust two tests (note that neither of the tests is supposed to test SimplifyCFG): * test/CodeGen/AArch64/arm64-jumptable.ll - changes in this test mimic the changes that previous implementation of SimplifyCFG would do. * test/CodeGen/ARM/avoid-cpsr-rmw.ll - in this test I disabled common code sinking by a command line flag. llvm-svn: 321236
*	[ICP] Expose unconditional call promotion interface	Matthew Simpson	2017-12-20	1	-77/+178
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch modifies the indirect call promotion utilities by exposing and using an unconditional call promotion interface. The unconditional promotion interface (i.e., call promotion without creating an if-then-else) can be used if it's known that an indirect call has only one possible callee. The existing conditional promotion interface uses this unconditional interface to promote an indirect call after it has been versioned and placed within the "then" block. A consequence of unconditional promotion is that the fix-up operations for phi nodes in the normal destination of invoke instructions are changed. This is necessary because the existing implementation assumed that an invoke had been versioned, creating a "merge" block where a return value bitcast could be placed. In the new implementation, the edge between a promoted invoke's parent block and its normal destination is split if needed to add a bitcast for the return value. If the invoke is also versioned, the phi node merging the return value of the promoted and original invoke instructions is placed in the "merge" block. Differential Revision: https://reviews.llvm.org/D40751 llvm-svn: 321210
*	Fix more inconsistent line endings. NFC.	Dimitry Andric	2017-12-18	1	-9/+9
\| \| \| \|	llvm-svn: 321016
*	[Memcpy Loop Lowering] Remove the fixed int8 lowering.	Sean Fertile	2017-12-18	1	-80/+13
\| \| \| \| \| \| \| \|	Switch over to the lowering that uses target supplied operand types. Differential Revision: https://reviews.llvm.org/D41201 llvm-svn: 320989
*	[Memcpy Loop Lowering] Only calculate residual size/bytes copied when needed.	Sean Fertile	2017-12-16	1	-6/+10
\| \| \| \| \| \| \| \|	If the loop operand type is int8 then there will be no residual loop for the unknown size expansion. Dont create the residual-size and bytes-copied values when they are not needed. llvm-svn: 320929
*	[SimplifyLibCalls] Inline calls to cabs when it's safe to do so	Hal Finkel	2017-12-16	1	-0/+33
\| \| \| \| \| \| \| \| \| \| \| \|	When unsafe algerbra is allowed calls to cabs(r) can be replaced by: sqrt(creal(r)creal(r) + cimag(r)cimag(r)) Patch by Paul Walker, thanks! Differential Revision: https://reviews.llvm.org/D40069 llvm-svn: 320901
*	[Memcpy Loop Lowering] Insert loop BB inbetween the split BB.	Sean Fertile	2017-12-15	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \|	The original memcpy expansion inserted the loop basic block inbetween the 2 new basic blocks created by splitting the original block the memcpy call was in. This commit makes the new memcpy expansion do the same to keep the layout of the IR matching between the old and new implementations. Differential Review: https://reviews.llvm.org/D41197 llvm-svn: 320848
*	fix typo in comment and remove inaccurate comment; NFC	Sanjay Patel	2017-12-15	1	-2/+0
\| \| \| \|	llvm-svn: 320838