bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Remove unnecessary fallthrough annotation after unreachable	Reid Kleckner	2018-11-01	1	-2/+0
\| \| \| \| \| \| \| \| \| \|	Clang's -Wimplicit-fallthrough implementation warns on this. I built clang with GCC 7.3 in +asserts and -asserts mode, and GCC doesn't warn on this in either configuration. I think it is unnecessary. I separated it from the large mechanical patch (https://reviews.llvm.org/D53950) in case I am wrong and it has to be reverted. llvm-svn: 345876
*	ADT/STLExtras: Introduce llvm::empty; NFC	Matthias Braun	2018-10-31	2	-3/+3
\| \| \| \| \| \| \| \|	This is modeled after C++17 std::empty(). Differential Revision: https://reviews.llvm.org/D53909 llvm-svn: 345679
*	[Local] Keep K's range if K does not move when combining metadata.	Florian Hahn	2018-10-27	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As K has to dominate I, IIUC I's range metadata must be a subset of K's. After Eli's recent clarification to the LangRef, loading a value outside of the range is undefined behavior. Therefore if I's range contains elements outside of K's range and we would load one such value, K would cause undefined behavior. In cases like hoisting/sinking, we still want the most generic range over all code paths to/from the hoist/sink point. As suggested in the patches related to D47339, I will refactor the handling of those scenarios and try to decouple it from this function as follow up, once we switched to a similar handling of metadata in most of combineMetadata. I updated some tests checking mostly the merging of metadata to keep the metadata of to dominating load. The most interesting one is probably test8 in test/Transforms/JumpThreading/thread-loads.ll. It contained a comment about the alias metadata preventing us to eliminate the branch, but it seem like the actual problem currently is that we merge the ranges of both loads and cannot eliminate the icmp afterwards. With this patch, we manage to eliminate the icmp, as the range of the first load excludes 8. Reviewers: efriedma, nlopes, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D51629 llvm-svn: 345456
*	[DebugInfo][Dexter] Unreachable line stepped onto after SimplifyCFG.	Carlos Alberto Enciso	2018-10-25	2	-18/+45
\| \| \| \| \| \| \| \|	When SimplifyCFG changes the PHI node into a select instruction, the debug line records becomes ambiguous. It causes the debugger to display unreachable source lines. Differential Revision: https://reviews.llvm.org/D53287 llvm-svn: 345250
*	Update MemorySSA in LoopRotate.	Alina Sbirlea	2018-10-24	1	-9/+51
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach LoopRotate to preserve MemorySSA. Enable tests for correctness, dependency disabled by default. Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D51718 llvm-svn: 345216
*	[HotColdSplitting] Identify larger cold regions using domtree queries	Vedant Kumar	2018-10-24	1	-16/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current splitting algorithm works in three stages: 1) Identify cold blocks, then 2) Use forward/backward propagation to mark hot blocks, then 3) Grow a SESE region of blocks outside of the set of hot blocks and start outlining. While testing this pass on Apple internal frameworks I noticed that some kinds of control flow (e.g. loops) are never outlined, even though they unconditionally lead to / follow cold blocks. I noticed two other issues related to how cold regions are identified: - An inconsistency can arise in the internal state of the hotness propagation stage, as a block may end up in both the ColdBlocks set and the HotBlocks set. Further inconsistencies can arise as these sets do not match what's in ProfileSummaryInfo. - It isn't necessary to limit outlining to single-exit regions. This patch teaches the splitting algorithm to identify maximal cold regions and outline them. A maximal cold region is defined as the set of blocks post-dominated by a cold sink block, or dominated by that sink block. This approach can successfully outline loops in the cold path. As a side benefit, it maintains less internal state than the current approach. Due to a limitation in CodeExtractor, blocks within the maximal cold region which aren't dominated by a single entry point (a so-called "max ancestor") are filtered out. Results: - X86 (LNT + -Os + externals): 134KB of TEXT were outlined compared to 47KB pre-patch, or a ~3x improvement. Did not see a performance impact across two runs. - AArch64 (LNT + -Os + externals + Apple-internal benchmarks): 149KB of TEXT were outlined. Ditto re: performance impact. - Outlining results improve marginally in the internal frameworks I tested. Follow-ups: - Outline more than once per function, outline large single basic blocks, & try to remove unconditional branches in outlined functions. Differential Revision: https://reviews.llvm.org/D53627 llvm-svn: 345209
*	[hot-cold-split] Name split functions with ".cold" suffix	Teresa Johnson	2018-10-24	1	-5/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The current default of appending "_"+entry block label to the new extracted cold function breaks demangling. Change the deliminator from "_" to "." to enable demangling. Because the header block label will be empty for release compile code, use "extracted" after the "." when the label is empty. Additionally, add a mechanism for the client to pass in an alternate suffix applied after the ".", and have the hot cold split pass use "cold."+Count, where the Count is currently 1 but can be used to uniquely number multiple cold functions split out from the same function with D53588. Reviewers: sebpop, hiraditya Subscribers: llvm-commits, erik.pilkington Differential Revision: https://reviews.llvm.org/D53534 llvm-svn: 345178
*	[NFC][InstCombine] Undo stray change	Evandro Menezes	2018-10-19	1	-2/+2
\| \| \| \| \| \|	Undo stray change introduced by r344725. llvm-svn: 344814
*	Add a emitUnaryFloatFnCall version that fetches the function name from TLI	Mikael Holmen	2018-10-18	2	-10/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In several places in the code we use the following pattern: if (hasUnaryFloatFn(&TLI, Ty, LibFunc_tan, LibFunc_tanf, LibFunc_tanl)) { [...] Value Res = emitUnaryFloatFnCall(X, TLI.getName(LibFunc_tan), B, Attrs); [...] } In short, we check if there is a lib-function for a certain type, and then we _always_ fetch the name of the "double" version of the lib function and construct a call to the appropriate function, that we just checked exists, using that "double" name as a basis. This is of course a problem in cases where the target doesn't support the "double" version, but e.g. only the "float" version. In that case TLI.getName(LibFunc_tan) returns "", and emitUnaryFloatFnCall happily appends an "f" to "", and we erroneously end up with a call to a function called "f". To solve this, the above pattern is changed to if (hasUnaryFloatFn(&TLI, Ty, LibFunc_tan, LibFunc_tanf, LibFunc_tanl)) { [...] Value Res = emitUnaryFloatFnCall(X, &TLI, LibFunc_tan, LibFunc_tanf, LibFunc_tanl, B, Attrs); [...] } I.e instead of first fetching the name of the "double" version and then letting emitUnaryFloatFnCall() add the final "f" or "l", we let emitUnaryFloatFnCall() fetch the right name from TLI. Reviewers: eli.friedman, efriedma Reviewed By: efriedma Subscribers: efriedma, bjope, llvm-commits Differential Revision: https://reviews.llvm.org/D53370 llvm-svn: 344725
*	[TI removal] Use `Instruction` instead of `TerminatorInst` for	Chandler Carruth	2018-10-18	1	-2/+2
\| \| \| \| \| \|	a variable's type. llvm-svn: 344717
*	[TI removal] Update CodeExtractor to use Instruction directly.	Chandler Carruth	2018-10-18	1	-4/+4
\| \| \| \|	llvm-svn: 344716
*	[InstCombine] Cleanup libfunc attribute inferring	David Bolvansky	2018-10-16	2	-53/+70
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53338 llvm-svn: 344645
*	[NFC] Make LoopSafetyInfo abstract to allow alternative implementations	Max Kazantsev	2018-10-16	1	-1/+1
\| \| \| \|	llvm-svn: 344592
*	[DebugInfo][LCSSA] Rewrite pre-existing debug values outside loop	David Stenberg	2018-10-16	2	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extend LCSSA so that debug values outside loops are rewritten to use the PHI nodes that the pass creates. This fixes PR39019. In that case, we ran LCSSA on a loop that was later on vectorized, which left us with something like this: for.cond.cleanup: %add.lcssa = phi i32 [ %add, %for.body ], [ %34, %middle.block ] call void @llvm.dbg.value(metadata i32 %add, ret i32 %add.lcssa for.body: %add = [...] br i1 %exitcond, label %for.cond.cleanup, label %for.body which later resulted in the debug.value becoming undef when removing the scalar loop (and the location would have probably been wrong for the vectorized case otherwise). As we now may need to query the AvailableVals cache more than once for a basic block, FindAvailableVals() in SSAUpdaterImpl is changed so that it updates the cache for blocks that we do not create a PHI node for, regardless of the block's number of predecessors. The debug value in the attached IR reproducer would not be properly rewritten without this. Debug values residing in blocks where we have not inserted any PHI nodes are currently left as-is by this patch. I'm not sure what should be done with those uses. Reviewers: mattd, aprantl, vsk, probinson Reviewed By: mattd, aprantl Subscribers: jmorse, gbedwell, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D53130 llvm-svn: 344589
*	[CodeExtractor] Erase debug intrinsics in outlined thunks (fix PR22900)	Vedant Kumar	2018-10-15	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Variable updates within the outlined function are invisible to debuggers. This could be improved by defining a DISubprogram for the new function. For the moment, simply erase the debug intrinsics instead. This fixes verifier failures about function-local metadata being used in the wrong function, seen while testing the hot/cold splitting pass. rdar://45142482 Differential Revision: https://reviews.llvm.org/D53267 llvm-svn: 344545
*	[TI removal] Make variables declared as `TerminatorInst` and initialized	Chandler Carruth	2018-10-15	13	-63/+63
\| \| \| \| \| \| \| \| \| \| \| \| \|	by `getTerminator()` calls instead be declared as `Instruction`. This is the biggest remaining chunk of the usage of `getTerminator()` that insists on the narrow type and so is an easy batch of updates. Several files saw more extensive updates where this would cascade to requiring API updates within the file to use `Instruction` instead of `TerminatorInst`. All of these were trivial in nature (pervasively using `Instruction` instead just worked). llvm-svn: 344502
*	[TI removal] Remove `TerminatorInst` from BasicBlockUtils.h	Chandler Carruth	2018-10-15	4	-16/+17
\| \| \| \| \| \| \| \| \|	This requires updating a number of .cpp files to adapt to the new API. I've just systematically updated all uses of `TerminatorInst` within these files te `Instruction` so thta I won't have to touch them again in the future. llvm-svn: 344498
*	[TI removal] Remove TerminatorInst as an input parameter from all public	Chandler Carruth	2018-10-15	1	-1/+1
\| \| \| \| \| \| \| \| \|	LLVM APIs. There weren't very many. We still have the instruction visitor, and APIs with TerminatorInst as a return type or an output parameter. llvm-svn: 344494
*	[InstCombine] Fixed crash with aliased functions	David Bolvansky	2018-10-13	2	-22/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes PR39177 Reviewers: spatel, jbuening Reviewed By: jbuening Subscribers: jbuening, llvm-commits Differential Revision: https://reviews.llvm.org/D53129 llvm-svn: 344454
*	[InstCombine] Fix SimplifyLibCalls erasing an instruction while IC still had ↵	Amara Emerson	2018-10-11	1	-10/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	references to it. InstCombine keeps a worklist and assumes that optimizations don't eraseFromParent() the instruction, which SimplifyLibCalls violates. This change adds a new callback to SimplifyLibCalls to let clients specify their own hander for erasing actions. Differential Revision: https://reviews.llvm.org/D52729 llvm-svn: 344251
*	[IndVars] Drop "exact" flag from lshr and udiv when substituting their args	Max Kazantsev	2018-10-11	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \|	There is a transform that may replace `lshr (x+1), 1` with `lshr x, 1` in case if it can prove that the result will be the same. However the initial instruction might have an `exact` flag set, and it now should be dropped unless we prove that it may hold. Incorrectly set `exact` attribute may then produce poison. Differential Revision: https://reviews.llvm.org/D53061 Reviewed By: sanjoy llvm-svn: 344223
*	Relax trivial cast requirements in CallPromotionUtils	Scott Linder	2018-10-10	1	-6/+8
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D52792 llvm-svn: 344153
*	Revert "[DebugInfo][Dexter] Unreachable line stepped onto after SimplifyCFG."	Carlos Alberto Enciso	2018-10-10	2	-32/+18
\| \| \| \| \| \| \| \|	This reverts commit r344120. It was causing buildbot failures. llvm-svn: 344135
*	[DebugInfo][Dexter] Unreachable line stepped onto after SimplifyCFG.	Carlos Alberto Enciso	2018-10-10	2	-18/+32
\| \| \| \| \| \| \| \|	When SimplifyCFG changes the PHI node into a select instruction, the debug line records becomes ambiguous. It causes the debugger to display unreachable source lines. Differential Revision: https://reviews.llvm.org/D52887 llvm-svn: 344120
*	[NFC] Make a variable const	Max Kazantsev	2018-10-10	1	-1/+1
\| \| \| \|	llvm-svn: 344113
*	[SimplifyCFG] Pass AggressiveInsts to DominatesMergePoint by reference. ↵	Craig Topper	2018-10-04	1	-11/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove null check. Summary: At some point in the past the recursion in DominatesMergePoint used to pass null for AggressiveInsts as part of the recursion. It no longer does this. So there is no way for AggressiveInsts to be null. This passes it by reference and removes the null check to make this explicit. Reviewers: efriedma, reames Reviewed By: efriedma Subscribers: xbolva00, llvm-commits Differential Revision: https://reviews.llvm.org/D52575 llvm-svn: 343828
*	[SimplifyCFG] Change recursive calls to llvm::SimplifyCFG to instead use an ↵	Craig Topper	2018-10-04	1	-29/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	outer while loop to revisit. Summary: The llvm::SimplifyCFG function creates a SimplifyCFGOpt object and calls run on it. There were numerous places reached from this run function that called back out llvm::SimplifyCFG which would create another SimplifyCFGOpt object. This is an inefficient use of stack space at minimum. We are also not passing along the LoopHeaders pointer passed into the outer llvm::SimplifyCFG call. So if its not null we lose it on the first recursion and get nullptr from there on. This patch adds an outer loop around the main BasicBlock simplifying code and adds a flag to the SimplifyCFGOpt class that can be set by to request another iteration. I don't think we can iterate based just on the change flag alone since some of the simplifications delete a basic block entirely leaving nothing to iterate on. Reviewers: bogner, eli.friedman, reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52760 llvm-svn: 343816
*	[SimplifyCFG] Use Value::hasNUses instead of 'getNumUses() =='. NFCI	Craig Topper	2018-10-01	1	-1/+1
\| \| \| \| \| \|	getNumUses is linear in the number of uses. Since we're looking for a specific use count, we can use hasNUses which will stop as soon as it determines there are more than N uses instead of walking all of them. llvm-svn: 343550
*	[SimplifyCFG] Update comments that refer to CondBB to say ThenBB instead. NFC	Craig Topper	2018-10-01	1	-4/+4
\| \| \| \| \| \|	There is no variable in this function named CondBB, but there is one named ThenBB and I believe the comments are all refering to it. llvm-svn: 343548
*	Use the container form llvm::sort(C, ...)	Fangrui Song	2018-09-30	1	-12/+9
\| \| \| \| \| \| \|	There are a few leftovers in rL343163 which span two lines. This commit changes these llvm::sort(C.begin(), C.end, ...) to llvm::sort(C, ...) llvm-svn: 343426
*	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...)	Fangrui Song	2018-09-27	6	-18/+15
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: The convenience wrapper in STLExtras is available since rL342102. Reviewers: dblaikie, javed.absar, JDevlieghere, andreadb Subscribers: MatzeB, sanjoy, arsenm, dschuff, mehdi_amini, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, eraman, aheejin, kbarton, JDevlieghere, javed.absar, gbedwell, jrtc27, mgrang, atanasyan, steven_wu, george.burgess.iv, dexonsmith, kristina, jsji, llvm-commits Differential Revision: https://reviews.llvm.org/D52573 llvm-svn: 343163
*	Remove LoopID metadata from the branch instruction	Vyacheslav Zakharin	2018-09-26	1	-1/+5
\| \| \| \| \| \| \| \|	that follows the peeled iterations. Differential Revision: https://reviews.llvm.org/D52176 llvm-svn: 343054
*	[LoopUnroll] Add check to Latch's terminator in UnrollRuntimeLoopRemainder	David Green	2018-09-25	1	-5/+19
\| \| \| \| \| \| \| \| \| \| \| \| \|	In this patch, I'm adding an extra check to the Latch's terminator in llvm::UnrollRuntimeLoopRemainder, similar to how it is already done in the llvm::UnrollLoop. The compiler would crash if this function is called with a malformed loop. Patch by Rodrigo Caetano Rocha! Differential Revision: https://reviews.llvm.org/D51486 llvm-svn: 342958
*	[InstCombine] Disable strcmp->memcmp transform for MSan.	Matt Morehouse	2018-09-19	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The strcmp->memcmp transform can make the resulting memcmp read uninitialized data, which MSan doesn't like. Resolves https://github.com/google/sanitizers/issues/993. Reviewers: eugenis, xbolva00 Reviewed By: eugenis Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D52272 llvm-svn: 342582
*	[InstCombine] Don't transform sin/cos -> tanl if for half types	Benjamin Kramer	2018-09-19	1	-0/+2
\| \| \| \| \| \| \|	This is still unsafe for long double, we will transform things into tanl even if tanl is for another type. But that's for someone else to fix. llvm-svn: 342542
*	[DebugInfo][Dexter] Speculated BB presents illegal variable value to debugger.	Carlos Alberto Enciso	2018-09-19	2	-2/+13
\| \| \| \| \| \| \| \|	When SimplifyCFG changes the PHI node into a select instruction, the debug information becomes ambiguous. It causes the debugger to display wrong variable value. Differential Revision: https://reviews.llvm.org/D51976 llvm-svn: 342527
*	[SimplifyCFG] Put an alignment on generated switch tables	David Green	2018-09-12	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	Previously the alignment on the newly created switch table data was not set, meaning that DataLayout::getPreferredAlignment was free to overalign it to 16 bytes. This causes unnecessary code bloat. Differential Revision: https://reviews.llvm.org/D51800 llvm-svn: 342039
*	Break LoopUtils into an Analysis file.	Vikram TV	2018-09-12	1	-988/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The InductionDescriptor and RecurrenceDescriptor classes basically analyze the IR to identify the respective IVs. So, it is better to have them in the "Analysis" directory instead of the "Transforms" directory. The rationale for this is to make the Induction and Recurrence descriptor classes available for analysis passes. Currently including them in an analysis pass produces link error (http://lists.llvm.org/pipermail/llvm-dev/2018-July/124456.html). Induction and Recurrence descriptors are moved from Transforms/Utils/LoopUtils.h\|cpp to Analysis/IVDescriptors.h\|cpp. Reviewers: dmgreen, llvm-commits, hfinkel Reviewed By: dmgreen Subscribers: mgorny Differential Revision: https://reviews.llvm.org/D51153 llvm-svn: 342016
*	Don't create a temporary vector of loop blocks just to iterate over them.	Benjamin Kramer	2018-09-10	1	-2/+1
\| \| \| \| \| \|	Loop's getBlocks returns an ArrayRef. llvm-svn: 341821
*	Move a transformation routine from LoopUtils to LoopVectorize.	Vikram TV	2018-09-10	1	-68/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Move InductionDescriptor::transform() routine from LoopUtils to its only uses in LoopVectorize.cpp. Specifically, the function is renamed as InnerLoopVectorizer::emitTransformedIndex(). This is a child to D51153. Reviewers: dmgreen, llvm-commits Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D51837 llvm-svn: 341776
*	Move createMinMaxOp() out of RecurrenceDescriptor.	Vikram TV	2018-09-10	1	-48/+47
\| \| \| \| \| \| \| \| \| \|	Reviewers: dmgreen, llvm-commits Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D51838 llvm-svn: 341773
*	[MemorySSA] Update MemoryPhi wiring for block splitting to consider if ↵	Alina Sbirlea	2018-09-07	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	identical edges were merged. Summary: Block splitting is done with either identical edges being merged, or not. Only critical edges can be split without merging identical edges based on an option. Teach the memoryssa updater to take this into account: for the same edge between two blocks only move one entry from the Phi in Old to the new Phi in New. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D51563 llvm-svn: 341709
*	[x86/SLH] Add a real Clang flag and LLVM IR attribute for Speculative	Chandler Carruth	2018-09-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Load Hardening. Wires up the existing pass to work with a proper IR attribute rather than just a hidden/internal flag. The internal flag continues to work for now, but I'll likely remove it soon. Most of the churn here is adding the IR attribute. I talked about this Kristof Beyls and he seemed at least initially OK with this direction. The idea of using a full attribute here is that we do expect at least some forms of this for other architectures. There isn't anything inherently x86-specific about this technique, just that we only have an implementation for x86 at the moment. While we could potentially expose this as a Clang-level attribute as well, that seems like a good question to defer for the moment as it isn't 100% clear whether that or some other programmer interface (or both?) would be best. We'll defer the programmer interface side of this for now, but at least get to the point where the feature can be enabled without relying on implementation details. This also allows us to do something that was really hard before: we can enable just the indirect call retpolines when using SLH. For x86, we don't have any other way to mitigate indirect calls. Other architectures may take a different approach of course, and none of this is surfaced to user-level flags. Differential Revision: https://reviews.llvm.org/D51157 llvm-svn: 341363
*	[SLC] Support expanding pow(x, n+0.5) to x * x * ... * sqrt(x)	Florian Hahn	2018-09-03	1	-14/+52
\| \| \| \| \| \| \| \| \| \|	Reviewers: evandro, efriedma, spatel Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D51435 llvm-svn: 341330
*	[InstCombine] Expand the simplification of pow() into exp2()	Evandro Menezes	2018-08-30	1	-5/+27
\| \| \| \| \| \| \| \| \| \| \| \| \|	Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 341095
*	Revert "[SimplifyCFG] Common debug handling [NFC]"	Martin Storsjo	2018-08-30	1	-0/+8
\| \| \| \| \| \| \| \| \|	This reverts commit r340997. This change turned out not to be NFC after all, but e.g. causes clang to crash when building the linux kernel for aarch64. llvm-svn: 341031
*	[NFC] Move OrderedInstructions and InstructionPrecedenceTracking to Analysis	Max Kazantsev	2018-08-30	4	-153/+0
\| \| \| \| \| \| \| \|	These classes don't make any changes to IR and have no reason to be in Transform/Utils. This patch moves them to Analysis folder. This will allow us reusing these classes in some analyzes, like MustExecute. llvm-svn: 341015
*	[SimplifyCFG] Rename a variable for readibility of a future change [NFC]	Philip Reames	2018-08-30	1	-8/+9
\| \| \| \|	llvm-svn: 341004
*	[SimplifyCFG] Fix a cost modeling oversight in branch commoning	Philip Reames	2018-08-30	1	-2/+8
\| \| \| \| \| \| \| \|	The cost modeling was not accounting for the fact we were duplicating the instruction once per predecessor. With a default threshold of 1, this meant we were actually creating #pred copies. Adding to the fun, there is absolutely no test coverage for this. Simply bailing for more than one predecessor passes all checked in tests. llvm-svn: 341001
*	[SimplifyCFG] Common debug handling [NFC]	Philip Reames	2018-08-29	1	-8/+0
\| \| \| \|	llvm-svn: 340997