bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Variable names start with an upper case letter; NFC	Sanjay Patel	2015-12-31	1	-4/+4
\| \| \| \|	llvm-svn: 256676
*	fix formatting; NFC	Sanjay Patel	2015-12-31	1	-17/+22
\| \| \| \|	llvm-svn: 256675
*	[ThinLTO] Rename variables used in metadata linking (NFC)	Teresa Johnson	2015-12-30	1	-5/+5
\| \| \| \| \| \| \| \|	As suggested in review for r255909, rename MDMaterialized to AllowTemps, and identify the name of the boolean flag being set in calls to saveMetadataList. llvm-svn: 256653
*	[Transforms] Use asserts instead of ifs around llvm_unreachable. NFC	Craig Topper	2015-12-25	1	-34/+20
\| \| \| \|	llvm-svn: 256405
*	Nonnull elements in OperandBundleCallSites are not all Instructions	Sanjoy Das	2015-12-19	1	-3/+2
\| \| \| \| \| \| \| \| \| \|	`CloneAndPruneIntoFromInst` sometimes RAUW's dead instructions with `undef` before erasing them (to avoid deleting instructions that still have uses). This changes the `WeakVH` in `OperandBundleCallSites` to hold an `undef`, and we need to guard for this situation in eventuality in `llvm::InlineFunction`. llvm-svn: 256110
*	Clean up the processing of dbg.value in various places	Keno Fischer	2015-12-19	1	-4/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: First up is instcombine, where in the dbg.declare -> dbg.value conversion, the llvm.dbg.value needs to be called on the actual loaded value, rather than the address (since the whole point of this transformation is to be able to get rid of the alloca). Further, now that that's cleaned up, we can remove a hack in the backend, that would add an implicit OP_deref if the argument to dbg.value was an alloca. This stems from before the existence of DIExpression and is no longer necessary since the deref can be expressed explicitly. Now, in order to make sure that the tests pass with this change, we need to correct the printing of DEBUG_VALUE comments to take into account the expression, which wasn't taken into account before. Unfortunately, for both these changes, there were a number of incorrect test cases (mostly the wrong number of DW_OP_derefs, but also a couple where the test itself was broken more badly). aprantl and I have gone through and adjusted these test case in order to make them pass with these fixes and in some cases to make sure they're actually testing what they are meant to test. Reviewers: aprantl Subscribers: dsanders Differential Revision: http://reviews.llvm.org/D14186 llvm-svn: 256077
*	[WinEH] Update LCSSA to handle catchswitch with handlers inside and outside ↵	Andrew Kaylor	2015-12-18	1	-0/+7
\| \| \| \| \| \| \| \|	a loop Differential Revision: http://reviews.llvm.org/D15630 llvm-svn: 256005
*	[ThinLTO/LTO] Don't link in unneeded metadata	Teresa Johnson	2015-12-18	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Third patch split out from http://reviews.llvm.org/D14752. Only map in needed DISubroutine metadata (imported or otherwise linked in functions and other DISubroutine referenced by inlined instructions). This is supported for ThinLTO, LTO and llvm-link --only-needed, with associated tests for each one. Depends on D14838. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14843 llvm-svn: 256003
*	[ThinLTO] Metadata linking for imported functions	Teresa Johnson	2015-12-17	1	-20/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Second patch split out from http://reviews.llvm.org/D14752. Maps metadata as a post-pass from each module when importing complete, suturing up final metadata to the temporary metadata left on the imported instructions. This entails saving the mapping from bitcode value id to temporary metadata in the importing pass, and from bitcode value id to final metadata during the metadata linking postpass. Depends on D14825. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14838 llvm-svn: 255909
*	LPM: Make callers of LPM.deleteLoopFromQueue update LoopInfo directly. NFC	Justin Bogner	2015-12-16	1	-8/+5
\| \| \| \| \| \| \| \| \| \| \|	As of r255720, the loop pass manager will DTRT when passes update the loop info for removed loops, so they no longer need to reach into LPPassManager APIs to do this kind of transformation. This change very nearly removes the need for the LPPassManager to even be passed into loop passes - the only remaining pass that uses the LPM argument is LoopUnswitch. llvm-svn: 255797
*	[SimplifyCFG] Don't create unnecessary PHIs	James Molloy	2015-12-16	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	In conditional store merging, we were creating PHIs when we didn't need to. If the value to be predicated isn't defined in the block we're predicating, then it doesn't need a PHI at all (because we only deal with triangles and diamonds, any value not in the predicated BB must dominate the predicated BB). This fixes a large code size increase in some benchmarks in a popular embedded benchmark suite. Now with a fix (and fixed tests) for the conformance issue seen in Chromium. llvm-svn: 255767
*	[WinEH] Use operand bundles to describe call sites	David Majnemer	2015-12-15	2	-35/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SimplifyCFG allows tail merging with code which terminates in unreachable which, in turn, makes it possible for an invoke to end up in a funclet which it was not originally part of. Using operand bundles on invokes allows us to determine whether or not an invoke was part of a funclet in the source program. Furthermore, it allows us to unambiguously answer questions about the legality of inlining into call sites which the personality may have trouble with. Differential Revision: http://reviews.llvm.org/D15517 llvm-svn: 255674
*	LPM: Stop threading `Pass *` through all of the loop utility APIs. NFC	Justin Bogner	2015-12-15	5	-100/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A large number of loop utility functions take a `Pass ` and reach into it to find out which analyses to preserve. There are a number of problems with this: - The APIs have access to pretty well any Pass state they want, so it's hard to tell what they may or may not do. - Other APIs have copied these and pass around a `Pass ` even though they don't even use it. Some of these just hand a nullptr to the API since the callers don't even have a pass available. - Passes in the new pass manager don't work like the current ones, so the APIs can't be used as is there. Instead, we should explicitly thread the analysis results that we actually care about through these APIs. This is both simpler and more reusable. llvm-svn: 255669
*	[SimplifyCFG] allow speculation of exactly one expensive instruction (PR24818)	Sanjay Patel	2015-12-15	1	-4/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the last general step to allow more IR-level speculation with a safety harness in place in CodeGenPrepare. The intent is to restore the behavior enabled by: http://reviews.llvm.org/rL228826 but prevent bad performance such as: https://llvm.org/bugs/show_bug.cgi?id=24818 Earlier patches in this sequence: D12882 (disable SimplifyCFG speculation for expensive instructions) D13297 (have CGP despeculate expensive ops) D14630 (have CGP despeculate special versions of cttz/ctlz) As shown in the test cases, we only have two instructions currently affected: ctz for some x86 and fdiv generally. Allowing exactly one expensive instruction is a bit of a hack, but it lines up with what is currently implemented in CGP. If we make the despeculation more general in CGP, we can make the speculation here more liberal. A follow-up patch will adjust the cost for sqrt and possibly other typically expensive math intrinsics (currently everything is cheap by default). GPU targets would likely want to override those expensive default costs (just as they probably should already override the cost of div/rem) because just about any math is cheaper than control-flow on those targets. Differential Revision: http://reviews.llvm.org/D15213 llvm-svn: 255660
*	Revert "Don't create unnecessary PHIs"	Reid Kleckner	2015-12-14	1	-5/+0
\| \| \| \| \| \| \| \| \|	This reverts commit r255489. It causes test failures in Chromium and does not appear to respect the AlternativeV parameter. llvm-svn: 255562
*	[IR] Remove terminatepad	David Majnemer	2015-12-14	3	-21/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	It turns out that terminatepad gives little benefit over a cleanuppad which calls the termination function. This is not sufficient to implement fully generic filters but MSVC doesn't support them which makes terminatepad a little over-designed. Depends on D15478. Differential Revision: http://reviews.llvm.org/D15479 llvm-svn: 255522
*	getParent() ^ 3 == getModule() ; NFCI	Sanjay Patel	2015-12-14	2	-5/+3
\| \| \| \|	llvm-svn: 255511
*	Don't create unnecessary PHIs	James Molloy	2015-12-14	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \|	In conditional store merging, we were creating PHIs when we didn't need to. If the value to be predicated isn't defined in the block we're predicating, then it doesn't need a PHI at all (because we only deal with triangles and diamonds, any value not in the predicated BB must dominate the predicated BB). This fixes a large code size increase in some benchmarks in a popular embedded benchmark suite. llvm-svn: 255489
*	[IR] Reformulate LLVM's EH funclet IR	David Majnemer	2015-12-12	5	-62/+135
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While we have successfully implemented a funclet-oriented EH scheme on top of LLVM IR, our scheme has some notable deficiencies: - catchendpad and cleanupendpad are necessary in the current design but they are difficult to explain to others, even to seasoned LLVM experts. - catchendpad and cleanupendpad are optimization barriers. They cannot be split and force all potentially throwing call-sites to be invokes. This has a noticable effect on the quality of our code generation. - catchpad, while similar in some aspects to invoke, is fairly awkward. It is unsplittable, starts a funclet, and has control flow to other funclets. - The nesting relationship between funclets is currently a property of control flow edges. Because of this, we are forced to carefully analyze the flow graph to see if there might potentially exist illegal nesting among funclets. While we have logic to clone funclets when they are illegally nested, it would be nicer if we had a representation which forbade them upfront. Let's clean this up a bit by doing the following: - Instead, make catchpad more like cleanuppad and landingpad: no control flow, just a bunch of simple operands; catchpad would be splittable. - Introduce catchswitch, a control flow instruction designed to model the constraints of funclet oriented EH. - Make funclet scoping explicit by having funclet instructions consume the token produced by the funclet which contains them. - Remove catchendpad and cleanupendpad. Their presence can be inferred implicitly using coloring information. N.B. The state numbering code for the CLR has been updated but the veracity of it's output cannot be spoken for. An expert should take a look to make sure the results are reasonable. Reviewers: rnk, JosephTremoulet, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D15139 llvm-svn: 255422
*	[Mem2Reg] Respect optnone	James Molloy	2015-12-11	1	-0/+3
\| \| \| \| \| \| \| \|	Mem2Reg shouldn't be optimizing a function that is marked optnone. There is a test checking this that fails when mem2reg is explicitly added to the standard pass pipeline. llvm-svn: 255336
*	Add arg_begin() and arg_end() to CallInst and InvokeInst; NFCI	Sanjoy Das	2015-12-10	2	-5/+3
\| \| \| \| \| \| \| \| \| \|	- This simplifies the CallSite class, arg_begin / arg_end are now simple wrapper getters. - In several places, we were creating CallSite instances solely to call arg_begin and arg_end. With this change, that's no longer required. llvm-svn: 255226
*	Use WeakVH to keep track of calls with operand bundles in CloneCodeInfo	Sanjoy Das	2015-12-09	1	-1/+3
\| \| \| \| \| \| \| \|	`CloneAndPruneIntoFromInst` can DCE instructions after cloning them into the new function, and so an AssertingVH is too strong. This change switches CloneCodeInfo to use a std::vector<WeakVH>. llvm-svn: 255148
*	Delete trailing whitespace; NFC	Sanjoy Das	2015-12-09	1	-1/+1
\| \| \| \|	llvm-svn: 255147
*	Revert "Revert r253253 and r253126: "Don't recompute LCSSA after ↵	Michael Zolotukhin	2015-12-09	1	-2/+12
\| \| \| \| \| \| \| \| \| \| \|	loop-unrolling when possible."" The bug in IndVarSimplify was fixed in r254976, r254977, so I'm reapplying the original patch for avoiding redundant LCSSA recomputation. This reverts commit ffe3b434e505e403146aff00be0c177bb6d13466. llvm-svn: 255133
*	Re-commit r255115, with the PredicatedScalarEvolution class moved to	Silviu Baranga	2015-12-09	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ScalarEvolution.h, in order to avoid cyclic dependencies between the Transform and Analysis modules: [LV][LAA] Add a layer over SCEV to apply run-time checked knowledge on SCEV expressions Summary: This change creates a layer over ScalarEvolution for LAA and LV, and centralizes the usage of SCEV predicates. The SCEVPredicatedLayer takes the statically deduced knowledge by ScalarEvolution and applies the knowledge from the SCEV predicates. The end goal is that both LAA and LV should use this interface everywhere. This also solves a problem involving the result of SCEV expression rewritting when the predicate changes. Suppose we have the expression (sext {a,+,b}) and two predicates P1: {a,+,b} has nsw P2: b = 1. Applying P1 and then P2 gives us {a,+,1}, while applying P2 and the P1 gives us sext({a,+,1}) (the AddRec expression was changed by P2 so P1 no longer applies). The SCEVPredicatedLayer maintains the order of transformations by feeding back the results of previous transformations into new transformations, and therefore avoiding this issue. The SCEVPredicatedLayer maintains a cache to remember the results of previous SCEV rewritting results. This also has the benefit of reducing the overall number of expression rewrites. Reviewers: mzolotukhin, anemet Subscribers: jmolloy, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14296 llvm-svn: 255122
*	Revert r255115 until we figure out how to fix the bot failures.	Silviu Baranga	2015-12-09	2	-45/+2
\| \| \| \|	llvm-svn: 255117
*	[LV][LAA] Add a layer over SCEV to apply run-time checked knowledge on SCEV ↵	Silviu Baranga	2015-12-09	2	-2/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	expressions Summary: This change creates a layer over ScalarEvolution for LAA and LV, and centralizes the usage of SCEV predicates. The SCEVPredicatedLayer takes the statically deduced knowledge by ScalarEvolution and applies the knowledge from the SCEV predicates. The end goal is that both LAA and LV should use this interface everywhere. This also solves a problem involving the result of SCEV expression rewritting when the predicate changes. Suppose we have the expression (sext {a,+,b}) and two predicates P1: {a,+,b} has nsw P2: b = 1. Applying P1 and then P2 gives us {a,+,1}, while applying P2 and the P1 gives us sext({a,+,1}) (the AddRec expression was changed by P2 so P1 no longer applies). The SCEVPredicatedLayer maintains the order of transformations by feeding back the results of previous transformations into new transformations, and therefore avoiding this issue. The SCEVPredicatedLayer maintains a cache to remember the results of previous SCEV rewritting results. This also has the benefit of reducing the overall number of expression rewrites. Reviewers: mzolotukhin, anemet Subscribers: jmolloy, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14296 llvm-svn: 255115
*	Return a std::unique_ptr from CloneModule. NFC.	Rafael Espindola	2015-12-08	1	-13/+15
\| \| \| \|	llvm-svn: 255078
*	[OperandBundles] Fix a transform in simplifycfg	Sanjoy Das	2015-12-08	1	-2/+6
\| \| \| \| \| \| \| \| \| \|	Reviewers: pcc, majnemer, reames Subscribers: reames, llvm-commits Differential Revision: http://reviews.llvm.org/D15345 llvm-svn: 255062
*	[OperandBundles] Remove unncessary constructor	Sanjoy Das	2015-12-08	1	-1/+1
\| \| \| \| \| \| \| \|	The StringRef constructor is unnecessary (since we're converting to std::string anyway), and having it requires an explicit call to StringRef's or std::string's constructor. llvm-svn: 255000
*	Create llvm.global_ctors in the new format.	Rafael Espindola	2015-12-06	1	-2/+2
\| \| \| \|	llvm-svn: 254878
*	[SimplifyLibCalls] Optimization for pow(x, n) where n is some constant	Weiming Zhao	2015-12-04	1	-0/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In order to avoid calling pow function we generate repeated fmul when n is a positive or negative whole number. For each exponent we pre-compute Addition Chains in order to minimize the no. of fmuls. Refer: http://wwwhomes.uni-bielefeld.de/achim/addition_chain.html We pre-compute addition chains for exponents upto 32 (which results in a max of 7 fmuls). For eg: 4 = 2+2 5 = 2+3 6 = 3+3 and so on Hence, pow(x, 4.0) ==> y = fmul x, x x = fmul y, y ret x For negative exponents, we simply compute the reciprocal of the final result. Note: This transformation is only enabled under fast-math. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: weimingz, majnemer, escha, davide, scanon, joerg Subscribers: probinson, escha, llvm-commits Differential Revision: http://reviews.llvm.org/D13994 llvm-svn: 254776
*	Move EH-specific helper functions to a more appropriate place	David Majnemer	2015-12-02	1	-1/+1
\| \| \| \| \| \|	No functionality change is intended. llvm-svn: 254562
*	Bring r254336 back:	Rafael Espindola	2015-12-01	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The difference is that now we don't error on out-of-comdat access to internal global values. We copy them instead. This seems to match the expectation of COFF linkers (see pr25686). Original message: Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254418
*	[safestack] Protect byval function arguments.	Evgeniy Stepanov	2015-12-01	1	-5/+11
\| \| \| \| \| \| \|	Detect unsafe byval function arguments and move them to the unsafe stack. llvm-svn: 254353
*	This reverts commit r254336 and r254344.	Rafael Espindola	2015-11-30	1	-3/+3
\| \| \| \| \| \|	They broke a bot and I am debugging why. llvm-svn: 254347
*	Start deciding earlier what to link.	Rafael Espindola	2015-11-30	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254336
*	[SimplifyLibCalls] Transform log(exp2(y)) to y*log(2) under fast-math.	Davide Italiano	2015-11-30	1	-1/+9
\| \| \| \|	llvm-svn: 254317
*	[SimplifyLibCalls] Don't crash if the function doesn't have a name.	Davide Italiano	2015-11-29	1	-3/+2
\| \| \| \|	llvm-svn: 254265
*	[SimplifyLibCalls] Cross out implemented transformations.	Davide Italiano	2015-11-29	1	-2/+0
\| \| \| \|	llvm-svn: 254264
*	[SimplifyLibCalls] Tranform log(pow(x, y)) -> y*log(x).	Davide Italiano	2015-11-29	1	-5/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This one is enabled only under -ffast-math. There are cases where the difference between the value computed and the correct value is huge even for ffast-math, e.g. as Steven pointed out: x = -1, y = -4 log(pow(-1), 4) = 0 4*log(-1) = NaN I checked what GCC does and apparently they do the same optimization (which result in the dramatic difference). Future work might try to make this (slightly) less worse. Differential Revision: http://reviews.llvm.org/D14400 llvm-svn: 254263
*	[SimplifyLibCalls] Use any_of(). Suggested by David Blaikie!	Davide Italiano	2015-11-28	1	-4/+3
\| \| \| \|	llvm-svn: 254239
*	[SimplifyLibCalls] Fix inverted condition that lead to an uninitialized ↵	Benjamin Kramer	2015-11-28	1	-2/+2
\| \| \| \| \| \| \| \|	memory read below. Found by msan! llvm-svn: 254238
*	Simplify the linking of recursive data.	Rafael Espindola	2015-11-27	1	-2/+10
\| \| \| \| \| \| \| \|	Now the ValueMapper has two callbacks. The first one maps the declaration. The ValueMapper records the mapping and then materializes the body/initializer. llvm-svn: 254209
*	[SimplifyLibCalls] Use range-based loop. NFC.	Davide Italiano	2015-11-27	1	-4/+2
\| \| \| \|	llvm-svn: 254193
*	[SimplifyLibCalls] Don't depend on a called function having a name, it might ↵	Benjamin Kramer	2015-11-26	1	-11/+8
\| \| \| \| \| \| \| \|	be an indirect call. Fixes the crasher in PR25651 and related crashers using the same pattern. llvm-svn: 254145
*	[OperandBundles] Extract duplicated code into a helper function, NFC	Sanjoy Das	2015-11-25	1	-5/+1
\| \| \| \|	llvm-svn: 254047
*	[Utils] Put includes in correct order. NFC.	Weiming Zhao	2015-11-24	8	-10/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Followed the guidelines in: http://llvm.org/docs/CodingStandards.html#include-style However, I noticed that uppercase named headers come before lowercase ones throughout the codebase. So kept them as is. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide, jmolloy, atrick Subscribers: sanjoy Differential Revision: http://reviews.llvm.org/D14939 llvm-svn: 254005
*	[SimplifyLibCalls] Removed some TODOs which are already implemented. NFC.	Weiming Zhao	2015-11-21	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: D14302 implements tan(atan(x)) -> x D14045 implements pow(exp(x), y) -> exp(x*y) Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide Differential Revision: http://reviews.llvm.org/D14882 llvm-svn: 253768
*	Fix the debug build breakage that getDiscriminator is called by mistake.	Dehao Chen	2015-11-19	1	-1/+2
\| \| \| \|	llvm-svn: 253597