bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[LDist] Port to new streaming API for opt remarks	Adam Nemet	2016-09-30	1	-17/+26
\| \| \| \|	llvm-svn: 282838
*	[LoopUnroll] Port to the new streaming interface for opt remarks.	Adam Nemet	2016-09-30	2	-31/+41
\| \| \| \|	llvm-svn: 282834
*	[thinlto] Don't decay threshold for hot callsites	Piotr Padlewski	2016-09-30	1	-5/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We don't want to decay hot callsites to import chains of hot callsites. The same mechanism is used in LIPO. Reviewers: tejohnson, eraman, mehdi_amini Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D24976 llvm-svn: 282833
*	[LoopDataPrefetch] Port to new streaming API for opt remarks	Adam Nemet	2016-09-30	1	-1/+2
\| \| \| \|	llvm-svn: 282826
*	[LV] Port the remarks in processLoop to the new streaming API	Adam Nemet	2016-09-30	1	-22/+39
\| \| \| \| \| \|	This completes LV. llvm-svn: 282821
*	[LV] Port the last opt remark in Hints to the new streaming interface	Adam Nemet	2016-09-30	1	-5/+6
\| \| \| \|	llvm-svn: 282820
*	[LAA, LV] Port to new streaming interface for opt remarks. Update LV	Adam Nemet	2016-09-30	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	(Recommit after making sure IsVerbose gets properly initialized in DiagnosticInfoOptimizationBase. See previous commit that takes care of this.) OptimizationRemarkAnalysis directly takes the role of the report that is generated by LAA. Then we need the magic to be able to turn an LAA remark into an LV remark. This is done via a new OptimizationRemark ctor. llvm-svn: 282813
*	[InstCombine] fix function names; NFC	Sanjay Patel	2016-09-29	2	-44/+48
\| \| \| \| \| \| \| \|	Also, make foldSelectExtConst() a member of InstCombiner, remove unnecessary parameters from its interface, and group visitSelectInst helpers together in the header file. llvm-svn: 282796
*	Revert "[LAA, LV] Port to new streaming interface for opt remarks. Update LV"	Adam Nemet	2016-09-29	1	-6/+3
\| \| \| \| \| \| \| \|	This reverts commit r282758. There are some clang failures I haven't seen. llvm-svn: 282759
*	[LAA, LV] Port to new streaming interface for opt remarks. Update LV	Adam Nemet	2016-09-29	1	-3/+6
\| \| \| \| \| \| \| \| \| \|	OptimizationRemarkAnalysis directly takes the role of the report that is generated by LAA. Then we need the magic to be able to turn an LAA remark into an LV remark. This is done via a new OptimizationRemark ctor. llvm-svn: 282758
*	[LV] Port OptimizationRemarkAnalysisFPCommute and	Adam Nemet	2016-09-29	1	-10/+12
\| \| \| \| \| \|	OptimizationRemarkAnalysisAliasing to new streaming API for opt remarks llvm-svn: 282742
*	[LV] Convert processLoop to new streaming API for opt remarks	Adam Nemet	2016-09-29	1	-10/+10
\| \| \| \|	llvm-svn: 282740
*	fix formatting; NFC	Sanjay Patel	2016-09-29	1	-11/+9
\| \| \| \|	llvm-svn: 282737
*	[sanitizer-coverage/libFuzzer] make the guards for trace-pc 32-bit; create ↵	Kostya Serebryany	2016-09-29	1	-64/+91
\| \| \| \| \| \|	one array of guards per function, instead of one guard per BB. reorganize the code so that trace-pc-guard does not create unneeded globals llvm-svn: 282735
*	[thinlto] Add cold-callsite import heuristic	Piotr Padlewski	2016-09-29	1	-6/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Not tunned up heuristic, but with this small heuristic there is about +0.10% improvement on SPEC 2006 Reviewers: tejohnson, mehdi_amini, eraman Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D24940 llvm-svn: 282733
*	[LV] Move static createMissedAnalysis from anonymous to global namespace	Adam Nemet	2016-09-29	1	-26/+26
\| \| \| \| \| \|	This is an attempt to fix a windows bot. llvm-svn: 282730
*	[LV] Convert CostModel to use the new streaming opt remark API	Adam Nemet	2016-09-29	1	-21/+20
\| \| \| \| \| \|	Here we can already remove the member function emitAnalysis. llvm-svn: 282729
*	[LV] Split most of createMissedAnalysis into a static function. NFC	Adam Nemet	2016-09-29	1	-15/+28
\| \| \| \| \| \|	This will be shared between Legality and CostModel. llvm-svn: 282728
*	[LV] Convert all but one opt remark in Legality to new streaming interface	Adam Nemet	2016-09-29	1	-46/+72
\| \| \| \| \| \| \| \|	The last one remaining after which emitAnalysis can be removed is when we convert the LAA's report to a vectorization report. This requires converting LAA to the new interface first. llvm-svn: 282726
*	[LV] Convert emitRemark to new opt remark streaming interface	Adam Nemet	2016-09-29	1	-14/+17
\| \| \| \| \| \| \|	Also renamed the function to emitRemarkWithHints to better reflect what the function actually does. llvm-svn: 282723
*	Test commit. NFC.	Volkan Keles	2016-09-29	1	-1/+1
\| \| \| \|	llvm-svn: 282717
*	Wisely choose sext or zext when widening IV.	Evgeny Stupachenko	2016-09-28	1	-37/+88
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: The patch fixes regression caused by two earlier patches D18777 and D18867. Reviewers: reames, sanjoy Differential Revision: http://reviews.llvm.org/D24280 From: Li Huang llvm-svn: 282650
*	Refactor the ProfileSummaryInfo to use doInitialization and doFinalization ↵	Dehao Chen	2016-09-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	to handle Module update. Summary: This refactors the change in r282616 Reviewers: davidxl, eraman, mehdi_amini Subscribers: mehdi_amini, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D25041 llvm-svn: 282630
*	[SystemZ] Implementation of getUnrollingPreferences().	Jonas Paulsson	2016-09-28	1	-6/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit enables more unrolling for SystemZ by implementing the SystemZTargetTransformInfo::getUnrollingPreferences() method. It has been found that it is better to only unroll moderately, so the DefaultUnrollRuntimeCount has been moved into UnrollingPreferences in order to set this to a lower value for SystemZ (4). Reviewers: Evgeny Stupachenko, Ulrich Weigand. https://reviews.llvm.org/D24451 llvm-svn: 282570
*	[Inliner] Port all opt remarks to new streaming API	Adam Nemet	2016-09-27	1	-34/+35
\| \| \| \|	llvm-svn: 282559
*	Shorten DiagnosticInfoOptimizationRemark* to OptimizationRemark*. NFC	Adam Nemet	2016-09-27	4	-7/+5
\| \| \| \| \| \| \|	With the new streaming interface, these class names need to be typed a lot and it's way too looong. llvm-svn: 282544
*	[Inliner] Fold the analysis remark into the missed remark	Adam Nemet	2016-09-27	1	-6/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There is really no reason for these to be separate. The vectorizer started this pretty bad tradition that the text of the missed remarks is pretty meaningless, i.e. vectorization failed. There, you have to query analysis to get the full picture. I think we should just explain the reason for missing the optimization in the missed remark when possible. Analysis remarks should provide information that the pass gathers regardless whether the optimization is passing or not. llvm-svn: 282542
*	[LoopSimplify] When simplifying phis in loop-simplify, do it only if it ↵	Michael Zolotukhin	2016-09-27	1	-2/+4
\| \| \| \| \| \|	preserves LCSSA form. llvm-svn: 282541
*	Output optimization remarks in YAML	Adam Nemet	2016-09-27	1	-6/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(Re-committed after moving the template specialization under the yaml namespace. GCC was complaining about this.) This allows various presentation of this data using an external tool. This was first recommended here[1]. As an example, consider this module: 1 int foo(); 2 int bar(); 3 4 int baz() { 5 return foo() + bar(); 6 } The inliner generates these missed-optimization remarks today (the hotness information is pulled from PGO): remark: /tmp/s.c:5:10: foo will not be inlined into baz (hotness: 30) remark: /tmp/s.c:5:18: bar will not be inlined into baz (hotness: 30) Now with -pass-remarks-output=<yaml-file>, we generate this YAML file: --- !Missed Pass: inline Name: NotInlined DebugLoc: { File: /tmp/s.c, Line: 5, Column: 10 } Function: baz Hotness: 30 Args: - Callee: foo - String: will not be inlined into - Caller: baz ... --- !Missed Pass: inline Name: NotInlined DebugLoc: { File: /tmp/s.c, Line: 5, Column: 18 } Function: baz Hotness: 30 Args: - Callee: bar - String: will not be inlined into - Caller: baz ... This is a summary of the high-level decisions: * There is a new streaming interface to emit optimization remarks. E.g. for the inliner remark above: ORE.emit(DiagnosticInfoOptimizationRemarkMissed( DEBUG_TYPE, "NotInlined", &I) << NV("Callee", Callee) << " will not be inlined into " << NV("Caller", CS.getCaller()) << setIsVerbose()); NV stands for named value and allows the YAML client to process a remark using its name (NotInlined) and the named arguments (Callee and Caller) without parsing the text of the message. Subsequent patches will update ORE users to use the new streaming API. * I am using YAML I/O for writing the YAML file. YAML I/O requires you to specify reading and writing at once but reading is highly non-trivial for some of the more complex LLVM types. Since it's not clear that we (ever) want to use LLVM to parse this YAML file, the code supports and asserts that we're writing only. On the other hand, I did experiment that the class hierarchy starting at DiagnosticInfoOptimizationBase can be mapped back from YAML generated here (see D24479). * The YAML stream is stored in the LLVM context. * In the example, we can probably further specify the IR value used, i.e. print "Function" rather than "Value". * As before hotness is computed in the analysis pass instead of DiganosticInfo. This avoids the layering problem since BFI is in Analysis while DiagnosticInfo is in IR. [1] https://reviews.llvm.org/D19678#419445 Differential Revision: https://reviews.llvm.org/D24587 llvm-svn: 282539
*	[DebugInfo] Add comments to phi dbg.value tracking code, NFC	Reid Kleckner	2016-09-27	1	-6/+8
\| \| \| \| \| \| \| \| \| \| \| \|	LLVM developers might be surprised to learn that there are blocks without valid insertion points (catchswitch), so it seems worth calling that out explicitly. Also add a FIXME about what we should really be doing if we ever need to make optimized Windows EH code debuggable. While I'm here, make auto usage more consistent with LLVM standards and avoid an unecessary call to insertBefore. llvm-svn: 282521
*	Revert "Output optimization remarks in YAML"	Adam Nemet	2016-09-27	1	-8/+6
\| \| \| \| \| \| \| \|	This reverts commit r282499. The GCC bots are failing llvm-svn: 282503
*	Output optimization remarks in YAML	Adam Nemet	2016-09-27	1	-6/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This allows various presentation of this data using an external tool. This was first recommended here[1]. As an example, consider this module: 1 int foo(); 2 int bar(); 3 4 int baz() { 5 return foo() + bar(); 6 } The inliner generates these missed-optimization remarks today (the hotness information is pulled from PGO): remark: /tmp/s.c:5:10: foo will not be inlined into baz (hotness: 30) remark: /tmp/s.c:5:18: bar will not be inlined into baz (hotness: 30) Now with -pass-remarks-output=<yaml-file>, we generate this YAML file: --- !Missed Pass: inline Name: NotInlined DebugLoc: { File: /tmp/s.c, Line: 5, Column: 10 } Function: baz Hotness: 30 Args: - Callee: foo - String: will not be inlined into - Caller: baz ... --- !Missed Pass: inline Name: NotInlined DebugLoc: { File: /tmp/s.c, Line: 5, Column: 18 } Function: baz Hotness: 30 Args: - Callee: bar - String: will not be inlined into - Caller: baz ... This is a summary of the high-level decisions: * There is a new streaming interface to emit optimization remarks. E.g. for the inliner remark above: ORE.emit(DiagnosticInfoOptimizationRemarkMissed( DEBUG_TYPE, "NotInlined", &I) << NV("Callee", Callee) << " will not be inlined into " << NV("Caller", CS.getCaller()) << setIsVerbose()); NV stands for named value and allows the YAML client to process a remark using its name (NotInlined) and the named arguments (Callee and Caller) without parsing the text of the message. Subsequent patches will update ORE users to use the new streaming API. * I am using YAML I/O for writing the YAML file. YAML I/O requires you to specify reading and writing at once but reading is highly non-trivial for some of the more complex LLVM types. Since it's not clear that we (ever) want to use LLVM to parse this YAML file, the code supports and asserts that we're writing only. On the other hand, I did experiment that the class hierarchy starting at DiagnosticInfoOptimizationBase can be mapped back from YAML generated here (see D24479). * The YAML stream is stored in the LLVM context. * In the example, we can probably further specify the IR value used, i.e. print "Function" rather than "Value". * As before hotness is computed in the analysis pass instead of DiganosticInfo. This avoids the layering problem since BFI is in Analysis while DiagnosticInfo is in IR. [1] https://reviews.llvm.org/D19678#419445 Differential Revision: https://reviews.llvm.org/D24587 llvm-svn: 282499
*	[sanitizer-coverage] fix a bug in trace-gep	Kostya Serebryany	2016-09-27	1	-1/+1
\| \| \| \|	llvm-svn: 282467
*	[sanitizer-coverage] don't emit the CTOR function if nothing has been ↵	Kostya Serebryany	2016-09-27	1	-17/+21
\| \| \| \| \| \|	instrumented llvm-svn: 282465
*	Revert r277556. Add -lowertypetests-bitsets-level to control bitsets generation	Ivan Krasin	2016-09-27	1	-9/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We don't currently need this facility for CFI. Disabling individual hot methods proved to be a better strategy in Chrome. Also, the design of the feature is suboptimal, as pointed out by Peter Collingbourne. Reviewers: pcc Subscribers: kcc Differential Revision: https://reviews.llvm.org/D24948 llvm-svn: 282461
*	LowerTypeTests: Remove unused variable.	Peter Collingbourne	2016-09-26	1	-1/+0
\| \| \| \|	llvm-svn: 282456
*	LowerTypeTests: Create LowerTypeTestsModule class and move implementation ↵	Peter Collingbourne	2016-09-26	1	-82/+74
\| \| \| \| \| \|	there. Related simplifications. llvm-svn: 282455
*	[thinlto] Basic thinlto fdo heuristic	Piotr Padlewski	2016-09-26	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch improves thinlto importer by importing 3x larger functions that are called from hot block. I compared performance with the trunk on spec, and there were about 2% on povray and 3.33% on milc. These results seems to be consistant and match the results Teresa got with her simple heuristic. Some benchmarks got slower but I think they are just noisy (mcf, xalancbmki, omnetpp)- running the benchmarks again with more iterations to confirm. Geomean of all benchmarks including the noisy ones were about +0.02%. I see much better improvement on google branch with Easwaran patch for pgo callsite inlining (the inliner actually inline those big functions) Over all I see +0.5% improvement, and I get +8.65% on povray. So I guess we will see much bigger change when Easwaran patch will land (it depends on new pass manager), but it is still worth putting this to trunk before it. Implementation details changes: - Removed CallsiteCount. - ProfileCount got replaced by Hotness - hot-import-multiplier is set to 3.0 for now, didn't have time to tune it up, but I see that we get most of the interesting functions with 3, so there is no much performance difference with higher, and binary size doesn't grow as much as with 10.0. Reviewers: eraman, mehdi_amini, tejohnson Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D24638 llvm-svn: 282437
*	Remove pruning of phi nodes in MemorySSA - it makes updating harder	Daniel Berlin	2016-09-26	1	-40/+5
\| \| \| \| \| \| \| \| \| \|	Reviewers: george.burgess.iv Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24923 llvm-svn: 282419
*	[LV] Scalarize instructions marked scalar after vectorization	Matthew Simpson	2016-09-26	1	-0/+9
\| \| \| \| \| \| \| \| \|	This patch ensures that we actually scalarize instructions marked scalar after vectorization. Previously, such instructions may have been vectorized instead. Differential Revision: https://reviews.llvm.org/D23889 llvm-svn: 282418
*	[Coroutines] Part14: Handle coroutines with no suspend points.	Gor Nishanov	2016-09-26	2	-0/+120
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If coroutine has no suspend points, remove heap allocation and turn a coroutine into a normal function. Also, if a pattern is detected that coroutine resumes or destroys itself prior to coro.suspend call, turn the suspend point into a simple jump to resume or cleanup label. This pattern occurs when coroutines are used to propagate errors in functions that return expected<T>. Reviewers: majnemer Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D24408 llvm-svn: 282414
*	[InstCombine] Fixed bug introduced in r282237	Alexey Bataev	2016-09-26	1	-6/+8
\| \| \| \| \| \| \| \|	The index of the new insertelement instruction was evaluated in the wrong way, it was considered as the index of the inserted value instead of index of the position, where the value should be inserted. llvm-svn: 282401
*	[InstCombine] Teach the udiv folding logic how to handle constant expressions.	Andrea Di Biagio	2016-09-26	1	-11/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes PR30366. Function foldUDivShl() worked under the assumption that one of the values in input to the function was always an instance of llvm::Instruction. However, function visitUDivOperand() (the only user of foldUDivShl) was clearly violating that precondition; internally, visitUDivOperand() uses pattern matches to check the operands of a udiv. Pattern matchers for binary operators know how to handle both Instruction and ConstantExpr values. This patch fixes the problem in foldUDivShl(). Now we use pattern matchers instead of explicit casts to Instruction. The reduced test case from PR30366 has been added to test file InstCombine/udiv-simplify.ll. Differential Revision: https://reviews.llvm.org/D24565 llvm-svn: 282398
*	ObjCARC: Don't look at users of ConstantData	Duncan P. N. Exon Smith	2016-09-24	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \|	Stop looking at users of UndefValue and ConstantPointerNull in the objective C ARC optimizers. The other users aren't actually interesting, since they're not pointing at a particular object. I imagine these calls could be optimized through -instcombine... maybe they already are? These early returns will be required at some point in the future, with a WIP patch that asserts when someone accesses a use-list on ConstantData. llvm-svn: 282338
*	Scalar: Ignore ConstantData in processAssumption	Duncan P. N. Exon Smith	2016-09-24	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Assumptions on UndefValue and ConstantPointerNull aren't relevant to other users. Ignore them entirely to avoid wasting cycles walking through their (possibly extremely extensive (cross-module)) use-lists. It wasn't clear how to add a specific test for this, and it'll be covered anyway by an eventual patch that asserts when trying to access the use-list of an instance of ConstantData. llvm-svn: 282334
*	GlobalStatus: Don't walk use-lists of ConstantData	Duncan P. N. Exon Smith	2016-09-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Return early from llvm::isSafeToDestroyConstant() whenever the value `isa<ConstantData>()`. These constants are shared across the LLVMContext. We never really want to delete them here, and walking their use-lists can be very expensive. (This is motivated by an eventual goal of removing use-lists entirely from ConstantData.) llvm-svn: 282320
*	[InstCombine] Fix for PR29124: reduce insertelements to shufflevector	Alexey Bataev	2016-09-23	2	-50/+138
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If inserting more than one constant into a vector: define <4 x float> @foo(<4 x float> %x) { %ins1 = insertelement <4 x float> %x, float 1.0, i32 1 %ins2 = insertelement <4 x float> %ins1, float 2.0, i32 2 ret <4 x float> %ins2 } InstCombine could reduce that to a shufflevector: define <4 x float> @goo(<4 x float> %x) { %shuf = shufflevector <4 x float> %x, <4 x float> <float undef, float 1.0, float 2.0, float undef>, <4 x i32><i32 0, i32 5, i32 6, i32 3> ret <4 x float> %shuf } Also, InstCombine tries to convert shuffle instruction to single insertelement, if one of the vectors is a constant vector and only a single element from this constant should be used in shuffle, i.e. shufflevector <4 x float> %v, <4 x float> <float undef, float 1.0, float undef, float undef>, <4 x i32> <i32 0, i32 5, i32 undef, i32 undef> -> insertelement <4 x float> %v, float 1.0, 1 Differential Revision: https://reviews.llvm.org/D24182 llvm-svn: 282237
*	[InstCombine] fold X urem C -> X < C ? X : X - C when C is big (PR28672)	Sanjay Patel	2016-09-22	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	We already have the udiv variant of this transform, so I think this is ok for InstCombine too even though there is an increase in IR instructions. As the tests and TODO comments show, the transform can lead to follow-on combines. This should fix: https://llvm.org/bugs/show_bug.cgi?id=28672 Differential Revision: https://reviews.llvm.org/D24527 llvm-svn: 282209
*	Revert r282168 "GVN-hoist: fix store past load dependence analysis (PR30216)"	Hans Wennborg	2016-09-22	1	-35/+28
\| \| \| \| \| \| \| \|	and also the dependent r282175 "GVN-hoist: do not dereference null pointers" It's causing compiler crashes building Harfbuzz (PR30499). llvm-svn: 282199
*	GVN-hoist: do not dereference null pointers	Sebastian Pop	2016-09-22	1	-0/+3
\| \| \| \| \| \| \|	there may be basic blocks without memory accesses, in which case the list of accesses is a null pointer. llvm-svn: 282175