bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Adding a width of the GEP index to the Data Layout.	Elena Demikhovsky	2018-02-14	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Making a width of GEP Index, which is used for address calculation, to be one of the pointer properties in the Data Layout. p[address space]:size:memory_size:alignment:pref_alignment:index_size_in_bits. The index size parameter is optional, if not specified, it is equal to the pointer size. Till now, the InstCombiner normalized GEPs and extended the Index operand to the pointer width. It works fine if you can convert pointer to integer for address calculation and all registered targets do this. But some ISAs have very restricted instruction set for the pointer calculation. During discussions were desided to retrieve information for GEP index from the Data Layout. http://lists.llvm.org/pipermail/llvm-dev/2018-January/120416.html I added an interface to the Data Layout and I changed the InstCombiner and some other passes to take the Index width into account. This change does not affect any in-tree target. I added tests to cover data layouts with explicitly specified index size. Differential Revision: https://reviews.llvm.org/D42123 llvm-svn: 325102
*	[InlineCost] Mark functions accessing varargs as not viable.	Florian Hahn	2018-01-28	1	-6/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	This prevents functions accessing varargs from being inlined if they have the alwaysinline attribute. Reviewers: efriedma, rnk, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D42556 llvm-svn: 323619
*	Avoid inlining if there is byval arguments with non-alloca address space	Bjorn Pettersson	2018-01-10	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: After teaching InlineCost more about address spaces () another fault was detected in the inliner. If an argument has the byval attribute the parameter might be copied to an alloca. That part seems to work fine even if the argument has a different address space than the alloca address space. However, if the address spaces differ, then the inlined function still might refer to the parameter using the original address space (the inliner does not handle that situation very well). This patch avoids the problem by simply disallowing inlining when there are byval arguments with address space that differs from the alloca address space. I'm not really sure how to transform the code if we want to get inlining for this situation. I assume that it never has been working, and that the fixes in r321809 just exposed an old problem. Fault found by skatkov (Serguei Katkov). It is mentioned in follow up comments to https://reviews.llvm.org/D40455. Reviewers: skatkov Reviewed By: skatkov Subscribers: uabelho, eraman, llvm-commits, haicheng Differential Revision: https://reviews.llvm.org/D41898 llvm-svn: 322181
*	[InlineFunction] Inline vararg functions that do not access varargs.	Florian Hahn	2018-01-06	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the varargs are not accessed by a function, we can inline the function. Reviewers: dblaikie, chandlerc, davide, efriedma, rnk, hfinkel Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D41335 llvm-svn: 321940
*	Teach InlineCost about address spaces	Bjorn Pettersson	2018-01-04	1	-8/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: I basically copied this patch from here: https://reviews.llvm.org/D1251 But I skipped some of the refactoring to make the patch more clean. The new outer3/inner3 test case in ptr-diff.ll triggers the following assert without this patch: lib/IR/Constants.cpp:1834: static llvm::Constant llvm::ConstantExpr::getCompare(unsigned short, llvm::Constant , llvm::Constant *, bool): Assertion `C1->getType() == C2->getType() && "Op types should be identical!"' failed. The other new test cases makes sure that there is code coverage for all modifications in InlineCost.cpp (getting different values due to not fetching sizes for address space zero). I only guarantee code coverage for those tests. The tests are not written in a way that they would break if not having the corrections in InlineCost.cpp. I found it quite hard to fine tune the tests into getting different results based on the pointer sizes (except for the test case where we hit an assert if not teaching InlineCost about address spaces). Reviewers: chandlerc, arsenm, haicheng Reviewed By: arsenm Subscribers: wdng, eraman, llvm-commits, haicheng Differential Revision: https://reviews.llvm.org/D40455 llvm-svn: 321809
*	[InlineCost] Find more free binary operations	Haicheng Wu	2017-12-22	1	-40/+18
\| \| \| \| \| \| \| \| \| \| \| \|	Currently, inline cost model considers a binary operator as free only if both its operands are constants. Some simple cases are missing such as a + 0, a - a, etc. This patch modifies visitBinaryOperator() to call SimplifyBinOp() without going through simplifyInstruction() to get rid of the constant restriction. Thus, visitAnd() and visitOr() are not needed. Differential Revision: https://reviews.llvm.org/D41494 llvm-svn: 321366
*	[Inliner] Restrict soft-float inlining penalty.	Eli Friedman	2017-12-22	1	-11/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The penalty is currently getting applied in a bunch of places where it doesn't make sense, like bitcasts (which are free) and calls (which were getting the call penalty applied twice). Instead, just apply the penalty to binary operators and floating-point casts. While I'm here, also fix getFPOpCost() to do the right thing in more cases, so we don't have to dig into function attributes. Differential Revision: https://reviews.llvm.org/D41522 llvm-svn: 321332
*	[InlineCost] Skip volatile loads when looking for repeated loads	Haicheng Wu	2017-12-19	1	-1/+2
\| \| \| \| \| \|	This is a follow-up fix of r320814. A test case is also added. llvm-svn: 321075
*	[InlineCost] Find repeated loads in the callee	Haicheng Wu	2017-12-15	1	-5/+51
\| \| \| \| \| \| \| \| \| \| \|	SROA analysis of InlineCost can figure out that some stores can be removed after inlining and then the repeated loads clobbered by these stores are also free. This patch finds these clobbered loads and adjust the inline cost accordingly. Differential Revision: https://reviews.llvm.org/D33946 llvm-svn: 320814
*	[InlineCost] Tracking Values through PHI Nodes	Haicheng Wu	2017-12-14	1	-6/+138
\| \| \| \| \| \| \| \| \| \| \| \|	This patch fix this FIXME in visitPHI() FIXME: We should potentially be tracking values through phi nodes, especially when they collapse to a single value due to deleted CFG edges during inlining. Differential Revision: https://reviews.llvm.org/D38594 llvm-svn: 320699
*	[InlineCost] Prefer getFunction() to two calls to getParent().	Davide Italiano	2017-11-30	1	-3/+3
\| \| \| \| \| \|	Improves clarity, also slightly cheaper. NFCI. llvm-svn: 319481
*	Reverting r315590; it did not include changes for llvm-tblgen, which is ↵	Aaron Ballman	2017-10-15	1	-1/+1
\| \| \| \| \| \| \| \|	causing link errors for several people. Error LNK2019 unresolved external symbol "public: void __cdecl `anonymous namespace'::MatchableInfo::dump(void)const " (?dump@MatchableInfo@?A0xf4f1c304@@QEBAXXZ) referenced in function "public: void __cdecl `anonymous namespace'::AsmMatcherEmitter::run(class llvm::raw_ostream &)" (?run@AsmMatcherEmitter@?A0xf4f1c304@@QEAAXAEAVraw_ostream@llvm@@@Z) llvm-tblgen D:\llvm\2017\utils\TableGen\AsmMatcherEmitter.obj 1 llvm-svn: 315854
*	[dump] Remove NDEBUG from test to enable dump methods [NFC]	Don Hinton	2017-10-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add LLVM_FORCE_ENABLE_DUMP cmake option, and use it along with LLVM_ENABLE_ASSERTIONS to set LLVM_ENABLE_DUMP. Remove NDEBUG and only use LLVM_ENABLE_DUMP to enable dump methods. Move definition of LLVM_ENABLE_DUMP from config.h to llvm-config.h so it'll be picked up by public headers. Differential Revision: https://reviews.llvm.org/D38406 llvm-svn: 315590
*	[NFC] Convert OptimizationRemarkEmitter old emit() calls to new closure	Vivek Pandya	2017-10-11	1	-10/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	parameterized emit() calls Summary: This is not functional change to adopt new emit() API added in r313691. Reviewed By: anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38285 llvm-svn: 315476
*	[InlineCost, NFC] Extract code dealing with inbounds GEPs from ↵	Evgeny Astigeevich	2017-10-03	1	-29/+24
\| \| \| \| \| \| \| \| \| \| \| \| \|	visitGetElementPtr into a function The code responsible for analysis of inbounds GEPs is extracted into a separate function: CallAnalyzer::canFoldInboundsGEP. With the patch SROA enabling/disabling code is localized at one place instead of spreading across the code of CallAnalyzer::visitGetElementPtr. Differential Revision: https://reviews.llvm.org/D38233 llvm-svn: 314787
*	[InlineCost] add visitSelectInst()	Haicheng Wu	2017-09-27	1	-0/+82
\| \| \| \| \| \| \| \| \| \|	InlineCost can understand Select IR now. This patch finds free Select IRs and continue the propagation of SimplifiedValues, ConstantOffsetPtrs, and SROAArgValues. Differential Revision: https://reviews.llvm.org/D37198 llvm-svn: 314307
*	[Inliner] Add another way to compute full inline cost.	Easwaran Raman	2017-09-13	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Full inline cost is computed when -inline-cost-full is true or ORE is non-null. This patch adds another way to compute full inline cost by adding a field to InlineParams. This will be used by SampleProfileLoader to check legality of inlining a callee that it wants to inline. Reviewers: danielcdh, haicheng Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37819 llvm-svn: 313185
*	[InlineCost] Small changes to early exit condition. NFC.	Haicheng Wu	2017-08-25	1	-3/+3
\| \| \| \| \| \| \| \| \|	Change the early exit condition from Cost > Threshold to Cost >= Threshold because the inline condition is Cost < Threshold. Differential Revision: https://reviews.llvm.org/D37087 llvm-svn: 311791
*	[InlineCost] Add cl::opt to allow full inline cost to be computed for ↵	Haicheng Wu	2017-08-21	1	-14/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	debugging purposes. Currently, the inline cost model will bail once the inline cost exceeds the inline threshold in order to avoid unnecessary compile-time. However, when debugging it is useful to compute the full cost, so this command line option is added to override the default behavior. I took over this work from Chad Rosier (mcrosier@codeaurora.org). Differential Revision: https://reviews.llvm.org/D35850 llvm-svn: 311371
*	[InlineCost] Add more debug during inline cost computation.	Chad Rosier	2017-08-21	1	-1/+1
\| \| \| \|	llvm-svn: 311370
*	[InlineCost] Refactor the checks for different analyses to be a bit more	Chandler Carruth	2017-08-14	1	-62/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	localized to the code that uses those analyses. Technically, this can change behavior as we no longer require the existence of the ProfileSummaryInfo analysis to use local profile information via BFI. We didn't actually require the PSI to have an interesting profile though, so this only really impacts the behavior in non-default pass pipelines. IMO, this makes it substantially less surprising how everything works -- before an analysis that wasn't actually used had to exist to trigger any profile aware inlining. I think the new organization makes it more obvious where various checks for profile signals happen. Differential Revision: https://reviews.llvm.org/D36710 llvm-svn: 310888
*	[Inliner] Fix a typo in option description. NFC.	Easwaran Raman	2017-08-04	1	-1/+1
\| \| \| \|	llvm-svn: 310073
*	[Inliner] Increase threshold for hot callsites without PGO.	Easwaran Raman	2017-08-03	1	-3/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This increases the inlining threshold for hot callsites. Hotness is defined in terms of block frequency of the callsite relative to the caller's entry block's frequency. Since this requires BFI in the inliner, this only affects the new PM pipeline. This is enabled by default at -O3. This improves the performance of some internal benchmarks. Notably, an internal benchmark for Gipfeli compression (https://github.com/google/gipfeli) improves by ~7%. Povray in SPEC2006 improves by ~2.5%. I am running more experiments and will update the thread if other benchmarks show improvement/regression. In terms of text size, LLVM test-suite shows an 1.22% text size increase. Diving into the results, 13 of the benchmarks in the test-suite increases by > 10%. Most of these are small, but Adobe-C++/loop_unroll (17.6% increases) and tramp3d(20.7% size increase) have >250K text size. On a large application, the text size increases by 2% Reviewers: chandlerc, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36199 llvm-svn: 309994
*	[InlineCost] Remove redundant call. NFC.	Chad Rosier	2017-08-02	1	-2/+3
\| \| \| \|	llvm-svn: 309819
*	[InlineCost] Simplify more 'and' and 'or' operations.	Chad Rosier	2017-08-02	1	-0/+30
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D35856 llvm-svn: 309817
*	[Inliner] Do not apply any bonus for cold callsites.	Easwaran Raman	2017-07-28	1	-28/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Inlining threshold is increased by application of bonuses when the callee has a single reachable basic block or is rich in vector instructions. Similarly, inlining cost is reduced by applying a large bonus when the last call to a static function is considered for inlining. This patch disables the application of these bonuses when the callsite or the callee is cold. The intention here is to prevent a large cold callsite from being inlined to a non-cold caller that could prevent the caller from being inlined. This is especially important when the cold callsite is a last call to a static since the associated bonus is very high. Reviewers: chandlerc, davidxl Subscribers: danielcdh, llvm-commits Differential Revision: https://reviews.llvm.org/D35823 llvm-svn: 309441
*	[InlineCost, NFC] Change CallAnalyzer::isGEPFree to use TTI::getUserCost ↵	Evgeny Astigeevich	2017-07-27	1	-6/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	instead of TTI::getGEPCost Currently CallAnalyzer::isGEPFree uses TTI::getGEPCost to check if GEP is free. TTI::getGEPCost cannot handle cases when GEPs participate in Def-Use dependencies (see https://reviews.llvm.org/D31186 for example). There is TTI::getUserCost which can calculate the cost more accurately by taking dependencies into account. Differential Revision: https://reviews.llvm.org/D33685 llvm-svn: 309268
*	Fix a typo.	Eric Christopher	2017-06-28	1	-1/+1
\| \| \| \|	llvm-svn: 306599
*	[NewPM/Inliner] Reduce threshold for cold callsites in the non-PGO case	Easwaran Raman	2017-06-27	1	-1/+30
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D34312 llvm-svn: 306484
*	[InlineCost] Do not take INT_MAX when Cost is negative	Jun Bum Lim	2017-06-23	1	-5/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: visitSwitchInst should not take INT_MAX when Cost is negative. Instead of INT_MAX , we also use a valid upperbound cost when overflow occurs in Cost. Reviewers: hans, echristo, dmgreen Reviewed By: dmgreen Subscribers: mcrosier, javed.absar, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D34436 llvm-svn: 306118
*	[InstSimplify] Don't constant fold or DCE calls that are marked nobuiltin	Andrew Kaylor	2017-06-09	1	-2/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D33737 llvm-svn: 305132
*	[InlineCost] Enable the new switch cost heuristic	Jun Bum Lim	2017-06-02	1	-76/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is to enable the new switch inline cost heuristic (r301649) by removing the old heuristic as well as the flag itself. In my experiment for LLVM test suite and spec2000/2006, +17.82% performance and 8% code size reduce was observed in spec2000/vertex with O3 LTO in AArch64. No significant code size / performance regression was found in O3/O2/Os. No significant complain was reported from the llvm-dev thread. Reviewers: hans, chandlerc, eraman, haicheng, mcrosier, bmakam, eastig, ddibyend, echristo Reviewed By: echristo Subscribers: javed.absar, kristof.beyls, echristo, aemerson, rengolin, mehdi_amini Differential Revision: https://reviews.llvm.org/D32653 llvm-svn: 304594
*	[Inliner] Do not mix callsite and callee hotness based updates.	Easwaran Raman	2017-05-16	1	-15/+27
\| \| \| \| \| \| \| \| \| \|	Update threshold based on callee's hotness only when BFI is not available. Otherwise use only callsite's hotness. This makes it easier to reason about hotness related threshold updates. Differential revision: https://reviews.llvm.org/D33157 llvm-svn: 303210
*	Decrease inlinecold-threshold to 45	Easwaran Raman	2017-05-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I ran the test-suite (including SPEC 2006) in PGO mode comparing cold thresholds of 225 and 45. Here are some stats on the text size: Out of 904 tests that ran, 197 see a change in text size. The average text size reduction (of all the 904 binaries) is 1.07%. Of the 197 binaries, 19 see a text size increase, as high as 18%, but most of them are small single source benchmarks. There are 3 multisource benchmarks with a >0.5% size increase (0.7, 1.3 and 2.1 are their % increases). On the other side of the spectrum, 31 benchmarks see >10% size reduction and 6 of them are MultiSource. I haven't run the test-suite with other values of inlinecold-threshold. Since we have a cold callsite threshold of 45, I picked this value. Differential revision: https://reviews.llvm.org/D33106 llvm-svn: 302829
*	Refactor callsite cost computation into a helper function /NFC	Xinliang David Li	2017-05-02	1	-29/+35
\| \| \| \| \| \| \|	Makes code more readable. The function will also be used by the partial inlining's cost analysis. llvm-svn: 301899
*	[InlineCost] Improve the cost heuristic for Switch	Jun Bum Lim	2017-04-28	1	-5/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The motivation example is like below which has 13 cases but only 2 distinct targets ``` lor.lhs.false2: ; preds = %if.then switch i32 %Status, label %if.then27 [ i32 -7012, label %if.end35 i32 -10008, label %if.end35 i32 -10016, label %if.end35 i32 15000, label %if.end35 i32 14013, label %if.end35 i32 10114, label %if.end35 i32 10107, label %if.end35 i32 10105, label %if.end35 i32 10013, label %if.end35 i32 10011, label %if.end35 i32 7008, label %if.end35 i32 7007, label %if.end35 i32 5002, label %if.end35 ] ``` which is compiled into a balanced binary tree like this on AArch64 (similar on X86) ``` .LBB853_9: // %lor.lhs.false2 mov w8, #10012 cmp w19, w8 b.gt .LBB853_14 // BB#10: // %lor.lhs.false2 mov w8, #5001 cmp w19, w8 b.gt .LBB853_18 // BB#11: // %lor.lhs.false2 mov w8, #-10016 cmp w19, w8 b.eq .LBB853_23 // BB#12: // %lor.lhs.false2 mov w8, #-10008 cmp w19, w8 b.eq .LBB853_23 // BB#13: // %lor.lhs.false2 mov w8, #-7012 cmp w19, w8 b.eq .LBB853_23 b .LBB853_3 .LBB853_14: // %lor.lhs.false2 mov w8, #14012 cmp w19, w8 b.gt .LBB853_21 // BB#15: // %lor.lhs.false2 mov w8, #-10105 add w8, w19, w8 cmp w8, #9 // =9 b.hi .LBB853_17 // BB#16: // %lor.lhs.false2 orr w9, wzr, #0x1 lsl w8, w9, w8 mov w9, #517 and w8, w8, w9 cbnz w8, .LBB853_23 .LBB853_17: // %lor.lhs.false2 mov w8, #10013 cmp w19, w8 b.eq .LBB853_23 b .LBB853_3 .LBB853_18: // %lor.lhs.false2 mov w8, #-7007 add w8, w19, w8 cmp w8, #2 // =2 b.lo .LBB853_23 // BB#19: // %lor.lhs.false2 mov w8, #5002 cmp w19, w8 b.eq .LBB853_23 // BB#20: // %lor.lhs.false2 mov w8, #10011 cmp w19, w8 b.eq .LBB853_23 b .LBB853_3 .LBB853_21: // %lor.lhs.false2 mov w8, #14013 cmp w19, w8 b.eq .LBB853_23 // BB#22: // %lor.lhs.false2 mov w8, #15000 cmp w19, w8 b.ne .LBB853_3 ``` However, the inline cost model estimates the cost to be linear with the number of distinct targets and the cost of the above switch is just 2 InstrCosts. The function containing this switch is then inlined about 900 times. This change use the general way of switch lowering for the inline heuristic. It etimate the number of case clusters with the suitability check for a jump table or bit test. Considering the binary search tree built for the clusters, this change modifies the model to be linear with the size of the balanced binary tree. The model is off by default for now : -inline-generic-switch-cost=false This change was originally proposed by Haicheng in D29870. Reviewers: hans, bmakam, chandlerc, eraman, haicheng, mcrosier Reviewed By: hans Subscribers: joerg, aemerson, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D31085 llvm-svn: 301649
*	Remove a repeated comment line. NFC.	Easwaran Raman	2017-04-21	1	-1/+0
\| \| \| \|	llvm-svn: 301059
*	Tidy checking for the soft float attribute.	Eric Christopher	2017-04-15	1	-10/+1
\| \| \| \|	llvm-svn: 300394
*	Cache the DataLayout rather than looking it up frequently.	Eric Christopher	2017-04-15	1	-20/+14
\| \| \| \|	llvm-svn: 300393
*	[IR] Make paramHasAttr to use arg indices instead of attr indices	Reid Kleckner	2017-04-14	1	-2/+1
\| \| \| \| \| \| \| \| \|	This avoids the confusing 'CS.paramHasAttr(ArgNo + 1, Foo)' pattern. Previously we were testing return value attributes with index 0, so I introduced hasReturnAttr() for that use case. llvm-svn: 300367
*	[IR] Redesign the case iterator in SwitchInst to actually be an iterator	Chandler Carruth	2017-04-12	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and to expose a handle to represent the actual case rather than having the iterator return a reference to itself. All of this allows the iterator to be used with common STL facilities, standard algorithms, etc. Doing this exposed some missing facilities in the iterator facade that I've fixed and required some work to the actual iterator to fully support the necessary API. Differential Revision: https://reviews.llvm.org/D31548 llvm-svn: 300032
*	Remove unused functions. Remove static qualifier from functions in header ↵	Vassil Vassilev	2017-04-11	1	-7/+0
\| \| \| \| \| \|	files. NFC. llvm-svn: 299947
*	Do not inline hot callsites for samplepgo in thinlto compile phase.	Dehao Chen	2017-03-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Because SamplePGO passes will be invoked twice in ThinLTO build: once at compile phase, the other at backend. We want to make sure the IR at the 2nd phase matches the hot part in profile, thus we do not want to inline hot callsites in the first phase. Reviewers: tejohnson, eraman Reviewed By: tejohnson Subscribers: mehdi_amini, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31201 llvm-svn: 298428
*	[InlineCost] Move the code in isGEPOffsetConstant to a lambda.	Easwaran Raman	2017-02-25	1	-13/+9
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D30112 llvm-svn: 296208
*	Refactor instruction simplification code in visitors. NFC.	Easwaran Raman	2017-02-18	1	-88/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Several visitors check if operands to the instruction are constants, either as it is or after looking up SimplifiedValues, check if the result is a constant and update the SimplifiedValues map. This refactoring splits it into a common function that does the checking of whether the operands are constants and updating of the SimplifiedValues table, and an instruction specific part that is implemented by each instruction visitor as a lambda and passed to the common function. Differential revision: https://reviews.llvm.org/D30104 llvm-svn: 295552
*	Improve PGO support for the new inliner	Easwaran Raman	2017-01-20	1	-17/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the following to the new PM based inliner in PGO mode: * Use block frequency analysis to derive callsite's profile count and use that to adjust thresholds of hot and cold callsites. * Incrementally update the BFI of the caller after a callee gets inlined into it. This incremental update is only within an invocation of the run method - BFI is not preserved across calls to run. Update the function entry count of the callee after inlining it into a caller. * I've tuned the thresholds for the hot and cold callsites using a hacked up version of the old inliner that explicitly computes BFI on a set of internal benchmarks and spec. Once the new PM based pipeline stabilizes (IIRC Chandler mentioned there are known issues) I'll benchmark this again and adjust the thresholds if required. Inliner PGO support. Differential revision: https://reviews.llvm.org/D28331 llvm-svn: 292666
*	Recommit "[InlineCost] Use TTI to check if GEP is free." #3	Haicheng Wu	2017-01-20	1	-2/+18
\| \| \| \| \| \| \| \| \| \| \| \|	This is the third attemp to recommit r292526. The original summary: Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. llvm-svn: 292633
*	Revert "Recommit "[InlineCost] Use TTI to check if GEP is free." #2"	Haicheng Wu	2017-01-20	1	-18/+2
\| \| \| \| \| \|	This reverts commit r292616 because the test case still has problem. llvm-svn: 292618
*	Recommit "[InlineCost] Use TTI to check if GEP is free." #2	Haicheng Wu	2017-01-20	1	-2/+18
\| \| \| \| \| \| \| \| \| \| \| \|	This is the second attemp to recommit r292526. The original summary: Currently, a GEP is considered free only if its indices are all constant. TTI::getGEPCost() can give target-specific more accurate analysis. TTI is already used for the cost of many other instructions. llvm-svn: 292616
*	Revert "Recommit "[InlineCost] Use TTI to check if GEP is free.""	Haicheng Wu	2017-01-20	1	-18/+2
\| \| \| \| \| \|	This reverts commit r292570. The test still has problem. llvm-svn: 292572