bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[ARM] Don't pessimize i32 vselect.	Charlie Turner	2015-11-17	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The underlying issues surrounding codegen for 32-bit vselects have been resolved. The pessimistic costs for 64-bit vselects remain due to the bad scalarization that is still happening there. I tested this on A57 in T32, A32 and A64 modes. I saw no regressions, and some improvements. From my benchmarks, I saw these improvements in A57 (T32) spec.cpu2000.ref.177_mesa 5.95% lnt.SingleSource/Benchmarks/Shootout/strcat 12.93% lnt.MultiSource/Benchmarks/MiBench/telecomm-CRC32/telecomm-CRC32 11.89% I also measured A57 A32, A53 T32 and A9 T32 and found no performance regressions. I see much bigger wins in third-party benchmarks with this change Differential Revision: http://reviews.llvm.org/D14743 llvm-svn: 253349
*	Revert "Revert "[FunctionAttrs] Identify norecurse functions""	James Molloy	2015-11-12	1	-3/+5
\| \| \| \| \| \|	This reapplies this patch, with test fixes. llvm-svn: 252871
*	Revert "[FunctionAttrs] Identify norecurse functions"	James Molloy	2015-11-12	1	-5/+3
\| \| \| \| \| \|	This reverts commit r252862. This introduced test failures and I'm reverting while I investigate how this happened. llvm-svn: 252863
*	[FunctionAttrs] Identify norecurse functions	James Molloy	2015-11-12	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	A function can be marked as norecurse if: * The SCC to which it belongs has cardinality 1; and either a) It does not call any non-norecurse function. This includes self-recursion; or b) It only has one callsite and the function that callsite is within is marked norecurse. a) is best propagated bottom-up and b) is best propagated top-down. We build up the norecurse attributes bottom-up using the existing SCC pass, and mark functions with no obvious recursion (but not provably norecurse) to sweep later, top-down. llvm-svn: 252862
*	Sort the enums in Attributes.h in case insensitive alphabetical order.	Akira Hatanaka	2015-11-11	2	-4/+4
\| \| \| \| \| \| \| \| \|	Sort the enums in preparation for moving the attributes to a table-gen file. rdar://problem/19836465 llvm-svn: 252692
*	Revert "Strip metadata when speculatively hoisting instructions"	Renato Golin	2015-11-10	1	-1/+1
\| \| \| \| \| \| \|	This reverts commit r252604, as it broke all ARM and AArch64 buildbots, as well as some x86, et al. llvm-svn: 252623
*	Strip metadata when speculatively hoisting instructions	Igor Laevsky	2015-11-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is fix for PR24059. When we are hoisting instruction above some condition it may turn out that metadata on this instruction was control dependant on the condition. This metadata becomes invalid and we need to drop it. This patch should cover most obvious places of speculative execution (which I have found by greping isSafeToSpeculativelyExecute). I think there are more cases but at least this change covers the severe ones. Differential Revision: http://reviews.llvm.org/D14398 llvm-svn: 252604
*	Fix LoopAccessAnalysis when potentially nullptr check are involved	Mehdi Amini	2015-11-05	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: GetUnderlyingObjects() can return "null" among its list of objects, we don't want to deduce that two pointers can point to the same memory in this case, so filter it out. Reviewers: anemet Subscribers: dexonsmith, llvm-commits From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 252149
*	[LAA] LLE 3/6: Rename InterestingDependence to Dependences, NFC	Adam Nemet	2015-11-03	8	-22/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We now collect all types of dependences including lexically forward deps not just "interesting" ones. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13256 llvm-svn: 251985
*	[LAA] LLE 2/6: Fix a NoDep case that should be a Forward dependence	Adam Nemet	2015-11-03	1	-0/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When the dependence distance in zero then we have a loop-independent dependence from the earlier to the later access. No current client of LAA uses forward dependences so other than potentially hitting the MaxDependences threshold earlier, this change shouldn't affect anything right now. This and the previous patch were tested together for compile-time regression. None found in LNT/SPEC. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13255 llvm-svn: 251973
*	[LAA] LLE 1/6: Expose Forward dependences	Adam Nemet	2015-11-03	2	-0/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Before this change, we didn't use to collect forward dependences since none of the current clients (LV, LDist) required them. The motivation to also collect forward dependences is a new pass LoopLoadElimination (LLE) which discovers store-to-load forwarding opportunities across the loop's backedge. The pass uses both lexically forward or backward loop-carried dependences to detect these opportunities. The new pass also analyzes loop-independent (forward) dependences since they can conflict with the loop-carried dependences in terms of how the data flows through memory. The newly added test only covers loop-carried forward dependences because loop-independent ones are currently categorized as NoDep. The next patch will fix this. The two patches were tested together for compile-time regression. None found in LNT/SPEC. Note that with this change LAA provides all dependences rather than just "interesting" ones. A subsequent NFC patch will remove the now trivial isInterestingDependence and rename the APIs. Reviewers: hfinkel Subscribers: jmolloy, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13254 llvm-svn: 251972
*	[SCEV] Fix PR25369	Sanjoy Das	2015-11-02	1	-0/+78
\| \| \| \| \| \| \| \| \| \| \| \| \|	Have `getConstantEvolutionLoopExitValue` work correctly with multiple entry loops. As far as I can tell, `getConstantEvolutionLoopExitValue` never did the right thing for multiple entry loops; and before r249712 it would silently return an incorrect answer. r249712 changed SCEV to fail an assert on a multiple entry loop, and this change fixes the underlying issue. llvm-svn: 251770
*	[SCEV] Don't create SCEV expressions that break LCSSA	Sanjoy Das	2015-10-31	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \| \| \|	Prevent `createNodeFromSelectLikePHI` from creating SCEV expressions that break LCSSA. A better fix for the same issue is to teach SCEVExpander to not break LCSSA by inserting PHI nodes at appropriate places. That's planned for the future. Fixes PR25360. llvm-svn: 251756
*	[SCEV] Generalize the SCEV algorithm for creating expressions for PHI nodes	Silviu Baranga	2015-10-30	1	-0/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When forming expressions for phi nodes having an incoming value from outside the loop A and a value coming from the previous iteration B we were forming an AddRec if: - B was an AddRec - the value A was equal to the value for B at iteration -1 (or equal to the value of B shifted by one iteration, at iteration 0) In this case, we were computing the expression to be the expression of B, shifted by one iteration. This changes generalizes the logic above by removing the restriction that B needs to be an AddRec. For this we introduce two expression rewriters that allow us to - shift an expression by one iteration - get the value of an expression at iteration 0 This allows us to get SCEV expressions for PHI nodes when these expressions are not AddRecExprs. Reviewers: sanjoy Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D14175 llvm-svn: 251700
*	[SCEV] Compute max backedge count for loops with "shift ivs"	Sanjoy Das	2015-10-28	1	-0/+164
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This teaches SCEV to compute //max// backedge taken counts for loops like for (int i = k; i != 0; i >>>= 1) whatever(); SCEV yet cannot represent the exact backedge count for these loops, and this patch does not change that. This is really geared towards teaching SCEV that loops like the above are not infinite. llvm-svn: 251558
*	[AliasAnalysis] Take into account readnone attribute for the function arguments	Igor Laevsky	2015-10-28	1	-0/+11
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D13992 llvm-svn: 251535
*	[AliasAnalysis] Take into account readonly attribute for the function arguments	Igor Laevsky	2015-10-28	1	-0/+26
\| \| \| \| \| \| \| \| \| \|	In getArgModRefInfo we consider all arguments as having MRI_ModRef. However for arguments marked with readonly attribute we can return more precise answer - MRI_Ref. Differential Revision: http://reviews.llvm.org/D13992 llvm-svn: 251525
*	Add newline to test. NFC.	Chad Rosier	2015-10-28	1	-1/+1
\| \| \| \|	llvm-svn: 251511
*	[GlobalsAA] An indirect global that is initialized is not fair game	James Molloy	2015-10-28	1	-0/+27
\| \| \| \| \| \| \| \|	When checking if an indirect global (a global with pointer type) is only assigned by allocation functions, first check if the global is itself initialized. If it is, it's not only assigned by allocation functions. This fixes PR25309. Thanks to David Majnemer for reducing the test case! llvm-svn: 251508
*	[ValueTracking] Use !range metadata more aggressively in KnownBits	Sanjoy Das	2015-10-28	1	-0/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach `computeKnownBitsFromRangeMetadata` to use `!range` metadata more aggressively. Reviewers: majnemer, nlewycky, jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14100 llvm-svn: 251487
*	[ValueTracking] Extend r251146 to catch a fairly common case	James Molloy	2015-10-26	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	Even though we may not know the value of the shifter operand, it's possible we know the shifter operand is non-zero. This can allow us to infer more known bits - for example: %1 = load %p !range {1, 5} %2 = shl %q, %1 We don't know %1, but we do know that it is nonzero so %2[0] is known zero, and importantly %2 is known non-zero. Calling isKnownNonZero is nontrivially expensive so use an Optional to run it lazily and cache its result. llvm-svn: 251294
*	[BasicAA] Bugfix for r251016	James Molloy	2015-10-23	1	-0/+11
\| \| \| \| \| \| \| \|	If the loaded type sizes don't match the element type of the sequential type, all bets are off and the addresses may, indeed, overlap. Surprisingly, this just got caught in one test, on one builder, out of the 30+ builders testing this change. Congratulations go to http://lab.llvm.org:8011/builders/clang-aarch64-lnt/builds/5205. llvm-svn: 251112
*	[SCEV] Commute zero extends through <nuw> additions	Sanjoy Das	2015-10-22	1	-0/+37
\| \| \| \|	llvm-svn: 251052
*	[SCEV] Commute sign extends through nsw additions	Sanjoy Das	2015-10-22	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Depends on D13613. Reviewers: atrick, hfinkel, reames, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13685 llvm-svn: 251049
*	[SCEV] Mark AddExprs as nsw or nuw if legal	Sanjoy Das	2015-10-22	5	-8/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This uses `ScalarEvolution::getRange` and not potentially control dependent `nsw` and `nuw` bits on the arithmetic instruction. Reviewers: atrick, hfinkel, nlewycky Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D13613 llvm-svn: 251048
*	[GlobalsAA] Loosen an overly conservative bailout	James Molloy	2015-10-22	1	-0/+17
\| \| \| \| \| \| \| \| \| \|	Instead of bailing out when we see loads, analyze them. If we can prove that the loaded-from address must escape, then we can conclude that a load from that address must escape too and therefore cannot alias a non-addr-taken global. When checking if a Value can alias a non-addr-taken global, if the Value is a LoadInst of a non-global, recurse instead of bailing. If we can follow a trail of loads up to some base that is captured, we know by inference that all the loads we followed are also captured. llvm-svn: 251017
*	[BasicAA] Non-equal indices in a GEP of a SequentialType don't overlap	James Molloy	2015-10-22	1	-0/+43
\| \| \| \| \| \| \| \|	If the final indices of two GEPs can be proven to not be equal, and the GEP is of a SequentialType (not a StructType), then the two GEPs do not alias. llvm-svn: 251016
*	[ValueTracking] Add a new predicate: isKnownNonEqual()	James Molloy	2015-10-22	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	isKnownNonEqual(A, B) returns true if it can be determined that A != B. At the moment it only knows two facts, that a non-wrapping add of nonzero to a value cannot be that value: A + B != A [where B != 0, addition is nsw or nuw] and that contradictory known bits imply two values are not equal. This patch also hooks this up to InstSimplify; InstSimplify had a peephole for the first fact but not the second so this teaches InstSimplify a new trick too (alas no measured performance impact!) llvm-svn: 251012
*	[CostModel] Fixed AVX integer shift costs	Simon Pilgrim	2015-10-17	3	-34/+34
\| \| \| \| \| \|	Targets with AVX but without AVX2 were incorrectly reporting costs of 256-bit integer shifts. llvm-svn: 250611
*	[GlobalsAA] Don't assume anything about functions that may be overridden	James Molloy	2015-10-13	1	-0/+24
\| \| \| \| \| \| \| \|	Weak linkage and friends allow a symbol to be overriden outside the code generator's model, so GlobalsAA shouldn't assume that anything it can compute about such a symbol is valid. llvm-svn: 250156
*	SCEV: Allow simple AddRec * Parameter products in delinearization	Tobias Grosser	2015-10-12	1	-0/+56
\| \| \| \| \| \| \| \| \|	This patch also allows the -delinearize pass to delinearize expressions that do not have an outermost SCEVAddRec expression. The SCEV::delinearize infrastructure allowed this since r240952, but the -delinearize pass was not updated yet. llvm-svn: 250018
*	[X86] Completed SHL cost model tests	Simon Pilgrim	2015-10-11	1	-1/+399
\| \| \| \| \| \|	As discussed in D8690. llvm-svn: 249990
*	[X86] Renamed SHL cost model tests	Simon Pilgrim	2015-10-11	1	-0/+0
\| \| \| \| \| \| \| \|	Matches naming conventions for ASHR/LSHR cost tests As discussed in D8690. llvm-svn: 249984
*	[X86] Added LSHR cost model tests	Simon Pilgrim	2015-10-11	1	-0/+400
\| \| \| \| \| \| \| \|	There are several dodgy costings due to AVX1 legalizing 256-bit integer vectors that need fixing. As discussed in D8690. llvm-svn: 249983
*	[X86] Added ASHR cost model tests	Simon Pilgrim	2015-10-11	1	-0/+392
\| \| \| \| \| \| \| \|	There are several dodgy costings due to AVX1 legalizing 256-bit integer vectors that need fixing. As discussed in D8690. llvm-svn: 249981
*	[WinEH] Delete the old landingpad implementation of Windows EH	Reid Kleckner	2015-10-09	1	-278/+0
\| \| \| \| \| \| \| \| \| \| \|	The new implementation works at least as well as the old implementation did. Also delete the associated preparation tests. They don't exercise interesting corner cases of the new implementation. All the codegen tests of the EH tables have already been ported. llvm-svn: 249918
*	Compute demanded bits for icmp instructions	James Molloy	2015-10-08	1	-0/+22
\| \| \| \| \| \| \| \| \|	Instead of bailing out when we see an icmp, we can instead at least say that if the upper bits of both operands are known zero, they are not demanded. This doesn't help with signed comparisons, but it's at least better than bailing out. llvm-svn: 249687
*	Treat Mul just like Add and Subtract	James Molloy	2015-10-08	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	Like adds and subtracts, muls ripple only to the left so we can use the same logic. While we're here, add a print method to DemandedBits so it can be used with -analyze, which we'll use in the testcase. llvm-svn: 249686
*	Revert "Revert "This patch builds on top of D13378 to handle constant ↵	Mehdi Amini	2015-10-07	1	-0/+51
\| \| \| \| \| \| \| \| \| \|	condition."" This reverts commit r249528 and reapply r249431. The fix for the fallout has been commited in r249575. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 249581
*	Revert "This patch builds on top of D13378 to handle constant condition."	James Molloy	2015-10-07	1	-51/+0
\| \| \| \| \| \|	This reverts commit r249431. This caused failures in sqlite3: http://lab.llvm.org:8011/builders/clang-native-arm-lnt/builds/14453 llvm-svn: 249528
*	This patch builds on top of D13378 to handle constant condition.	Mehdi Amini	2015-10-06	1	-0/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With this patch, clang -O3 optimizes correctly providing > 1000x speedup on this artificial benchmark): for (a=0; a<n; a++) for (b=0; b<n; b++) for (c=0; c<n; c++) for (d=0; d<n; d++) for (e=0; e<n; e++) for (f=0; f<n; f++) x++; From test-suite/SingleSource/Benchmarks/Shootout/nestedloop.c Reviewers: sanjoyd Differential Revision: http://reviews.llvm.org/D13390 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 249431
*	[SCEV] Recognize simple br-phi patterns	Sanjoy Das	2015-10-02	1	-0/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach SCEV to match patterns like ``` br %cond, label %left, label %right left: br label %merge right: br label %merge merge: V = phi [ %x, %left ], [ %y, %right ] ``` as "select %cond, %x, %y". Before this SCEV would match PHI nodes exclusively to add recurrences. This addresses PR25005. Reviewers: joker.eph, joker-eph, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13378 llvm-svn: 249211
*	[ARM][NEON] Use address space in vld([1234]\|[234]lane) and ↵	Jeroen Ketema	2015-09-30	3	-36/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vst([1234]\|[234]lane) instructions This commit changes the interface of the vld[1234], vld[234]lane, and vst[1234], vst[234]lane ARM neon intrinsics and associates an address space with the pointer that these intrinsics take. This changes, e.g., <2 x i32> @llvm.arm.neon.vld1.v2i32(i8, i32) to <2 x i32> @llvm.arm.neon.vld1.v2i32.p0i8(i8, i32) This change ensures that address spaces are fully taken into account in the ARM target during lowering of interleaved loads and stores. Differential Revision: http://reviews.llvm.org/D12985 llvm-svn: 248887
*	[X86][XOP] Added support for the lowering of 128-bit vector shifts to XOP ↵	Simon Pilgrim	2015-09-30	1	-2/+17
\| \| \| \| \| \| \| \| \| \| \| \|	shift instructions The XOP shifts just have logical/arithmetic versions and the left/right shifts are controlled by whether the value is positive/negative. Because of this I've added new X86ISD nodes instead of trying to force them to use the existing shift nodes. Additionally Excavator cores (bdver4) support XOP and AVX2 - meaning that it should use the AVX2 shifts when it can and fall back to XOP in other cases. Differential Revision: http://reviews.llvm.org/D8690 llvm-svn: 248878
*	[ValueTracking] Teach isKnownNonZero about monotonically increasing PHIs	James Molloy	2015-09-29	1	-0/+49
\| \| \| \| \| \| \| \|	If a PHI starts at a non-negative constant, monotonically increases (only adds of a constant are supported at the moment) and that add does not wrap, then the PHI is known never to be zero. llvm-svn: 248796
*	Introduce !align metadata for load instruction	Artur Pilipenko	2015-09-28	1	-0/+8
\| \| \| \| \| \| \| \|	Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D12853 llvm-svn: 248721
*	[SCEV] Reapply 'Teach isLoopBackedgeGuardedByCond to exploit trip counts'	Sanjoy Das	2015-09-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the trip count of a specific backedge is `N`, then we know that backedge is effectively guarded by the condition `{0,+,1} u< N`. This change teaches SCEV to use this condition to prove things in `isLoopBackedgeGuardedByCond`. Depends on D12948 Depends on D12949 The original checkin, r248608 had to be backed out due to an issue with a ObjCXX unit test. That issue is now fixed, so re-landing. Reviewers: atrick, reames, majnemer, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12950 llvm-svn: 248638
*	Use fixed-point representation for BranchProbability.	Cong Hou	2015-09-25	7	-131/+131
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	BranchProbability now is represented by its numerator and denominator in uint32_t type. This patch changes this representation into a fixed point that is represented by the numerator in uint32_t type and a constant denominator 1<<31. This is quite similar to the representation of BlockMass in BlockFrequencyInfoImpl.h. There are several pros and cons of this change: Pros: 1. It uses only a half space of the current one. 2. Some operations are much faster like plus, subtraction, comparison, and scaling by an integer. Cons: 1. Constructing a probability using arbitrary numerator and denominator needs additional calculations. 2. It is a little less precise than before as we use a fixed denominator. For example, 1 - 1/3 may not be exactly identical to 1 / 3 (this will lead to many BranchProbability unit test failures). This should not matter when we only use it for branch probability. If we use it like a rational value for some precise calculations we may need another construct like ValueRatio. One important reason for this change is that we propose to store branch probabilities instead of edge weights in MachineBasicBlock. We also want clients to use probability instead of weight when adding successors to a MBB. The current BranchProbability has more space which may be a concern. Differential revision: http://reviews.llvm.org/D12603 llvm-svn: 248633
*	Revert two SCEV changes that caused test failures in clang.	Sanjoy Das	2015-09-25	1	-1/+1
\| \| \| \| \| \|	r248606: "[SCEV] Exploit A < B => (A+K) < (B+K) when possible" r248608: "[SCEV] Teach isLoopBackedgeGuardedByCond to exploit trip counts." llvm-svn: 248614
*	[SCEV] Teach isLoopBackedgeGuardedByCond to exploit trip counts.	Sanjoy Das	2015-09-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the trip count of a specific backedge is `N`, then we know that backedge is effectively guarded by the condition `{0,+,1} u< N`. This change teaches SCEV to use this condition to prove things in `isLoopBackedgeGuardedByCond`. Depends on D12948 Depends on D12949 Reviewers: atrick, reames, majnemer, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12950 llvm-svn: 248608