bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Make sure that any new and optimized objects created during GlobalOPT copy ↵	Sergei Larin	2016-01-22	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	all the attributes from the base object. Summary: Make sure that any new and optimized objects created during GlobalOPT copy all the attributes from the base object. A good example of improper behavior in the current implementation is section information associated with the GlobalObject. If a section was set for it, and GlobalOpt is creating/modifying a new object based on this one (often copying the original name), without this change new object will be placed in a default section, resulting in inappropriate properties of the new variable. The argument here is that if customer specified a section for a variable, any changes to it that compiler does should not cause it to change that section allocation. Moreover, any other properties worth representation in copyAttributesFrom() should also be propagated. Reviewers: jmolloy, joker-eph, joker.eph Subscribers: slarin, joker.eph, rafael, tobiasvk, llvm-commits Differential Revision: http://reviews.llvm.org/D16074 llvm-svn: 258556
*	function names start with a lowercase letter; NFC	Sanjay Patel	2016-01-22	1	-8/+8
\| \| \| \|	llvm-svn: 258552
*	[PlaceSafepoints] Introduce a -spp-no-statepoints flag	Sanjoy Das	2016-01-22	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change adds a `-spp-no-statepoints` flag to PlaceSafepoints that bypasses the code that wraps newly introduced polls and existing calls in gc.statepoint. With `-spp-no-statepoints` enabled, PlaceSafepoints effectively becomes a safpeoint poll insertion pass. The eventual goal is to "constant fold" this option, along with `-rs4gc-use-deopt-bundles` to `true`, once clients using gc.statepoint are okay doing so. Reviewers: pgavlin, reames, JosephTremoulet Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16439 llvm-svn: 258551
*	[PGO] Remove use of static variable. /NFC	Xinliang David Li	2016-01-22	1	-11/+15
\| \| \| \| \| \| \| \|	Make the variable a member of the writer trait object owned now by the writer. Also use a different generator interface to pass the infoObject from the writer. llvm-svn: 258544
*	Revert 258486 -- for a better fix coming soon	Xinliang David Li	2016-01-22	1	-10/+7
\| \| \| \|	llvm-svn: 258538
*	AMDGPU: Fix crash with invariant markers	Matt Arsenault	2016-01-22	1	-0/+8
\| \| \| \| \| \| \| \|	The promote alloca pass didn't handle these intrinsics and crashed. These intrinsics should accept any address space, but for now just erase them to avoid breaking. llvm-svn: 258537
*	[NVPTX] expand mul_lohi to mul_lo and mul_hi	Jingyue Wu	2016-01-22	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes PR26186. Reviewers: grosser, jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D16479 llvm-svn: 258536
*	[AArch64] Simplify emitConditionalCompare calls. NFC.	Ahmed Bougacha	2016-01-22	1	-13/+9
\| \| \| \| \| \| \|	Now that both callsites are identical, we can simplify the prototype and make it easier to reason about the 2-CC case. llvm-svn: 258534
*	[AArch64] Lower 2-CC FCCMPs (one/ueq) using AND'ed CCs.	Ahmed Bougacha	2016-01-22	1	-8/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current behavior is incorrect, as the two CCs returned by changeFPCCToAArch64CC, intended to be OR'ed, are instead used in an AND ccmp chain. Consider: define i32 @t(float %a, float %b, float %c, float %d, i32 %e, i32 %f) { %cc1 = fcmp one float %a, %b %cc2 = fcmp olt float %c, %d %and = and i1 %cc1, %cc2 %r = select i1 %and, i32 %e, i32 %f ret i32 %r } Assuming (%a < %b) and (%c < %d); we used to do: fcmp s0, s1 # nzcv <- 1000 orr w8, wzr, #0x1 # w8 <- 1 csel w9, w8, wzr, mi # w9 <- 1 csel w8, w8, w9, gt # w8 <- 1 fcmp s2, s3 # nzcv <- 1000 cset w9, mi # w9 <- 1 tst w8, w9 # (w8 & w9) == 1, so: nzcv <- 0000 csel w0, w0, w1, ne # w0 <- w0 We now do: fcmp s2, s3 # nzcv <- 1000 fccmp s0, s1, #0, mi # mi, so: nzcv <- 1000 fccmp s0, s1, #8, le # !le, so: nzcv <- 1000 csel w0, w0, w1, pl # !pl, so: w0 <- w1 In other words, we transformed: (c < d) && ((a < b) \|\| (a > b)) into: (c < d) && (a u>= b) && (a u<= b) whereas, per De Morgan's, we wanted: (c < d) && !((a u>= b) && (a u<= b)) Note that this problem doesn't occur in the test-suite. changeFPCCToAArch64CC produces disjunct CCs; here, one -> mi/gt. We can't represent that in the fccmp chain; it can't express arbitrary OR sequences, as one comment explains: In general we can create code for arbitrary "... (and (and A B) C)" sequences. We can also implement some "or" expressions, because "(or A B)" is equivalent to "not (and (not A) (not B))" and we can implement some negation operations. [...] However there is no way to negate the result of a partial sequence. Instead, introduce changeFPCCToANDAArch64CC, which produces the conjunct cond codes: - (a one b) == ((a olt b) \|\| (a ogt b)) == ((a ord b) && (a une b)) - (a ueq b) == ((a uno b) \|\| (a oeq b)) == ((a ule b) && (a uge b)) Note that, at first, one might think that, when PushNegate is true, we should use the disjunct CCs, in effect doing: (a \|\| b) = !(!a && !(b)) = !(!a && !(b1 \|\| b2)) <- changeFPCCToAArch64CC(b, b1, b2) = !(!a && !b1 && !b2) However, we can take advantage of the fact that the CC is already negated, which lets us avoid special-casing PushNegate and doing the simpler to reason about: (a \|\| b) = !(!a && (!b)) = !(!a && (b1 && b2)) <- changeFPCCToANDAArch64CC(!b, b1, b2) = !(!a && b1 && b2) This makes both emitConditionalCompare cases behave identically, and produces correct ccmp sequences for the 2-CC fcmps. llvm-svn: 258533
*	[AArch64] Assert that CCMP isel didn't fail inconsistently.	Ahmed Bougacha	2016-01-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	We verify that the op tree is eligible for CCMP emission in isConjunctionDisjunctionTree, but it's also possible that emitConjunctionDisjunctionTree fails later. The initial check is useful, as it avoids building nodes that will get discarded. Still, make sure that inconsistencies don't happen with an assert. llvm-svn: 258532
*	[RS4GC] Use OB_deopt instead of "deopt"	Sanjoy Das	2016-01-22	1	-1/+2
\| \| \| \|	llvm-svn: 258529
*	[Hexagon] Use general purpose registers to spill pred/mod registers into	Krzysztof Parzyszek	2016-01-22	4	-78/+310
\| \| \| \| \| \|	Patch by Tobias Edler Von Koch. llvm-svn: 258527
*	AMDGPU: Fix getArchTypePrefix	Matt Arsenault	2016-01-22	1	-2/+2
\| \| \| \|	llvm-svn: 258525
*	AMDGPU: Rename some r600 intrinsics to use correct TargetPrefix	Matt Arsenault	2016-01-22	3	-39/+44
\| \| \| \| \| \|	These ones aren't directly emitted by mesa and inserted by a pass. llvm-svn: 258523
*	AMDGPU: Remove unused R600 intrinsics	Matt Arsenault	2016-01-22	2	-48/+0
\| \| \| \|	llvm-svn: 258522
*	[WinEH] Make collectFuncletMembers non-recursive	David Majnemer	2016-01-22	1	-22/+20
\| \| \| \| \| \| \|	Use a worklist for the pre-order DFS instead of using recursion. No functionality change is intended. llvm-svn: 258521
*	Fix MachOObjectFile::getSymbolName() to not call report_fatal_error()	Kevin Enderby	2016-01-22	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \|	but to return object_error::parse_failed. Then made the code in llvm-nm do for Mach-O files what is done in the darwin native tools which is to print "bad string index" for bad string indexes. Updated the error message in the llvm-objdump test, and added tests to show llvm-nm prints "bad string index" and a test to print the actual bad string index value which in this case is 0xfe000002 when printing the fields as raw hex. llvm-svn: 258520
*	AMDGPU: Change control flow intrinsics to use amdgcn prefix	Matt Arsenault	2016-01-22	3	-21/+23
\| \| \| \| \| \| \|	These aren't supposed to be used outside of the backend, so there aren't any users to worry about. llvm-svn: 258516
*	AMDGPU: Don't use separate mulhu/mulhs Pats	Matt Arsenault	2016-01-22	1	-12/+2
\| \| \| \|	llvm-svn: 258515
*	AMDGPU: Remove random TGSI intrinsic	Matt Arsenault	2016-01-22	3	-14/+0
\| \| \| \| \| \|	I don't think this was ever used. llvm-svn: 258514
*	AMDGPU: Remove AMDGPU.fract intrinsic	Matt Arsenault	2016-01-22	4	-7/+1
\| \| \| \| \| \| \|	Mesa doesn't use this, and this is pattern matched already from fsub x, (ffloor x) llvm-svn: 258513
*	[PGO] eliminate use of static variable	Xinliang David Li	2016-01-22	1	-7/+10
\| \| \| \|	llvm-svn: 258486
*	NFC WebAssembly: update links	JF Bastien	2016-01-22	1	-2/+2
\| \| \| \| \| \|	I got a vanity URL, and moved the github waterfall repo. llvm-svn: 258484
*	[SelectionDAG] Fold more offsets into GlobalAddresses	Dan Gohman	2016-01-22	2	-75/+123
\| \| \| \| \| \| \| \|	This reapplies r258296 and r258366, and also fixes an existing bug in SelectionDAG.cpp's isMemSrcFromString, neglecting to account for the offset in a GlobalAddressSDNode, which is uncovered by those patches. llvm-svn: 258482
*	Replace Type::getInt32Ty() and comparison by isIntegerTy(32). NFC.	Manuel Jacob	2016-01-22	1	-3/+1
\| \| \| \|	llvm-svn: 258480
*	Revert r258473 as it's breaking the build with libc++	Ivan Krasin	2016-01-22	4	-57/+17
\| \| \| \| \| \| \| \|	Reviewers: kcc Differential Revision: http://reviews.llvm.org/D16441 llvm-svn: 258479
*	[opaque pointer types] [NFC] DataLayout::getIndexedOffset: take source ↵	Eduard Burtescu	2016-01-22	4	-31/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	element type instead of pointer type and rename to getIndexedOffsetInType. Summary: Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16282 llvm-svn: 258478
*	[opaque pointer types] [NFC] FindAvailableLoadedValue: take LoadInst instead ↵	Eduard Burtescu	2016-01-22	4	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \|	of just the pointer. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16422 llvm-svn: 258477
*	[opaque pointer types] [NFC] gep_type_{begin,end} now take source element ↵	Eduard Burtescu	2016-01-22	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	type and address space. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16436 llvm-svn: 258474
*	Use std::piecewise_constant_distribution instead of ad-hoc binary search.	Ivan Krasin	2016-01-22	4	-17/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fix the issue with the most recently discovered unit receiving much less attention. Note: I had to change the seed for one test to make it pass. Alternatively, the number of runs could be increased. I believe that the average time of 'foo' discovery is not increased, just seed=1 was particularly convenient for the previous PRNG scheme used. Reviewers: aizatsky, kcc Subscribers: llvm-commits, kcc Differential Revision: http://reviews.llvm.org/D16419 llvm-svn: 258473
*	[opaque pointer types] [NFC] Add an explicit type argument to ↵	Eduard Burtescu	2016-01-22	6	-28/+28
\| \| \| \| \| \| \| \| \| \| \| \|	ConstantFoldLoadFromConstPtr. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16418 llvm-svn: 258472
*	Do not lower VSETCC if operand is an f16 vector	Pirama Arumuga Nainar	2016-01-22	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: SETCC with f16 vectors has OperationAction set to Expand but still gets lowered to FCM* intrinsics based on its result type. This patch skips lowering of VSETCC if the operand is an f16 vector. v4 and v8 tests included. Reviewers: ab, jmolloy Subscribers: srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D15361 llvm-svn: 258471
*	Revert "[SelectionDAG] Fold more offsets into GlobalAddresses"	Reid Kleckner	2016-01-22	2	-120/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts r258296 and the follow up r258366. With this change, we miscompiled the following program on Windows: #include <string> #include <iostream> static const char kData[] = "asdf jkl;"; int main() { std::string s(kData + 3, sizeof(kData) - 3); std::cout << s << '\n'; } llvm-svn: 258465
*	[libFuzzer] don't do expensive memmem if the result will not be used	Kostya Serebryany	2016-01-22	1	-0/+2
\| \| \| \|	llvm-svn: 258462
*	[ThinLTO] Do metadata linking during batch function importing	Teresa Johnson	2016-01-22	2	-29/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Since we are currently not doing incremental importing there is no need to link metadata as a postpass. The module linker will only link in the imported subroutines due to the functionality added by r256003. (Note that the metadata postpass linking functionalitiy is still used by llvm-link, and may be needed here in the future if a more incremental strategy is adopted.) Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16424 llvm-svn: 258458
*	[opaque pointer types] [NFC] Take advantage of get{Source,Result}ElementType ↵	Eduard Burtescu	2016-01-21	1	-45/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	when folding GEPs. Summary: Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16302 llvm-svn: 258456
*	move function definitions so we don't need separate declarations ; NFCI	Sanjay Patel	2016-01-21	1	-67/+63
\| \| \| \|	llvm-svn: 258455
*	[LibCallSimplifier] refactor FP function signature checks ; NFCI	Sanjay Patel	2016-01-21	1	-60/+24
\| \| \| \| \| \| \| \| \|	Use the helper function added in r258428. The check should really be hoisted to the caller of all of these optimize* functions, but that's another step. llvm-svn: 258446
*	avoid variable shadowing; NFC	Sanjay Patel	2016-01-21	1	-2/+2
\| \| \| \|	llvm-svn: 258445
*	remove unnecessary variable; NFC	Sanjay Patel	2016-01-21	1	-2/+1
\| \| \| \|	llvm-svn: 258444
*	Avoid unnecessary stack realignment in musttail thunks with SSE2 enabled	Reid Kleckner	2016-01-21	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	The X86 musttail implementation finds register parameters to forward by running the calling convention algorithm until a non-register location is returned. However, assigning a vector memory location has the side effect of increasing the function's stack alignment. We shouldn't increase the stack alignment when we are only looking for register parameters, so this change conditionalizes it. llvm-svn: 258442
*	[X86][SSE] Improve i16 splatting shuffles	Simon Pilgrim	2016-01-21	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \|	Better handling of the annoying pshuflw/pshufhw ops which only shuffle lower/upper halves of a vector. Added vXi16 unary shuffle support for cases where i16 elements (from the same half of the source) are being splatted to the whole of one of the halves. This avoids the general lowering case which must shuffle the 32-bit elements first - meaning that we used to end up with unnecessary duplicate pshuflw/pshufhw shuffles. Note this has the side effect of a lot of SSSE3 test cases no longer needing to use PSHUFB, as it falls below the 3 op combine threshold for when PSHUFB is typically worth it. I've raised PR26183 to discuss if the threshold should be changed and whether we need to make it more specific to the target CPU. Differential Revision: http://reviews.llvm.org/D14901 llvm-svn: 258440
*	[RuntimeDyld][AArch64] Add support for the MachO ARM64_RELOC_SUBTRACTOR reloc.	Lang Hames	2016-01-21	1	-1/+53
\| \| \| \|	llvm-svn: 258438
*	Fix for two constant propagation problems in GVN with the assume intrinsic	David L Kreitzer	2016-01-21	2	-4/+5
\| \| \| \| \| \| \| \| \| \|	instruction. Patch by Yuanrui Zhang. Differential Revision: http://reviews.llvm.org/D16100 llvm-svn: 258435
*	Fix MachOObjectFile::getSymbolSection() to not call report_fatal_error()	Kevin Enderby	2016-01-21	1	-1/+1
\| \| \| \| \| \| \| \| \|	but to return object_error::parse_failed. Then made the code in llvm-nm do for Mach-O files what is done in the darwin native tools which is to print "(?,?)" or just "s" for bad section indexes. Also added a test to show it prints the bad section index of "42" when printing the fields as raw hex. llvm-svn: 258434
*	[LibCallSimplifier] don't get fooled by a fake fmin()	Sanjay Patel	2016-01-21	1	-9/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is similar to the bug/fix: https://llvm.org/bugs/show_bug.cgi?id=26211 http://reviews.llvm.org/rL258325 The fmin() test case reveals another bug caused by sloppy code duplication. It will crash without this patch because fp128 is a valid floating-point type, but we would think that we had matched a function that used doubles. The new helper function can be used to replace similar checks that are used in several other places in this file. llvm-svn: 258428
*	[InstCombine] Simplify (x >> y) <= x	David Majnemer	2016-01-21	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \|	This commit extends the patterns recognised by InstSimplify to also handle (x >> y) <= x in the same way as (x /u y) <= x. The missing optimisation was found investigating why LLVM did not optimise away bound checks in a binary search: https://github.com/rust-lang/rust/pull/30917 Patch by Andrea Canciani! Differential Revision: http://reviews.llvm.org/D16402 llvm-svn: 258422
*	Partially revert "Add command line options to force function/loop alignments."	Chad Rosier	2016-01-21	1	-10/+0
\| \| \| \| \| \|	This partially reverts r256571 in favor of the solution in r258409. llvm-svn: 258421
*	[PGO] Passmanagerbuilder change that enable IR level PGO instrumentation	Rong Xu	2016-01-21	2	-1/+30
\| \| \| \| \| \| \| \| \| \|	This patch includes the passmanagerbuilder change that enables IR level PGO instrumentation. It adds two passmanagerbuilder options: -profile-generate=<profile_filename> and -profile-use=<profile_filename>. The new options are primarily for debug purpose. Reviewers: davidxl, silvas Differential Revision: http://reviews.llvm.org/D15828 llvm-svn: 258420
*	[TTI] Add getCacheLineSize	Adam Nemet	2016-01-21	4	-5/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: And use it in PPCLoopDataPrefetch.cpp. @hfinkel, please let me know if your preference would be to preserve the ppc-loop-prefetch-cache-line option in order to be able to override the value of TTI::getCacheLineSize for PPC. Reviewers: hfinkel Subscribers: hulx2000, mcrosier, mssimpso, hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D16306 llvm-svn: 258419