bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[IPO/MergeFunctions] changes so it doesn't try to bitcast a struct return ↵	Carlo Kok	2014-04-30	1	-1/+16
\| \| \| \| \| \|	type but instead recreates it with insert/extract value. llvm-svn: 207679
*	Add a <tuple> include to more files that aren't getting it transitively on MSVC.	Benjamin Kramer	2014-04-30	2	-0/+2
\| \| \| \|	llvm-svn: 207617
*	ConstantHoisting.cpp: Add <tuple> for std::tie, since r207593 removed ↵	NAKAMURA Takumi	2014-04-30	1	-0/+1
\| \| \| \| \| \|	FileSystem.h, it includes <tuple>. llvm-svn: 207614
*	Tidy up.	Jim Grosbach	2014-04-29	1	-2/+2
\| \| \| \|	llvm-svn: 207585
*	Spelling.	Jim Grosbach	2014-04-29	1	-1/+1
\| \| \| \|	llvm-svn: 207584
*	Also handle ConstantAggregateZero when optimizing vpermilvar*.	Rafael Espindola	2014-04-29	1	-20/+22
\| \| \| \|	llvm-svn: 207582
*	Remove tabs.	Rafael Espindola	2014-04-29	1	-4/+4
\| \| \| \| \| \|	Sorry, new machine and I forgot to change the editor setting. llvm-svn: 207578
*	Two fixes to the vpermilvar optimization.	Rafael Espindola	2014-04-29	1	-1/+24
\| \| \| \| \| \| \| \|	The instcomine logic to handle vpermilvar's pd and 256 variants was incorrect. The _256 variants have indexes into the individual 128 bit lanes and in all cases it also has to mask out unused bits. llvm-svn: 207577
*	Fix vectorization remarks.	Diego Novillo	2014-04-29	1	-6/+13
\| \| \| \| \| \| \| \| \|	This patch changes the vectorization remarks to also inform when vectorization is possible but not beneficial. Added tests to exercise some loop remarks. llvm-svn: 207574
*	Continue slp vectorization even the BB already has vectorized store ↵	Yi Jiang	2014-04-29	1	-1/+1
\| \| \| \| \| \|	radar://16641956 llvm-svn: 207572
*	Add slp vectorization to LTO passes	Yi Jiang	2014-04-29	1	-0/+3
\| \| \| \|	llvm-svn: 207571
*	Reapply r207271 without the testcase	Adam Nemet	2014-04-29	1	-9/+12
\| \| \| \| \| \|	PR19608 was filed to find a suitable testcase. llvm-svn: 207569
*	Add optimization remarks to the loop unroller and vectorizer.	Diego Novillo	2014-04-29	2	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This calls emitOptimizationRemark from the loop unroller and vectorizer at the point where they make a positive transformation. For the vectorizer, it reports vectorization and interleave factors. For the loop unroller, it reports all the different supported types of unrolling. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3456 llvm-svn: 207528
*	[BUG] Fix -Wunused-variable warning in Release mode. Thnx to Kostya ↵	Zinovy Nis	2014-04-29	1	-2/+3
\| \| \| \| \| \|	Serebryany for pointing. llvm-svn: 207516
*	fix -Wunused-variable warning in Release mode	Kostya Serebryany	2014-04-29	1	-0/+1
\| \| \| \|	llvm-svn: 207514
*	[OPENMP][LV][D3423] Respect Hints.Force meta-data for loops in LoopVectorizer	Zinovy Nis	2014-04-29	1	-27/+57
\| \| \| \|	llvm-svn: 207512
*	Fix a typo in comment	Michael Zolotukhin	2014-04-29	1	-1/+1
\| \| \| \|	llvm-svn: 207499
*	Revert r207271 for now. This commit introduced a test case that ran	Chandler Carruth	2014-04-28	1	-12/+9
\| \| \| \| \| \| \| \|	clang directly from the LLVM test suite! That doesn't work. I've followed up on the review thread to try and get a viable solution sorted out, but trying to get the tree clean here. llvm-svn: 207462
*	InstCombine: don't drop 'inalloca' in PromoteCastOfAllocation (PR19569)	Hans Wennborg	2014-04-28	1	-0/+1
\| \| \| \|	llvm-svn: 207426
*	Fix rampant quadratic behavior in UpdatePHINodes. The operation of	Chandler Carruth	2014-04-28	1	-23/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	mapping from a basic block to an incoming value, either for removal or just lookup, is linear in the number of predecessors, and we were doing this for every entry in the 'Preds' list which is in many cases almost all of them! Unfortunately, the fixes are quite ugly. PHI nodes just don't make this operation easy. The efficient way to fix this is to have a clever 'remove_if' operation on PHI nodes that lets us do a single pass over all the incoming values of the original PHI node, extracting the ones we care about. Then we could quickly construct the new phi node from this list. This would remove the remaining underlying quadratic movement of unrelated incoming values and the need for silly backwards looping to "minimize" how often we hit the quadratic case. This is the last obvious fix for PR19499. It shaves another 20% off the compile time for me, and while UpdatePHINodes remains in the profile, most of the time is now stemming from the well known inefficiencies of LVI and jump threading. llvm-svn: 207409
*	[C++] Use 'nullptr'.	Craig Topper	2014-04-28	14	-44/+44
\| \| \| \|	llvm-svn: 207394
*	RecursivelyDeleteTriviallyDeadInstructions() could remove	Gerolf Hoflehner	2014-04-26	2	-2/+18
\| \| \| \| \| \| \| \| \| \| \|	more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 Repaired r207302. llvm-svn: 207309
*	Restore CloneFunction.cpp which got accidently	Gerolf Hoflehner	2014-04-26	1	-92/+33
\| \| \| \| \| \|	overwritten by previous backout of r207303 llvm-svn: 207308
*	Revert commit r207302 since build failures	Gerolf Hoflehner	2014-04-26	3	-51/+94
\| \| \| \| \| \|	have been reported. llvm-svn: 207303
*	RecursivelyDeleteTriviallyDeadInstructions() could remove	Gerolf Hoflehner	2014-04-26	2	-2/+18
\| \| \| \| \| \| \| \| \|	more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 llvm-svn: 207302
*	[InstCombine][X86] Teach how to fold calls to SSE2/AVX2 packed logical shift	Andrea Di Biagio	2014-04-26	1	-9/+41
\| \| \| \| \| \| \| \| \| \|	right intrinsics. A packed logical shift right with a shift count bigger than or equal to the element size always produces a zero vector. In all other cases, it can be safely replaced by a 'lshr' instruction. llvm-svn: 207299
*	Unbreak the gdb buildbot by not lowering dbg.declare intrinsics for arrays.	Adrian Prantl	2014-04-25	1	-1/+7
\| \| \| \|	llvm-svn: 207284
*	[LoopStrengthReduce] Don't trim formula that uses a subset of required registers	Adam Nemet	2014-04-25	1	-9/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Consider this use from the new testcase: LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i32 reg({1000,+,-1}<nw><%for.body>) -3003 + reg({3,+,3}<nw><%for.body>) -1001 + reg({1,+,1}<nuw><nsw><%for.body>) -1000 + reg({0,+,1}<nw><%for.body>) -3000 + reg({0,+,3}<nuw><%for.body>) reg({-1000,+,1}<nw><%for.body>) reg({-3000,+,3}<nsw><%for.body>) This is the last use we consider for a solution in SolveRecurse, so CurRegs is a large set. (CurRegs is the set of registers that are needed by the previously visited uses in the in-progress solution.) ReqRegs is { {3,+,3}<nw><%for.body>, {1,+,1}<nuw><nsw><%for.body> } This is the intersection of the regs used by any of the formulas for the current use and CurRegs. Now, the code requires a formula to contain all these regs (the comment is simply wrong), otherwise the formula is immediately disqualified. Obviously, no formula for this use contains two regs so they will all get disqualified. The fix modifies the check to allow the formula in this case. The idea is that neither of these formulae is introducing any new registers which is the point of this early pruning as far as I understand. In terms of set arithmetic, we now allow formulas whose used regs are a subset of the required regs not just the other way around. There are few more loops in the test-suite that are now successfully LSRed. I have benchmarked those and found very minimal change. Fixes <rdar://problem/13965777> llvm-svn: 207271
*	This reapplies r207235 with an additional bugfixes caught by the msan	Adrian Prantl	2014-04-25	1	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	buildbot - do not insert debug intrinsics before phi nodes. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207269
*	SCC: Change clients to use const, NFC	Duncan P. N. Exon Smith	2014-04-25	2	-8/+7
\| \| \| \| \| \| \| \| \| \|	It's fishy to be changing the `std::vector<>` owned by the iterator, and no one actual does it, so I'm going to remove the ability in a subsequent commit. First, update the users. <rdar://problem/14292693> llvm-svn: 207252
*	Revert "This reapplies r207130 with an additional testcase+and a missing ↵	Adrian Prantl	2014-04-25	1	-14/+8
\| \| \| \| \| \| \| \|	check for" This reverts commit 207235 to investigate msan buildbot breakage. llvm-svn: 207250
*	[inline cold threshold] Command line argument for inline threshold will	Manman Ren	2014-04-25	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \|	override the default cold threshold. When we use command line argument to set the inline threshold, the default cold threshold will not be used. This is in line with how we use OptSizeThreshold. When we want a higher threshold for all functions, we do not have to set both inline threshold and cold threshold. llvm-svn: 207245
*	Reapply r207135 without modifications.	Adrian Prantl	2014-04-25	1	-17/+3
\| \| \| \| \| \| \| \| \| \|	Debug info: Let dbg.values inserted by LowerDbgDeclare inherit the location of the dbg.value. This gets rid of tons of redundant variable DIEs in subscopes. rdar://problem/14874886, rdar://problem/16679936 llvm-svn: 207236
*	This reapplies r207130 with an additional testcase+and a missing check for	Adrian Prantl	2014-04-25	1	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207235
*	[C++] Use 'nullptr'. Transforms edition.	Craig Topper	2014-04-25	100	-1600/+1628
\| \| \| \|	llvm-svn: 207196
*	Allow vectorization of bit intrinsics in BB Vectorizer.	Karthik Bhat	2014-04-25	1	-8/+21
\| \| \| \| \| \|	This patch adds support for vectorization of bit intrinsics such as bswap,ctpop,ctlz,cttz. llvm-svn: 207174
*	Revert "This reapplies r207130 with an additional testcase+and a missing ↵	Adrian Prantl	2014-04-25	1	-14/+8
\| \| \| \| \| \| \| \|	check for" Typo in testcase. llvm-svn: 207166
*	This reapplies r207130 with an additional testcase+and a missing check for	Adrian Prantl	2014-04-25	1	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207165
*	Revert "Debug info for optimized code: Support variables that are on the ↵	Adrian Prantl	2014-04-25	1	-14/+8
\| \| \| \| \| \| \| \|	stack and" This reverts commit 207130 for buildbot breakage. llvm-svn: 207162
*	Revert "Debug info: Let dbg.values inserted by LowerDbgDeclare inherit the ↵	Adrian Prantl	2014-04-24	1	-3/+17
\| \| \| \| \| \| \| \|	location" This reverts commit 207130 for buildbot breakage. llvm-svn: 207159
*	Debug info: Let dbg.values inserted by LowerDbgDeclare inherit the location	Adrian Prantl	2014-04-24	1	-17/+3
\| \| \| \| \| \| \| \| \|	of the dbg.value. This gets rid of tons of redundant variable DIEs in subscopes. rdar://problem/14874886, rdar://problem/16679936 llvm-svn: 207135
*	Debug info for optimized code: Support variables that are on the stack and	Adrian Prantl	2014-04-24	1	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine-intrinsics testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207130
*	Allow vectorization of few missed llvm intrinsic calls in BBVectorizor by ↵	Karthik Bhat	2014-04-24	1	-0/+8
\| \| \| \| \| \|	handling them in isVectorizableIntrinsic function. llvm-svn: 207085
*	[InstCombine][x86] Constant fold psll intrinsics.	Michael J. Spencer	2014-04-24	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \|	This excludes avx512 as I don't have hardware to verify. It excludes _dq variants because they are represented in the IR as <{2,4} x i64> when it's actually a byte shift of the entire i{128,265}. This also excludes _dq_bs as they aren't at all supported by the backend. There are also no corresponding instructions in the ISA. I have no idea why they exist... llvm-svn: 207058
*	Optimize some special cases for SSE4a insertqi	Filipe Cabecinhas	2014-04-24	1	-0/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Since the upper 64 bits of the destination register are undefined when performing this operation, we can substitute it and let the optimizer figure out that only a copy is needed. Also added range merging, if an instruction copies a range that can be merged with a previous copied range. Added test cases for both optimizations. Reviewers: grosbach, nadav CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3357 llvm-svn: 207055
*	Handle addrspacecast when looking at memcpys from globals	Matt Arsenault	2014-04-24	1	-3/+6
\| \| \| \|	llvm-svn: 207054
*	Remove more default address space argument usage.	Matt Arsenault	2014-04-23	4	-7/+13
\| \| \| \| \| \|	These places are inconsequential in practice. llvm-svn: 207021
*	Don't use default address space arguments in GlobalOpt	Matt Arsenault	2014-04-23	1	-3/+7
\| \| \| \|	llvm-svn: 207019
*	[ASan] Move the shadow range on 32-bit iOS (and iOS Simulator)	Alexander Potapenko	2014-04-23	1	-1/+4
\| \| \| \| \| \| \| \|	to 0x40000000-0x60000000 to avoid address space clash with system libraries. The solution has been proposed by tahabekireren@gmail.com in https://code.google.com/p/address-sanitizer/issues/detail?id=210 This is also known to fix some Chromium iOS tests. llvm-svn: 207002
*	Remove dead code in instcombine.	Matt Arsenault	2014-04-23	1	-11/+2
\| \| \| \| \| \| \| \| \|	Don't replace shifts greater than the type with the maximum shift. This isn't hit anywhere in the tests, and somewhere else is replacing these with undef. llvm-svn: 207000