bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	InstCombine: allow unmasked icmps to be combined with logical ops	Tim Northover	2013-09-04	1	-9/+29
\| \| \| \| \| \| \| \| \| \|	"(icmp op i8 A, B)" is equivalent to "(icmp op i8 (A & 0xff), B)" as a degenerate case. Allowing this as a "masked" comparison when analysing "(icmp) &/\| (icmp)" allows us to combine them in more cases. rdar://problem/7625728 llvm-svn: 189931
*	InstCombine: look for masked compares with subset relation	Tim Northover	2013-09-04	1	-11/+75
\| \| \| \| \| \| \| \| \| \| \|	Even in cases which aren't universally optimisable like "(A & B) != 0 && (A & C) != 0", the masks can make one of the comparisons completely redundant. In this case, since we've gone to the effort of spotting masked comparisons we should combine them. rdar://problem/7625728 llvm-svn: 189930
*	Teach InstCombineLoadCast about address spaces.	Matt Arsenault	2013-09-03	1	-2/+2
\| \| \| \| \| \| \| \|	This is another one that doesn't matter much, but uses the right GEP index types in the first place. llvm-svn: 189854
*	Use type form of getIntPtrType in alloca visitor.	Matt Arsenault	2013-09-03	1	-2/+2
\| \| \| \| \| \| \|	This doesn't actually matter, since alloca is always 0 address space, but this is more consistent. llvm-svn: 189853
*	InstCombine: Check for zero shift amounts before subtracting one causing ↵	Benjamin Kramer	2013-08-30	1	-10/+15
\| \| \| \| \| \| \| \| \|	integer overflow. PR17026. Also avoid undefined shifts and shift amounts larger than 64 bits (those are always undef because we can't represent integer types that large). llvm-svn: 189672
*	Fix typo.	Matt Arsenault	2013-08-28	1	-2/+2
\| \| \| \|	llvm-svn: 189524
*	Teach InstCombine about address spaces	Matt Arsenault	2013-08-21	2	-21/+33
\| \| \| \|	llvm-svn: 188926
*	Use pop_back_val() instead of both back() and pop_back().	Jakub Staszak	2013-08-19	1	-2/+1
\| \| \| \|	llvm-svn: 188723
*	Teach InstCombine visitGetElementPtr about address spaces	Matt Arsenault	2013-08-19	3	-20/+26
\| \| \| \|	llvm-svn: 188721
*	Cleanup visitGetElementPtr to make address space change easier	Matt Arsenault	2013-08-19	1	-11/+13
\| \| \| \|	llvm-svn: 188720
*	commonPointerCast cleanups to make address space change easier	Matt Arsenault	2013-08-19	1	-5/+11
\| \| \| \|	llvm-svn: 188719
*	Revert non-test parts of r188507	Matt Arsenault	2013-08-19	1	-1/+9
\| \| \| \| \| \|	Re-add the inboundsless tests I didn't add originally llvm-svn: 188710
*	InstCombine: Use isAllOnesValue() instead of explicit -1.	Jim Grosbach	2013-08-16	1	-1/+1
\| \| \| \|	llvm-svn: 188563
*	InstCombine: Simplify if(x!=0 && x!=-1).	Jim Grosbach	2013-08-16	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \|	When both constants are positive or both constants are negative, InstCombine already simplifies comparisons like this, but when it's exactly zero and -1, the operand sorting ends up reversed and the pattern fails to match. Handle that special case. Follow up for rdar://14689217 llvm-svn: 188512
*	Don't do FoldCmpLoadFromIndexedGlobal for non inbounds GEPs	Matt Arsenault	2013-08-15	1	-9/+2
\| \| \| \| \| \| \|	This path wasn't tested before without a datalayout, so add some more tests and re-run with and without one. llvm-svn: 188507
*	Fix always creating GEP with i32 indices	Matt Arsenault	2013-08-14	2	-10/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use the pointer size if datalayout is available. Use i64 if it's not, which is consistent with what other places do when the pointer size is unknown. The test doesn't really test this in a useful way since it will be transformed to that later anyway, but this now tests it for non-zero arrays and when datalayout isn't available. The cases in visitGetElementPtrInst should save an extra re-visit to the newly created GEP since it won't need to cleanup after itself. llvm-svn: 188339
*	Use type helper functions instead of cast	Matt Arsenault	2013-08-14	2	-11/+8
\| \| \| \|	llvm-svn: 188338
*	Use array initializer, space around operator	Matt Arsenault	2013-08-14	1	-5/+3
\| \| \| \|	llvm-svn: 188337
*	Fix big-endian handling of integer-to-vector bitcasts in InstCombine	Richard Sandiford	2013-08-12	1	-20/+32
\| \| \| \| \| \| \| \| \| \|	These functions used to assume that the lsb of an integer corresponds to vector element 0, whereas for big-endian it's the other way around: the msb is in the first element and the lsb is in the last element. Fixes MultiSource/Benchmarks/mediabench/gsm/toast for z. llvm-svn: 188155
*	Fix missing -- C++ --s	Matt Arsenault	2013-08-06	1	-1/+1
\| \| \| \|	llvm-svn: 187758
*	Preserve fast-math flags when folding (fsub x, (fneg y)) to (fadd x, y).	Owen Anderson	2013-07-30	1	-4/+11
\| \| \| \|	llvm-svn: 187462
*	Change behavior of calling bitcasted alias functions.	Matt Arsenault	2013-07-30	1	-9/+9
\| \| \| \| \| \| \| \|	It will now only convert the arguments / return value and call the underlying function if the types are able to be bitcasted. This avoids using fp<->int conversions that would occur before. llvm-svn: 187444
*	Fix variable name.	Owen Anderson	2013-07-26	1	-2/+2
\| \| \| \|	llvm-svn: 187253
*	When InstCombine tries to fold away (fsub x, (fneg y)) into (fadd x, y), it is	Owen Anderson	2013-07-26	1	-1/+13
\| \| \| \| \| \| \|	also worthwhile for it to look through FP extensions and truncations, whose application commutes with fneg. llvm-svn: 187249
*	Correct case of m_UIToFp to m_UIToFP to match instruction name, add m_SIToFP ↵	Stephen Lin	2013-07-26	1	-4/+4
\| \| \| \| \| \|	for consistency. llvm-svn: 187225
*	InstCombine: call FoldOpIntoSelect for all floating binops, not just fmul	Stephen Lin	2013-07-20	2	-3/+23
\| \| \| \|	llvm-svn: 186759
*	Restore r181216, which was partially reverted in r182499.	Stephen Lin	2013-07-17	2	-43/+29
\| \| \| \|	llvm-svn: 186533
*	Use llvm::array_lengthof to replace sizeof(array)/sizeof(array[0]).	Craig Topper	2013-07-15	1	-1/+2
\| \| \| \|	llvm-svn: 186301
*	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector ↵	Craig Topper	2013-07-14	1	-1/+1
\| \| \| \| \| \|	size. llvm-svn: 186274
*	Add a microoptimization for urem.	Nick Lewycky	2013-07-13	1	-0/+7
\| \| \| \|	llvm-svn: 186235
*	Fix a crash in EvaluateInDifferentElementOrder where it would generate an	Joey Gouly	2013-07-12	1	-1/+3
\| \| \| \| \| \| \| \|	undef vector of the wrong type. LGTM'd by Nick Lewycky on IRC. llvm-svn: 186224
*	Don't use a potentially expensive shift if all we want is one set bit.	Benjamin Kramer	2013-07-11	1	-1/+1
\| \| \| \| \| \|	No functionality change. llvm-svn: 186095
*	InstCombine: Fix typo in comment for visitICmpInstWithInstAndIntCst	David Majnemer	2013-07-09	1	-2/+2
\| \| \| \|	llvm-svn: 185916
*	InstCombine: variations on 0xffffffff - x >= 4	David Majnemer	2013-07-09	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	The following transforms are valid if -C is a power of 2: (icmp ugt (xor X, C), ~C) -> (icmp ult X, C) (icmp ult (xor X, C), -C) -> (icmp uge X, C) These are nice, they get rid of the xor. llvm-svn: 185915
*	InstCombine: X & -C != -C -> X <= u ~C	David Majnemer	2013-07-09	1	-0/+9
\| \| \| \| \| \|	Tests were added in r185910 somehow. llvm-svn: 185912
*	Commit r185909 was a misapplied patch, fix it	David Majnemer	2013-07-09	1	-21/+13
\| \| \| \|	llvm-svn: 185910
*	InstCombine: add more transforms	David Majnemer	2013-07-09	1	-0/+42
\| \| \| \| \| \| \| \| \|	C1-X <u C2 -> (X\|(C2-1)) == C1 C1-X >u C2 -> (X\|C2) == C1 X-C1 <u C2 -> (X & -C2) == C1 X-C1 >u C2 -> (X & ~C2) == C1 llvm-svn: 185909
*	InstCombine: Fold X-C1 <u 2 -> (X & -2) == C1	David Majnemer	2013-07-08	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \|	Back in r179493 we determined that two transforms collided with each other. The fix back then was to reorder the transforms so that the preferred transform would give it a try and then we would try the secondary transform. However, it was noted that the best approach would canonicalize one transform into the other, removing the collision and allowing us to optimize IR given to us in that form. llvm-svn: 185808
*	InstCombine: (icmp eq B, 0) \| (icmp ult A, B) -> (icmp ule A, B-1)	David Majnemer	2013-07-05	1	-1/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This transform allows us to turn IR that looks like: %1 = icmp eq i64 %b, 0 %2 = icmp ult i64 %a, %b %3 = or i1 %1, %2 ret i1 %3 into: %0 = add i64 %b, -1 %1 = icmp uge i64 %0, %a ret i1 %1 which means we go from lowering: cmpq %rsi, %rdi setb %cl testq %rsi, %rsi sete %al orb %cl, %al ret to lowering: decq %rsi cmpq %rdi, %rsi setae %al ret llvm-svn: 185677
*	InstCombine: Reimplementation of visitUDivOperand	David Majnemer	2013-07-04	1	-56/+139
\| \| \| \| \| \| \| \| \| \| \|	This transform was originally added in r185257 but later removed in r185415. The original transform would create instructions speculatively and then discard them if the speculation was proved incorrect. This has been replaced with a scheme that splits the transform into two parts: preflight and fold. While we preflight, we build up fold actions that inform the folding stage on how to act. llvm-svn: 185667
*	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid ↵	Craig Topper	2013-07-04	1	-1/+1
\| \| \| \| \| \|	specifying the vector size. llvm-svn: 185606
*	Revert r185257 (InstCombine: Be more agressive optimizing 'udiv' instrs with ↵	Hal Finkel	2013-07-02	1	-77/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	'select' denoms) I'm reverting this commit because: 1. As discussed during review, it needs to be rewritten (to avoid creating and then deleting instructions). 2. This is causing optimizer crashes. Specifically, I'm seeing things like this: While deleting: i1 % Use still stuck around after Def is destroyed: <badref> = select i1 <badref>, i32 0, i32 1 opt: /src/llvm-trunk/lib/IR/Value.cpp:79: virtual llvm::Value::~Value(): Assertion `use_empty() && "Uses remain when a value is destroyed!"' failed. I'd guess that these will go away once we're no longer creating/deleting instructions here, but just in case, I'm adding a regression test. Because the code is bring rewritten, I've just XFAIL'd the original regression test. Original commit message: InstCombine: Be more agressive optimizing 'udiv' instrs with 'select' denoms Real world code sometimes has the denominator of a 'udiv' be a 'select'. LLVM can handle such cases but only when the 'select' operands are symmetric in structure (both select operands are a constant power of two or a left shift, etc.). This falls apart if we are dealt a 'udiv' where the code is not symetric or if the select operands lead us to more select instructions. Instead, we should treat the LHS and each select operand as a distinct divide operation and try to optimize them independently. If we can to simplify each operation, then we can replace the 'udiv' with, say, a 'lshr' that has a new select with a bunch of new operands for the select. llvm-svn: 185415
*	InstCombine: Also turn selects fed by an and into arithmetic when the types ↵	Benjamin Kramer	2013-06-29	1	-4/+9
\| \| \| \| \| \| \| \| \|	don't match. Inserting a zext or trunc is sufficient. This pattern is somewhat common in LLVM's pointer mangling code. llvm-svn: 185270
*	InstCombine: FoldGEPICmp shouldn't change sign of base pointer comparison	David Majnemer	2013-06-29	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Changing the sign when comparing the base pointer would introduce all sorts of unexpected things like: %gep.i = getelementptr inbounds [1 x i8]* %a, i32 0, i32 0 %gep2.i = getelementptr inbounds [1 x i8]* %b, i32 0, i32 0 %cmp.i = icmp ult i8* %gep.i, %gep2.i %cmp.i1 = icmp ult [1 x i8]* %a, %b %cmp = icmp ne i1 %cmp.i, %cmp.i1 ret i1 %cmp into: %cmp.i = icmp slt [1 x i8]* %a, %b %cmp.i1 = icmp ult [1 x i8]* %a, %b %cmp = xor i1 %cmp.i, %cmp.i1 ret i1 %cmp By preserving the original sign, we now get: ret i1 false This fixes PR16483. llvm-svn: 185259
*	InstCombine: Small whitespace cleanup in FoldGEPICmp	David Majnemer	2013-06-29	1	-1/+1
\| \| \| \|	llvm-svn: 185258
*	InstCombine: Be more agressive optimizing 'udiv' instrs with 'select' denoms	David Majnemer	2013-06-29	1	-44/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Real world code sometimes has the denominator of a 'udiv' be a 'select'. LLVM can handle such cases but only when the 'select' operands are symmetric in structure (both select operands are a constant power of two or a left shift, etc.). This falls apart if we are dealt a 'udiv' where the code is not symetric or if the select operands lead us to more select instructions. Instead, we should treat the LHS and each select operand as a distinct divide operation and try to optimize them independently. If we can to simplify each operation, then we can replace the 'udiv' with, say, a 'lshr' that has a new select with a bunch of new operands for the select. llvm-svn: 185257
*	InstCombine: Optimize (1 << X) Pred CstP2 to X Pred Log2(CstP2)	David Majnemer	2013-06-28	1	-2/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We may, after other optimizations, find ourselves with IR that looks like: %shl = shl i32 1, %y %cmp = icmp ult i32 %shl, 32 Instead, we should just compare the shift count: %cmp = icmp ult i32 %y, 5 llvm-svn: 185242
*	Fix using arg_end() - arg_begin() instead of arg_size()	Matt Arsenault	2013-06-28	1	-3/+3
\| \| \| \|	llvm-svn: 185121
*	Revert "Revert "[APFloat] Removed APFloat constructor which initialized to ↵	Michael Gottesman	2013-06-27	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	either zero/NaN but allowed you to arbitrarily set the category of the float."" This reverts commit r185099. Looks like both the ppc-64 and mips bots are still failing after I reverted this change. Since: 1. The mips bot always performs a clean build, 2. The ppc64-bot failed again after a clean build (I asked the ppc-64 maintainers to clean the bot which they did... Thanks Will!), I think it is safe to assume that this change was not the cause of the failures that said builders were seeing. Thus I am recomitting. llvm-svn: 185111
*	Revert "[APFloat] Removed APFloat constructor which initialized to either ↵	Michael Gottesman	2013-06-27	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	zero/NaN but allowed you to arbitrarily set the category of the float." This reverts commit r185095. This is causing a FileCheck failure on the 3dnow intrinsics on at least the mips/ppc bots but not on the x86 bots. Reverting while I figure out what is going on. llvm-svn: 185099