bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Fix a typo in AddAliasScopeMetadata	Hal Finkel	2014-08-29	1	-1/+1
\| \| \| \|	llvm-svn: 216741
*	Revert two GEP-related InstCombine commits	David Majnemer	2014-08-29	1	-40/+11
\| \| \| \| \| \| \|	This reverts commit r216523 and r216598; people have reported regressions. llvm-svn: 216698
*	Don't promote byval pointer arguments when padding matters	Reid Kleckner	2014-08-28	1	-3/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Don't promote byval pointer arguments when when their size in bits is not equal to their alloc size in bits. This can happen for x86_fp80, where the size in bits is 80 but the alloca size in bits in 128. Promoting these types can break passing unions of x86_fp80s and other types. Patch by Thomas Jablin! Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D5057 llvm-svn: 216693
*	InstCombine: Remove redundant combines	David Majnemer	2014-08-28	1	-15/+0
\| \| \| \| \| \| \| \|	InstSimplify already handles icmp (X+Y), X (and things like it) appropriately. The first thing that InstCombine does is run InstSimplify on the instruction. llvm-svn: 216659
*	Fix: SLPVectorizer tried to move an instruction which was replaced by a ↵	Erik Eckstein	2014-08-28	1	-4/+0
\| \| \| \| \| \| \| \| \| \|	vector instruction. For a detailed description of the problem see the comment in the test file. The problematic moveBefore() calls are not required anymore because the new scheduling algorithm ensures a correct ordering anyway. llvm-svn: 216656
*	InstSimplify: Move a transform from InstCombine to InstSimplify	David Majnemer	2014-08-28	1	-10/+0
\| \| \| \| \| \| \| \|	Several combines involving icmp (shl C2, %X) C1 can be simplified without introducing any new instructions. Move them to InstSimplify; while we are at it, make them more powerful. llvm-svn: 216642
*	InstCombine: Combine gep X, (Y-X) to Y	David Majnemer	2014-08-27	1	-14/+25
\| \| \| \| \| \| \| \|	We try to perform this transform in InstSimplify but we aren't always able to. Sometimes, we need to insert a bitcast if X and Y don't have the same time. llvm-svn: 216598
*	[SLP] Re-enable vectorization of GEP expressions (re-apply r210342 with a fix).	Michael Zolotukhin	2014-08-27	1	-0/+101
\| \| \| \|	llvm-svn: 216549
*	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or ↵	Craig Topper	2014-08-27	12	-42/+27
\| \| \| \| \| \|	just letting them be implicitly created. llvm-svn: 216525
*	Fix some cases were ArrayRefs were being passed by reference. Also remove ↵	Craig Topper	2014-08-27	1	-4/+4
\| \| \| \| \| \|	'const' from some other ArrayRef uses since its implicitly const already. llvm-svn: 216524
*	InstCombine: Optimize GEP's involving ptrtoint better	David Majnemer	2014-08-27	1	-11/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We supported transforming: (gep i8* X, -(ptrtoint Y)) to: (inttoptr (sub (ptrtoint X), (ptrtoint Y))) However, this only fired if 'X' had type i8*. Generalize this to support various types of different sizes. This results in much better CodeGen, especially for pointers to packed structs. llvm-svn: 216523
*	Revert r210342 and r210343, add test case for the crasher.	Joerg Sonnenberger	2014-08-26	1	-91/+0
\| \| \| \| \| \|	PR 20642. llvm-svn: 216475
*	This patch enables SimplifyUsingDistributiveLaws() to handle following pattens.	Dinesh Dwivedi	2014-08-26	2	-54/+45
\| \| \| \| \| \| \| \| \| \| \| \|	(X >> Z) & (Y >> Z) -> (X&Y) >> Z for all shifts. (X >> Z) \| (Y >> Z) -> (X\|Y) >> Z for all shifts. (X >> Z) ^ (Y >> Z) -> (X^Y) >> Z for all shifts. These patterns were previously handled separately in visitAnd()/visitOr()/visitXor(). Differential Revision: http://reviews.llvm.org/D4951 llvm-svn: 216443
*	musttail: Don't eliminate varargs packs if there is a forwarding call	Reid Kleckner	2014-08-26	1	-2/+7
\| \| \| \| \| \|	Also clean up and beef up this grep test for the feature. llvm-svn: 216425
*	fix typos in comments	Sanjay Patel	2014-08-26	1	-4/+4
\| \| \| \|	llvm-svn: 216424
*	ArgPromotion: Don't touch variadic functions	Reid Kleckner	2014-08-25	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adding, removing, or changing non-pack parameters can change the ABI classification of pack parameters. Clang and other frontends encode the classification in the IR of the call site, but the callee side determines it dynamically based on the number of registers consumed so far. Changing the prototype affects the number of registers consumed would break such code. Dead argument elimination performs a similar task and already has a similar check to avoid this problem. Patch by Thomas Jablin! llvm-svn: 216421
*	Modernize raw_fd_ostream's constructor a bit.	Rafael Espindola	2014-08-25	2	-5/+4
\| \| \| \| \| \| \| \| \| \|	Take a StringRef instead of a "const char *". Take a "std::error_code &" instead of a "std::string &" for error. A create static method would be even better, but this patch is already a bit too big. llvm-svn: 216393
*	Remove dangling initializers in GlobalDCE	Bruno Cardoso Lopes	2014-08-25	2	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	GlobalDCE deletes global vars and updates their initializers to nullptr while leaving underlying constants to be cleaned up later by its uses. The clean up may never happen, fix this by forcing it every time it's safe to destroy constants. Final patch by Rafael Espindola http://reviews.llvm.org/D4931 <rdar://problem/17523868> llvm-svn: 216390
*	MergeFunctions, tiny refactoring:	Stepan Dyatkovskiy	2014-08-25	1	-3/+3
\| \| \| \| \| \|	cmpAPFloat has been renamed to cmpAPFloats (multiple form). llvm-svn: 216376
*	MergeFunctions, tiny refactoring:	Stepan Dyatkovskiy	2014-08-25	1	-5/+5
\| \| \| \| \| \|	cmpAPInt has been renamed to cmpAPInts (multiple form). llvm-svn: 216375
*	MergeFunctions, tiny refactoring:	Stepan Dyatkovskiy	2014-08-25	1	-12/+11
\| \| \| \| \| \|	cmpType has been renamed to cmpTypes (multiple form). llvm-svn: 216374
*	MergeFunctions, tiny refactoring:	Stepan Dyatkovskiy	2014-08-25	1	-5/+5
\| \| \| \| \| \|	cmpGEP has been renamed to cmpGEPs (multiple form). llvm-svn: 216373
*	Allow vectorization of division by uniform power of 2.	Karthik Bhat	2014-08-25	2	-8/+31
\| \| \| \| \| \| \| \|	This patch adds support to recognize division by uniform power of 2 and modifies the cost table to vectorize division by uniform power of 2 whenever possible. Updates Cost model for Loop and SLP Vectorizer.The cost table is currently only updated for X86 backend. Thanks to Hal, Andrea, Sanjay for the review. (http://reviews.llvm.org/D4971) llvm-svn: 216371
*	Use range based for loops to avoid needing to re-mention SmallPtrSet size.	Craig Topper	2014-08-24	13	-103/+58
\| \| \| \|	llvm-svn: 216351
*	InstCombine: Properly optimize or'ing bittests together	David Majnemer	2014-08-24	1	-0/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	CFE, with -03, would turn: bool f(unsigned x) { bool a = x & 1; bool b = x & 2; return a \| b; } into: %1 = lshr i32 %x, 1 %2 = or i32 %1, %x %3 = and i32 %2, 1 %4 = icmp ne i32 %3, 0 This sort of thing exposes a nasty pathology in GCC, ICC and LLVM. Instead, we would rather want: %1 = and i32 %x, 3 %2 = icmp ne i32 %1, 0 Things get a bit more interesting in the following case: %1 = lshr i32 %x, %y %2 = or i32 %1, %x %3 = and i32 %2, 1 %4 = icmp ne i32 %3, 0 Replacing it with the following sequence is better: %1 = shl nuw i32 1, %y %2 = or i32 %1, 1 %3 = and i32 %2, %x %4 = icmp ne i32 %3, 0 This sequence is preferable because %1 doesn't involve %x and could potentially be hoisted out of loops if it is invariant; only perform this transform in the non-constant case if we know we won't increase register pressure. llvm-svn: 216343
*	[SROA] Fold a PHI node if all its incoming values are the same	Jingyue Wu	2014-08-22	1	-41/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes PR20425. During slice building, if all of the incoming values of a PHI node are the same, replace the PHI node with the common value. This simplification makes alloca's used by PHI nodes easier to promote. Test Plan: Added three more tests in phi-and-select.ll Reviewers: nlewycky, eliben, meheff, chandlerc Reviewed By: chandlerc Subscribers: zinovy.nis, hfinkel, baldrick, llvm-commits Differential Revision: http://reviews.llvm.org/D4659 llvm-svn: 216299
*	InstCombine: Don't unconditionally preserve 'nuw' when shrinking constants	David Majnemer	2014-08-22	1	-6/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Consider: %add = add nuw i32 %a, -16777216 %and = and i32 %add, 255 Regardless of whether or not we demand the sign bit of %add, we cannot replace -16777216 with 2130706432 without also removing 'nuw' from the instruction. llvm-svn: 216273
*	InstCombine: sub nsw %x, C -> add nsw %x, -C if C isn't INT_MIN	David Majnemer	2014-08-22	1	-1/+4
\| \| \| \| \| \|	We can preserve nsw during this transform if -C won't overflow. llvm-svn: 216269
*	InstCombine: Don't unconditionally preserve 'nsw' when shrinking constants	David Majnemer	2014-08-22	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Consider: %add = add nsw i32 %a, -16777216 %and = and i32 %add, 255 Regardless of whether or not we demand the sign bit of %add, we cannot replace -16777216 with 2130706432 without also removing 'nsw' from the instruction. This fixes PR20377. llvm-svn: 216261
*	fix: SLPVectorizer crashes for unreachable blocks containing not schedulable ↵	Erik Eckstein	2014-08-22	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	instructions. In unreachable blocks it's legal to have instructions like "%x = op %x". Such instuctions are not schedulable. Therefore the SLPVectorizer has to check for unreachable blocks and ignore them. Fixes bug 20646. llvm-svn: 216256
*	[dfsan] Fix non-determinism bug in non-zero label check annotator.	Peter Collingbourne	2014-08-22	1	-10/+8
\| \| \| \| \| \| \|	We now use a std::vector instead of a DenseSet to store the list of label checks so that we can iterate over it deterministically. llvm-svn: 216255
*	SROA: Handle a case of store size being smaller than allocation size	Reid Kleckner	2014-08-22	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In this case, we are creating an x86_fp80 slice for a union from C where the padding bytes may contain real data. An x86_fp80 alloca is 16 bytes, and that's just fine. We can't, however, use regular loads and stores to access the slice, because the store size is only 10 bytes / 80 bits. Instead, use memcpy and memset. Fixes PR18726. Reviewed By: chandlerc Differential Revision: http://reviews.llvm.org/D5012 llvm-svn: 216248
*	Use DILexicalBlockFile, rather than DILexicalBlock, to track discriminator ↵	David Blaikie	2014-08-21	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	changes to ensure discriminator changes don't introduce new DWARF DW_TAG_lexical_blocks. Somewhat unnoticed in the original implementation of discriminators, but it could cause instructions to end up in new, small, DW_TAG_lexical_blocks due to the use of DILexicalBlock to track discriminator changes. Instead, use DILexicalBlockFile which we already use to track file changes without introducing new scopes, so it works well to track discriminator changes in the same way. llvm-svn: 216239
*	Move some logic to populateLTOPassManager.	Rafael Espindola	2014-08-21	1	-5/+36
\| \| \| \| \| \| \|	This will avoid code duplication in the next commit which calls it directly from the gold plugin. llvm-svn: 216211
*	Respect LibraryInfo in populateLTOPassManager and use it. NFC.	Rafael Espindola	2014-08-21	1	-0/+4
\| \| \| \|	llvm-svn: 216203
*	Handle inlining in populateLTOPassManager like in populateModulePassManager.	Rafael Espindola	2014-08-21	1	-5/+13
\| \| \| \| \| \|	No functionality change. llvm-svn: 216178
*	[CLNUP] Remove return after llvm_unreachable. Thanks to Hal Finkel for pointing.	Zinovy Nis	2014-08-21	1	-1/+0
\| \| \| \|	llvm-svn: 216176
*	Move DisableGVNLoadPRE from populateLTOPassManager to PassManagerBuilder.	Rafael Espindola	2014-08-21	1	-6/+6
\| \| \| \|	llvm-svn: 216174
*	Reassociate x + -0.1234 * y into x - 0.1234 * y	Erik Verbruggen	2014-08-21	1	-2/+49
\| \| \| \| \| \| \| \| \| \| \|	This does not require -ffast-math, and it gives CSE/GVN more options to eliminate duplicate expressions in, e.g.: return ((x + 0.1234 * y) * (x - 0.1234 * y)); Differential Revision: http://reviews.llvm.org/D4904 llvm-svn: 216169
*	[INDVARS] Extend using of widening of induction variables for the cases of ↵	Zinovy Nis	2014-08-21	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	"sub nsw" and "mul nsw" instructions. Currently only "add nsw" are widened. This patch eliminates tons of "sext" instructions for 64 bit code (and the corresponding target code) in cases like: int N = 100; float *A; void foo(int x0, int x1) { float A_cur = &A[0][0]; float * A_next = &A[1][0]; for(int x = x0; x < x1; ++x). { // Currently only [x+N] case is widened. Others 2 cases lead to sext. // This patch fixes it, so all 3 cases do not need sext. const float div = A_cur[x + N] + A_cur[x - N] + A_cur[x * N]; A_next[x] = div; } } ... > clang++ test.cpp -march=core-avx2 -Ofast -fno-unroll-loops -fno-tree-vectorize -S -o - Differential Revision: http://reviews.llvm.org/D4695 llvm-svn: 216160
*	Repace SmallPtrSet with SmallPtrSetImpl in function arguments to avoid ↵	Craig Topper	2014-08-21	22	-75/+73
\| \| \| \| \| \|	needing to mention the size. llvm-svn: 216158
*	InstCombine: Fold ((A \| B) & C1) ^ (B & C2) -> (A & C1) ^ B if C1^C2=-1	David Majnemer	2014-08-21	2	-0/+46
\| \| \| \| \| \|	Adapted from a patch by Richard Smith, test-case written by me. llvm-svn: 216157
*	[LoopVectorizer] Limit unroll factor in the presence of nested reductions.	James Molloy	2014-08-20	1	-0/+17
\| \| \| \| \| \|	If we have a scalar reduction, we can increase the critical path length if the loop we're unrolling is inside another loop. Limit, by default to 2, so the critical path only gets increased by one reduction operation. llvm-svn: 216140
*	New InstCombine pattern: (icmp ult/ule (A + C1), C3) \| (icmp ult/ule (A + ↵	Yi Jiang	2014-08-20	1	-0/+55
\| \| \| \| \| \|	C2), C3) to (icmp ult/ule ((A & ~(C1 ^ C2)) + max(C1, C2)), C3) under certain condition llvm-svn: 216135
*	InstCombine: Annotate sub with nuw when we prove it's safe	David Majnemer	2014-08-20	2	-0/+19
\| \| \| \| \| \| \|	We can prove that a 'sub' can be a 'sub nuw' if the left-hand side is negative and the right-hand side is non-negative. llvm-svn: 216045
*	[dfsan] Treat vararg custom functions like unimplemented functions.	Peter Collingbourne	2014-08-20	1	-1/+1
\| \| \| \| \| \| \| \| \|	Because declarations of these functions can appear in places like autoconf checks, they have to be handled somehow, even though we do not support vararg custom functions. We do so by printing a warning and calling the uninstrumented function, as we do for unimplemented functions. llvm-svn: 216042
*	InstCombine: Annotate sub with nsw when we prove it's safe	David Majnemer	2014-08-19	2	-1/+40
\| \| \| \| \| \| \| \| \| \|	We can prove that a 'sub' can be a 'sub nsw' under certain conditions: - The sign bits of the operands is the same. - Both operands have more than 1 sign bit. The subtraction cannot be a signed overflow in either case. llvm-svn: 216037
*	Revert "Small refactor on VectorizerHint for deduplication"	Renato Golin	2014-08-19	1	-147/+91
\| \| \| \| \| \|	This reverts commit r215994 because MSVC 2012 can't cope with its C++11 goodness. llvm-svn: 215999
*	Small refactor on VectorizerHint for deduplication	Renato Golin	2014-08-19	1	-91/+147
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, the hint mechanism relied on clean up passes to remove redundant metadata, which still showed up if running opt at low levels of optimization. That also has shown that multiple nodes of the same type, but with different values could still coexist, even if temporary, and cause confusion if the next pass got the wrong value. This patch makes sure that, if metadata already exists in a loop, the hint mechanism will never append a new node, but always replace the existing one. It also enhances the algorithm to cope with more metadata types in the future by just adding a new type, not a lot of code. llvm-svn: 215994
*	InstCombine: ((A & ~B) ^ (~A & B)) to A ^ B	Mayur Pandey	2014-08-19	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \|	Proof using CVC3 follows: $ cat t.cvc A, B : BITVECTOR(32); QUERY BVXOR((A & ~B),(~A & B)) = BVXOR(A,B); $ cvc3 t.cvc Valid. Differential Revision: http://reviews.llvm.org/D4898 llvm-svn: 215974