bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[Statepoints 3/4] Statepoint infrastructure for garbage collection: ↵	Philip Reames	2014-12-02	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SelectionDAGBuilder This is the third patch in a small series. It contains the CodeGen support for lowering the gc.statepoint intrinsic sequences (223078) to the STATEPOINT pseudo machine instruction (223085). The change also includes the set of helper routines and classes for working with gc.statepoints, gc.relocates, and gc.results since the lowering code uses them. With this change, gc.statepoints should be functionally complete. The documentation will follow in the fourth change, and there will likely be some cleanup changes, but interested parties can start experimenting now. I'm not particularly happy with the amount of code or complexity involved with the lowering step, but at least it's fairly well isolated. The statepoint lowering code is split into it's own files and anyone not working on the statepoint support itself should be able to ignore it. During the lowering process, we currently spill aggressively to stack. This is not entirely ideal (and we have plans to do better), but it's functional, relatively straight forward, and matches closely the implementations of the patchpoint intrinsics. Most of the complexity comes from trying to keep relocated copies of values in the same stack slots across statepoints. Doing so avoids the insertion of pointless load and store instructions to reshuffle the stack. The current implementation isn't as effective as I'd like, but it is functional and 'good enough' for many common use cases. In the long term, I'd like to figure out how to integrate the statepoint lowering with the register allocator. In principal, we shouldn't need to eagerly spill at all. The register allocator should do any spilling required and the statepoint should simply record that fact. Depending on how challenging that turns out to be, we may invest in a smarter global stack slot assignment mechanism as a stop gap measure. Reviewed by: atrick, ributzka llvm-svn: 223137
*	[SwitchLowering] Handle destinations on multiple phi instructions	Bruno Cardoso Lopes	2014-12-02	1	-2/+3
\| \| \| \| \| \| \| \| \|	Follow up from r222926. Also handle multiple destinations from merged cases on multiple and subsequent phi instructions. rdar://problem/19106978 llvm-svn: 223135
*	[LICM] Avoind store sinking if no preheader is available	Bruno Cardoso Lopes	2014-12-02	1	-2/+4
\| \| \| \| \| \| \| \| \|	Load instructions are inserted into loop preheaders when sinking stores and later removed if not used by the SSA updater. Avoid sinking if the loop has no preheader and avoid crashes. This fixes one more side effect of not handling indirectbr instructions properly on LoopSimplify. llvm-svn: 223119
*	Revert r223049, r223050 and r223051 while investigating test failures.	Hans Wennborg	2014-12-01	1	-8/+56
\| \| \| \| \| \|	I didn't foresee affecting the Clang test suite :/ llvm-svn: 223054
*	SimplifyCFG: Omit range checks for switch lookup tables when default is ↵	Hans Wennborg	2014-12-01	1	-7/+8
\| \| \| \| \| \| \| \|	unreachable They would get optimized away later, but we might as well not emit them. llvm-svn: 223051
*	SimplifyCFG: don't remove unreachable default switch destinations	Hans Wennborg	2014-12-01	1	-49/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	An unreachable default destination can be exploited by other optimizations, and SDag lowering is now prepared to handle them efficiently. For example, branches to the unreachable destination will be optimized away, such as in the case of range checks for switch lookup tables. On 64-bit Linux, this reduces the size of a clang bootstrap by 80 kB (and Chromium by 30 kB). llvm-svn: 223050
*	[msan] Add compile-time checks for missing origins.	Evgeniy Stepanov	2014-12-01	1	-9/+10
\| \| \| \| \| \| \| \| \| \| \|	This change makes MemorySanitizer instrumentation a bit more strict about instructions that have no origin id assigned to them. This would have caught the bug that was fixed in r222918. No functional change. llvm-svn: 222997
*	[asan] Change dynamic alloca instrumentation to only consider allocas that ↵	Yury Gribov	2014-12-01	1	-3/+35
\| \| \| \| \| \| \| \|	are dominating all exits from function. Reviewed in http://reviews.llvm.org/D6412 llvm-svn: 222991
*	DebugIR: Delete -debug-ir	Duncan P. N. Exon Smith	2014-11-29	3	-716/+0
\| \| \| \|	llvm-svn: 222945
*	Revert "Masked Vector Load and Store Intrinsics."	Duncan P. N. Exon Smith	2014-11-28	1	-83/+15
\| \| \| \| \| \| \| \| \| \| \|	This reverts commit r222632 (and follow-up r222636), which caused a host of LNT failures on an internal bot. I'll respond to the commit on the list with a reproduction of one of the failures. Conflicts: lib/Target/X86/X86TargetTransformInfo.cpp llvm-svn: 222936
*	InstCombine: FoldOrOfICmps harder	David Majnemer	2014-11-28	1	-2/+25
\| \| \| \| \| \| \| \| \| \| \| \|	We may be in a situation where the icmps might not be near each other in a tree of or instructions. Try to dig out related compare instructions and see if they combine. N.B. This won't fire on deep trees of compares because rewritting the tree might end up creating a net increase of IR. We may have to resort to something more sophisticated if this is a real problem. llvm-svn: 222928
*	[LICM] Store sink and indirectbr instructions	Bruno Cardoso Lopes	2014-11-28	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Loop simplify skips exit-block insertion when exits contain indirectbr instructions. This leads to an assertion in LICM when trying to sink stores out of non-dedicated loop exits containing indirectbr instructions. This patch fix this issue by re-checking for dedicated exits in LICM prior to store sink attempts. Differential Revision: http://reviews.llvm.org/D6414 rdar://problem/18943047 llvm-svn: 222927
*	[SwitchLowering] Handle multiple destinations on condensed case stmts	Bruno Cardoso Lopes	2014-11-28	1	-12/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Switch cases statements with sequential values that branch to the same destination BB may often be handled together in a single new source BB. In this scenario we need to remove remaining incoming values from PHI instructions in the destination BB, as to match the number of source branches. Differential Revision: http://reviews.llvm.org/D6415 rdar://problem/19040894 llvm-svn: 222926
*	[msan] Fix origin propagation for select of floats.	Evgeniy Stepanov	2014-11-28	1	-3/+4
\| \| \| \| \| \| \| \| \| \|	MSan does not assign origin for instrumentation temps (i.e. the ones that do not come from the application code), but "select" instrumentation erroneously tried to use one of those. https://code.google.com/p/memory-sanitizer/issues/detail?id=78 llvm-svn: 222918
*	Removed extra line from a comment to test first commit. NFC.	Ankur Garg	2014-11-28	1	-1/+0
\| \| \| \|	llvm-svn: 222916
*	reinstate r222872: Peephole optimization in switch table lookup: reuse the ↵	Erik Eckstein	2014-11-27	1	-7/+100
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	guarding table comparison if possible. Fixed missing dominance check. Original commit message: This optimization tries to reuse the generated compare instruction, if there is a comparison against the default value after the switch. Example: if (idx < tablesize) r = table[idx]; // table does not contain default_value else r = default_value; if (r != default_value) ... Is optimized to: cond = idx < tablesize; if (cond) r = table[idx]; else r = default_value; if (cond) ... Jump threading will then eliminate the second if(cond). llvm-svn: 222891
*	[msan] Remove indirect call wrapping code.	Evgeniy Stepanov	2014-11-27	1	-89/+1
\| \| \| \| \| \|	This functionality was only used in MSanDR, which is deprecated. llvm-svn: 222889
*	Revert "Peephole optimization in switch table lookup: reuse the guarding ↵	Erik Eckstein	2014-11-27	1	-89/+7
\| \| \| \| \| \| \| \|	table comparison if possible." It is breaking the clang bootstrag. llvm-svn: 222877
*	Peephole optimization in switch table lookup: reuse the guarding table ↵	Erik Eckstein	2014-11-27	1	-7/+89
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	comparison if possible. This optimization tries to reuse the generated compare instruction, if there is a comparison against the default value after the switch. Example: if (idx < tablesize) r = table[idx]; // table does not contain default_value else r = default_value; if (r != default_value) ... Is optimized to: cond = idx < tablesize; if (cond) r = table[idx]; else r = default_value; if (cond) ... \endcode Jump threading will then eliminate the second if(cond). llvm-svn: 222872
*	InstCombine: Restore optimizations lost in r210006	David Majnemer	2014-11-27	1	-0/+28
\| \| \| \| \| \| \| \|	This restores our ability to optimize: (X & C) == 0 ? X ^ C : X into X \| C (X & C) != 0 ? X ^ C : X into X & ~C llvm-svn: 222871
*	Revert "Added inst combine transforms for single bit tests from Chris's note"	David Majnemer	2014-11-26	1	-28/+1
\| \| \| \| \| \| \| \| \| \| \|	This reverts commit r210006, it miscompiled libapr which is used in who knows how many projects. A test has been added to ensure that we don't regress again. I'll work on a rewrite of what the optimization was trying to do later. llvm-svn: 222856
*	[InstCombine] Change LLVM To canonicalize toward the value type being	Chandler Carruth	2014-11-25	1	-100/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	stored rather than the pointer type. This change is analogous to r220138 which changed the canonicalization for loads. The rationale is the same: memory does not have a type, operations (and thus the values they produce) have a type. We should match that type as closely as possible rather than reading some form of semantics into the pointer type. With this change, loads and stores should no longer be made with nonsensical types for the values that tehy load and store. This is particularly important when trying to match specific loaded and stored types in the process of doing other instcombines, which is what led me down this twisty maze of miscanonicalization. I've put quite some effort into looking through IR to find places where LLVM's optimizer was being unreasonably conservative in the face of mismatched load and store types, however it is possible (let's say, likely!) I have missed some. If you see regressions here, or from r220138, the likely cause is some part of LLVM failing to cope with load and store types differing. Test cases appreciated, it is important that we root all of these out of LLVM. llvm-svn: 222748
*	Revert r220349 to re-instate r220277 with a fix for PR21330 -- quite	Chandler Carruth	2014-11-25	2	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	clearly only exactly equal width ptrtoint and inttoptr casts are no-op casts, it says so right there in the langref. Make the code agree. Original log from r220277: Teach the load analysis to allow finding available values which require inttoptr or ptrtoint cast provided there is datalayout available. Eventually, the datalayout can just be required but in practice it will always be there today. To go with the ability to expose available values requiring a ptrtoint or inttoptr cast, helpers are added to perform one of these three casts. These smarts are necessary to finish canonicalizing loads and stores to the operational type requirements without regressing fundamental combines. I've added some test cases. These should actually improve as the load combining and store combining improves, but they may fundamentally be highlighting some missing combines for select in addition to exercising the specific added logic to load analysis. llvm-svn: 222739
*	Bug 21610: Canonicalize min/max fcmp selects to use ordered comparisons	Matt Arsenault	2014-11-24	1	-1/+30
\| \| \| \|	llvm-svn: 222705
*	[asan/coverage] change the way asan coverage instrumentation is done: ↵	Kostya Serebryany	2014-11-24	1	-10/+7
\| \| \| \| \| \|	instead of setting the guard to 1 in the generated code, pass the pointer to guard to __sanitizer_cov and set it there. No user-visible functionality change expected llvm-svn: 222675
*	InstCombine: Don't create an unused instruction	David Majnemer	2014-11-24	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	We would create an instruction but not inserting it. Not inserting the unused instruction would lead us to verification failure. This fixes PR21653. llvm-svn: 222659
*	InstCombine: Don't assume DataLayout is always available	David Majnemer	2014-11-24	1	-1/+2
\| \| \| \| \| \| \| \| \|	We tried to get the result of DataLayout::getLargestLegalIntTypeSize but we didn't have a DataLayout. This resulted in opt crashing. This fixes PR21651. llvm-svn: 222645
*	Masked Vector Load and Store Intrinsics.	Elena Demikhovsky	2014-11-23	1	-15/+83
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduced new target-independent intrinsics in order to support masked vector loads and stores. The loop vectorizer optimizes loops containing conditional memory accesses by generating these intrinsics for existing targets AVX2 and AVX-512. The vectorizer asks the target about availability of masked vector loads and stores. Added SDNodes for masked operations and lowering patterns for X86 code generator. Examples: <16 x i32> @llvm.masked.load.v16i32(i8* %addr, <16 x i32> %passthru, i32 4 /* align /, <16 x i1> %mask) declare void @llvm.masked.store.v8f64(i8 %addr, <8 x double> %value, i32 4, <8 x i1> %mask) Scalarizer for other targets (not AVX2/AVX-512) will be done in a separate patch. http://reviews.llvm.org/D6191 llvm-svn: 222632
*	InstCombine: Propagate exact for (sdiv X, Pow2) -> (udiv X, Pow2)	David Majnemer	2014-11-22	1	-2/+4
\| \| \| \|	llvm-svn: 222625
*	InstCombine: Propagate exact for (sdiv X, Y) -> (udiv X, Y)	David Majnemer	2014-11-22	1	-1/+3
\| \| \| \|	llvm-svn: 222624
*	InstCombine: Propagate exact for (sdiv -X, C) -> (sdiv X, -C)	David Majnemer	2014-11-22	1	-4/+6
\| \| \| \|	llvm-svn: 222623
*	Tidied up target triple OS detection. NFC	Simon Pilgrim	2014-11-22	1	-2/+2
\| \| \| \| \| \|	Use Triple::isOS*() helper functions where possible. llvm-svn: 222622
*	InstCombine: Propagate exact in (udiv (lshr X,C1),C2) -> (udiv x,C1<<C2)	David Majnemer	2014-11-22	1	-2/+7
\| \| \| \|	llvm-svn: 222620
*	InstCombine: Propagate NSW/NUW for X*(1<<Y) -> X<<Y	David Majnemer	2014-11-22	1	-4/+17
\| \| \| \|	llvm-svn: 222613
*	InstCombine: Propagate NSW for -X * -Y -> X * Y	David Majnemer	2014-11-22	1	-3/+10
\| \| \| \|	llvm-svn: 222612
*	InstCombine: Silence a parenthesis warning	David Majnemer	2014-11-22	1	-1/+1
\| \| \| \|	llvm-svn: 222609
*	InstCombine: Preserve nsw when folding X*(2^C) -> X << C	David Majnemer	2014-11-22	1	-0/+2
\| \| \| \|	llvm-svn: 222606
*	InstCombine: Preserve nsw/nuw for ((X << C2)C1) -> (X (C1 << C2))	David Majnemer	2014-11-22	1	-3/+12
\| \| \| \|	llvm-svn: 222605
*	InstCombine: Preserve nsw for (mul %V, -1) -> (sub 0, %V)	David Majnemer	2014-11-22	1	-2/+7
\| \| \| \|	llvm-svn: 222604
*	[InstCombine] Re-commit of r218721 (Optimize icmp-select-icmp sequence)	Gerolf Hoflehner	2014-11-21	3	-10/+168
\| \| \| \| \| \| \|	Fixes the self-host fail. Note that this commit activates dominator analysis in the combiner by default (like the original commit did). llvm-svn: 222590
*	[asan] remove old experimental code	Kostya Serebryany	2014-11-21	1	-38/+0
\| \| \| \|	llvm-svn: 222586
*	[asan] add statistic counter to dynamic alloca instrumentation	Kostya Serebryany	2014-11-21	1	-0/+3
\| \| \| \|	llvm-svn: 222573
*	Disable header duplication at -Oz in loop-rotate pass.	Roman Divacky	2014-11-21	1	-1/+2
\| \| \| \|	llvm-svn: 222562
*	[asan] Add new hidden compile-time flag asan-instrument-allocas to sanitize ↵	Yury Gribov	2014-11-21	1	-6/+227
\| \| \| \| \| \| \| \|	variable-sized dynamic allocas. Patch by Max Ostapenko. Reviewed at http://reviews.llvm.org/D6055 llvm-svn: 222519
*	This Reassociate change unintentionally slipped in r222499	David Majnemer	2014-11-21	1	-7/+0
\| \| \| \|	llvm-svn: 222500
*	SROA: The alloca type isn't a candidate promotion type for vectors	David Majnemer	2014-11-21	2	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \|	The alloca's type is irrelevant, only those types which are used in a load or store of the exact size of the slice should be considered. This manifested as an assertion failure when we compared the various types: we had a size mismatch. This fixes PR21480. llvm-svn: 222499
*	SimplifyCFG: Refactor GatherConstantCompares() result in a struct	Mehdi Amini	2014-11-20	1	-133/+152
\| \| \| \| \| \| \| \| \| \|	Code seems cleaner and easier to understand this way This is basically r222416, after fixes for MSVC lack of standard support, and a few cleaning (got rid of a warning). Thanks Nakamura Takumi and Nico Weber for the MSVC fixes. llvm-svn: 222472
*	Fix a trip-count overflow issue in LoopUnroll.	Michael Zolotukhin	2014-11-20	1	-2/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Currently LoopUnroll generates a prologue loop before the main loop body to execute first N%UnrollFactor iterations. Also, this loop is used if trip-count can overflow - it's determined by a runtime check. However, we've been mistakenly optimizing this loop to a linear code for UnrollFactor = 2, not taking into account that it also serves as a safe version of the loop if its trip-count overflows. llvm-svn: 222451
*	Revert r222416, r222422, r222426: the former revision had problems and ↵	Timur Iskhodzhanov	2014-11-20	1	-150/+136
\| \| \| \| \| \|	fixing them introduced bugs llvm-svn: 222428
*	Fix a typo	Timur Iskhodzhanov	2014-11-20	1	-1/+1
\| \| \| \|	llvm-svn: 222426