bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[LoopUtils,LV] Propagate fast-math flags on generated FCmp instructions	James Molloy	2015-09-21	2	-2/+11
\| \| \| \| \| \| \| \| \|	We're currently losing any fast-math flags when synthesizing fcmps for min/max reductions. In LV, make sure we copy over the scalar inst's flags. In LoopUtils, we know we only ever match patterns with hasUnsafeAlgebra, so apply that to any synthesized ops. llvm-svn: 248201
*	[FunctionAttrs] Extract a helper function for the core logic used to	Chandler Carruth	2015-09-21	1	-90/+117
\| \| \| \| \| \| \| \|	evaluate whether 'readonly' or 'readnone' apply to a given function. This both reduces indentation and will make it easy to share the logic with a new pass manager implementation. llvm-svn: 248181
*	add ShouldChangeType() variant that takes bitwidths	Sanjay Patel	2015-09-21	2	-6/+16
\| \| \| \| \| \|	This is more efficient for cases like D12965 where we already have widths. llvm-svn: 248170
*	don't repeat function names in comments; NFC	Sanjay Patel	2015-09-21	1	-62/+57
\| \| \| \|	llvm-svn: 248166
*	[IndVars] Use C++11 style field initialization; NFCI.	Sanjoy Das	2015-09-20	1	-14/+7
\| \| \| \|	llvm-svn: 248131
*	[IndVars] Don't add a level of indentation for namespace {. NFC.	Sanjoy Das	2015-09-20	1	-77/+77
\| \| \| \| \| \|	Whitespace-only change. llvm-svn: 248130
*	[IndVars] Don't repeat function names in comment; NFC.	Sanjoy Das	2015-09-20	1	-65/+62
\| \| \| \| \| \|	Only changes comments. llvm-svn: 248112
*	[IndVars] Fix a bug in r248045.	Sanjoy Das	2015-09-20	1	-14/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Because -indvars widens induction variables through arithmetic, `NeverNegative` cannot be a property of the `WidenIV` (a `WidenIV` manages information for all transitive uses of an IV being widened, including uses of `-1 * IV`). Instead it must live on `NarrowIVDefUse` which manages information for a specific def-use edge in the transitive use list of an induction variable. This change also adds a test case that demonstrates the problem with r248045. llvm-svn: 248107
*	[InstCombine] Use SimplifyDemandedVectorEltsLow helper function. NFCI.	Simon Pilgrim	2015-09-19	1	-17/+8
\| \| \| \| \| \|	Use the SimplifyDemandedVectorEltsLow helper function introduced in D12680. llvm-svn: 248089
*	[InstCombine] FoldICmpCstShrCst failed for ashr when comparing against -1	David Majnemer	2015-09-19	1	-1/+1
\| \| \| \| \| \| \| \| \|	(icmp eq (ashr C1, %V) -1) may have multiple answers if C1 is not a power of two and has the sign bit set. This fixes PR24873. llvm-svn: 248074
*	[InstCombine] FoldICmpCstShrCst didn't handle icmps of -1 in the ashr case ↵	David Majnemer	2015-09-19	1	-6/+10
\| \| \| \| \| \|	correctly llvm-svn: 248073
*	[IndVars] Widen more comparisons for non-negative induction vars	Sanjoy Das	2015-09-18	1	-3/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If an induction variable is provably non-negative, its sign extension is equal to its zero extension. This means narrow uses like icmp slt iNarrow %indvar, %rhs can be widened into icmp slt iWide zext(%indvar), sext(%rhs) Reviewers: atrick, mcrosier, hfinkel Subscribers: hfinkel, reames, llvm-commits Differential Revision: http://reviews.llvm.org/D12745 llvm-svn: 248045
*	Clean up: Refactoring the hardcoded value of 6 for ↵	Larisse Voufo	2015-09-18	2	-6/+8
\| \| \| \| \| \|	FindAvailableLoadedValue()'s parameter MaxInstsToScan. (Complete version of r247497. See D12886) llvm-svn: 248022
*	gvn small fix	Piotr Padlewski	2015-09-17	1	-3/+1
\| \| \| \| \| \|	http://reviews.llvm.org/D12928 llvm-svn: 247935
*	[InstCombine] Added vector demanded bits support for SSE4A EXTRQ/INSERTQ ↵	Simon Pilgrim	2015-09-17	2	-1/+83
\| \| \| \| \| \| \| \| \| \|	instructions The SSE4A instructions EXTRQ/INSERTQ only use the lower 64-bits (or less) for many of their input vector operands and all of them have undefined upper 64-bits results. Differential Revision: http://reviews.llvm.org/D12680 llvm-svn: 247934
*	[InstCombine] Optimize icmp slt signum(x), 1 --> icmp slt x, 1	Sanjoy Das	2015-09-16	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: `signum(x)` is sometimes implemented as `(x >> 63) \| (-x >>> 63)` (for an `i64` `x`). This change adds a matcher for that pattern, and an instcombine rule to optimize `signum(x) s< 1`. Later, we can also consider optimizing: icmp slt signum(x), 0 --> icmp slt x, 0 icmp sle signum(x), 1 --> true etc. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12703 llvm-svn: 247846
*	don't repeat function names in comments; NFC	Sanjay Patel	2015-09-16	1	-29/+24
\| \| \| \|	llvm-svn: 247813
*	[sanitizer] Add MSan support for AArch64	Adhemerval Zanella	2015-09-16	1	-0/+34
\| \| \| \| \| \| \| \| \|	This patch adds support for msan on aarch64-linux for both 39 and 42-bit VMA. The support is enabled by defining the SANITIZER_AARCH64_VMA compiler flag to either 39 or 42 at build time for both clang/llvm and compiler-rt. The default VMA is 39 bits. llvm-svn: 247807
*	Test commit: Fixed a few typos in the comments.	David L Kreitzer	2015-09-16	1	-6/+6
\| \| \| \|	llvm-svn: 247793
*	[Unroll] Fix a bug in UnrolledInstAnalyzer::visitLoad.	Michael Zolotukhin	2015-09-16	1	-1/+1
\| \| \| \| \| \| \| \|	We only checked that a global is initialized with constants, which is incorrect. We should be checking that GlobalVariable is a constant, not just initialized with it. llvm-svn: 247769
*	[IndVars] Fix PR24783.	Sanjoy Das	2015-09-15	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In `IndVarSimplify::ExpandSCEVIfNeeded`, `SCEVExpander::findExistingExpansion` may return an `llvm::Value` that differs in type from the SCEV it was asked to find an expansion for (but computes the same value). In such cases, we fall back on `expandCodeFor`; and rely on LLVM to CSE the two equivalent expressions (different only by a no-op cast) into a single computation. I tried a few other approaches to fixing PR24783, all of which turned out to be more complex than this current version: 1. Move the `ExpandSCEVIfNeeded` logic into `expandCodeFor`. This got problematic because currently we do not pass in the `Loop *` into `expandCodeFor`. Changing the interface to do this is a more invasive change, and really does not make much semantic sense unless the SCEV being passed in is an add recurrence. There is also the problem of `expandCodeFor` being used in places other than `indvars` -- there may be performance / correctness issues elsewhere if `expandCodeFor` is moved from always generating IR from scratch to cache-like model. 2. Have `findExistingExpansion` only return expression with the correct type. This would make `isHighCostExpansionHelper` and thus `isHighCostExpansion` more conservative than necessary. 3. Insert casts on the value returned by `findExistingExpansion` if needed using `InsertNoopCastOfTo`. This is complicated because `InsertNoopCastOfTo` depends on internal state of its `SCEVExpander` (specifically `Builder.GetInserPoint()`), and this may not be set up when `ExpandSCEVIfNeeded` is called. 4. Manually insert casts on the value returned by `findExistingExpansion` if needed using `InsertNoopCastOfTo` via `CastInst::Create`. This is probably workable, but figuring out the location where the cast instruction needs to be inserted has enough edge cases (arguments, constants, invokes, LCSSA must be preserved) makes me feel what I have right now is simplest solution. llvm-svn: 247749
*	[IndVars] Rename variable; NFC.	Sanjoy Das	2015-09-15	1	-2/+2
\| \| \| \|	llvm-svn: 247748
*	[ASan] Don't instrument globals in .preinit_array/.init_array/.fini_array	Alexey Samsonov	2015-09-15	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	These sections contain pointers to function that should be invoked during startup/shutdown by __libc_csu_init and __libc_csu_fini. Instrumenting these globals will append redzone to them, which will be filled with zeroes. This will cause null pointer dereference at runtime. Merge ASan regression tests for globals that should be ignored by instrumentation pass. llvm-svn: 247734
*	Revert "Clean up: Refactoring the hardcoded value of 6 for ↵	Larisse Voufo	2015-09-15	2	-8/+6
\| \| \| \| \| \|	FindAvailableLoadedValue()'s parameter MaxInstsToScan." for preliminary community discussion (See. D12886) llvm-svn: 247716
*	Broaden optimization of fcmp ([us]itofp x, constant) by instcombine.	Arch D. Robison	2015-09-15	1	-14/+23
\| \| \| \| \| \| \| \| \| \|	The patch extends the optimization to cases where the constant's magnitude is so small or large that the rounding of the conversion is irrelevant. The "so small" case includes negative zero. Differential review: http://reviews.llvm.org/D11210 llvm-svn: 247708
*	[CorrelatedValuePropagation] Infer nonnull attributes	Igor Laevsky	2015-09-15	1	-0/+31
\| \| \| \| \| \| \| \| \|	LazuValueInfo can prove that value is nonnull based on the context information. Make use of this ability to infer nonnull attributes for the call arguments. Differential Revision: http://reviews.llvm.org/D12836 llvm-svn: 247707
*	[NaryReassociate] Add support for Mul instructions	Marcello Maggioni	2015-09-15	1	-24/+77
\| \| \| \| \| \| \| \| \|	This patch extends the current pass by handling Mul instructions as well. Patch by: Volkan Keles (vkeles@apple.com) llvm-svn: 247705
*	more space; NFC	Sanjay Patel	2015-09-15	1	-0/+1
\| \| \| \|	llvm-svn: 247699
*	[GlobalsAA] Disable globals-aa by default	James Molloy	2015-09-15	1	-1/+1
\| \| \| \| \| \|	Several issues have been found with it - disabling in the meantime. llvm-svn: 247674
*	[PlaceSafepoints] Make the width of a counted loop settable.	Sanjoy Das	2015-09-15	1	-18/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change lets a `PlaceSafepoints` client change how wide the trip count of a loop has to be for the loop to be considerd "counted", via `CountedLoopTripWidth`. It also removes the boolean `SkipCounted` flag and the `upperTripBound` constant -- we can get the old behavior of `SkipCounted` == `false` by setting `CountedLoopTripWidth` to `13` (2 ^ 13 == 8192). Reviewers: reames Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D12789 llvm-svn: 247656
*	[opaque pointer types] Switch a few cases of getElementType over, since I ↵	David Blaikie	2015-09-14	3	-19/+16
\| \| \| \| \| \|	had them lying around anyway llvm-svn: 247610
*	[InstCombineCalls] Use isKnownNonNullAt() to check nullness of passing ↵	Chen Li	2015-09-14	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	arguments at callsite Summary: This patch replaces isKnownNonNull() with isKnownNonNullAt() when checking nullness of passing arguments at callsite. In this way it can handle cases where the argument does not have nonnull attribute but has a dominating null check from the CFG. It also adds assertions in isKnownNonNull() and isKnownNonNullFromDominatingCondition() to make sure the value checked is pointer type (as defined in LLVM document). These assertions might trip failures in things which are not covered under llvm/test, but fixes should be pretty obvious. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12779 llvm-svn: 247587
*	Revert "[opaque pointer type] Pass GlobalAlias the actual pointer type ↵	David Blaikie	2015-09-14	3	-12/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	rather than decomposing it into pointee type + address space" This was a flawed change - it just caused the getElementType call to be deferred until later, when we really need to remove it. Now that the IR for GlobalAliases has been updated, the root cause is addressed that way instead and this change is no longer needed (and in fact gets in the way - because we want to pass the pointee type directly down further). Follow up patches to push this through GlobalValue, bitcode format, etc, will come along soon. This reverts commit 236160. llvm-svn: 247585
*	[MergeFuncs] Fix bug in merging GetElementPointers	JF Bastien	2015-09-14	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	GetElementPointers must have the first argument's type compared for structural equivalence. Previously the code erroneously compared the pointer's type, but this code was dead because all pointer types (of the same address space) are the same. The pointee must be compared instead (using the type stored in the GEP, not from the pointer type which will be erased anyway). Author: jrkoenig Reviewers: dschuff, nlewycky, jfb Subscribers: nlewycky, llvm-commits Differential revision: http://reviews.llvm.org/D12820 llvm-svn: 247570
*	[FunctionAttrs] Move the malloc-like test to a static helper function	Chandler Carruth	2015-09-13	1	-4/+3
\| \| \| \| \| \| \|	that could be used from a new pass manager. This one makes particular sense as a static helper as it doesn't even need TLI. llvm-svn: 247525
*	[FunctionAttrs] Factor the logic to test for a known non-null return out	Chandler Carruth	2015-09-13	1	-7/+10
\| \| \| \| \| \| \| \| \| \|	of a method and into a re-usable static helper. We can potentially use this function from the implementation of a new pass manager oriented version of the pass. Also add some better documentation of exactly what the semantic model of this routine is (it isn't trivial) and use a more modern naming convention for it. llvm-svn: 247524
*	[FunctionAttrs] Make the per-function attribute inference a boring	Chandler Carruth	2015-09-13	1	-4/+3
\| \| \| \| \| \| \| \| \|	static function rather than a method. It just needed access to TargetLibraryInfo, and this way it can be easily reused between the current FunctionAttrs implementation and any port for the new pass manager. llvm-svn: 247522
*	[FunctionAttrs] Collect utility functions as static helpers rather than	Chandler Carruth	2015-09-13	1	-57/+52
\| \| \| \| \| \| \| \|	methods. They don't need anything from the class anyways. Also, collect the declarations into the private section of the pass. llvm-svn: 247521
*	Clean up doxygen comments in FunctionAttrs, promoting some non-doxygen	Chandler Carruth	2015-09-13	1	-34/+21
\| \| \| \| \| \| \| \|	comments, deleting duplicate comments, moving comments to consistently live on the definition since these are all really internal routines, etc. NFC. llvm-svn: 247520
*	Do some spring cleaning on FunctionAttrs.cpp with clang-format prior to	Chandler Carruth	2015-09-13	1	-340/+299
\| \| \| \| \| \|	other refactorings and cleanups here. llvm-svn: 247519
*	Fixed unused variable warning.	Simon Pilgrim	2015-09-12	1	-2/+2
\| \| \| \|	llvm-svn: 247505
*	[InstCombine] CVTPH2PS Vector Demanded Elements + Constant Folding	Simon Pilgrim	2015-09-12	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \|	Improved InstCombine support for CVTPH2PS (F16C half 2 float conversion): <4 x float> @llvm.x86.vcvtph2ps.128(<8 x i16>) - only uses the bottom 4 i16 elements for the conversion. Added constant folding support. Differential Revision: http://reviews.llvm.org/D12731 llvm-svn: 247504
*	[PM] Port SROA to the new pass manager.	Chandler Carruth	2015-09-12	2	-408/+343
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In some ways this is a very boring port to the new pass manager as there are no interesting analyses or dependencies or other oddities. However, this does introduce the first good example of a transformation pass with non-trivial state porting to the new pass manager. I've tried to carve out patterns here to replicate elsewhere, and would appreciate comments on whether folks like these patterns: - A common need in the new pass manager is to effectively lift the pass class and some of its state into a public header file. Prior to this, LLVM used anonymous namespaces to provide "module private" types and utilities, but that doesn't scale to cases where a public header file is needed and the new pass manager will exacerbate that. The pattern I've adopted here is to use the namespace-cased-name of the core pass (what would be a module if we had them) as a module-private namespace. Then utility and other code can be declared and defined in this namespace. At some point in the future, we could even have (conditionally compiled) code that used modules features when available to do the same basic thing. - I've split the actual pass run method in two in order to expose a private method usable by the old pass manager to wrap the new class with a minimum of duplicated code. I actually looked at a bunch of ways to automate or generate these, but they are all quite terrible IMO. The fundamental need is to extract the set of analyses which need to cross this interface boundary, and that will end up being too unpredictable to effectively encapsulate IMO. This is also a relatively small amount of boiler plate that will live a relatively short time, so I'm not too worried about the fact that it is boiler plate. The rest of the patch is totally boring but results in a massive diff (sorry). It just moves code around and removes or adds qualifiers to reflect the new name and nesting structure. Differential Revision: http://reviews.llvm.org/D12773 llvm-svn: 247501
*	Clean up: Refactoring the hardcoded value of 6 for ↵	Larisse Voufo	2015-09-12	2	-6/+8
\| \| \| \| \| \|	FindAvailableLoadedValue()'s parameter MaxInstsToScan. llvm-svn: 247497
*	Fix typos.	Bruce Mitchener	2015-09-12	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	Summary: This fixes a variety of typos in docs, code and headers. Subscribers: jholewinski, sanjoy, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D12626 llvm-svn: 247495
*	typo; NFC	Sanjay Patel	2015-09-11	1	-1/+1
\| \| \| \|	llvm-svn: 247454
*	Revert "[InstCombineCalls] Use isKnownNonNullAt() to check nullness of ↵	Mehdi Amini	2015-09-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	passing arguments at callsite" This reverts commit r247356. Breaks test/Transforms/InstCombine/pr8547.ll with: Wrong types for attribute: byval inalloca nest noalias nocapture nonnull readnone readonly sret dereferenceable(1) dereferenceable_or_null(1) %call = call i32 (i8, ...) @printf(i8 getelementptr inbounds ([10 x i8], [10 x i8]* @.str, i64 0, i64 0), i32 nonnull %conv2) #0 LLVM ERROR: Broken function found, compilation aborted! From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 247371
*	[InstCombineCalls] Use isKnownNonNullAt() to check nullness of passing ↵	Chen Li	2015-09-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	arguments at callsite Summary: This patch replaces isKnownNonNull() with isKnownNonNullAt() when checking nullness of passing arguments at callsite. In this way it can handle cases where the argument does not have nonnull attribute but has a dominating null check from the CFG. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12779 llvm-svn: 247356
*	[InstCombineCalls] Use isKnownNonNullAt() to check nullness of gc.relocate ↵	Chen Li	2015-09-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	return value Summary: This patch replaces isKnownNonNull() with isKnownNonNullAt() when checking nullness of gc.relocate return value. In this way it can handle cases where the relocated value does not have nonnull attribute but has a dominating null check from the CFG. Reviewers: reames Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D12772 llvm-svn: 247353
*	Remove gcc warning when comparing an unsigned var for >= 0	Filipe Cabecinhas	2015-09-10	1	-1/+1
\| \| \| \|	llvm-svn: 247352