bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[DAGCombiner] Improve FMA support for interpolation patterns	Simon Pilgrim	2015-09-21	1	-0/+89
\| \| \| \| \| \| \| \| \| \|	This patch adds support for combining patterns such as (FMUL(FADD(1.0, x), y)) and (FMUL(FSUB(x, 1.0), y)) to their FMA equivalents. This is useful in particular for linear interpolation cases such as (FADD(FMUL(x, t), FMUL(y, FSUB(1.0, t)))) Differential Revision: http://reviews.llvm.org/D13003 llvm-svn: 248210
*	[ARM] Do not scale vext with a factor	Jeroen Ketema	2015-09-21	1	-9/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	The vext pseudo-instruction takes the number of elements that need to be extracted, not the number of bytes. Hence, use the number of elements directly instead of scaling them with a factor. Reviewers: Silviu Baranga, James Molloy (not reflected in the differential revision) Differential Revision: http://reviews.llvm.org/D12974 llvm-svn: 248208
*	[DAGCombiner] Tidy up FMA combine helpers. NFCI.	Simon Pilgrim	2015-09-21	1	-25/+21
\| \| \| \| \| \|	Based on feedback for D13003. llvm-svn: 248206
*	[LoopUtils,LV] Propagate fast-math flags on generated FCmp instructions	James Molloy	2015-09-21	2	-2/+11
\| \| \| \| \| \| \| \| \|	We're currently losing any fast-math flags when synthesizing fcmps for min/max reductions. In LV, make sure we copy over the scalar inst's flags. In LoopUtils, we know we only ever match patterns with hasUnsafeAlgebra, so apply that to any synthesized ops. llvm-svn: 248201
*	Remove roundingMode argument in APFloat::mod	Stephen Canon	2015-09-21	4	-7/+6
\| \| \| \| \| \|	Because mod is always exact, this function should have never taken a rounding mode argument. The actual implementation still has issues, which I'll look at resolving in a subsequent patch. llvm-svn: 248195
*	Fix accidentally committed debug printing	Matt Arsenault	2015-09-21	1	-14/+1
\| \| \| \|	llvm-svn: 248190
*	[DivergenceAnalysis] Separated definition of class into header.	Marcello Maggioni	2015-09-21	1	-54/+25
\| \| \| \| \| \| \| \| \| \| \| \|	The definition of the DivergenceAnalysis pass was in a CPP file and wasn't accessible to users of the analysis to get it through "getAnalysis<>()". This patch extracts the definition into a separate header that can be used by users of the analysis to fetch the results. Patch by Volkan Keles (vkeles@apple.com) llvm-svn: 248186
*	SelectionDAG: Use InsertNode for EntryNode	Matthias Braun	2015-09-21	1	-2/+2
\| \| \| \| \| \|	This fixes problems where two nodes have persistent debug id 0 assigned. llvm-svn: 248182
*	[FunctionAttrs] Extract a helper function for the core logic used to	Chandler Carruth	2015-09-21	1	-90/+117
\| \| \| \| \| \| \| \|	evaluate whether 'readonly' or 'readnone' apply to a given function. This both reduces indentation and will make it easy to share the logic with a new pass manager implementation. llvm-svn: 248181
*	[SystemZ] Fix expansion of ISD::FPOW and ISD::FSINCOS	Ulrich Weigand	2015-09-21	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	The ISD::FPOW and ISD::FSINCOS opcodes default to Legal, but there is no legal instruction for those on SystemZ. This could cause LLVM internal errors. Fixed by setting the operation action to Expand for those opcodes. Also added test cases for all other LLVM IR intrinsics that should generate a library call. (Those already work correctly since the default operation action is fine.) llvm-svn: 248180
*	Revert "[ARM] Handle +t2dsp feature as an ArchExtKind in ARMTargetParser.def"	James Molloy	2015-09-21	2	-5/+2
\| \| \| \| \| \| \| \|	This was committed without the code review (http://reviews.llvm.org/D12937) being approved. This reverts commit r248152. llvm-svn: 248174
*	AMDGPU: Move copy handling under switch like other instructions	Matt Arsenault	2015-09-21	1	-5/+10
\| \| \| \|	llvm-svn: 248172
*	add ShouldChangeType() variant that takes bitwidths	Sanjay Patel	2015-09-21	2	-6/+16
\| \| \| \| \| \|	This is more efficient for cases like D12965 where we already have widths. llvm-svn: 248170
*	DAGCombiner: Replace store of FP constant after attemping store merges	Matt Arsenault	2015-09-21	1	-10/+10
\| \| \| \| \| \| \| \| \|	If storing multiple FP constants, some subset of the stores would be replaced with integers due to visit order, so MergeConsecutiveStores would only partially merge these. llvm-svn: 248169
*	Factor replacement of stores of FP constants into new function	Matt Arsenault	2015-09-21	1	-72/+104
\| \| \| \|	llvm-svn: 248168
*	don't repeat function names in comments; NFC	Sanjay Patel	2015-09-21	1	-62/+57
\| \| \| \|	llvm-svn: 248166
*	[Machine Combiner] Refactor machine reassociation code to be target-independent.	Chad Rosier	2015-09-21	7	-511/+237
\| \| \| \| \| \| \| \| \| \|	No functional change intended. Patch by Haicheng Wu <haicheng@codeaurora.org>! http://reviews.llvm.org/D12887 PR24522 llvm-svn: 248164
*	[ARM] Handle +t2dsp feature as an ArchExtKind in ARMTargetParser.def	Artyom Skrobov	2015-09-21	2	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, the availability of DSP instructions (ACLE 6.4.7) is handled in a hand-rolled tricky condition block in tools/clang/lib/Basic/Targets.cpp, with a FIXME: attached. This patch changes the handling of +t2dsp to be in line with other architecture extensions. Following review comments, also updating the description of FeatureDSPThumb2 in ARM.td. Differential Revision: http://reviews.llvm.org/D12937 llvm-svn: 248152
*	[X86][AVX512] add masked version for RSQRT14 & RCP14 Scalar FP	Asaf Badouh	2015-09-21	4	-46/+39
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D12524 llvm-svn: 248147
*	[mips] Allow constant expressions in second argument of .cpsetup.	Daniel Sanders	2015-09-21	2	-8/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Also tightened up the test and made a trivial fix to prevent double-newline after emitting .cpsetup directives. Reviewers: vkalintiris Subscribers: seanbruno, emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D12956 llvm-svn: 248143
*	Use makeArrayRef or None to avoid unnecessarily mentioning the ArrayRef type ↵	Craig Topper	2015-09-21	5	-18/+18
\| \| \| \| \| \|	extra times. NFC llvm-svn: 248140
*	Don't pass StringRefs around by const reference. Pass by value instead per ↵	Craig Topper	2015-09-21	2	-6/+6
\| \| \| \| \| \|	coding standards. NFC llvm-svn: 248136
*	Cleanup places that passed SMLoc by const reference to pass it by value ↵	Craig Topper	2015-09-20	9	-20/+15
\| \| \| \| \| \|	instead. NFC llvm-svn: 248135
*	[IndVars] Use C++11 style field initialization; NFCI.	Sanjoy Das	2015-09-20	1	-14/+7
\| \| \| \|	llvm-svn: 248131
*	[IndVars] Don't add a level of indentation for namespace {. NFC.	Sanjoy Das	2015-09-20	1	-77/+77
\| \| \| \| \| \|	Whitespace-only change. llvm-svn: 248130
*	AVX512: Implemented encoding and intrinsics for vcmpss/sd.	Igor Breger	2015-09-20	4	-42/+129
\| \| \| \| \| \| \| \|	Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D12593 llvm-svn: 248121
*	[X86][AVX512] extend support in Scalar conversion	Asaf Badouh	2015-09-20	3	-127/+231
\| \| \| \| \| \| \| \| \| \|	add scalar FP to Int conversion with truncation intrinsics add scalar conversion FP32 from/to FP64 intrinsics add rounding mode and SAE mode encoding for these intrinsics Differential Revision: http://reviews.llvm.org/D12665 llvm-svn: 248117
*	AVX512: vsqrtss/sd encoding and intrinsics implementation.	Igor Breger	2015-09-20	3	-93/+63
\| \| \| \| \| \| \| \|	Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D12102 llvm-svn: 248116
*	[X86][AVX512DQ] Add fpclass instruction	Asaf Badouh	2015-09-20	5	-2/+119
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D12931 llvm-svn: 248115
*	[X86] Fix sitofp and uitofp instruction matching failures with long double ↵	Michael Kuperstein	2015-09-20	1	-9/+14
\| \| \| \| \| \| \| \| \| \| \| \|	and avx512 The operation action for i32 and i64 cannot be set to legal, as long double needs custom lowering. Patch by: mitch.l.bodart@intel.com Differential Revision: http://reviews.llvm.org/D12372 llvm-svn: 248114
*	AVX512: Implemented intrinsics for vshuff32x4, vshuff64x2, vshufi64x2, ↵	Igor Breger	2015-09-20	1	-0/+16
\| \| \| \| \| \| \| \| \| \|	vshufi32x4 Added tests for intrinsics. Differential Revision: http://reviews.llvm.org/D12525 llvm-svn: 248113
*	[IndVars] Don't repeat function names in comment; NFC.	Sanjoy Das	2015-09-20	1	-65/+62
\| \| \| \| \| \|	Only changes comments. llvm-svn: 248112
*	AVX512: Implement instructions encoding, lowering and intrinsics	Igor Breger	2015-09-20	4	-65/+137
\| \| \| \| \| \| \| \| \|	vinserti64x4, vinserti64x2, vinserti32x8, vinserti32x4, vinsertf64x4, vinsertf64x2, vinsertf32x8, vinsertf32x4 Added tests for encoding, lowering and intrinsics. Differential Revision: http://reviews.llvm.org/D11893 llvm-svn: 248111
*	ARM: cleanup formatting	Saleem Abdulrasool	2015-09-20	1	-2/+2
\| \| \| \| \| \|	clang-format a line which was poorly formatted. NFC. llvm-svn: 248110
*	[IndVars] Fix a bug in r248045.	Sanjoy Das	2015-09-20	1	-14/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Because -indvars widens induction variables through arithmetic, `NeverNegative` cannot be a property of the `WidenIV` (a `WidenIV` manages information for all transitive uses of an IV being widened, including uses of `-1 * IV`). Instead it must live on `NarrowIVDefUse` which manages information for a specific def-use edge in the transitive use list of an induction variable. This change also adds a test case that demonstrates the problem with r248045. llvm-svn: 248107
*	[X86][SSE] Vectorize CTTZ + CTTZ_ZERO_UNDEF	Simon Pilgrim	2015-09-19	1	-4/+57
\| \| \| \| \| \| \| \| \| \|	Now that we have fast vector CTPOP implementations we can use this to speed up vector CTTZ using the pattern (cttz(x) = ctpop((x & -x) - 1)) Additionally, for AVX512CD that provides lzcnt instructions we can use the pattern (cttz_undef(x) = (width - 1) - ctlz(x & -x)) Differential Revision: http://reviews.llvm.org/D12663 llvm-svn: 248091
*	[InstCombine] Use SimplifyDemandedVectorEltsLow helper function. NFCI.	Simon Pilgrim	2015-09-19	1	-17/+8
\| \| \| \| \| \|	Use the SimplifyDemandedVectorEltsLow helper function introduced in D12680. llvm-svn: 248089
*	AMDGPU: Remove dead code	Matt Arsenault	2015-09-19	5	-18/+2
\| \| \| \| \| \| \|	getCFGStructurizerRegClass is not used for SI, so move it into R600 specific stuff. llvm-svn: 248087
*	NFC: Fix indentation and add braces to clarify nested of else-statement.	Bob Wilson	2015-09-19	1	-2/+3
\| \| \| \|	llvm-svn: 248086
*	[PrologEpilogInserter] Minor refactoring.	Maksim Panchenko	2015-09-19	1	-1/+1
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D12924 llvm-svn: 248084
*	Test commit. Fix comment. NFC.	Maksim Panchenko	2015-09-19	1	-1/+1
\| \| \| \|	llvm-svn: 248082
*	[InstCombine] FoldICmpCstShrCst failed for ashr when comparing against -1	David Majnemer	2015-09-19	1	-1/+1
\| \| \| \| \| \| \| \| \|	(icmp eq (ashr C1, %V) -1) may have multiple answers if C1 is not a power of two and has the sign bit set. This fixes PR24873. llvm-svn: 248074
*	[InstCombine] FoldICmpCstShrCst didn't handle icmps of -1 in the ashr case ↵	David Majnemer	2015-09-19	1	-6/+10
\| \| \| \| \| \|	correctly llvm-svn: 248073
*	[IndVars] Widen more comparisons for non-negative induction vars	Sanjoy Das	2015-09-18	1	-3/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If an induction variable is provably non-negative, its sign extension is equal to its zero extension. This means narrow uses like icmp slt iNarrow %indvar, %rhs can be widened into icmp slt iWide zext(%indvar), sext(%rhs) Reviewers: atrick, mcrosier, hfinkel Subscribers: hfinkel, reames, llvm-commits Differential Revision: http://reviews.llvm.org/D12745 llvm-svn: 248045
*	Update edge weights properly when merging blocks in if-conversion.	Cong Hou	2015-09-18	1	-6/+70
\| \| \| \| \| \| \| \|	In if-conversion, there is a utility function MergeBlocks() that is used to merge blocks. However, when new edges are built in this function the edge weight is either not provided or not updated properly, leading to a modified CFG with incorrect edge weights. This patch corrects this issue. Differential Revision: http://reviews.llvm.org/D12513 llvm-svn: 248030
*	Limit the range of processors supported by ARM fast isel to v6 or	Eric Christopher	2015-09-18	1	-0/+4
\| \| \| \| \| \| \| \|	later as that's all that is tested right now. Fixes PR24858. llvm-svn: 248027
*	Clean up: Refactoring the hardcoded value of 6 for ↵	Larisse Voufo	2015-09-18	4	-8/+23
\| \| \| \| \| \|	FindAvailableLoadedValue()'s parameter MaxInstsToScan. (Complete version of r247497. See D12886) llvm-svn: 248022
*	Make MachineScheduler debug output less confusing.	James Y Knight	2015-09-18	2	-6/+31
\| \| \| \| \| \|	At least...a little bit. llvm-svn: 248020
*	Scaling up values in ARMBaseInstrInfo::isProfitableToIfCvt() before they are ↵	Cong Hou	2015-09-18	1	-10/+17
\| \| \| \| \| \| \| \| \| \|	scaled by a probability to avoid precision issue. In ARMBaseInstrInfo::isProfitableToIfCvt(), there is a simple cost model in which the number of cycles is scaled by a probability to estimate the cost. However, when the number of cycles is small (which is usually the case), there is a precision issue after the computation. To avoid this issue, this patch scales those cycles by 1024 (chosen to make the multiplication a litter faster) before they are scaled by the probability. Other variables are also scaled up for the final comparison. Differential Revision: http://reviews.llvm.org/D12742 llvm-svn: 248018
*	SelectionDAGDumper: Leave out the <multiple use> markers	Matthias Braun	2015-09-18	1	-3/+0
\| \| \| \| \| \| \| \| \|	They mostly clutter the output while it is still possible to see which node has multiple users without them. Differential Revision: http://reviews.llvm.org/D12569 llvm-svn: 248013