bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Remove a variable from r206192 that is only used in an assert.	Kaelyn Takata	2014-04-14	1	-2/+2
\| \| \| \|	llvm-svn: 206195
*	Fix a bug in which BranchProbabilityInfo wasn't setting branch weights of ↵	Akira Hatanaka	2014-04-14	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	basic blocks inside loops correctly. Previously, BranchProbabilityInfo::calcLoopBranchHeuristics would determine the weights of basic blocks inside loops even when it didn't have enough information to estimate the branch probabilities correctly. This patch fixes the function to exit early if it doesn't see any exit edges or back edges and let the later heuristics determine the weights. This fixes PR18705 and <rdar://problem/15991090>. Differential Revision: http://reviews.llvm.org/D3363 llvm-svn: 206194
*	Fix up MCFixup::getAccessVariant to handle unary expressions.	Kaelyn Takata	2014-04-14	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This allows correct relocations to be generated for a symbolic address that is being adjusted by a negative constant. Since r204294, such expressions have triggered undefined behavior when LLVM was built without assertions. Credit goes to Rafael for this patch; I'm submitting it on his behalf as he is on vacation this week. llvm-svn: 206192
*	[mips] Fix fcopysign for MIPS-IV and add the test.	Daniel Sanders	2014-04-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This was another incorrect use of hasMips64() vs isGP64bit(). Depends on D3344 Reviewers: matheusalmeida, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3347 llvm-svn: 206187
*	[mips] Fix more incorrect uses of HasMips64 and isMips64()	Daniel Sanders	2014-04-14	4	-13/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: - Conditional moves acting on 64-bit GPR's should require MIPS-IV rather than MIPS64 - ISD::MUL, and ISD::MULH[US] should be lowered on all 64-bit ISA's Patch by David Chisnall His work was sponsored by: DARPA, AFRL I've added additional testcases to cover as much of the codegen changes affecting MIPS-IV as I can. Where I've been unable to find an existing MIPS64 testcase that can be re-used for MIPS-IV (mainly tests covering ISD::GlobalAddress and similar), I at least agree that MIPS-IV should behave like MIPS64. Further testcases that are fixed by this patch will follow in my next commit. The testcases from that commit that fail for MIPS-IV without this patch are: LLVM :: CodeGen/Mips/2010-07-20-Switch.ll LLVM :: CodeGen/Mips/cmov.ll LLVM :: CodeGen/Mips/eh-dwarf-cfa.ll LLVM :: CodeGen/Mips/largeimmprinting.ll LLVM :: CodeGen/Mips/longbranch.ll LLVM :: CodeGen/Mips/mips64-f128.ll LLVM :: CodeGen/Mips/mips64directive.ll LLVM :: CodeGen/Mips/mips64ext.ll LLVM :: CodeGen/Mips/mips64fpldst.ll LLVM :: CodeGen/Mips/mips64intldst.ll LLVM :: CodeGen/Mips/mips64load-store-left-right.ll LLVM :: CodeGen/Mips/sint-fp-store_pattern.ll Reviewers: dsanders Reviewed By: dsanders CC: matheusalmeida Differential Revision: http://reviews.llvm.org/D3343 llvm-svn: 206183
*	Teach llvm-lto to respect the given RelocModel.	James Molloy	2014-04-14	1	-1/+5
\| \| \| \| \| \|	Patch by Nick Tomlinson! llvm-svn: 206177
*	ARM64: remove buggy REV16 pattern.	Tim Northover	2014-04-14	1	-2/+1
\| \| \| \| \| \|	The 32-bit pattern is still valid: 0123 -> 3210 -> 1032. llvm-svn: 206172
*	AArch64/ARM64: enable directcond.ll test on ARM64.	Tim Northover	2014-04-14	1	-0/+2
\| \| \| \| \| \| \|	Code change is because optimizeCompareInstr didn't know how to pull the condition code out of FCSEL instructions. llvm-svn: 206171
*	ARM64: add patterns for csXYZ with reversed operands.	Tim Northover	2014-04-14	1	-0/+13
\| \| \| \| \| \| \|	AArch64 tests for this, and it's obviously a good idea. Have to invert the condition code, of course. llvm-svn: 206170
*	ARM64: add support for AArch64's addsub_ext.ll	Tim Northover	2014-04-14	1	-1/+1
\| \| \| \| \| \| \| \| \|	There was one definite issue in ARM64 (the off-by-1 check for whether a shift could be folded in) and one difference that is probably correct: ARM64 didn't fold nodes with multiple uses into the arithmetic operations unless optimising for code size. llvm-svn: 206168
*	ARM64: optimise (cmp x, (sub 0, y)) to (cmn x, y).	Tim Northover	2014-04-14	1	-11/+30
\| \| \| \| \| \| \|	This transformation is only valid when being used for an EQ or NE comparison since the flags change otherwise. llvm-svn: 206167
*	[XCore] Don't create invalid MKMSK instructions inside loadImmediate().	Richard Osborne	2014-04-14	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously loadImmediate() would produce MKMSK instructions with invalid immediate values such as mkmsk r0, 9. Fix this by checking the mask size is valid. Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3289 llvm-svn: 206163
*	Whitespace.	NAKAMURA Takumi	2014-04-14	1	-1/+0
\| \| \| \|	llvm-svn: 206154
*	Revert r206045, "Fix shift by constants for vector."	NAKAMURA Takumi	2014-04-14	2	-23/+13
\| \| \| \| \| \|	It broke some builders, at least, i686. llvm-svn: 206153
*	[Allocator] Hoist the external helper function into a namespace scope	Chandler Carruth	2014-04-14	1	-0/+4
\| \| \| \| \| \| \|	declaration. GCC 4.7 appears to get hopelessly confused by declaring this function within a member function of a class template. Go figure. llvm-svn: 206152
*	Don't assert in BasicTTI::getMemoryOpCost for non-simple types	Hal Finkel	2014-04-14	1	-6/+8
\| \| \| \| \| \| \| \|	BasicTTI::getMemoryOpCost must explicitly check for non-simple types; setting AllowUnknown=true with TLI->getSimpleValueType is not sufficient because, for example, non-power-of-two vector types return non-simple EVTs (not MVT::Other). llvm-svn: 206150
*	[Allocator] Make the underlying allocator a template instead of an	Chandler Carruth	2014-04-14	2	-26/+11
\| \| \| \| \| \| \| \| \| \| \| \|	abstract interface. The only user of this functionality is the JIT memory manager and it is quite happy to have a custom type here. This removes a virtual function call and a lot of unnecessary abstraction from the common case where this is just a very thin vaneer around a call to malloc. Hopefully still no functionality changed here. =] llvm-svn: 206149
*	[Allocator] Switch the BumpPtrAllocator to use a vector of pointers to	Chandler Carruth	2014-04-14	2	-31/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	slabs rather than embedding a singly linked list in the slabs themselves. This has a few advantages: - Better utilization of the slab's memory by not wasting 16-bytes at the front. - Simpler allocation strategy by not having a struct packed at the front. - Avoids paging every allocated slab in just to traverse them for deallocating or dumping stats. The latter is the really nice part. Folks have complained from time to time bitterly that tearing down a BumpPtrAllocator, even if it doesn't run any destructors, pages in all of the memory allocated. Now it won't. =] Also resolves a FIXME with the scaling of the slab sizes. The scaling now disregards specially sized slabs for allocations larger than the threshold. llvm-svn: 206147
*	Use APInt arithmetic, fixed typo. Thanks to Benjamin Kramer for noticing that.	Serge Pavlov	2014-04-14	1	-2/+2
\| \| \| \|	llvm-svn: 206144
*	[C++11] More 'nullptr' conversion. In some cases just using a boolean check ↵	Craig Topper	2014-04-14	93	-1017/+1025
\| \| \| \| \| \|	instead of comparing to nullptr. llvm-svn: 206142
*	[PowerPC] [Constant Hoisting] Enable constant hoisting on PPC	Hal Finkel	2014-04-13	1	-0/+147
\| \| \| \| \| \| \| \| \| \|	Implements the various TTI functions to enable constant hoisting on PPC. The only significant test-suite change is this: MultiSource/Benchmarks/VersaBench/bmm/bmm - 20% speedup (which essentially reverses the slowdown from r206120). llvm-svn: 206141
*	MC: check machine magic when applying offset adjustments	Saleem Abdulrasool	2014-04-13	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The values for the relocation type can (and do) overlap across various architectures. When performing an adjustment of the emitted relocation in the final object file, check that the file magic matches the target for which the relocation type is valid (e.g. a I386 relocation is only applied to an X86 object file, and an AMD64 relocation is only applied to an X86_64 object file). This was noticed while adding support for ARM WinCOFF object file emission. A test case for this is not really possible as the values for REL32 do not overlap on I386 and AMD64, which is why this was never noticed in practice. The ARM WinCOFF emission is not yet ready to merge into the tree. llvm-svn: 206138
*	Recognize test for overflow in integer multiplication.	Serge Pavlov	2014-04-13	1	-0/+240
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If multiplication involves zero-extended arguments and the result is compared as in the patterns: %mul32 = trunc i64 %mul64 to i32 %zext = zext i32 %mul32 to i64 %overflow = icmp ne i64 %mul64, %zext or %overflow = icmp ugt i64 %mul64 , 0xffffffff then the multiplication may be replaced by call to umul.with.overflow. This change fixes PR4917 and PR4918. Differential Revision: http://llvm-reviews.chandlerc.com/D2814 llvm-svn: 206137
*	[PowerPC] Fix rlwimi isel when mask is not constant	Hal Finkel	2014-04-13	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We had been using the known-zero values of the operand of the or to construct the mask for an rlwimi; this is not quite correct, but fine when the mask is constant. When the mask is constant, then the known zeros of the operand must be a superset of the zeros in the mask. However, when the mask is not a constant, then there might be bits in the operand that are not known to be zero that, at runtime, might be zero in the mask. Therefore, we check that any bits not known to be zero are known to be one in the mask. Otherwise, we can't fold the mask with the or and shift. This was revealed as a miscompile of MultiSource/Benchmarks/BitBench/drop3/drop3 when I started experimenting with constant hoisting. llvm-svn: 206136
*	Fix instruction debug info location during legalization	David Blaikie	2014-04-13	2	-16/+13
\| \| \| \| \| \| \| \| \| \| \| \|	I found this from a particular GDB test suite case of inlining (something similar is provided as a test case) but came across a few other related cases (other callers of the same functions, and one other instance of the same coding mistake in a separate function). I'm not sure what the best way to test this is (let alone to cover the other cases I discovered), so hopefully this sufficies - open to ideas. llvm-svn: 206130
*	[C++11] More 'nullptr' conversion or in some cases just using a boolean ↵	Craig Topper	2014-04-13	27	-131/+135
\| \| \| \| \| \|	check instead of comparing to nullptr. llvm-svn: 206129
*	[X86] unique_ptr'ify one of X86GenericDisassembler's members.	Lang Hames	2014-04-13	2	-14/+10
\| \| \| \|	llvm-svn: 206127
*	[PowerPC] Implement some additional TLI callbacks	Hal Finkel	2014-04-12	2	-0/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add implementations of: bool isLegalICmpImmediate(int64_t Imm) const bool isLegalAddImmediate(int64_t Imm) const bool isTruncateFree(Type Ty1, Type Ty2) const bool isTruncateFree(EVT VT1, EVT VT2) const bool shouldConvertConstantLoadToIntImm(const APInt &Imm, Type *Ty) const Unfortunately, this regresses counter-register-based loop formation because some of the loops now end up in forms were SE cannot compute loop counts. However, nevertheless, the test-suite results favor committing: SingleSource/Benchmarks/BenchmarkGame/puzzle: 26% speedup MultiSource/Benchmarks/FreeBench/analyzer/analyzer: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan: 20% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/trisolv/trisolv: 19% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gesummv/gesummv: 15% speedup MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2: 2% speedup MultiSource/Benchmarks/VersaBench/bmm/bmm: 26% slowdown llvm-svn: 206120
*	Spell the specialization namespace correctly.	Benjamin Kramer	2014-04-12	2	-2/+6
\| \| \| \| \| \|	Not sure why clang didn't diagnose this (GCC does). llvm-svn: 206117
*	Make helper static and place random global into the llvm namespace.	Benjamin Kramer	2014-04-12	4	-10/+9
\| \| \| \|	llvm-svn: 206116
*	Retire llvm::array_endof in favor of non-member std::end.	Benjamin Kramer	2014-04-12	6	-17/+17
\| \| \| \| \| \|	While there make array_lengthof constexpr if we have support for it. llvm-svn: 206112
*	Move MDBuilder's methods out of line.	Benjamin Kramer	2014-04-12	2	-2/+142
\| \| \| \| \| \| \|	Making them inline was a historical accident, they're neither hot nor templated. llvm-svn: 206109
*	PR13337: Omit DW_TAG_restrict_type when compiling for DWARF2	David Blaikie	2014-04-12	1	-0/+4
\| \| \| \| \| \| \|	DWARF3 introduced DW_TAG_restrict_type, so avoid using it in prior versions. llvm-svn: 206105
*	Revert "Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions,"	Adrian Prantl	2014-04-12	1	-14/+9
\| \| \| \| \| \| \|	This reverts commit 206096 while I investigate why this broke the gdb buildbot. llvm-svn: 206103
*	[ARM64] Never hoist the shift value of a shift instruction.	Juergen Ributzka	2014-04-12	1	-3/+7
\| \| \| \| \| \| \|	There is no need to check if we want to hoist the immediate value of an shift instruction. Simply return TCC_Free right away. llvm-svn: 206101
*	[ARM64] Fix the cost model for cheap large constants.	Juergen Ributzka	2014-04-12	1	-5/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Originally the cost model would give up for large constants and just return the maximum cost. This is not what we want for constant hoisting, because some of these constants are large in bitwidth, but are still cheap to materialize. This commit fixes the cost model to either return TCC_Free if the cost cannot be determined, or accurately calculate the cost even for large constants (bitwidth > 128). This fixes <rdar://problem/16591573>. llvm-svn: 206100
*	Use dwarf::Tag rather than unsigned for DIE::Tag to make debugging easier.	David Blaikie	2014-04-12	4	-8/+13
\| \| \| \| \| \| \| \| \| \| \|	Nice to be able to just print out the Tag and have the debugger print dwarf::DW_TAG_subprogram or whatever, rather than an int. It's a bit finicky (for example DIDescriptor::getTag still returns unsigned) because some places still handle real dwarf tags + our fake tags (one day we'll remove the fake tags, hopefully). llvm-svn: 206098
*	Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions,	Adrian Prantl	2014-04-12	1	-9/+14
\| \| \| \| \| \| \| \| \| \| \| \|	therefore, their declaration cannot have one DW_AT_linkage_name. The specific instances however can and should have that attribute. This patch reorders the code in DwarfUnit::getOrCreateSubprogramDIE() to emit linkage names for C/Dtors. rdar://problem/16362674. llvm-svn: 206096
*	X86: Remove TargetMachine CPU auto-detection.	Jim Grosbach	2014-04-12	2	-285/+15
\| \| \| \| \| \| \| \|	This logic is properly in the realm of whatever is creating the TargetMachine. This makes plain 'llc foo.ll' consistent across heterogenous machines. llvm-svn: 206094
*	Reenable use of TBAA during CodeGen	Hal Finkel	2014-04-12	2	-14/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We had disabled use of TBAA during CodeGen (even when otherwise using AA) because the ptrtoint/inttoptr used by CGP for address sinking caused BasicAA to miss basic type punning that it should catch (and, thus, we'd fail to override TBAA when we should). However, when AA is in use during CodeGen, CGP now uses normal GEPs and bitcasts, instead of ptrtoint/inttoptr, when doing address sinking. As a result, BasicAA should be able to make us do the right thing in the face of type-punning, and it seems safe to enable use of TBAA again. self-hosting seems fine on PPC64/Linux on the P7, with TBAA enabled and -misched=shuffle. Note: We still don't update TBAA when merging stack slots, although because BasicAA should now catch all such cases, this is no longer a blocking issue. Nevertheless, I plan to commit code to deal with this properly in the near future. llvm-svn: 206093
*	Add the ability to use GEPs for address sinking in CGP	Hal Finkel	2014-04-12	1	-0/+126
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current memory-instruction optimization logic in CGP, which sinks parts of the address computation that can be adsorbed by the addressing mode, does this by explicitly converting the relevant part of the address computation into IR-level integer operations (making use of ptrtoint and inttoptr). For most targets this is currently not a problem, but for targets wishing to make use of IR-level aliasing analysis during CodeGen, the use of ptrtoint/inttoptr is a problem for two reasons: 1. BasicAA becomes less powerful in the face of the ptrtoint/inttoptr 2. In cases where type-punning was used, and BasicAA was used to override TBAA, BasicAA may no longer do so. (this had forced us to disable all use of TBAA in CodeGen; something which we can now enable again) This (use of GEPs instead of ptrtoint/inttoptr) is not currently enabled by default (except for those targets that use AA during CodeGen), and so aside from some PowerPC subtargets and SystemZ, there should be no change in behavior. We may be able to switch completely away from the ptrtoint/inttoptr sinking on all targets, but further testing is required. I've doubled-up on a number of existing tests that are sensitive to the address sinking behavior (including some store-merging tests that are sensitive to the order of the resulting ADD operations at the SDAG level). llvm-svn: 206092
*	[AArch64] Implement the isLegalAddressingMode and getScalingFactorCost APIs.	Chad Rosier	2014-04-12	2	-0/+77
\| \| \| \|	llvm-svn: 206089
*	blockfreq: Rename BlockFrequencyImpl to BlockFrequencyInfoImpl	Duncan P. N. Exon Smith	2014-04-11	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	This is a shared implementation class for BlockFrequencyInfo and MachineBlockFrequencyInfo, not for BlockFrequency, a related (but distinct) class. No functionality change. <rdar://problem/14292693> llvm-svn: 206083
*	blockfreq: Use getSuccessorIndex()	Duncan P. N. Exon Smith	2014-04-11	1	-5/+3
\| \| \| \| \| \| \| \|	No functionality change. <rdar://problem/14292693> llvm-svn: 206082
*	Pull out a named variable for the cached section names to aid readability.	David Blaikie	2014-04-11	1	-6/+8
\| \| \| \| \| \|	Based on a code review suggestion from Eric Christopher in r205990 llvm-svn: 206080
*	Add ARM64 CLS patterns	Louis Gerbarg	2014-04-11	1	-0/+6
\| \| \| \| \| \| \| \| \|	This patch adds patterns to generate the cls instruction ARM64. Includes tests for 64 bit and 32 bit operands. rdar://15611957 llvm-svn: 206079
*	Format fixes for r205990	David Blaikie	2014-04-11	1	-5/+12
\| \| \| \|	llvm-svn: 206078
*	[RegAllocGreedy][Last Chance Recoloring] Change the name of the exhaustive ↵	Quentin Colombet	2014-04-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	search option. fexhaustive-register-search => exhaustive-register-search 'f' is a Clang thing! This is related to PR18747. llvm-svn: 206075
*	[RegAllocGreedy][Last Chance Recoloring] Addition of	Quentin Colombet	2014-04-11	1	-6/+14
\| \| \| \| \| \| \| \| \| \| \|	-fexhaustive-register-search option to allow an exhaustive search during last chance recoloring. This is related to PR18747 Patch by MAYUR PANDEY <mayur.p@samsung.com>. llvm-svn: 206072
*	R600: Check if a sextload should be used for parameter loads.	Matt Arsenault	2014-04-11	3	-14/+20
\| \| \| \| \| \| \| \| \| \|	Through some oddity where truncate (sextload x) isn't folded into an anyextload for vectors, the sextload remains if the vector isn't immediately scalarized. This keeps the expected zextload instructions in the kernel-args test when small type vectors aren't scalarized. llvm-svn: 206070