bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Revert "[PPC] Use alias symbols in address computation."	Hal Finkel	2014-05-28	2	-15/+34
\| \| \| \| \| \| \| \| \|	This reverts commit r209638 because it broke self-hosting on ppc64/Linux. (the Clang-compiled TableGen would segfault because it jumped to an invalid address from within _ZNK4llvm17ManagedStaticBase21RegisterManagedStaticEPFPvvEPFvS1_E (which is within the command-line parameter registration process)). llvm-svn: 209745
*	[PATCH] Correct type used for VADD_SPLAT optimization on PowerPC	Bill Schmidt	2014-05-27	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In PPCISelLowering.cpp: PPCTargetLowering::LowerBUILD_VECTOR(), there is an optimization for certain patterns to generate one or two vector splats followed by a vector add or subtract. This operation is represented by a VADD_SPLAT in the selection DAG. Prior to this patch, it was possible for the VADD_SPLAT to be assigned the wrong data type, causing incorrect code generation. This patch corrects the problem. Specifically, the code previously assigned the value type of the BUILD_VECTOR node to the newly generated VADD_SPLAT node. This is correct much of the time, but not always. The problem is that the call to isConstantSplat() may return a SplatBitSize that is not the same as the number of bits in the original element vector type. The correct type to assign is a vector type with the same element bit size as SplatBitSize. The included test case shows an example of this, where the BUILD_VECTOR node has a type of v16i8. The vector to be built is {0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16}. isConstantSplat detects that we can generate a splat of 16 for type v8i16, which is the type we must assign to the VADD_SPLAT node. If we do not, we generate a vspltisb of 8 and a vaddubm, which generates the incorrect result {16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16}. The correct code generation is a vspltish of 8 and a vadduhm. This patch also corrected code generation for CodeGen/PowerPC/2008-07-10-SplatMiscompile.ll, which had been marked as an XFAIL, so we can remove the XFAIL from the test case. llvm-svn: 209662
*	[PPC] Use alias symbols in address computation.	Rafael Espindola	2014-05-26	2	-34/+15
\| \| \| \| \| \| \|	This seems to match what gcc does for ppc and what every other llvm backend does. llvm-svn: 209638
*	Fix typo.	Eric Christopher	2014-05-22	1	-1/+1
\| \| \| \|	llvm-svn: 209377
*	Avoid using subtarget features when initializing the pass pipeline	Eric Christopher	2014-05-22	2	-12/+17
\| \| \| \| \| \|	on PPC. llvm-svn: 209376
*	Reset the subtarget for DAGToDAG on every iteration of runOnMachineFunction.	Eric Christopher	2014-05-22	5	-47/+47
\| \| \| \| \| \| \|	This required updating the generated functions and TD file accordingly to be pointers rather than const references. llvm-svn: 209375
*	Make early if conversion dependent upon the subtarget and add	Eric Christopher	2014-05-21	2	-6/+4
\| \| \| \| \| \| \|	a subtarget hook to enable. Unconditionally add to the pass pipeline for targets that might want to use it. No functional change. llvm-svn: 209340
*	[PowerPC] PR19796: Also match ISD::TargetConstant in isIntS16Immediate	Adam Nemet	2014-05-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	The SplitIndexingFromLoad changes exposed a latent isel bug in the PowerPC64 backend. We matched an immediate offset with STWX8 even though it only supports register offset. The culprit is the complex-pattern predicate, SelectAddrIdx, which decides that if the offset is not ISD::Constant it must be a register. Many thanks to Bill Schmidt for testing this. llvm-svn: 209219
*	SDAG: Legalize vector BSWAP into a shuffle if the shuffle is legal but the ↵	Benjamin Kramer	2014-05-19	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	bswap not. - On ARM/ARM64 we get a vrev because the shuffle matching code is really smart. We still unroll anything that's not v4i32 though. - On X86 we get a pshufb with SSSE3. Required more cleverness in isShuffleMaskLegal. - On PPC we get a vperm for v8i16 and v4i32. v2i64 is unrolled. llvm-svn: 209123
*	Target: remove old constructors for CallLoweringInfo	Saleem Abdulrasool	2014-05-17	1	-10/+5
\| \| \| \| \| \| \| \| \| \|	This is mostly a mechanical change changing all the call sites to the newer chained-function construction pattern. This removes the horrible 15-parameter constructor for the CallLoweringInfo in favour of setting properties of the call via chained functions. No functional change beyond the removal of the old constructors are intended. llvm-svn: 209082
*	Use a sized enum for MachineOperandType. No functionality change	Pete Cooper	2014-05-16	1	-1/+1
\| \| \| \|	llvm-svn: 209048
*	Delete getAliasedGlobal.	Rafael Espindola	2014-05-16	3	-9/+5
\| \| \| \|	llvm-svn: 209040
*	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been	Jay Foad	2014-05-14	3	-20/+20
\| \| \| \| \| \|	inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811
*	Fix typo in function name.	Eric Christopher	2014-05-14	1	-5/+5
\| \| \| \|	llvm-svn: 208743
*	Save the optimization level the subtarget was created with in a	Eric Christopher	2014-05-13	2	-15/+14
\| \| \| \| \| \| \| \| \| \|	member variable and sink the initialization of crbits into the subtarget feature reset code. No functional change, but this refactor will be used in a future commit. llvm-svn: 208726
*	[PowerPC] Add global named register support	Hal Finkel	2014-05-11	2	-0/+27
\| \| \| \| \| \| \|	Support for the intrinsics that read from and write to global named registers is added for r1, r2 and r13 (depending on the subtarget). llvm-svn: 208509
*	[PowerPC] On PPC32, 128-bit shifts might be runtime calls	Hal Finkel	2014-05-11	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \|	The counter-loops formation pass needs to know what operations might be function calls (because they can't appear in counter-based loops). On PPC32, 128-bit shifts might be runtime calls (even though you can't use __int128 on PPC32, it seems that SROA might form them). Fixes PR19709. llvm-svn: 208501
*	Remove the UseCFI option from createAsmStreamer.	Rafael Espindola	2014-05-07	1	-4/+3
\| \| \| \| \| \|	We were already always passing true, this just removes the option. llvm-svn: 208205
*	Fix pr19645.	Rafael Espindola	2014-05-03	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The fix itself is fairly simple: move getAccessVariant to MCValue so that we replace the old weak expression evaluation with the far more general EvaluateAsRelocatable. This then requires that EvaluateAsRelocatable stop when it finds a non trivial reference kind. And that in turn requires the ELF writer to look harder for weak references. Last but not least, this found a case where we were being bug by bug compatible with gas and accepting an invalid input. I reported pr19647 to track it. llvm-svn: 207920
*	Use makeArrayRef insted of calling ArrayRef<T> constructor directly. I ↵	Craig Topper	2014-04-30	1	-5/+4
\| \| \| \| \| \|	introduced most of these recently. llvm-svn: 207616
*	De-virtualize or remove some methods that have no overrides nor override ↵	Craig Topper	2014-04-30	1	-1/+1
\| \| \| \| \| \|	anything. In some cases remove all together if there are no callers either. llvm-svn: 207610
*	[C++11] Add 'override' keywords and remove 'virtual'. Additionally add ↵	Craig Topper	2014-04-29	24	-306/+311
\| \| \| \| \| \|	'final' and leave 'virtual' on some methods that are marked virtual without overriding anything and have no obvious overrides themselves. PowerPC edition llvm-svn: 207504
*	None of these targets actually define their own CFI_INSTRUCTION	Eric Christopher	2014-04-29	1	-7/+8
\| \| \| \| \| \| \|	opcode so there's no reason to use the target namespace for it rather than TargetOpcode. llvm-svn: 207475
*	80-column, tab characters, comment fixups.	Eric Christopher	2014-04-29	1	-43/+44
\| \| \| \|	llvm-svn: 207473
*	Convert more SelectionDAG functions to use ArrayRef.	Craig Topper	2014-04-28	1	-1/+1
\| \| \| \|	llvm-svn: 207397
*	[C++] Use 'nullptr'.	Craig Topper	2014-04-28	5	-9/+9
\| \| \| \|	llvm-svn: 207394
*	Convert SelectionDAG::SelectNodeTo to use ArrayRef.	Craig Topper	2014-04-27	1	-17/+17
\| \| \| \|	llvm-svn: 207377
*	Convert SelectionDAG::getMergeValues to use ArrayRef.	Craig Topper	2014-04-27	1	-4/+4
\| \| \| \|	llvm-svn: 207374
*	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer ↵	Craig Topper	2014-04-26	1	-7/+5
\| \| \| \| \| \|	and size. llvm-svn: 207329
*	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>.	Craig Topper	2014-04-26	1	-35/+24
\| \| \| \|	llvm-svn: 207327
*	[C++] Use 'nullptr'. Target edition.	Craig Topper	2014-04-25	15	-70/+71
\| \| \| \|	llvm-svn: 207197
*	Add 'musttail' marker to call instructions	Reid Kleckner	2014-04-24	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \|	This is similar to the 'tail' marker, except that it guarantees that tail call optimization will occur. It also comes with convervative IR verification rules that ensure that tail call optimization is possible. Reviewers: nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D3240 llvm-svn: 207143
*	Spread some const around for non-mutating uses of MCSymbolData.	David Blaikie	2014-04-24	1	-3/+3
\| \| \| \| \| \| \| \|	I discovered this const-hole while attempting to coalesnce the Symbol and SymbolMap data structures. There's some pending issues with that, but I figured this change was easy to flush early. llvm-svn: 207124
*	Create MCTargetOptions.	Evgeniy Stepanov	2014-04-23	1	-1/+2
\| \| \| \| \| \| \| \| \|	For now it contains a single flag, SanitizeAddress, which enables AddressSanitizer instrumentation of inline assembly. Patch by Yuri Gorshenin. llvm-svn: 206971
*	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE	Chandler Carruth	2014-04-22	13	-14/+26
\| \| \| \| \| \| \|	definition below all of the header #include lines, lib/Target/... edition. llvm-svn: 206842
*	[cleanup] Lift using directives, DEBUG_TYPE definitions, and even some	Chandler Carruth	2014-04-22	4	-10/+10
\| \| \| \| \| \| \| \| \| \| \| \|	system headers above the includes of generated '.inc' files that actually contain code. In a few targets this was already done pretty consistently, but it wasn't done really consistently anywhere. It is strictly cleaner IMO and necessary in a bunch of places where the DEBUG_TYPE is referenced from the generated code. Consistency with the necessary places trumps. Hopefully the build bots are OK with the movement of intrin.h... llvm-svn: 206838
*	[Modules] Make Support/Debug.h modular. This requires it to not change	Chandler Carruth	2014-04-21	3	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822
*	Break PseudoSourceValue out of the Value hierarchy. It is now the root of ↵	Nick Lewycky	2014-04-15	1	-2/+2
\| \| \| \| \| \|	its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255
*	[MC] Require an MCContext when constructing an MCDisassembler.	Lang Hames	2014-04-15	1	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch re-introduces the MCContext member that was removed from MCDisassembler in r206063, and requires that an MCContext be passed in at MCDisassembler construction time. (Previously the MCContext member had been initialized in an ad-hoc fashion after construction). The MCCContext member can be used by MCDisassembler sub-classes to construct constant or target-specific MCExprs. This patch updates disassemblers for in-tree targets, and provides the MCRegisterInfo instance that some disassemblers were using through the MCContext (previously those backends were constructing their own MCRegisterInfo instances). llvm-svn: 206241
*	[PowerPC] [Constant Hoisting] Enable constant hoisting on PPC	Hal Finkel	2014-04-13	1	-0/+147
\| \| \| \| \| \| \| \| \| \|	Implements the various TTI functions to enable constant hoisting on PPC. The only significant test-suite change is this: MultiSource/Benchmarks/VersaBench/bmm/bmm - 20% speedup (which essentially reverses the slowdown from r206120). llvm-svn: 206141
*	[PowerPC] Fix rlwimi isel when mask is not constant	Hal Finkel	2014-04-13	1	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We had been using the known-zero values of the operand of the or to construct the mask for an rlwimi; this is not quite correct, but fine when the mask is constant. When the mask is constant, then the known zeros of the operand must be a superset of the zeros in the mask. However, when the mask is not a constant, then there might be bits in the operand that are not known to be zero that, at runtime, might be zero in the mask. Therefore, we check that any bits not known to be zero are known to be one in the mask. Otherwise, we can't fold the mask with the or and shift. This was revealed as a miscompile of MultiSource/Benchmarks/BitBench/drop3/drop3 when I started experimenting with constant hoisting. llvm-svn: 206136
*	[PowerPC] Implement some additional TLI callbacks	Hal Finkel	2014-04-12	2	-0/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add implementations of: bool isLegalICmpImmediate(int64_t Imm) const bool isLegalAddImmediate(int64_t Imm) const bool isTruncateFree(Type Ty1, Type Ty2) const bool isTruncateFree(EVT VT1, EVT VT2) const bool shouldConvertConstantLoadToIntImm(const APInt &Imm, Type *Ty) const Unfortunately, this regresses counter-register-based loop formation because some of the loops now end up in forms were SE cannot compute loop counts. However, nevertheless, the test-suite results favor committing: SingleSource/Benchmarks/BenchmarkGame/puzzle: 26% speedup MultiSource/Benchmarks/FreeBench/analyzer/analyzer: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan: 20% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/trisolv/trisolv: 19% speedup SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gesummv/gesummv: 15% speedup MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2: 2% speedup MultiSource/Benchmarks/VersaBench/bmm/bmm: 26% slowdown llvm-svn: 206120
*	LLVMBuild.txt: Reformat.	NAKAMURA Takumi	2014-04-10	2	-3/+3
\| \| \| \|	llvm-svn: 205961
*	[PowerPC] Don't return false from PPC::isVSLDOIShuffleMask	Hal Finkel	2014-04-08	1	-1/+1
\| \| \| \| \| \| \| \| \|	PPC::isVSLDOIShuffleMask should return -1, not false, when the shuffle predicate should be false. Noticed by inspection; no test case (yet). llvm-svn: 205787
*	[PowerPC] Remove unused TM member variable to unbreak build	Hal Finkel	2014-04-05	1	-3/+2
\| \| \| \| \| \|	Fix "error: private field 'TM' is not used [-Werror,-Wunused-private-field]" llvm-svn: 205660
*	[PowerPC] Adjust load/store costs in PPCTTI	Hal Finkel	2014-04-04	1	-3/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This provides more realistic costs for the insert/extractelement instructions (which are load/store pairs), accounts for the cheap unaligned Altivec load sequence, and for unaligned VSX load/stores. Bad news: MultiSource/Applications/sgefa/sgefa - 35% slowdown (this will require more investigation) SingleSource/Benchmarks/McGill/queens - 20% slowdown (we no longer vectorize this, but it was a constant store that was scalarized) MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2 - 2% slowdown Good news: SingleSource/Benchmarks/Shootout/ary3 - 54% speedup SingleSource/Benchmarks/Shootout-C++/ary - 40% speedup MultiSource/Benchmarks/Ptrdist/ks/ks - 35% speedup MultiSource/Benchmarks/FreeBench/neural/neural - 30% speedup MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt - 20% speedup Unfortunately, estimating the costs of the stack-based scalarization sequences is hard, and adjusting these costs is like a game of whac-a-mole :( I'll revisit this again after we have better codegen for vector extloads and truncstores and unaligned load/stores. llvm-svn: 205658
*	[PowerPC] PPCTTI Cleanup	Hal Finkel	2014-04-04	1	-4/+0
\| \| \| \| \| \|	Remove the declaration of an unimplemented function. llvm-svn: 205657
*	[PowerPC] Add a full condition code register to make the "cc" clobber work	Hal Finkel	2014-04-04	1	-0/+12
\| \| \| \| \| \| \| \|	gcc inline asm supports specifying "cc" as a clobber of all condition registers. Add just enough modeling of the full register to make this work. Fixed PR19326. llvm-svn: 205630
*	Make consistent use of MCPhysReg instead of uint16_t throughout the tree.	Craig Topper	2014-04-04	3	-26/+26
\| \| \| \|	llvm-svn: 205610
*	[PowerPC] Make PPCTTI::getMemoryOpCost call BasicTTI::getMemoryOpCost	Hal Finkel	2014-04-02	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	PPCTTI::getMemoryOpCost will now make use of BasicTTI::getMemoryOpCost to calculate the base cost of the memory access, and then adjust on top of that. There is no functionality change from this modification, but it will become important so that PPCTTI can take advantage of scalarization information for which BasicTTI::getMemoryOpCost will account in the near future. llvm-svn: 205476