bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Revert r155745	Derek Schuff	2012-04-27	1	-2/+0
\| \| \| \|	llvm-svn: 155746
*	Fix fastcc structure return with fast-isel on x86-32	Derek Schuff	2012-04-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	On x86-32, structure return via sret lets the callee pop the hidden pointer argument off the stack, which the caller then re-pushes. However if the calling convention is fastcc, then a register is used instead, and the caller should not adjust the stack. This is implemented with a check of IsTailCallConvention X86TargetLowering::LowerCall but is now checked properly in X86FastISel::DoSelectCall. llvm-svn: 155745
*	Track worst case alignment padding more accurately.	Jakob Stoklund Olesen	2012-04-27	1	-42/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, ARMConstantIslandPass would conservatively compute the address of an aligned basic block as: RoundUpToAlignment(Offset + UnknownPadding) This worked fine for the layout algorithm itself, but it could fool the verify() function because it accounts for alignment padding twice: Once when adding the worst case UnknownPadding, and again by rounding up the fictional block offset. This meant that when optimizeThumb2Instructions would shrink an instruction, the conservative distance estimate could grow. That shouldn't be possible since the woorst case alignment padding wss already included. This patch drops the use of RoundUpToAlignment, and depends only on worst case padding to compute conservative block offsets. This has the weird effect that the computed offset for an aligned block may not be aligned. The important difference is that shrinking an instruction can never cause the estimated distance between two instructions to grow. The estimated distance is always larger than the real distance that only the assembler knows. <rdar://problem/11339352> llvm-svn: 155744
*	Temporarily revert r155668: Fix the SD scheduler to avoid gluing.	Andrew Trick	2012-04-27	1	-4/+2
\| \| \| \| \| \|	This definitely caused regression with ARM -mno-thumb. llvm-svn: 155743
*	Use 'unsigned' instead of 'int' in several places when retrieving number of ↵	Craig Topper	2012-04-27	1	-12/+12
\| \| \| \| \| \|	vector elements. llvm-svn: 155742
*	Add x86-specific DAG combine to simplify:	Chad Rosier	2012-04-27	1	-0/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	x == -y --> x+y == 0 x != -y --> x+y != 0 On x86, the generated code goes from negl %esi cmpl %esi, %edi je .LBB0_2 to addl %esi, %edi je .L4 This case is correctly handled for ARM with "cmn". Patch by Manman Ren. rdar://11245199 PR12545 llvm-svn: 155739
*	[Support/YAMLParser] Fix ASan found bugs.	Michael J. Spencer	2012-04-27	1	-1/+7
\| \| \| \|	llvm-svn: 155735
*	Tidy up spacing.	Craig Topper	2012-04-27	1	-2/+2
\| \| \| \|	llvm-svn: 155733
*	Don't vectorize target-specific types (ppc_fp128, x86_fp80, etc.).	Hal Finkel	2012-04-27	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	Target specific types should not be vectorized. As a practical matter, these types are already register matched (at least in the x86 case), and codegen does not always work correctly (at least in the ppc case, and this is not worth fixing because ppc_fp128 is currently broken and will probably go away soon). llvm-svn: 155729
*	Change recurse depth limit to uint32 to fix warning.	David Blaikie	2012-04-27	1	-1/+1
\| \| \| \|	llvm-svn: 155727
*	Miscellaneous accumulated cleanups.	Dan Gohman	2012-04-27	1	-71/+57
\| \| \| \|	llvm-svn: 155725
*	Fix the order of the operands in the llvm.fma intrinsic patterns for ARM,	Lang Hames	2012-04-27	2	-24/+25
\| \| \| \| \| \|	<rdar://problem/11325085>. llvm-svn: 155724
*	Add an early bailout to IsValueFullyAvailableInBlock from deeply nested blocks.	Mon P Wang	2012-04-27	1	-3/+12
\| \| \| \| \| \| \|	The limit is set to an arbitrary 1000 recursion depth to avoid stack overflow issues. <rdar://problem/11286839>. llvm-svn: 155722
*	Reapply r155682, making constant folding more consistent, with a fix to work	Dan Gohman	2012-04-27	2	-35/+57
\| \| \| \| \| \|	properly with how the code handles all-undef PHI nodes. llvm-svn: 155721
*	Fix ARM assembly parsing for upper case condition codes on IT instructions.	Richard Barton	2012-04-27	1	-1/+1
\| \| \| \|	llvm-svn: 155720
*	X86: Don't emit conditional floating point moves on when targeting ↵	Benjamin Kramer	2012-04-27	5	-15/+84
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	pre-pentiumpro architectures. * Model FPSW (the FPU status word) as a register. * Add ISel patterns for the FUCOM, FNSTSW and SAHF instructions. During Legalize/Lowering, build a node sequence to transfer the comparison result from FPSW into EFLAGS. If you're wondering about the right-shift: That's an implicit sub-register extraction (%ax -> %ah) which is handled later on by the instruction selector. Fixes PR6679. Patch by Christoph Erhardt! llvm-svn: 155704
*	[asan] small optimization: do not emit "x+0" instructions	Kostya Serebryany	2012-04-27	1	-3/+4
\| \| \| \|	llvm-svn: 155701
*	Refactor IT handling not to store the bottom bit of the condition code in ↵	Richard Barton	2012-04-27	3	-14/+7
\| \| \| \| \| \|	the mask operand in the MCInst. llvm-svn: 155700
*	Revert r155682, "Use ConstantExpr::getExtractElement when constant-folding ↵	NAKAMURA Takumi	2012-04-27	2	-51/+32
\| \| \| \| \| \| \| \|	vectors" It broke stage2 build. stage1/clang sometimes crashed. llvm-svn: 155699
*	[tsan] Atomic support for ThreadSanitizer, patch by Dmitry Vyukov	Kostya Serebryany	2012-04-27	1	-33/+152
\| \| \| \|	llvm-svn: 155698
*	Implement a bastardized ABI.	Evan Cheng	2012-04-27	2	-3/+5
\| \| \| \|	llvm-svn: 155686
*	- thumbv6 shouldn't imply +thumb2. Cortex-M0 doesn't suppport 32-bit Thumb2	Evan Cheng	2012-04-27	2	-16/+28
\| \| \| \| \| \| \| \|	instructions. - However, it does support dmb, dsb, isb, mrs, and msr. rdar://11331541 llvm-svn: 155685
*	Use ConstantExpr::getExtractElement when constant-folding vectors	Dan Gohman	2012-04-27	2	-32/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instead of getAggregateElement. This has the advantage of being more consistent and allowing higher-level constant folding to procede even if an inner extract element cannot be folded. Make ConstantFoldInstruction call ConstantFoldConstantExpression on the instruction's operands, making it more consistent with ConstantFoldConstantExpression itself. This makes sure that ConstantExprs get TargetData-aware folding before being handed off as operands for further folding. This causes more expressions to be folded, but due to a known shortcoming in constant folding, this currently has the side effect of stripping a few more nuw and inbounds flags in the non-targetdata side of constant-fold-gep.ll. This is mostly harmless. This fixes rdar://11324230. llvm-svn: 155682
*	Break up getProfitableChainIncrement().	Jakob Stoklund Olesen	2012-04-26	1	-39/+47
\| \| \| \| \| \| \| \| \| \| \|	The required checks are moved to ChainInstruction() itself and the policy decisions are moved to IVChain::isProfitableInc(). Also cache the ExprBase in IVChain to avoid frequent recomputations. No functional change intended. llvm-svn: 155676
*	Turn IVChain into a struct.	Jakob Stoklund Olesen	2012-04-26	1	-19/+42
\| \| \| \| \| \|	No functional change intended. llvm-svn: 155675
*	Add instcombine patterns for the following transformations:	Chad Rosier	2012-04-26	2	-0/+19
\| \| \| \| \| \| \| \| \| \|	(x & y) \| (x ^ y) -> x \| y (x & y) + (x ^ y) -> x \| y Patch by Manman Ren. rdar://10770603 llvm-svn: 155674
*	Fix the SD scheduler to avoid gluing the same node twice.	Andrew Trick	2012-04-26	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \|	DAGCombine strangeness may result in multiple loads from the same offset. They both may try to glue themselves to another load. We could insist that the redundant loads glue themselves to each other, but the beter fix is to bail out from bad gluing at the time we detect it. Fixes rdar://11314175: BuildSchedUnits assert. llvm-svn: 155668
*	ARM: Thumb ldr(literal) base address alignment is 32-bits.	Jim Grosbach	2012-04-26	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	The base address for the PC-relative load is Align(PC,4), so it's the address of the word containing the 16-bit instruction, not the address of the instruction itself. Ugh. rdar://11314619 llvm-svn: 155659
*	Trivial change to set UseLeaForSP flag in addition to toggling	Preston Gurd	2012-04-26	1	-0/+2
\| \| \| \| \| \| \| \|	the FeatureLeaForSP feature bit when llvm auto detects Intel Atom. Patch by Andy Zhang llvm-svn: 155655
*	[Support/YAML] Properly fix unitialized variable warning by inserting a	Michael J. Spencer	2012-04-26	1	-6/+12
\| \| \| \| \| \|	'REPLACEMENT CHARACTER' (U+FFFD) when getAsInteger fails. llvm-svn: 155653
*	Use VLD1 in NEON extenting-load patterns instead of VLDR.	Tim Northover	2012-04-26	1	-56/+59
\| \| \| \| \| \| \|	On some cores it's a bad idea for performance to mix VFP and NEON instructions and since these patterns are NEON anyway, the NEON load should be used. llvm-svn: 155630
*	Test commit.	Tim Northover	2012-04-26	1	-2/+0
\| \| \| \|	llvm-svn: 155626
*	Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to ↵	Craig Topper	2012-04-26	2	-13/+9
\| \| \| \| \| \|	corei7-avx, core-avx-i, and core-avx2 cpu names. llvm-svn: 155618
*	Teach the reassociate pass to fold chains of multiplies with repeated	Chandler Carruth	2012-04-26	1	-10/+247
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	elements to minimize the number of multiplies required to compute the final result. This uses a heuristic to attempt to form near-optimal binary exponentiation-style multiply chains. While there are some cases it misses, it seems to at least a decent job on a very diverse range of inputs. Initial benchmarks show no interesting regressions, and an 8% improvement on SPASS. Let me know if any other interesting results (in either direction) crop up! Credit to Richard Smith for the core algorithm, and helping code the patch itself. llvm-svn: 155616
*	If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume	Evan Cheng	2012-04-26	3	-7/+16
\| \| \| \| \| \| \| \| \| \|	the feature set of v7a. This comes about if the user specifies something like -arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as uxtab in this case. rdar://11318438 llvm-svn: 155601
*	Don't forget to reset 'first operand' flag when we're setting the ↵	Bill Wendling	2012-04-26	1	-5/+8
\| \| \| \| \| \|	MDNodeOperand value. llvm-svn: 155599
*	Print IV chain numbers while collecting them.	Jakob Stoklund Olesen	2012-04-25	1	-4/+5
\| \| \| \|	llvm-svn: 155567
*	Remove more dead code.	Jakob Stoklund Olesen	2012-04-25	1	-3/+0
\| \| \| \|	llvm-svn: 155566
*	Unify internal representation of ARM instructions with a register ↵	Richard Barton	2012-04-25	2	-4/+10
\| \| \| \| \| \|	right-shifted by #32. These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation. llvm-svn: 155565
*	Remove the -disable-cross-class-join option.	Jakob Stoklund Olesen	2012-04-25	1	-13/+4
\| \| \| \| \| \| \| \|	Cross-class joins have been normal and fully supported for a while now. With TableGen generating the getMatchingSuperRegClass() hook, they are unlikely to cause problems again. llvm-svn: 155552
*	Cross-class joining is winning.	Jakob Stoklund Olesen	2012-04-25	1	-66/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Remove the heuristic for disabling cross-class joins. The greedy register allocator can handle the narrow register classes, and when it splits a live range, it can pick a larger register class. Benchmarks were unaffected by this change. <rdar://problem/11302212> llvm-svn: 155551
*	Add ifdef around getSubtargetFeatureName in tablegen output file so that ↵	Craig Topper	2012-04-25	1	-0/+1
\| \| \| \| \| \|	only targets that want the function get it. This prevents other targets from getting an unused function warning. llvm-svn: 155538
*	Use vector_shuffles instead of target specific unpack nodes for AVX ↵	Craig Topper	2012-04-25	1	-18/+20
\| \| \| \| \| \|	ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code. llvm-svn: 155537
*	Reverting r155468. Chris and Chandler have convinced me that it's dangerous and	Lang Hames	2012-04-25	1	-35/+0
\| \| \| \| \| \| \| \|	in poor taste. Talking through some alternate solutions with Chandler. llvm-svn: 155530
*	Do not use $gp as a dedicated global register if the target ABI is not O32.	Akira Hatanaka	2012-04-25	1	-2/+2
\| \| \| \|	llvm-svn: 155522
*	Simplify the known retain count tracking; use a boolean state instead	Dan Gohman	2012-04-25	1	-41/+34
\| \| \| \| \| \| \|	of a precise count. Also, move RRInfo's Partial field into PtrState, now that it won't increase the size. llvm-svn: 155513
*	Build custom predecessor and successor lists for each basic block.	Dan Gohman	2012-04-24	1	-115/+101
\| \| \| \| \| \| \| \|	These lists exclude invoke unwind edges and loop backedges which are being ignored. This makes it easier to ignore them consistently. llvm-svn: 155500
*	ARM: improved assembler diagnostics for missing CPU features.	Jim Grosbach	2012-04-24	2	-23/+42
\| \| \| \| \| \| \| \| \| \| \|	When an instruction match is found, but the subtarget features it requires are not available (missing floating point unit, or thumb vs arm mode, for example), issue a diagnostic that identifies what the feature mismatch is. rdar://11257547 llvm-svn: 155499
*	Fix a naughty header include that breaks "installed" builds.	Andrew Trick	2012-04-24	1	-2/+12
\| \| \| \|	llvm-svn: 155486
*	ConstantFoldSelectInstruction swapped the operands of the select.	Nadav Rotem	2012-04-24	1	-1/+1
\| \| \| \| \| \|	Fix 12592. Patch by Matt Pharr. llvm-svn: 155480