bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Avoid using lossy load / stores for memcpy / memset expansion. e.g.	Evan Cheng	2012-12-12	1	-15/+21
\| \| \| \| \| \|	f64 load / store on non-SSE2 x86 targets. llvm-svn: 169944
*	Replace TargetLowering::isIntImmLegal() with	Evan Cheng	2012-12-11	1	-1/+5
\| \| \| \| \| \| \| \| \|	ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined term for something like integer immediate materialization. It is always possible to materialize an integer immediate. Whether to use it for memcpy expansion is more a "cost" conceern. llvm-svn: 169929
*	Revert EVT->MVT changes, r169836-169851, due to buildbot failures.	Patrik Hagglund	2012-12-11	15	-125/+115
\| \| \| \|	llvm-svn: 169854
*	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs,	Patrik Hagglund	2012-12-11	2	-13/+12
\| \| \| \| \| \|	instead of EVTs. llvm-svn: 169851
*	Change TargetLowering::getTypeForExtArgOrReturn to take and return	Patrik Hagglund	2012-12-11	1	-1/+2
\| \| \| \| \| \| \| \|	MVTs, instead of EVTs. Accordingly, add bitsLT (and similar) to MVT. llvm-svn: 169850
*	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT,	Patrik Hagglund	2012-12-11	2	-14/+19
\| \| \| \| \| \|	from EVT. llvm-svn: 169849
*	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of	Patrik Hagglund	2012-12-11	5	-18/+18
\| \| \| \| \| \|	EVTs. llvm-svn: 169848
*	Change TargetLowering::TransformToType to contain MVTs, instead of	Patrik Hagglund	2012-12-11	1	-4/+4
\| \| \| \| \| \|	EVTs. llvm-svn: 169847
*	Change TargetLowering::findRepresentativeClass to take an MVT, instead	Patrik Hagglund	2012-12-11	1	-2/+2
\| \| \| \| \| \|	of EVT. llvm-svn: 169845
*	Change TargetLowering::getTypeToPromoteTo to take and return MVTs,	Patrik Hagglund	2012-12-11	2	-8/+8
\| \| \| \| \| \|	instead of EVTs. llvm-svn: 169844
*	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT.	Patrik Hagglund	2012-12-11	2	-12/+15
\| \| \| \|	llvm-svn: 169843
*	Change TargetLowering::getCondCodeAction to take an MVT, instead of	Patrik Hagglund	2012-12-11	2	-4/+4
\| \| \| \| \| \|	EVT. llvm-svn: 169842
*	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs.	Patrik Hagglund	2012-12-11	2	-3/+4
\| \| \| \|	llvm-svn: 169841
*	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT.	Patrik Hagglund	2012-12-11	1	-1/+1
\| \| \| \|	llvm-svn: 169840
*	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT.	Patrik Hagglund	2012-12-11	1	-1/+1
\| \| \| \|	llvm-svn: 169839
*	Change TargetLowering::getRepRegClassFor to take an MVT, instead of	Patrik Hagglund	2012-12-11	3	-11/+11
\| \| \| \| \| \| \| \|	EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 169838
*	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT.	Patrik Hagglund	2012-12-11	6	-27/+28
\| \| \| \| \| \| \| \| \|	Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. llvm-svn: 169837
*	Fix a miscompile in the DAG combiner. Previously, we would incorrectly	Chandler Carruth	2012-12-11	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	try to reduce the width of this load, and would end up transforming: (truncate (lshr (sextload i48 <ptr> as i64), 32) to i32) to (truncate (zextload i32 <ptr+4> as i64) to i32) We lost the sext attached to the load while building the narrower i32 load, and replaced it with a zext because lshr always zext's the results. Instead, bail out of this combine when there is a conflict between a sextload and a zext narrowing. The rest of the DAG combiner still optimize the code down to the proper single instruction: movswl 6(...),%eax Which is exactly what we wanted. Previously we read past the end and missed the sign extension: movl 6(...), %eax llvm-svn: 169802
*	Fall back to the selection dag isel to select tail calls.	Chad Rosier	2012-12-11	2	-10/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This shouldn't affect codegen for -O0 compiles as tail call markers are not emitted in unoptimized compiles. Testing with the external/internal nightly test suite reveals no change in compile time performance. Testing with -O1, -O2 and -O3 with fast-isel enabled did not cause any compile-time or execution-time failures. All tests were performed on my x86 machine. I'll monitor our arm testers to ensure no regressions occur there. In an upcoming clang patch I will be marking the objc_autoreleaseReturnValue and objc_retainAutoreleaseReturnValue as tail calls unconditionally. While it's theoretically true that this is just an optimization, it's an optimization that we very much want to happen even at -O0, or else ARC applications become substantially harder to debug. Part of rdar://12553082 llvm-svn: 169796
*	Some enhancements for memcpy / memset inline expansion.	Evan Cheng	2012-12-10	1	-18/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1. Teach it to use overlapping unaligned load / store to copy / set the trailing bytes. e.g. On 86, use two pairs of movups / movaps for 17 - 31 byte copies. 2. Use f64 for memcpy / memset on targets where i64 is not legal but f64 is. e.g. x86 and ARM. 3. When memcpy from a constant string, do not replace the load with a constant if it's not possible to materialize an integer immediate with a single instruction (required a new target hook: TLI.isIntImmLegal()). 4. Use unaligned load / stores more aggressively if target hooks indicates they are "fast". 5. Update ARM target hooks to use unaligned load / stores. e.g. vld1.8 / vst1.8. Also increase the threshold to something reasonable (8 for memset, 4 pairs for memcpy). This significantly improves Dhrystone, up to 50% on ARM iOS devices. rdar://12760078 llvm-svn: 169791
*	Fix a coding style nit.	Eric Christopher	2012-12-10	1	-2/+2
\| \| \| \|	llvm-svn: 169776
*	LegalizeDAG: Allow type promotion of scalar loads	Tom Stellard	2012-12-10	1	-3/+2
\| \| \| \|	llvm-svn: 169773
*	LegalizeDAG: Allow type promotion for scalar stores	Tom Stellard	2012-12-10	1	-3/+4
\| \| \| \|	llvm-svn: 169772
*	Teach DAG combine to handle vector add/sub with vectors of all 0s.	Craig Topper	2012-12-10	1	-0/+10
\| \| \| \|	llvm-svn: 169727
*	Remove extra blank line.	Craig Topper	2012-12-09	1	-1/+0
\| \| \| \|	llvm-svn: 169692
*	Teach DAG combine to handle vector logical operations with vectors of all 1s ↵	Craig Topper	2012-12-08	1	-0/+30
\| \| \| \| \| \|	or all 0s. These cases can show up when vectors are split for legalizing. Fix some tests that were dependent on these cases not being combined. llvm-svn: 169684
*	Replace r169459 with something safer. Rather than having computeMaskedBits to	Evan Cheng	2012-12-06	3	-28/+10
\| \| \| \| \| \| \| \| \| \|	understand target implementation of any_extend / extload, just generate zero_extend in place of any_extend for liveouts when the target knows the zero_extend will be implicit (e.g. ARM ldrb / ldrh) or folded (e.g. x86 movz). rdar://12771555 llvm-svn: 169536
*	Fix a bug in the code that merges consecutive stores. Previously we did not	Nadav Rotem	2012-12-06	1	-10/+14
\| \| \| \| \| \| \|	check if loads that happen in between stores alias with the first store in the chain, only with the second store onwards. llvm-svn: 169516
*	Let targets provide hooks that compute known zero and ones for any_extend	Evan Cheng	2012-12-06	2	-7/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459
*	Sort includes for all of the .h files under the 'lib' tree. These were	Chandler Carruth	2012-12-04	4	-7/+7
\| \| \| \| \| \| \| \| \| \|	missed in the first pass because the script didn't yet handle include guards. Note that the script is now able to handle all of these headers without manual edits. =] llvm-svn: 169224
*	Simplify code. No functionality change.	Jakub Staszak	2012-12-04	1	-3/+1
\| \| \| \|	llvm-svn: 169198
*	Use dyn_cast instead of isa and cast. No functionality change.	Jakub Staszak	2012-12-04	1	-4/+4
\| \| \| \|	llvm-svn: 169196
*	Use the new script to sort the includes of every file under lib.	Chandler Carruth	2012-12-03	17	-171/+171
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131
*	Allow merging multiple store sequences on the same chain.	Nadav Rotem	2012-12-02	1	-2/+15
\| \| \| \|	llvm-svn: 169111
*	Cleanup recent addition of DAGTypeLegalizer::SplitVecOp_VSELECT	Justin Holewinski	2012-11-29	1	-35/+31
\| \| \| \|	llvm-svn: 168932
*	Teach the legalizer how to handle operands for VSELECT nodes	Justin Holewinski	2012-11-29	2	-1/+60
\| \| \| \| \| \| \|	If we need to split the operand of a VSELECT, it must be the mask operand. We split the entire VSELECT operand with EXTRACT_SUBVECTOR. llvm-svn: 168883
*	Allow targets to prefer TypeSplitVector over TypePromoteInteger when ↵	Justin Holewinski	2012-11-29	1	-1/+1
\| \| \| \| \| \| \| \|	computing the legalization method for vectors For some targets, it is desirable to prefer scalarizing <N x i1> instead of promoting to a larger legal type, such as <N x i32>. llvm-svn: 168882
*	When combining consecutive stores allow loads in between the stores, if the ↵	Nadav Rotem	2012-11-29	1	-3/+61
\| \| \| \| \| \|	loads do not alias. llvm-svn: 168832
*	Refactor to make helper method static.	Craig Topper	2012-11-25	2	-29/+14
\| \| \| \|	llvm-svn: 168557
*	Remove duplicate check of LimitFloatPrecision. It was already checked ↵	Craig Topper	2012-11-25	1	-1/+1
\| \| \| \| \| \|	earlier before IsExp10 could be set to true. llvm-svn: 168553
*	Factor common code out of individual if blocks into common tail.	Craig Topper	2012-11-25	1	-24/+12
\| \| \| \|	llvm-svn: 168551
*	Remove redundant calls to getCurDebugLoc in visitIntrinsicCall. It's already ↵	Craig Topper	2012-11-24	1	-7/+4
\| \| \| \| \| \|	called at the start of the function and captured in a local variable. llvm-svn: 168548
*	Refactor a bit to make some helper methods static.	Craig Topper	2012-11-24	2	-39/+20
\| \| \| \|	llvm-svn: 168546
*	Factor some common code out of individual if blocks.	Craig Topper	2012-11-24	1	-52/+27
\| \| \| \|	llvm-svn: 168538
*	Refactor a bit to make some helper functions static.	Craig Topper	2012-11-23	2	-54/+24
\| \| \| \|	llvm-svn: 168524
*	Cleanup: Simplify loop end logic in computeRegisterProperties().	Patrik Hägglund	2012-11-23	1	-5/+4
\| \| \| \|	llvm-svn: 168507
*	llvm.fmuladd.* lowering should be checking isOperationLegalOrCustom, rather than	Lang Hames	2012-11-22	1	-1/+1
\| \| \| \| \| \|	isOperationLegal. Thanks to Craig Topper for pointing this out. llvm-svn: 168485
*	Mark FP_EXTEND form v2f32 to v2f64 as "expand" for ARM NEON. Patch by Pete ↵	Eli Friedman	2012-11-17	1	-0/+1
\| \| \| \| \| \|	Couperus. llvm-svn: 168240
*	Remove conditions from 'else if' that were guaranteed by preceding 'if'.	Craig Topper	2012-11-16	1	-12/+12
\| \| \| \|	llvm-svn: 168191
*	Factor out the final FADD that's common to multiple code paths in the ↵	Craig Topper	2012-11-16	1	-45/+30
\| \| \| \| \| \|	visitLog* functions. llvm-svn: 168183