bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	SelectionDAG: Teach FoldConstantArithmetic how to deal with vectors.	Benjamin Kramer	2013-02-04	1	-44/+115
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This required disabling a PowerPC optimization that did the following: input: x = BUILD_VECTOR <i32 16, i32 16, i32 16, i32 16> lowered to: tmp = BUILD_VECTOR <i32 8, i32 8, i32 8, i32 8> x = ADD tmp, tmp The add now gets folded immediately and we're back at the BUILD_VECTOR we started from. I don't see a way to fix this currently so I left it disabled for now. Fix some trivially foldable X86 tests too. llvm-svn: 174325
*	rdar://13126763	Shuxin Yang	2013-02-02	1	-13/+20
\| \| \| \| \| \| \|	Fix a bug in DAGCombine. The symptom is mistakenly optimizing expression "x + xx" into "x 3.0". llvm-svn: 174239
*	Correct indentation for dumping LexicalScope.	Manman Ren	2013-02-02	1	-8/+6
\| \| \| \|	llvm-svn: 174237
*	[Dwarf] avoid emitting multiple AT_const_value for static memebers.	Manman Ren	2013-02-01	1	-3/+9
\| \| \| \| \| \| \| \|	Testing case is reduced from MultiSource/BenchMarks/Prolangs-C++/deriv1. rdar://problem/13071590 llvm-svn: 174235
*	Fix errant fallthrough in the generation of the lifetime markers.	Nadav Rotem	2013-02-01	1	-0/+1
\| \| \| \| \| \|	Found by Alexander Kornienko. llvm-svn: 174207
*	Use a continue to simplify loop and reduce indentation. No functional change.	Chad Rosier	2013-02-01	1	-24/+25
\| \| \| \|	llvm-svn: 174198
*	Add braces, so my head doesn't explode.	Chad Rosier	2013-01-31	1	-1/+2
\| \| \| \|	llvm-svn: 174088
*	When lowering memcpys to loads and stores, make sure we don't promote alignments	Lang Hames	2013-01-31	1	-0/+9
\| \| \| \| \| \|	past the natural stack alignment. llvm-svn: 174085
*	[Dwarf] early exit to avoid creating dangling DIEs	Manman Ren	2013-01-31	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \|	We used to create children DIEs for a scope, then check whether ScopeDIE is null. If ScopeDIE is null, the children DIEs will be dangling. Other DIEs can link to those dangling DIEs, which are not emitted at all, causing dwarf error. The current testing case is 4k lines, from MultiSource/BenchMark/McCat/09-vor. rdar://problem/13071959 llvm-svn: 174084
*	[PEI] Pass the frame index operand number to the eliminateFrameIndex function.	Chad Rosier	2013-01-31	2	-3/+17
\| \| \| \| \| \| \|	Each target implementation was needlessly recomputing the index. Part of rdar://13076458 llvm-svn: 174083
*	Add a special handling case for untyped CopyFromReg node in GetCostForDef() ↵	Weiming Zhao	2013-01-29	1	-1/+11
\| \| \| \| \| \|	of ScheduleDAGRRList llvm-svn: 173833
*	Support artificial parameters in function types.	David Blaikie	2013-01-29	1	-0/+2
\| \| \| \| \| \| \|	Provides the functionality for Clang change r172911 - I just had this still lying around. llvm-svn: 173820
*	Fixing warnings revealed by gcc release build	Edwin Vane	2013-01-29	1	-3/+2
\| \| \| \| \| \| \|	Fixed set-but-not-used warnings. Reviewer: gribozavr llvm-svn: 173810
*	MIsched: cleanup code. Use isBoundaryNode().	Andrew Trick	2013-01-29	1	-2/+4
\| \| \| \|	llvm-svn: 173775
*	Teach SDISel to combine fsin / fcos into a fsincos node if the following	Evan Cheng	2013-01-29	3	-9/+138
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	conditions are met: 1. They share the same operand and are in the same BB. 2. Both outputs are used. 3. The target has a native instruction that maps to ISD::FSINCOS node or the target provides a sincos library call. Implemented the generic optimization in sdisel and enabled it for Mac OSX. Also added an additional optimization for x86_64 Mac OSX by using an alternative entry point __sincos_stret which returns the two results in xmm0 / xmm1. rdar://13087969 PR13204 llvm-svn: 173755
*	This patch addresses bug 15031.	Bill Schmidt	2013-01-28	2	-9/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The common code in the post-RA scheduler to break anti-dependencies on the critical path contained a flaw. In the reported case, an anti-dependency between the overlapping registers %X4 and %R4 exists: %X29<def> = OR8 %X4, %X4 %R4<def>, %X3<def,dead,tied3> = LBZU 1, %X3<kill,tied1> The unpatched code breaks the dependency by replacing %R4 and its uses with %R3, the first register on the available list. However, %R3 and %X3 overlap, so this creates two overlapping definitions on the same instruction. The fix is straightforward, preventing selection of a register that overlaps any other defined register on the same instruction. The test case is reduced from the bug report, and verifies that we no longer produce "lbzu 3, 1(3)" when breaking this anti-dependency. llvm-svn: 173706
*	Fix comment.	Eric Christopher	2013-01-28	1	-1/+1
\| \| \| \|	llvm-svn: 173698
*	Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation ↵	Michael Gottesman	2013-01-28	1	-1/+1
\| \| \| \| \| \|	for refactoring the ARC Optimizer. llvm-svn: 173647
*	Legalizer: Reword comment again, per Duncan's suggestion.	Benjamin Kramer	2013-01-27	1	-3/+2
\| \| \| \|	llvm-svn: 173625
*	Legalizer: Add an assert and tweak a comment to clarify the assumptions this ↵	Benjamin Kramer	2013-01-27	1	-1/+5
\| \| \| \| \| \|	code makes. llvm-svn: 173620
*	When the legalizer is splitting vector shifts, the result may not have the ↵	Benjamin Kramer	2013-01-27	1	-2/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	right shift amount type. Fix that by adding a cast to the shift expander. This came up with vector shifts on sse-less X86 CPUs. <2 x i64> = shl <2 x i64> <2 x i64> -> i64,i64 = shl i64 i64; shl i64 i64 -> i32,i32,i32,i32 = shl_parts i32 i32 i64; shl_parts i32 i32 i64 Now we cast the last two i64s to the right type. Fixes the crash in PR14668. llvm-svn: 173615
*	Use const reference instead of vector copying.	Jakub Staszak	2013-01-25	1	-1/+2
\| \| \| \|	llvm-svn: 173497
*	This patch aims to reduce compile time in LegalizeTypes by using SmallDenseMap,	Preston Gurd	2013-01-25	2	-9/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	with an initial number of elements, instead of DenseMap, which has zero initial elements, in order to avoid the copying of elements when the size changes and to avoid allocating space every time LegalizeTypes is run. This patch will not affect the memory footprint, because DenseMap will increase the element size to 64 when the first element is added. Patch by Wan Xiaofei. llvm-svn: 173448
*	MIsched: Print block name. No functionality.	Andrew Trick	2013-01-25	1	-1/+2
\| \| \| \|	llvm-svn: 173433
*	MachineScheduler support for viewGraph.	Andrew Trick	2013-01-25	1	-1/+88
\| \| \| \|	llvm-svn: 173432
*	ScheduleDAG: colorize the DOT graph and improve formatting.	Andrew Trick	2013-01-25	3	-2/+10
\| \| \| \|	llvm-svn: 173431
*	ScheduleDAG: Added isBoundaryNode to conveniently detect a common corner case.	Andrew Trick	2013-01-25	1	-7/+19
\| \| \| \| \| \|	This fixes DAG subtree analysis at the boundary. llvm-svn: 173427
*	SchedDFS: Complete support for nested subtrees.	Andrew Trick	2013-01-25	1	-33/+74
\| \| \| \| \| \| \| \| \|	Maintain separate per-node and per-tree book-keeping. Track all instructions above a DAG node including nested subtrees. Seperately track instructions within a subtree. Record subtree parents. llvm-svn: 173426
*	MIsched: Improve the interface to SchedDFS analysis (subtrees).	Andrew Trick	2013-01-25	2	-34/+42
\| \| \| \| \| \| \|	Allow the strategy to select SchedDFS. Allow the results of SchedDFS to affect initialization of the scheduler state. llvm-svn: 173425
*	SchedDFS: Initial support for nested subtrees.	Andrew Trick	2013-01-25	1	-37/+73
\| \| \| \| \| \| \|	This is mostly refactoring, along with adding an instruction count within the subtrees and ensuring we only look at data edges. llvm-svn: 173420
*	MISched: Add SchedDFSResult to ScheduleDAGMI to formalize the	Andrew Trick	2013-01-25	1	-25/+55
\| \| \| \| \| \|	interface and allow other strategies to select it. llvm-svn: 173413
*	SchedDFS: Refactor and tweak the subtree selection criteria.	Andrew Trick	2013-01-25	1	-24/+32
\| \| \| \| \| \| \| \| \| \|	For sanity, create a root when NumDataSuccs >= 4. Splitting large subtrees will no longer be detrimental after my next checkin to handle nested tree. A magic number of 4 is fine because single subtrees seldom rejoin more than this. It makes subtrees easier to visualize and heuristics more sane. llvm-svn: 173399
*	Avoid creating duplicate CFG edges in the IfConversion pass.	Jakob Stoklund Olesen	2013-01-24	1	-1/+1
\| \| \| \| \| \|	Patch by Stefan Hepp. llvm-svn: 173395
*	MachineScheduler: enable biasCriticalPath for all DAGs.	Andrew Trick	2013-01-24	1	-0/+4
\| \| \| \|	llvm-svn: 173318
*	MIsched: Added biasCriticalPath.	Andrew Trick	2013-01-24	1	-0/+15
\| \| \| \| \| \| \|	Allow schedulers to order DAG edges by critical path. This makes DFS-based heuristics more stable and effective. llvm-svn: 173317
*	Add the heuristic to differentiate SSPStrong from SSPRequired.	Bill Wendling	2013-01-23	1	-23/+103
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The requirements of the strong heuristic are: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) llvm-svn: 173231
*	Add the IR attribute 'sspstrong'.	Bill Wendling	2013-01-23	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SSPStrong applies a heuristic to insert stack protectors in these situations: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) This patch implements the SSPString attribute to be equivalent to SSPRequired. This will change in a subsequent patch. llvm-svn: 173230
*	Make APFloat constructor require explicit semantics.	Tim Northover	2013-01-22	5	-42/+29
\| \| \| \| \| \| \| \| \|	Previously we tried to infer it from the bit width size, with an added IsIEEE argument for the PPC/IEEE 128-bit case, which had a default value. This default value allowed bugs to creep in, where it was inappropriate. llvm-svn: 173138
*	Introduce a new data structure, the SparseMultiSet, and changes to the MI ↵	Michael Ilseman	2013-01-21	1	-45/+33
\| \| \| \| \| \| \| \|	scheduler to use it. A SparseMultiSet adds multiset behavior to SparseSet, while retaining SparseSet's desirable properties. Essentially, SparseMultiSet provides multiset behavior by storing its dense data in doubly linked lists that are inlined into the dense vector. This allows it to provide good data locality as well as vector-like constant-time clear() and fast constant time find(), insert(), and erase(). It also allows SparseMultiSet to have a builtin recycler rather than keeping SparseSet's behavior of always swapping upon removal, which allows it to preserve more iterators. It's often a better alternative to a SparseSet of a growable container or vector-of-vector. llvm-svn: 173064
*	Revert 172708.	Nadav Rotem	2013-01-20	2	-14/+8
\| \| \| \| \| \| \| \| \|	The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends. This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical. Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model. llvm-svn: 172968
*	The last of PR14471 - emission of constant floats	David Blaikie	2013-01-20	2	-4/+19
\| \| \| \|	llvm-svn: 172941
*	Split out DW_OP_addr for the split debug info DWARF5 proposal.	Eric Christopher	2013-01-18	2	-6/+23
\| \| \| \|	llvm-svn: 172857
*	Use AttributeSet accessor methods instead of Attribute accessor methods.	Bill Wendling	2013-01-18	1	-4/+4
\| \| \| \| \| \| \|	Further encapsulation of the Attribute object. Don't allow direct access to the Attribute object as an aggregate. llvm-svn: 172853
*	Remove unused parameter. Also use the AttributeSet query methods instead of ↵	Bill Wendling	2013-01-18	2	-9/+9
\| \| \| \| \| \|	the Attribute query methods. llvm-svn: 172852
*	[MC/Mach-O] Implement integrated assembler support for linker options.	Daniel Dunbar	2013-01-18	1	-7/+26
\| \| \| \| \| \|	- Also, fixup syntax errors in LangRef and missing newline in the MCAsmStreamer. llvm-svn: 172837
*	Optimization for the following SIGN_EXTEND pairs:	Elena Demikhovsky	2013-01-17	2	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \|	v8i8 -> v8i64, v8i8 -> v8i32, v4i8 -> v4i64, v4i16 -> v4i64 for AVX and AVX2. Bug 14865. llvm-svn: 172708
*	Fix the assembly and dissassembly of DW_FORM_sec_offset. Found this by	Eric Christopher	2013-01-17	2	-4/+7
\| \| \| \| \| \| \| \| \|	changing both the string of the dwo_name to be correct and the type of the statement list. Testcases all around. llvm-svn: 172699
*	Add the DW_AT_GNU_addr_base for the skeleton cu. Add support for	Eric Christopher	2013-01-17	2	-1/+7
\| \| \| \| \| \| \|	emitting the dwarf32 version of DW_FORM_sec_offset and correct disassembler support. llvm-svn: 172698
*	Move MachineTraceMetrics.h into include/llvm/CodeGen.	Jakob Stoklund Olesen	2013-01-17	4	-354/+3
\| \| \| \| \| \|	Let targets use it. llvm-svn: 172688
*	Provide a place for targets to insert ILP optimization passes.	Jakob Stoklund Olesen	2013-01-17	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Move the early if-conversion pass into this group. ILP optimizations usually need to find the right balance between register pressure and ILP using the MachineTraceMetrics analysis to identify critical paths and estimate other costs. Such passes should run together so they can share dominator tree and loop info analyses. Besides if-conversion, future passes to run here here could include expression height reduction and ARM's MLxExpansion pass. llvm-svn: 172687