bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Rip out support for 'llvm.noinline'. This thing has a strange history...	Chandler Carruth	2012-03-16	1	-45/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It was added in 2007 as the first cut at supporting no-inline attributes, but we didn't have function attributes of any form at the time. However, it was added without any mention in the LangRef or other documentation. Later on, in 2008, Devang added function notes for 'inline=never' and then turned them into proper function attributes. From that point onward, as far as I can tell, the world moved on, and no one has touched 'llvm.noinline' in any meaningful way since. It's time has now come. We have had better mechanisms for doing this for a long time, all the frontends I'm aware of use them, and this is just holding back progress. Given that it was never a documented feature of the IR, I've provided no auto-upgrade support. If people know of real, in-the-wild bitcode that relies on this, yell at me and I'll add it, but I seriously doubt anyone cares. llvm-svn: 152904
*	Start removing the use of an ad-hoc 'never inline' set and instead	Chandler Carruth	2012-03-16	4	-42/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	directly query the function information which this set was representing. This simplifies the interface of the inline cost analysis, and makes the always-inline pass significantly more efficient. Previously, always-inline would first make a single set of every function in the module except those marked with the always-inline attribute. It would then query this set at every call site to see if the function was a member of the set, and if so, refuse to inline it. This is quite wasteful. Instead, simply check the function attribute directly when looking at the callsite. The normal inliner also had similar redundancy. It added every function in the module with the noinline attribute to its set to ignore, even though inside the cost analysis function we already tested the noinline attribute and produced the same result. The only tricky part of removing this is that we have to be able to correctly remove only the functions inlined by the always-inline pass when finalizing, which requires a bit of a hack. Still, much less of a hack than the set of all non-always-inline functions was. While I was touching this function, I switched a heavy-weight set to a vector with sort+unique. The algorithm already had a two-phase insert and removal pattern, we were just needlessly paying the uniquing cost on every insert. This probably speeds up some compiles by a small amount (-O0 compiles with lots of always-inline, so potentially heavy libc++ users), but I've not tried to measure it. I believe there is no functional change here, but yell if you spot one. None are intended. Finally, the direction this is going in is to greatly simplify the inline cost query interface so that we can replace its implementation with a much more clever one. Along the way, all the APIs get simplified, so it seems incrementally good. llvm-svn: 152903
*	Pull the implementation of the code metrics out of the inline cost	Chandler Carruth	2012-03-16	3	-158/+177
\| \| \| \| \| \| \| \| \| \| \|	analysis implementation. The header was already separated. Also cleanup all the comments in the header to follow a nice modern doxygen form. There is still plenty of cruft here, but some of that will fall out in subsequent refactorings and this was an easy step in the right direction. No functionality changed here. llvm-svn: 152898
*	misched: add DAG edges from vreg defs to ExitSU.	Andrew Trick	2012-03-16	1	-1/+3
\| \| \| \| \| \| \| \| \| \|	These edges are not really necessary, but it is consistent with the way we currently create physreg edges. Scheduler heuristics that expect a DAG edge to the block terminator could benefit from this change. Although in the future I hope we have a better mechanism for modeling latency across scheduling regions. llvm-svn: 152895
*	LSR fix: Add isSimplifiedLoopNest to IVUsers analysis.	Andrew Trick	2012-03-16	4	-17/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Only record IVUsers that are dominated by simplified loop headers. Otherwise SCEVExpander will crash while looking for a preheader. I previously tried to work around this in LSR itself, but that was insufficient. This way, LSR can continue to run if some uses are not in simple loops, as long as we don't attempt to analyze those users. Fixes <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 152892
*	Revert r152705, which reapplied r152486 as this appears to be causing failures	Chad Rosier	2012-03-16	1	-128/+34
\| \| \| \| \| \| \| \| \| \| \|	on our internal nightly testers. So, basically revert r152486 again. Abbreviated original commit message: Implement a more intelligent way of spilling uses across an invoke boundary. It looks as if Chander's inlining work, r152737, exposed an issue. llvm-svn: 152887
*	In InstCombiner::visitOr, make sure we reverse the operand swap used for ↵	Eli Friedman	2012-03-16	1	-1/+7
\| \| \| \| \| \|	checking for or-of-xor operations after those checks; a later check expects that any constant will be in Op1. PR12234. llvm-svn: 152884
*	ARM optional operand on MRC/MCR assembly instructions.	Jim Grosbach	2012-03-16	2	-0/+24
\| \| \| \| \| \|	rdar://11058464 llvm-svn: 152883
*	ARM vmrs system registers mvfr0 and mvfr1 handling.	Jim Grosbach	2012-03-16	2	-0/+6
\| \| \| \| \| \|	rdar://11058464 llvm-svn: 152881
*	Do the right thing on NULL uint64 fields.	Eric Christopher	2012-03-16	1	-1/+1
\| \| \| \| \| \| \| \|	Patch by Clemens Hammacher! Fixes PR12243 llvm-svn: 152880
*	Revert r152613 (and r152614), "Inline the d'tor and add an anchor instead." ↵	NAKAMURA Takumi	2012-03-16	1	-1/+1
\| \| \| \| \| \| \| \| \|	for workaround of g++-4.4's miscompilation. It caused MSP430DAGToDAGISel::SelectIndexedBinOp() to be miscompiled. When two ReplaceUses()'s are expanded as inline, vtable in base class is stored to latter (ISelUpdater)ISU. llvm-svn: 152877
*	For types with a parent of the compile unit make sure and emit	Eric Christopher	2012-03-15	1	-5/+4
\| \| \| \| \| \| \| \|	the DECL information. rdar://10855921 llvm-svn: 152876
*	Remove inadvertant commit.	Jim Grosbach	2012-03-15	1	-23/+0
\| \| \| \|	llvm-svn: 152870
*	[fast-isel] Address Eli's comments for r152847. Specifically, add a test case	Chad Rosier	2012-03-15	1	-9/+8
\| \| \| \| \| \| \|	and still allow immediate encoding, just not with cmn. rdar://11038907 llvm-svn: 152869
*	[fast-isel] Don't try to encode LONG_MIN using cmn instructions.	Chad Rosier	2012-03-15	1	-5/+9
\| \| \| \| \| \|	rdar://11038907 llvm-svn: 152847
*	ARM case-insensitive checking for APSR_nzcv.	Jim Grosbach	2012-03-15	3	-4/+29
\| \| \| \| \| \|	rdar://11056591 llvm-svn: 152846
*	We actually handle AllocaInst via getRegForValue below just fine.	Eric Christopher	2012-03-15	1	-1/+1
\| \| \| \| \| \|	Part of rdar://8905263 llvm-svn: 152845
*	Add some debugging output into fast isel as well.	Eric Christopher	2012-03-15	1	-2/+6
\| \| \| \|	llvm-svn: 152844
*	Add another debug statement.	Eric Christopher	2012-03-15	1	-1/+3
\| \| \| \|	llvm-svn: 152843
*	Tabs.	Eric Christopher	2012-03-15	1	-3/+3
\| \| \| \|	llvm-svn: 152842
*	Typo.	Eric Christopher	2012-03-15	1	-1/+1
\| \| \| \|	llvm-svn: 152841
*	ARM aliases for pre-unified syntax fcmpz[sd] mnemonics.	Jim Grosbach	2012-03-15	2	-1/+6
\| \| \| \| \| \|	rdar://11056647 llvm-svn: 152834
*	Type sizes and fields offsets inside structs are unsigned. This is a highly	Duncan Sands	2012-03-15	1	-4/+2
\| \| \| \| \| \| \|	theoretical fix since it only matters for types with >= 2^63 bits (!) and also only matters if pointers have more than 64 bits, which is not supported anyway. llvm-svn: 152831
*	Use vmov.f32 to materialize f32 consts on ARM. This relaxes constraints on	Lang Hames	2012-03-15	2	-0/+26
\| \| \| \| \| \| \|	register allocation by allowing all 32 D-registers to be used. Patch by Cameron Zwarich. llvm-svn: 152824
*	Fix VCVT decoding (between floating-point and fixed-point, Floating-point). ↵	Kristof Beyls	2012-03-15	1	-16/+38
\| \| \| \| \| \|	Patch by Richard Barton. llvm-svn: 152814
*	Fix bug found by warning.	Michael J. Spencer	2012-03-15	1	-1/+2
\| \| \| \|	llvm-svn: 152812
*	Short term fix for pr12270 before we change dominates to handle unreachable	Rafael Espindola	2012-03-15	1	-29/+33
\| \| \| \| \| \| \|	code. While here, reduce indentation. llvm-svn: 152803
*	Use an iterator instead of calling .size() on the worklist every time, which ↵	Bill Wendling	2012-03-15	1	-2/+2
\| \| \| \| \| \|	is wasteful. llvm-svn: 152794
*	Implement relocation-overflow behavior for PE/COFF.	Michael J. Spencer	2012-03-15	1	-2/+29
\| \| \| \| \| \| \| \| \|	This needs a test, but it will take some time to figure out the best way to get an input that will produce > 2^16 relocs. Patch by Graydon Hoare! llvm-svn: 152787
*	When optimizing certain BUILD_VECTOR nodes into other BUILD_VECTOR nodes, ↵	Nadav Rotem	2012-03-15	1	-0/+4
\| \| \| \| \| \|	add the new node into the work list because there is a potential for further optimizations. llvm-svn: 152784
*	Revert the removal of DW_AT_MIPS_linkage_name when we aren't putting	Eric Christopher	2012-03-15	1	-5/+8
\| \| \| \| \| \| \| \| \|	out the DW_AT_name. Older gdbs unfortunately still use it to disambiguate member functions in templated classes (gdb.cp/templates.exp). rdar://11043421 (which is now deferred for a bit) llvm-svn: 152782
*	Add a xform to the DAG combiner.	Bill Wendling	2012-03-15	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \|	Transform: (fsub x, (fadd x, y)) -> (fneg y) and (fsub x, (fadd y, x)) -> (fneg y) if 'unsafe math' is specified. <rdar://problem/7540295> llvm-svn: 152777
*	Remove the basic inliner. This was added in 2007, and hasn't really	Chandler Carruth	2012-03-15	2	-183/+0
\| \| \| \| \| \| \|	changed since. No one was using it. It is yet another consumer of the InlineCost interface that I'd like to change. llvm-svn: 152769
*	Make the swap code here a bit more obvious what its doing... We're	Chandler Carruth	2012-03-15	1	-1/+1
\| \| \| \| \| \| \|	essentially sorting the pair's arguments. I'd love to actually call sort here, but I'm just not that crazy. ;] llvm-svn: 152764
*	Don't assume that the arguments are processed in some particular order.	Chandler Carruth	2012-03-15	1	-2/+4
\| \| \| \| \| \| \|	This appears to not be the case with dragonegg at least in some contexts. Hopefully will fix the bootstrap assert failure there. llvm-svn: 152763
*	[avx] Add patterns for VINSERTF128rm.	Chad Rosier	2012-03-15	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This results in things such as vmovaps -96(%rbx), %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 to be combined to vinsertf128 $1, -96(%rbx), %ymm0, %ymm0 rdar://10643481 llvm-svn: 152762
*	This pass didn't want the inline cost per-se, it just wants generic code	Chandler Carruth	2012-03-15	1	-1/+1
\| \| \| \| \| \|	metrics. llvm-svn: 152760
*	Remove all remnants of partial specialization in the cost computation	Chandler Carruth	2012-03-15	1	-69/+0
\| \| \| \| \| \|	side of things. This is all dead code. llvm-svn: 152759
*	Fixed a transform crash when setting a negative size value for memset. ↵	Aaron Ballman	2012-03-15	1	-2/+6
\| \| \| \| \| \|	Fixes PR12202. llvm-svn: 152756
*	[tsan] use FunctionBlackList	Kostya Serebryany	2012-03-14	1	-0/+9
\| \| \| \|	llvm-svn: 152755
*	[asan] rename class BlackList to FunctionBlackList and move it into a ↵	Kostya Serebryany	2012-03-14	4	-70/+120
\| \| \| \| \| \|	separate file -- we will need the same functionality in ThreadSanitizer llvm-svn: 152753
*	Extend the inline cost calculation to account for bonuses due to	Chandler Carruth	2012-03-14	2	-12/+92
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	correlated pairs of pointer arguments at the callsite. This is designed to recognize the common C++ idiom of begin/end pointer pairs when the end pointer is a constant offset from the begin pointer. With the C-based idiom of a pointer and size, the inline cost saw the constant size calculation, and this provides the same level of information for begin/end pairs. In order to propagate this information we have to search for candidate operations on a pair of pointer function arguments (or derived from them) which would be simplified if the pointers had a known constant offset. Then the callsite analysis looks for such pointer pairs in the argument list, and applies the appropriate bonus. This helps LLVM detect that half of bounds-checked STL algorithms (such as hash_combine_range, and some hybrid sort implementations) disappear when inlined with a constant size input. However, it's not a complete fix due the inaccuracy of our cost metric for constants in general. I'm looking into that next. Benchmarks showed no significant code size change, and very minor performance changes. However, specific code such as hashing is showing significantly cleaner inlining decisions. llvm-svn: 152752
*	When an invoke is marked with metadata indicating its unwind edge	Dan Gohman	2012-03-14	1	-1/+2
\| \| \| \| \| \| \|	should be ignored by ARC optimization, don't insert new ARC runtime calls in the unwind destination. llvm-svn: 152748
*	Change where we enable the heuristic that delays inlining into functions	Chandler Carruth	2012-03-14	1	-7/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	which are small enough to themselves be inlined. Delaying in this manner can be harmful if the function is inelligible for inlining in some (or many) contexts as it pessimizes the code of the function itself in the event that inlining does not eventually happen. Previously the check was written to only do this delaying of inlining for static functions in the hope that they could be entirely deleted and in the knowledge that all callers of static functions will have the opportunity to inline if it is in fact profitable. However, with C++ we get two other important sources of functions where the definition is always available for inlining: inline functions and templated functions. This patch generalizes the inliner to allow linkonce-ODR (the linkage such C++ routines receive) to also qualify for this delay-based inlining. Benchmarking across a range of large real-world applications shows roughly 2% size increase across the board, but an average speedup of about 0.5%. Some benhcmarks improved over 2%, and the 'clang' binary itself (when bootstrapped with this feature) shows a 1% -O0 performance improvement when run over all Sema, Lex, and Parse source code smashed into a single file. A clean re-build of Clang+LLVM with a bootstrapped Clang shows approximately 2% improvement, but that measurement is often noisy. llvm-svn: 152737
*	Silence operator precedence warnings.	Benjamin Kramer	2012-03-14	1	-3/+3
\| \| \| \|	llvm-svn: 152711
*	Refactor the inline cost bonus calculation for constants to use	Chandler Carruth	2012-03-14	1	-20/+26
\| \| \| \| \| \| \| \|	a worklist rather than a recursive call. No functionality changed. llvm-svn: 152706
*	Reapply r152486 with a fix for the nightly testers.	Bill Wendling	2012-03-14	1	-34/+128
\| \| \| \| \| \| \| \| \|	There were cases where a value could be used and it's both crossing an invoke and NOT crossing an invoke. This could happen in the landing pads. In that case, we will demote the value to the stack like we did before. <rdar://problem/10609139> llvm-svn: 152705
*	Insert the debugging instructions in one fell-swoop so that it doesn't call the	Bill Wendling	2012-03-14	1	-7/+8
\| \| \| \| \| \| \|	expensive "getFirstTerminator" call. This reduces the time of compilation in PR12258 from >10 minutes to < 10 seconds. llvm-svn: 152704
*	misched: implemented a framework for top-down or bottom-up scheduling.	Andrew Trick	2012-03-14	3	-103/+275
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	New flags: -misched-topdown, -misched-bottomup. They can be used with the default scheduler or with -misched=shuffle. Without either topdown/bottomup flag -misched=shuffle now alternates scheduling direction. LiveIntervals update is unimplemented with bottom-up scheduling, so only -misched-topdown currently works. Capped the ScheduleDAG hierarchy with a concrete ScheduleDAGMI class. ScheduleDAGMI is aware of the top and bottom of the unscheduled zone within the current region. Scheduling policy can be plugged into the ScheduleDAGMI driver by implementing MachineSchedStrategy. ConvergingScheduler is now the default scheduling algorithm. It exercises the new driver but still does no reordering. llvm-svn: 152700
*	misched comments	Andrew Trick	2012-03-14	1	-0/+15
\| \| \| \|	llvm-svn: 152699