bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Materialize GA addresses with movw + movt pairs for Darwin in PIC mode. e.g.	Evan Cheng	2011-01-17	15	-81/+244
\| \| \| \| \| \| \| \| \| \| \| \|	movw r0, :lower16:(L_foo$non_lazy_ptr-(LPC0_0+4)) movt r0, :upper16:(L_foo$non_lazy_ptr-(LPC0_0+4)) LPC0_0: add r0, pc, r0 It's not yet enabled by default as some tests are failing. I suspect bugs in down stream tools. llvm-svn: 123619
*	Roll out r123609 due to failures on the llvm-x86_64-linux-checks bot.	Cameron Zwarich	2011-01-17	3	-121/+60
\| \| \| \|	llvm-svn: 123618
*	Eliminate the use of dominance frontiers in PromoteMemToReg. In addition to	Cameron Zwarich	2011-01-17	3	-60/+121
\| \| \| \| \| \| \| \| \| \| \| \| \|	eliminating a potentially quadratic data structure, this also gives a 17% speedup when running -scalarrepl on test-suite + SPEC2000 + SPEC2006. My initial experiment gave a greater speedup around 25%, but I moved the dominator tree level computation from dominator tree construction to PromoteMemToReg. Since this approach to computing IDFs has a much lower overhead than the old code using precomputed DFs, it is worth looking at using this new code for the second scalarrepl pass as well. llvm-svn: 123609
*	UnRevert "Revert "Archive: Replace all internal uses of PathV1 with PathV2. ↵	Michael J. Spencer	2011-01-16	1	-36/+36
\| \| \| \| \| \|	The external API still uses PathV1."" llvm-svn: 123605
*	Fix rename.	Michael J. Spencer	2011-01-16	1	-2/+11
\| \| \| \|	llvm-svn: 123604
*	Provide instruction sizes for ARMv5 variants of MUL instructions.	Anton Korobeynikov	2011-01-16	1	-29/+30
\| \| \| \| \| \|	This fixes PR8987 llvm-svn: 123598
*	Update README.txt to remove the DAE enhancement.	Anders Carlsson	2011-01-16	1	-23/+0
\| \| \| \|	llvm-svn: 123597
*	Teach DAE to look for functions whose arguments are unused, and change all ↵	Anders Carlsson	2011-01-16	1	-1/+61
\| \| \| \| \| \|	callers to pass in an undefvalue instead. llvm-svn: 123596
*	UnRevert "Revert the archive part of "Support/PathV2: Add identify_magic.""	Michael J. Spencer	2011-01-16	2	-6/+7
\| \| \| \| \| \|	This reverts commit dd103021a889a986a181ce36ed7b0e8dc9b645e1. llvm-svn: 123595
*	Revert the archive part of "Support/PathV2: Add identify_magic."	Michael J. Spencer	2011-01-16	2	-7/+6
\| \| \| \|	llvm-svn: 123593
*	tidy up a comment, as suggested by duncan	Chris Lattner	2011-01-16	1	-2/+2
\| \| \| \|	llvm-svn: 123590
*	Only put unnamed_addr constants in mergeable sections. Fixes PR8297.	Rafael Espindola	2011-01-16	1	-1/+1
\| \| \| \|	llvm-svn: 123585
*	Don't merge two constants if we care about the address of both.	Rafael Espindola	2011-01-16	1	-22/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes the original testcase in PR8927. It also causes a clang binary built with a patched clang to increase in size by 0.21%. We can probably get some of the size back by writing a pass that detects that a global never has its pointer compared and adds unnamed_addr to it (maybe extend global opt). It is also possible that there are some other cases clang could add unnamed_addr to. I will investigate extending globalopt next. llvm-svn: 123584
*	Simplify the construction and destruction of Uses. Simplify	Jay Foad	2011-01-16	2	-24/+15
\| \| \| \| \| \|	User::dropHungOffUses(). llvm-svn: 123580
*	fix PR8514, a bug where the "heroic" transformation of shift/and	Chris Lattner	2011-01-16	1	-13/+9
\| \| \| \| \| \| \| \|	into and/shift would cause nodes to move around and a dangling pointer to happen. The code tried to avoid this with a HandleSDNode, but got the details wrong. llvm-svn: 123578
*	Move the implementation of the User class into a new source file,	Jay Foad	2011-01-16	4	-83/+89
\| \| \| \| \| \|	User.cpp. llvm-svn: 123575
*	fix PR8932, a case where arg promotion could infinitely promote.	Chris Lattner	2011-01-16	1	-24/+51
\| \| \| \|	llvm-svn: 123574
*	simplify a little	Chris Lattner	2011-01-16	1	-7/+3
\| \| \| \|	llvm-svn: 123573
*	add some commentary	Chris Lattner	2011-01-16	1	-1/+14
\| \| \| \|	llvm-svn: 123572
*	if an alloca is only ever accessed as a unit, and is accessed with ↵	Chris Lattner	2011-01-16	1	-3/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	load/store instructions, then don't try to decimate it into its individual pieces. This will just make a mess of the IR and is pointless if none of the elements are individually accessed. This was generating really terrible code for std::bitset (PR8980) because it happens to be lowered by clang as an {[8 x i8]} structure instead of {i64}. The testcase now is optimized to: define i64 @test2(i64 %X) { br label %L2 L2: ; preds = %0 ret i64 %X } before we generated: define i64 @test2(i64 %X) { %sroa.store.elt = lshr i64 %X, 56 %1 = trunc i64 %sroa.store.elt to i8 %sroa.store.elt8 = lshr i64 %X, 48 %2 = trunc i64 %sroa.store.elt8 to i8 %sroa.store.elt9 = lshr i64 %X, 40 %3 = trunc i64 %sroa.store.elt9 to i8 %sroa.store.elt10 = lshr i64 %X, 32 %4 = trunc i64 %sroa.store.elt10 to i8 %sroa.store.elt11 = lshr i64 %X, 24 %5 = trunc i64 %sroa.store.elt11 to i8 %sroa.store.elt12 = lshr i64 %X, 16 %6 = trunc i64 %sroa.store.elt12 to i8 %sroa.store.elt13 = lshr i64 %X, 8 %7 = trunc i64 %sroa.store.elt13 to i8 %8 = trunc i64 %X to i8 br label %L2 L2: ; preds = %0 %9 = zext i8 %1 to i64 %10 = shl i64 %9, 56 %11 = zext i8 %2 to i64 %12 = shl i64 %11, 48 %13 = or i64 %12, %10 %14 = zext i8 %3 to i64 %15 = shl i64 %14, 40 %16 = or i64 %15, %13 %17 = zext i8 %4 to i64 %18 = shl i64 %17, 32 %19 = or i64 %18, %16 %20 = zext i8 %5 to i64 %21 = shl i64 %20, 24 %22 = or i64 %21, %19 %23 = zext i8 %6 to i64 %24 = shl i64 %23, 16 %25 = or i64 %24, %22 %26 = zext i8 %7 to i64 %27 = shl i64 %26, 8 %28 = or i64 %27, %25 %29 = zext i8 %8 to i64 %30 = or i64 %29, %28 ret i64 %30 } In this case, instcombine was able to eliminate the nonsense, but in PR8980 enough PHIs are in play that instcombine backs off. It's better to not generate this stuff in the first place. llvm-svn: 123571
*	Use an irbuilder to get some trivial constant folding when doing a store	Chris Lattner	2011-01-16	1	-21/+17
\| \| \| \| \| \|	of a constant. llvm-svn: 123570
*	remove a dead check, this was needed before we had an explicit veto on uses ↵	Chris Lattner	2011-01-16	1	-5/+0
\| \| \| \| \| \|	of phis. llvm-svn: 123569
*	enhance FoldOpIntoPhi in instcombine to try harder when a phi has	Chris Lattner	2011-01-16	2	-3/+20
\| \| \| \| \| \| \| \|	multiple uses. In some cases, all the uses are the same operation, so instcombine can go ahead and promote the phi. In the testcase this pushes an add out of the loop. llvm-svn: 123568
*	Spill R4 if it's going to be used to restore SP from FP.	Evan Cheng	2011-01-16	1	-4/+12
\| \| \| \|	llvm-svn: 123567
*	remove the AllowAggressive argument to FoldOpIntoPhi. It is forced to false ↵	Chris Lattner	2011-01-16	3	-14/+6
\| \| \| \| \| \| \| \|	in the first line of the function because it isn't a good idea, even for compares. llvm-svn: 123566
*	more cleanups: use the IR builder.	Chris Lattner	2011-01-16	1	-38/+39
\| \| \| \|	llvm-svn: 123565
*	tidy up code.	Chris Lattner	2011-01-16	1	-16/+20
\| \| \| \|	llvm-svn: 123564
*	Improve the safety of my globalopt enhancement by ensuring that the bitcast	Owen Anderson	2011-01-16	1	-12/+22
\| \| \| \| \| \|	of the stored value to the new store type is always. Also, add a testcase. llvm-svn: 123563
*	fix PR8983, a broken assertion.	Chris Lattner	2011-01-16	1	-1/+1
\| \| \| \|	llvm-svn: 123562
*	Implement AnalyzeBranch in Sparc Backend.	Venkatraman Govindaraju	2011-01-16	2	-7/+199
\| \| \| \|	llvm-svn: 123561
*	fix PR8981, a crash trying to form a conditional inc with a floating point ↵	Chris Lattner	2011-01-16	1	-1/+2
\| \| \| \| \| \|	compare. llvm-svn: 123560
*	reapply my fix for PR8961 with a tweak to properly handle	Chris Lattner	2011-01-16	2	-7/+13
\| \| \| \| \| \| \|	multi-instruction sequences like calls. Many thanks to Jakob for finding a testcase. llvm-svn: 123559
*	simplify this code, it is still broken but will follow up on llvm-commits.	Chris Lattner	2011-01-16	1	-15/+5
\| \| \| \|	llvm-svn: 123558
*	Revert "Archive: Replace all internal uses of PathV1 with PathV2. The ↵	Michael J. Spencer	2011-01-16	1	-36/+36
\| \| \| \| \| \|	external API still uses PathV1." llvm-svn: 123557
*	Simplify a README.txt entry significantly to expose the core issue.	Chandler Carruth	2011-01-16	1	-28/+26
\| \| \| \|	llvm-svn: 123556
*	remove the partial specialization pass. It is unmaintained and has bugs.	Chris Lattner	2011-01-16	3	-230/+0
\| \| \| \|	llvm-svn: 123554
*	Archive: Replace all internal uses of PathV1 with PathV2. The external API ↵	Michael J. Spencer	2011-01-15	1	-36/+36
\| \| \| \| \| \|	still uses PathV1. llvm-svn: 123551
*	Add an assert so we don't silently miscompile ctpop for bit widths > 128.	Benjamin Kramer	2011-01-15	1	-0/+4
\| \| \| \|	llvm-svn: 123549
*	Support/PathV2: Add identify_magic.	Michael J. Spencer	2011-01-15	4	-40/+40
\| \| \| \|	llvm-svn: 123548
*	Reimplement CTPOP legalization with the "best" algorithm from	Benjamin Kramer	2011-01-15	1	-18/+45
\| \| \| \| \| \| \| \| \| \| \| \| \|	http://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel In a silly microbenchmark on a 65 nm core2 this is 1.5x faster than the old code in 32 bit mode and about 2x faster in 64 bit mode. It's also a lot shorter, especially when counting 64 bit population on a 32 bit target. I hope this is fast enough to replace Kernighan-style counting loops even when the input is rather sparse. llvm-svn: 123547
*	Support/PathV2: Implement has_magic in terms of get_magic.	Michael J. Spencer	2011-01-15	1	-26/+8
\| \| \| \|	llvm-svn: 123545
*	Support/PathV2: Implement get_magic.	Michael J. Spencer	2011-01-15	3	-0/+74
\| \| \| \|	llvm-svn: 123544
*	Add missing whitespace.	Nick Lewycky	2011-01-15	1	-2/+2
\| \| \| \|	llvm-svn: 123543
*	Make constmerge a two-pass algorithm so that it won't miss merging	Nick Lewycky	2011-01-15	1	-4/+34
\| \| \| \| \| \|	opporuntities. Fixes PR8978. llvm-svn: 123541
*	Try to unbreak selfhost.	Benjamin Kramer	2011-01-15	1	-0/+1
\| \| \| \|	llvm-svn: 123537
*	Add a cache that protects mergefunc's internals from more surprises in DenseSet.	Nick Lewycky	2011-01-15	1	-5/+27
\| \| \| \| \| \|	Also, replace tabs with spaces. Yes, it's 2011. llvm-svn: 123535
*	Teach LazyValueInfo that allocas aren't NULL. Over all of llvm-test, this saves	Nick Lewycky	2011-01-15	1	-5/+27
\| \| \| \| \| \| \| \| \| \| \|	half a million non-local queries, each of which would otherwise have triggered a linear scan over a basic block. Also fix a fixme for memory intrinsics which dereference pointers. With this, we prove that a pointer is non-null because it was dereferenced by an intrinsic 112 times in llvm-test. llvm-svn: 123533
*	Allow unnamed_addr on declarations.	Rafael Espindola	2011-01-15	3	-12/+7
\| \| \| \|	llvm-svn: 123529
*	temporarily revert r123526. While working on a follow-on patch I	Chris Lattner	2011-01-15	1	-3/+0
\| \| \| \| \| \|	realize that ConstantFoldTerminator doesn't preserve dominfo. llvm-svn: 123527
*	fix rdar://8785296 - -fcatch-undefined-behavior generates inefficient code	Chris Lattner	2011-01-15	1	-0/+3
\| \| \| \| \| \| \| \| \|	The basic issue is that isel (very reasonably!) expects conditional branches to be folded, so CGP leaving around a bunch dead computation feeding conditional branches isn't such a good idea. Just fold branches on constants into unconditional branches. llvm-svn: 123526