bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix a random missed optimization by making InstCombine more aggressive when ↵	Owen Anderson	2011-01-11	1	-2/+40
\| \| \| \| \| \| \| \|	determining which bits are demanded by a comparison against a constant. llvm-svn: 123203
*	Teach instcombine about the rest of the SSE and SSE2 conversion	Chandler Carruth	2011-01-10	1	-4/+11
\| \| \| \| \| \|	intrinsics element dependencies. Reviewed by Nick. llvm-svn: 123161
*	another random stab in the dark trying to fix llvm-gcc-i386-linux-selfhost	Chris Lattner	2011-01-10	1	-2/+4
\| \| \| \|	llvm-svn: 123149
*	another (more) aggressive attempt to bring llvm-gcc-i386-linux-selfhost	Chris Lattner	2011-01-10	1	-0/+2
\| \| \| \| \| \|	back to life. llvm-svn: 123146
*	temporarily disable memset formation from memsets in an effort to restore ↵	Chris Lattner	2011-01-09	1	-0/+3
\| \| \| \| \| \|	buildbot stability. llvm-svn: 123144
*	fix a few old bugs (found by inspection) where we would zap instructions	Chris Lattner	2011-01-09	1	-1/+4
\| \| \| \| \| \| \| \|	without informing memdep. This could cause nondeterminstic weirdness based on where instructions happen to get allocated, and will hopefully breath some life into some broken testers. llvm-svn: 123124
*	Instcombine: Fix pattern where the sext did not dominate the icmp using it	Tobias Grosser	2011-01-09	1	-2/+7
\| \| \| \|	llvm-svn: 123121
*	LoopInstSimplify preserves LoopSimplify.	Cameron Zwarich	2011-01-09	1	-0/+1
\| \| \| \|	llvm-svn: 123117
*	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's	Chris Lattner	2011-01-09	1	-3/+2
\| \| \| \| \| \|	that have the bit set. llvm-svn: 123104
*	fix a latent bug in memcpyoptimizer that my recent patches exposed: it wasn't	Chris Lattner	2011-01-08	1	-2/+4
\| \| \| \| \| \| \|	updating memdep when fusing stores together. This fixes the crash optimizing the bullet benchmark. llvm-svn: 123091
*	tryMergingIntoMemset can only handle constant length memsets.	Chris Lattner	2011-01-08	1	-5/+6
\| \| \| \|	llvm-svn: 123090
*	Merge memsets followed by neighboring memsets and other stores into	Chris Lattner	2011-01-08	1	-3/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	larger memsets. Among other things, this fixes rdar://8760394 and allows us to handle "Example 2" from http://blog.regehr.org/archives/320, compiling it into a single 4096-byte memset: _mad_synth_mute: ## @mad_synth_mute ## BB#0: ## %entry pushq %rax movl $4096, %esi ## imm = 0x1000 callq ___bzero popq %rax ret llvm-svn: 123089
*	fix an issue in IsPointerOffset that prevented us from recognizing that	Chris Lattner	2011-01-08	1	-3/+19
\| \| \| \| \| \|	P and P+1 are relative to the same base pointer. llvm-svn: 123087
*	enhance memcpyopt to merge a store and a subsequent	Chris Lattner	2011-01-08	1	-53/+83
\| \| \| \| \| \|	memset into a single larger memset. llvm-svn: 123086
*	constify TargetData references.	Chris Lattner	2011-01-08	1	-86/+96
\| \| \| \| \| \| \|	Split memset formation logic out into its own "tryMergingIntoMemset" helper function. llvm-svn: 123081
*	When loop rotation happens, it is very common for the duplicated condbr	Chris Lattner	2011-01-08	1	-21/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to be foldable into an uncond branch. When this happens, we can make a much simpler CFG for the loop, which is important for nested loop cases where we want the outer loop to be aggressively optimized. Handle this case more aggressively. For example, previously on phi-duplicate.ll we would get this: define void @test(i32 %N, double* %G) nounwind ssp { entry: %cmp1 = icmp slt i64 1, 1000 br i1 %cmp1, label %bb.nph, label %for.end bb.nph: ; preds = %entry br label %for.body for.body: ; preds = %bb.nph, %for.cond %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ] %arrayidx = getelementptr inbounds double* %G, i64 %j.02 %tmp3 = load double* %arrayidx %sub = sub i64 %j.02, 1 %arrayidx6 = getelementptr inbounds double* %G, i64 %sub %tmp7 = load double* %arrayidx6 %add = fadd double %tmp3, %tmp7 %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02 store double %add, double* %arrayidx10 %inc = add nsw i64 %j.02, 1 br label %for.cond for.cond: ; preds = %for.body %cmp = icmp slt i64 %inc, 1000 br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge for.cond.for.end_crit_edge: ; preds = %for.cond br label %for.end for.end: ; preds = %for.cond.for.end_crit_edge, %entry ret void } Now we get the much nicer: define void @test(i32 %N, double* %G) nounwind ssp { entry: br label %for.body for.body: ; preds = %entry, %for.body %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ] %arrayidx = getelementptr inbounds double* %G, i64 %j.01 %tmp3 = load double* %arrayidx %sub = sub i64 %j.01, 1 %arrayidx6 = getelementptr inbounds double* %G, i64 %sub %tmp7 = load double* %arrayidx6 %add = fadd double %tmp3, %tmp7 %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01 store double %add, double* %arrayidx10 %inc = add nsw i64 %j.01, 1 %cmp = icmp slt i64 %inc, 1000 br i1 %cmp, label %for.body, label %for.end for.end: ; preds = %for.body ret void } With all of these recent changes, we are now able to compile: void foo(char X) { for (int i = 0; i != 100; ++i) for (int j = 0; j != 100; ++j) X[j+i100] = 0; } into a single memset of 10000 bytes. This series of changes should also be helpful for other nested loop scenarios as well. llvm-svn: 123079
*	split ssa updating code out to its own helper function. Don't bother	Chris Lattner	2011-01-08	1	-74/+78
\| \| \| \| \| \| \|	moving the OrigHeader block anymore: we just merge it away anyway so its code layout doesn't matter. llvm-svn: 123077
*	Implement a TODO: Enhance loopinfo to merge away the unconditional branch	Chris Lattner	2011-01-08	1	-11/+7
\| \| \| \| \| \| \| \| \| \|	that it was leaving in loops after rotation (between the original latch block and the original header. With this change, it is possible for rotated loops to have just a single basic block, which is useful. llvm-svn: 123075
*	various code cleanups, enhance MergeBlockIntoPredecessor to preserve	Chris Lattner	2011-01-08	1	-13/+10
\| \| \| \| \| \|	loop info. llvm-svn: 123074
*	inline preserveCanonicalLoopForm now that it is simple.	Chris Lattner	2011-01-08	1	-39/+17
\| \| \| \|	llvm-svn: 123073
*	Three major changes:	Chris Lattner	2011-01-08	1	-115/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1. Rip out LoopRotate's domfrontier updating code. It isn't needed now that LICM doesn't use DF and it is super complex and gross. 2. Make DomTree updating code a lot simpler and faster. The old loop over all the blocks was just to find a block?? 3. Change the code that inserts the new preheader to just use SplitCriticalEdge instead of doing an overcomplex reimplementation of it. No behavior change, except for the name of the inserted preheader. llvm-svn: 123072
*	reduce nesting.	Chris Lattner	2011-01-08	1	-6/+6
\| \| \| \|	llvm-svn: 123071
*	LoopRotate requires canonical loop form, so it always has preheaders	Chris Lattner	2011-01-08	1	-15/+11
\| \| \| \| \| \| \|	and latch blocks. Reorder entry conditions to make hte pass faster and more logical. llvm-svn: 123069
*	use the LI ivar.	Chris Lattner	2011-01-08	1	-3/+2
\| \| \| \|	llvm-svn: 123068
*	some cleanups: remove dead arguments and eliminate ivars	Chris Lattner	2011-01-08	1	-55/+36
\| \| \| \| \| \|	that are just passed to one function. llvm-svn: 123067
*	fix an issue duncan pointed out, which could cause loop rotate	Chris Lattner	2011-01-08	1	-12/+16
\| \| \| \| \| \|	to violate LCSSA form llvm-svn: 123066
*	Fix coding style issues.	Cameron Zwarich	2011-01-08	1	-2/+2
\| \| \| \|	llvm-svn: 123065
*	Make more passes preserve dominators (or state that they preserve dominators if	Cameron Zwarich	2011-01-08	2	-0/+18
\| \| \| \| \| \| \| \| \| \|	they all ready do). This removes two dominator recomputations prior to isel, which is a 1% improvement in total llc time for 403.gcc. The only potentially suspect thing is making GCStrategy recompute dominators if it used a custom lowering strategy. llvm-svn: 123064
*	Contract subloop bodies. However, it is still important to visit the phis at the	Cameron Zwarich	2011-01-08	1	-7/+41
\| \| \| \| \| \|	top of subloop headers, as the phi uses logically occur outside of the subloop. llvm-svn: 123062
*	Fix a bug in r123034 (trying to sext/zext non-integers) and clean up a little.	Frits van Bommel	2011-01-08	1	-5/+8
\| \| \| \|	llvm-svn: 123061
*	Have loop-rotate simplify instructions (yay instsimplify!) as it clones	Chris Lattner	2011-01-08	1	-5/+21
\| \| \| \| \| \| \| \| \| \| \| \|	them into the loop preheader, eliminating silly instructions like "icmp i32 0, 100" in fixed tripcount loops. This also better exposes the bigger problem with loop rotate that I'd like to fix: once this has been folded, the duplicated conditional branch often turns into an uncond branch. Not aggressively handling this is pessimizing later loop optimizations somethin' fierce by making "dominates all exit blocks" checks fail. llvm-svn: 123060
*	Revamp the ValueMapper interfaces in a couple ways:	Chris Lattner	2011-01-08	5	-135/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1. Take a flags argument instead of a bool. This makes it more clear to the reader what it is used for. 2. Add a flag that says that "remapping a value not in the map is ok". 3. Reimplement MapValue to share a bunch of code and be a lot more efficient. For lookup failures, don't drop null values into the map. 4. Using the new flag a bunch of code can vaporize in LinkModules and LoopUnswitch, kill it. No functionality change. llvm-svn: 123058
*	two minor changes: switch to the standard ValueToValueMapTy	Chris Lattner	2011-01-08	1	-2/+7
\| \| \| \| \| \| \| \|	map from ValueMapper.h (giving us access to its utilities) and add a fastpath in the loop rotation code, avoiding expensive ssa updator manipulation for values with nothing to update. llvm-svn: 123057
*	InstCombine: Match min/max hidden by sext/zext	Tobias Grosser	2011-01-07	1	-35/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	X = sext x; x >s c ? X : C+1 --> X = sext x; X <s C+1 ? C+1 : X X = sext x; x <s c ? X : C-1 --> X = sext x; X >s C-1 ? C-1 : X X = zext x; x >u c ? X : C+1 --> X = zext x; X <u C+1 ? C+1 : X X = zext x; x <u c ? X : C-1 --> X = zext x; X >u C-1 ? C-1 : X X = sext x; x >u c ? X : C+1 --> X = sext x; X <u C+1 ? C+1 : X X = sext x; x <u c ? X : C-1 --> X = sext x; X >u C-1 ? C-1 : X Instead of calculating this with mixed types promote all to the larger type. This enables scalar evolution to analyze this expression. PR8866 llvm-svn: 123034
*	Some whitespace fixes	Tobias Grosser	2011-01-07	1	-24/+24
\| \| \| \|	llvm-svn: 123033
*	Revert 122959, it needs more thought. Add it back to README.txt with ↵	Benjamin Kramer	2011-01-07	1	-4/+0
\| \| \| \| \| \|	additional notes. llvm-svn: 123030
*	Remove all uses of the "ugly" method BranchInst::setUnconditionalDest().	Jay Foad	2011-01-07	2	-6/+9
\| \| \| \|	llvm-svn: 123025
*	InstCombine: Turn _chk functions into the "unsafe" variant if length and max ↵	Benjamin Kramer	2011-01-06	1	-0/+2
\| \| \| \| \| \| \| \|	langth are equal. This happens when we take the (non-constant) length from a malloc. llvm-svn: 122961
*	InstCombine: If we call llvm.objectsize on a malloc call we can replace it ↵	Benjamin Kramer	2011-01-06	1	-1/+5
\| \| \| \| \| \|	with the size passed to malloc. llvm-svn: 122959
*	InstCombine: Teach llvm.objectsize folding to look through GEPs.	Benjamin Kramer	2011-01-06	1	-50/+41
\| \| \| \|	llvm-svn: 122958
*	Add the CallInst optimizations that don't involve expanding inline assembly to	Cameron Zwarich	2011-01-06	1	-0/+7
\| \| \| \| \| \|	OptimizeInst() so that they can be used on a worklist instruction. llvm-svn: 122945
*	Move the GEP handling in CodeGenPrepare to OptimizeInst().	Cameron Zwarich	2011-01-06	1	-12/+12
\| \| \| \|	llvm-svn: 122944
*	Split the optimizations in CodeGenPrepare that don't manipulate the iterators	Cameron Zwarich	2011-01-06	1	-41/+52
\| \| \| \| \| \| \|	into a separate function, so that it can be called from a loop using a worklist rather than a loop traversing a whole basic block. llvm-svn: 122943
*	Zap the last two -Wself-assign warnings in llvm.	Jakob Stoklund Olesen	2011-01-06	1	-1/+2
\| \| \| \| \| \|	Simplify RALinScan::DowngradeRegister with TRI::getOverlaps while we are there. llvm-svn: 122940
*	Stop reallocating SunkAddrs for each basic block. When we move to an instruction	Cameron Zwarich	2011-01-06	1	-4/+10
\| \| \| \| \| \|	worklist, the key will need to become std::pair<BasicBlock, Value>. llvm-svn: 122932
*	Add some more statistics to CodeGenPrepare.	Cameron Zwarich	2011-01-05	1	-0/+4
\| \| \| \|	llvm-svn: 122891
*	Add some stats to CodeGenPrepare to make it easier to speed it up without	Cameron Zwarich	2011-01-05	1	-3/+15
\| \| \| \| \| \|	regressing code quality. llvm-svn: 122887
*	Use pop_back_val instead of back followed by pop_back.	Cameron Zwarich	2011-01-05	1	-2/+1
\| \| \| \|	llvm-svn: 122876
*	Use a worklist for later iterations just like ordinary instsimplify. The next	Cameron Zwarich	2011-01-05	1	-0/+19
\| \| \| \| \| \| \|	step is to only process instructions in subloops if they have been modified by an earlier simplification. llvm-svn: 122869
*	Change LoopInstSimplify back to a LoopPass. It revisits subloops rather than	Cameron Zwarich	2011-01-05	1	-10/+36
\| \| \| \| \| \| \| \| \| \|	skipping them, but it should probably use a worklist and only revisit those instructions in subloops that have actually changed. It should probably also use a worklist after the first iteration like instsimplify now does. Regardless, it's only 0.3% of opt -O2 time on 403.gcc if it replaces the instcombine placed in the middle of the loop passes. llvm-svn: 122868