bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert "Re-apply fixed version of 108749, correctly conditionalizing the new	Daniel Dunbar	2010-07-29	1	-15/+5
\| \| \| \| \| \| \|	sections on", this change uncovered a possible linker bug which resulted in the wrong messages getting dispatched. Backing this out while we investigate... llvm-svn: 109817
*	Wire up sema checking for __builtin_arm_usat and __builtin_arm_ssat immediates.	Nate Begeman	2010-07-29	1	-1/+1
\| \| \| \|	llvm-svn: 109814
*	move the last hunk of getCoerceResult into the place	Chris Lattner	2010-07-29	2	-34/+29
\| \| \| \| \| \|	that needs it and remove getCoerceResult. llvm-svn: 109807
*	now that direct and coerce are merged, getCoerceResult gets simpler.	Chris Lattner	2010-07-29	1	-5/+0
\| \| \| \|	llvm-svn: 109805
*	now that GetSSETypeAtOffset handles passing SSE class values as	Chris Lattner	2010-07-29	1	-14/+0
\| \| \| \| \| \|	float, the special case hack in getCoerceResult can go away. llvm-svn: 109804
*	Change the name to something less terrible; suggestion by Doug. No ↵	Argyrios Kyrtzidis	2010-07-29	1	-1/+1
\| \| \| \| \| \|	functionality change. llvm-svn: 109797
*	Implement the clang-side of detection for when to pass as	Chris Lattner	2010-07-29	1	-3/+39
\| \| \| \| \| \| \| \|	<2 x float> instead of double. This works but can't be turned on until I teach codegen to pass <2 x float> as one XMM register instead of two. llvm-svn: 109790
*	Look at me, I can count!	Chris Lattner	2010-07-29	1	-1/+1
\| \| \| \|	llvm-svn: 109786
*	Merge PCHWriterDecl.cpp's isRequiredDecl and ↵	Argyrios Kyrtzidis	2010-07-29	2	-163/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	CodeGenModule::MayDeferGeneration into a new function, DeclIsRequiredFunctionOrFileScopedVar. This is essentially a CodeGen predicate that is also needed by the PCH mechanism to determine whether a decl needs to be deserialized during PCH loading for codegen purposes. Since this logic is shared by CodeGen and the PCH mechanism, move it to the ASTContext, thus CodeGenModule's GetLinkageForFunction/GetLinkageForVariable and the GVALinkage enum is moved out of CodeGen. This fixes current (and avoids future) codegen-from-PCH bugs. llvm-svn: 109784
*	fix rdar://8251384, another case where we could access beyond the	Chris Lattner	2010-07-29	1	-21/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	end of a struct. This improves the case when the struct being passed contains 3 floats, either due to a struct or array of 3 things. Before we'd generate this IR for the testcase: define float @bar(double %X.coerce0, double %X.coerce1) nounwind { entry: %X = alloca %struct.foof, align 8 ; <%struct.foof> [#uses=2] %0 = bitcast %struct.foof %X to %1* ; <%1> [#uses=2] %1 = getelementptr %1 %0, i32 0, i32 0 ; <double> [#uses=1] store double %X.coerce0, double %1 %2 = getelementptr %1* %0, i32 0, i32 1 ; <double> [#uses=1] store double %X.coerce1, double %2 %tmp = getelementptr inbounds %struct.foof* %X, i32 0, i32 2 ; <float> [#uses=1] %tmp1 = load float %tmp ; <float> [#uses=1] ret float %tmp1 } which compiled (with optimization) to: _bar: ## @bar ## BB#0: ## %entry movd %xmm1, %rax movd %eax, %xmm0 ret Now we produce: define float @bar(double %X.coerce0, float %X.coerce1) nounwind { entry: %X = alloca %struct.foof, align 8 ; <%struct.foof> [#uses=2] %0 = bitcast %struct.foof %X to %0* ; <%0> [#uses=2] %1 = getelementptr %0 %0, i32 0, i32 0 ; <double> [#uses=1] store double %X.coerce0, double %1 %2 = getelementptr %0* %0, i32 0, i32 1 ; <float> [#uses=1] store float %X.coerce1, float %2 %tmp = getelementptr inbounds %struct.foof* %X, i32 0, i32 2 ; <float> [#uses=1] %tmp1 = load float %tmp ; <float> [#uses=1] ret float %tmp1 } and: _bar: ## @bar ## BB#0: ## %entry movaps %xmm1, %xmm0 ret llvm-svn: 109776
*	start setting up infrastructure for passing multi-floats	Chris Lattner	2010-07-29	1	-15/+33
\| \| \| \| \| \| \|	as <2 x float> instead of as double. The backend isn't ready yet, but infrastructure in the frontend can come up. llvm-svn: 109768
*	rename Get8ByteTypeAtOffset -> GetINTEGERTypeAtOffset to	Chris Lattner	2010-07-29	1	-17/+18
\| \| \| \| \| \| \|	make it clear that this function should only return a type that the codegen will classify the same as an INTEGER type. llvm-svn: 109763
*	handle a case where we could access off the end of a function	Chris Lattner	2010-07-29	1	-9/+6
\| \| \| \| \| \|	that Eli pointed out, rdar://8249586 llvm-svn: 109762
*	fix PR7742 / rdar://8250764, a miscompilation of struct	Chris Lattner	2010-07-29	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \|	return where the struct has a base but no fields. This was because the x86-64 abi logic was checking the wrong predicate in one place. This was introduced in r91874, which was a fix for PR5831, which lacked a CHECK line, so I verified and added it. llvm-svn: 109759
*	This is a little bit far, but optimize cases like:	Chris Lattner	2010-07-29	1	-3/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	struct a { struct c { double x; int y; } x[1]; }; void foo(struct a A) { } into: define void @foo(double %A.coerce0, i32 %A.coerce1) nounwind { entry: %A = alloca %struct.a, align 8 ; <%struct.a> [#uses=1] %0 = bitcast %struct.a %A to %struct.c* ; <%struct.c> [#uses=2] %1 = getelementptr %struct.c %0, i32 0, i32 0 ; <double> [#uses=1] store double %A.coerce0, double %1 %2 = getelementptr %struct.c* %0, i32 0, i32 1 ; <i32> [#uses=1] store i32 %A.coerce1, i32 %2 instead of: define void @foo(double %A.coerce0, i64 %A.coerce1) nounwind { entry: %A = alloca %struct.a, align 8 ; <%struct.a> [#uses=1] %0 = bitcast %struct.a %A to %0* ; <%0> [#uses=2] %1 = getelementptr %0 %0, i32 0, i32 0 ; <double> [#uses=1] store double %A.coerce0, double %1 %2 = getelementptr %0* %0, i32 0, i32 1 ; <i64> [#uses=1] store i64 %A.coerce1, i64 %2 I only do this now because I never want to look at this code again :) llvm-svn: 109738
*	implement a todo: pass a eight-byte that consists of a	Chris Lattner	2010-07-29	1	-6/+92
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	small integer + padding as that small integer. On code like: struct c { double x; int y; }; void bar(struct c C) { } This means that we compile to: define void @bar(double %C.coerce0, i32 %C.coerce1) nounwind { entry: %C = alloca %struct.c, align 8 ; <%struct.c> [#uses=2] %0 = getelementptr %struct.c %C, i32 0, i32 0 ; <double> [#uses=1] store double %C.coerce0, double %0 %1 = getelementptr %struct.c* %C, i32 0, i32 1 ; <i32> [#uses=1] store i32 %C.coerce1, i32 %1 instead of: define void @bar(double %C.coerce0, i64 %C.coerce1) nounwind { entry: %C = alloca %struct.c, align 8 ; <%struct.c> [#uses=3] %0 = bitcast %struct.c %C to %0* ; <%0> [#uses=2] %1 = getelementptr %0 %0, i32 0, i32 0 ; <double> [#uses=1] store double %C.coerce0, double %1 %2 = getelementptr %0* %0, i32 0, i32 1 ; <i64> [#uses=1] store i64 %C.coerce1, i64 %2 which gives SRoA heartburn. This implements rdar://5711709, a nice low number :) llvm-svn: 109737
*	fix a builder, why didn't clang++ catch this?	Chris Lattner	2010-07-29	2	-2/+3
\| \| \| \|	llvm-svn: 109735
*	Kill off the 'coerce' ABI passing form. Now 'direct' and 'extend' always	Chris Lattner	2010-07-29	4	-194/+178
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	have a "coerce to" type which often matches the default lowering of Clang type to LLVM IR type, but the coerce case can be handled by making them not be the same. This simplifies things and fixes issues where X86-64 abi lowering would return coerce after making preferred types exactly match up. This caused us to compile: typedef float v4f32 __attribute__((__vector_size__(16))); v4f32 foo(v4f32 X) { return X+X; } into this code at -O0: define <4 x float> @foo(<4 x float> %X.coerce) nounwind { entry: %retval = alloca <4 x float>, align 16 ; <<4 x float>> [#uses=2] %coerce = alloca <4 x float>, align 16 ; <<4 x float>> [#uses=2] %X.addr = alloca <4 x float>, align 16 ; <<4 x float>> [#uses=3] store <4 x float> %X.coerce, <4 x float> %coerce %X = load <4 x float>* %coerce ; <<4 x float>> [#uses=1] store <4 x float> %X, <4 x float>* %X.addr %tmp = load <4 x float>* %X.addr ; <<4 x float>> [#uses=1] %tmp1 = load <4 x float>* %X.addr ; <<4 x float>> [#uses=1] %add = fadd <4 x float> %tmp, %tmp1 ; <<4 x float>> [#uses=1] store <4 x float> %add, <4 x float>* %retval %0 = load <4 x float>* %retval ; <<4 x float>> [#uses=1] ret <4 x float> %0 } Now we get: define <4 x float> @foo(<4 x float> %X) nounwind { entry: %X.addr = alloca <4 x float>, align 16 ; <<4 x float>> [#uses=3] store <4 x float> %X, <4 x float> %X.addr %tmp = load <4 x float>* %X.addr ; <<4 x float>> [#uses=1] %tmp1 = load <4 x float>* %X.addr ; <<4 x float>> [#uses=1] %add = fadd <4 x float> %tmp, %tmp1 ; <<4 x float>> [#uses=1] ret <4 x float> %add } This implements rdar://8248065 llvm-svn: 109733
*	ignore structs that wrap vectors in IR, the abstraction shouldn't add penalty.	Chris Lattner	2010-07-29	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before we'd compile the example into something like: %coerce.dive2 = getelementptr %struct.v4f32wrapper* %retval, i32 0, i32 0 ; <<4 x float>> [#uses=1] %1 = bitcast <4 x float> %coerce.dive2 to <2 x double>* ; <<2 x double>> [#uses=1] %2 = load <2 x double> %1, align 1 ; <<2 x double>> [#uses=1] ret <2 x double> %2 Now we produce: %coerce.dive2 = getelementptr %struct.v4f32wrapper* %retval, i32 0, i32 0 ; <<4 x float>> [#uses=1] %0 = load <4 x float> %coerce.dive2, align 1 ; <<4 x float>> [#uses=1] ret <4 x float> %0 llvm-svn: 109732
*	move the 'pretty 16-byte vector' inferring code up to be shared	Chris Lattner	2010-07-29	1	-14/+24
\| \| \| \| \| \|	with return values, improving stuff that returns __m128 etc. llvm-svn: 109731
*	simplify code by eliminating a premature optimization.	Chris Lattner	2010-07-29	1	-30/+11
\| \| \| \|	llvm-svn: 109730
*	now that we have CGT around, we can start using preferred types	Chris Lattner	2010-07-29	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	for return values too. Instead of compiling something like: struct foo { int X; float Y; }; struct foo test(struct foo P) { return P; } to: %1 = type { i64, i64 } define %1 @test(%struct.foo* %P) nounwind { entry: %retval = alloca %struct.foo, align 8 ; <%struct.foo> [#uses=2] %P.addr = alloca %struct.foo, align 8 ; <%struct.foo*> [#uses=2] store %struct.foo %P, %struct.foo %P.addr %tmp = load %struct.foo %P.addr ; <%struct.foo> [#uses=1] %tmp1 = bitcast %struct.foo %retval to i8* ; <i8> [#uses=1] %tmp2 = bitcast %struct.foo %tmp to i8* ; <i8> [#uses=1] call void @llvm.memcpy.p0i8.p0i8.i64(i8 %tmp1, i8* %tmp2, i64 16, i32 8, i1 false) %0 = bitcast %struct.foo* %retval to %1* ; <%1> [#uses=1] %1 = load %1 %0, align 1 ; <%1> [#uses=1] ret %1 %1 } We now get the result more type safe, with: define %struct.foo @test(%struct.foo* %P) nounwind { entry: %retval = alloca %struct.foo, align 8 ; <%struct.foo> [#uses=2] %P.addr = alloca %struct.foo, align 8 ; <%struct.foo*> [#uses=2] store %struct.foo %P, %struct.foo %P.addr %tmp = load %struct.foo %P.addr ; <%struct.foo> [#uses=1] %tmp1 = bitcast %struct.foo %retval to i8* ; <i8> [#uses=1] %tmp2 = bitcast %struct.foo %tmp to i8* ; <i8> [#uses=1] call void @llvm.memcpy.p0i8.p0i8.i64(i8 %tmp1, i8* %tmp2, i64 16, i32 8, i1 false) %0 = load %struct.foo* %retval ; <%struct.foo> [#uses=1] ret %struct.foo %0 } That memcpy is completely terrible, but I don't know how to fix it. llvm-svn: 109729
*	sink preferred type stuff lower. It's possible that this might	Chris Lattner	2010-07-29	1	-27/+33
\| \| \| \| \| \| \|	improve codegen for vaarg or something, because its codepath is getting preferred types now. llvm-svn: 109728
*	dissolve some more complexity: make the x86-64 abi lowering code	Chris Lattner	2010-07-29	3	-57/+13
\| \| \| \| \| \| \|	compute its own preferred types instead of having CGT compute them then pass them (circuituously) down into ABIInfo. llvm-svn: 109726
*	simplify Get8ByteTypeAtOffset by making it a member of X86_64ABIInfo	Chris Lattner	2010-07-29	1	-24/+20
\| \| \| \|	llvm-svn: 109724
*	now that ABIInfo depends on CGT, it has trivial access to such	Chris Lattner	2010-07-29	3	-216/+160
\| \| \| \| \| \| \|	things as TargetData, ASTContext, LLVMContext etc. Stop passing them through so many APIs. llvm-svn: 109723
*	cave in to reality and make ABIInfo depend on CodeGenTypes.	Chris Lattner	2010-07-29	3	-68/+105
\| \| \| \| \| \|	This will simplify a bunch of code, coming up next. llvm-svn: 109722
*	pass argument vectors in a type that corresponds to the user type if	Chris Lattner	2010-07-28	1	-2/+14
\| \| \| \| \| \| \| \|	possible. This improves the example to pass <4 x float> instead of <2 x double> but we still get awful code, and still don't get the return value right. llvm-svn: 109700
*	tidy up	Chris Lattner	2010-07-28	2	-13/+10
\| \| \| \|	llvm-svn: 109699
*	Override selected builtin names (e.g. "long int" instead of "long") to match ↵	Devang Patel	2010-07-28	1	-3/+13
\| \| \| \| \| \|	names used by gcc in debug info. This makes gdb testsuite happy. llvm-svn: 109694
*	fix some break statements to be formatted more consistently,	Chris Lattner	2010-07-28	1	-16/+7
\| \| \| \| \| \|	remove some now-dead code. llvm-svn: 109690
*	use Get8ByteTypeAtOffset for the return value path as well so we	Chris Lattner	2010-07-28	1	-60/+63
\| \| \| \| \| \|	don't get errors similar to PR7714 on the return path. llvm-svn: 109689
*	refactor the autosizing code, eliminating duplication	Chris Lattner	2010-07-28	1	-58/+53
\| \| \| \| \| \|	and making Get8ByteTypeAtOffset always succeed and documented. llvm-svn: 109685
*	fix PR7714 by not referencing off the end of a struct when passed by value in	Chris Lattner	2010-07-28	1	-7/+39
\| \| \| \| \| \| \|	x86-64 abi. This also improves codegen as well. Some refactoring is needed of this code. llvm-svn: 109681
*	Fix flags in global block descriptor when	Fariborz Jahanian	2010-07-28	1	-15/+22
\| \| \| \| \| \| \|	block returns structs. Fies radar 8241648. Executable test added to llvm test suite. llvm-svn: 109620
*	some cleanups and get alignments correct for various coerce cases.	Chris Lattner	2010-07-28	1	-9/+16
\| \| \| \|	llvm-svn: 109607
*	we are not supposed to create an improper callsite using a CallInstr; leave ↵	Gabor Greif	2010-07-28	1	-0/+1
\| \| \| \| \| \|	a fixme mentioning the simplification when CallSite can clone itself llvm-svn: 109575
*	construct debug info for "id" by hand.	Devang Patel	2010-07-28	1	-3/+35
\| \| \| \| \| \|	Tested by mi1-var-obj.exp in gdb testsuite. llvm-svn: 109571
*	When creating a jump destination, its scope should be the scope of the	John McCall	2010-07-28	2	-1/+6
\| \| \| \| \| \| \| \| \|	enclosing normal cleanup, not the top of the EH stack. I'm really surprised this hasn't been causing more problems. Fixes rdar://problem/8231514. llvm-svn: 109569
*	Revert r109546, it broke linux build.	Argyrios Kyrtzidis	2010-07-27	1	-60/+38
\| \| \| \|	llvm-svn: 109550
*	Merge PCHWriterDecl.cpp's isRequiredDecl and ↵	Argyrios Kyrtzidis	2010-07-27	1	-38/+60
\| \| \| \| \| \| \| \| \| \| \| \|	CodeGenModule::MayDeferGeneration into a new function, DeclIsRequiredFunctionOrFileScopedVar. This function is part of the public CodeGen interface since it's essentially a CodeGen predicate that is also needed by the PCH mechanism to determine whether a decl needs to be deserialized during PCH loading for codegen purposes. This fixes current (and avoids future) codegen-from-PCH bugs. llvm-svn: 109546
*	Always use current working directory for DW_AT_comp_dir.	Devang Patel	2010-07-27	2	-14/+1
\| \| \| \|	llvm-svn: 109535
*	Reapply 109303.	Devang Patel	2010-07-27	2	-14/+40
\| \| \| \|	llvm-svn: 109507
*	2nd argument of __builtin_expect must be evaluated	Fariborz Jahanian	2010-07-26	1	-1/+4
\| \| \| \| \| \| \|	if it hs side-effect to matchgcc's behaviour. Addresses radar 8172109. llvm-svn: 109467
*	Test for the presence of EH branch-throughs instead of normal branch-throughs.	John McCall	2010-07-26	1	-1/+1
\| \| \| \| \| \|	I knew this code duplication would bite me. llvm-svn: 109463
*	Revert 109303.	Devang Patel	2010-07-26	2	-40/+14
\| \| \| \|	llvm-svn: 109426
*	Remove the vast majority of the Destroy methods from the AST library,	Douglas Gregor	2010-07-25	2	-11/+0
\| \| \| \| \| \|	since we aren't going to be calling them ever. llvm-svn: 109377
*	Mangle enum constant expressions. Fixes rdar://problem/8204122	John McCall	2010-07-24	1	-0/+6
\| \| \| \|	llvm-svn: 109315
*	Untangle filename/dirname confusion. Store constructed strings on the side. ↵	Devang Patel	2010-07-24	2	-14/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Avoid use of Path.makeAbsolute(). DW_TAG_compile_unit uses two attributes DW_AT_name and DW_AT_comp_dir. Their expected values are: $ clang foo.c -g DW_AT_name - foo.c DW_AT_comp_dir - `pwd` $ clang one/two/foo.c -g DW_AT_name - one/two/foo.c DW_AT_comp_dir - `pwd` $ clang /tmp/one/foo.c -g DW_AT_name - /tmp/one/foo.c DW_AT_comp_dir - empty llvm-svn: 109303
*	Support catching Objective C pointers in C++ under the non-fragile NeXT runtime.	John McCall	2010-07-24	4	-24/+56
\| \| \| \| \| \|	Diagnose attempts to do this under the GNU or fragile NeXT runtimes. llvm-svn: 109298