|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| ... |  | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | - BlockDeclRefExprs always store VarDecls
  - BDREs no longer store copy expressions
  - BlockDecls now store a list of captured variables, information about
    how they're captured, and a copy expression if necessary
    
With that in hand, change IR generation to use the captures data in       
blocks instead of walking the block independently.        
Additionally, optimize block layout by emitting fields in descending
alignment order, with a heuristic for filling in words when alignment
of the end of the block header is insufficient for the most aligned
field.
llvm-svn: 125005 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | fixing a crash which probably nobody was ever going to see.  In doing so,
fix a horrendous number of problems with the conditional-cleanups code.
Also, make conditional cleanups re-use the cleanup's activation variable,
which avoids some unfortunate repetitiveness.
llvm-svn: 124481 | 
| | 
| 
| 
| 
| 
| 
| | referenced in the block-literal initializer
of that variable. // rdar://8893785
llvm-svn: 124332 | 
| | 
| 
| 
| 
| 
| | ASTContext::toCharUnitsFromBits() when converting from bit sizes to char units.
llvm-svn: 123720 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | process, perform a number of refactorings:
- Move MiscNameMangler member functions to MangleContext
- Remove GlobalDecl dependency from MangleContext
- Make MangleContext abstract and move Itanium/Microsoft functionality
  to their own classes/files
- Implement ASTContext::createMangleContext and have CodeGen use it
No (intended) functionality change.
llvm-svn: 123386 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | in asm statements:
register int foo asm("rdi");
asm("..." : ... "r" (foo) ...
We also only accept these variables if the constraint in the asm statement is "r".
This fixes most of PR3933.
llvm-svn: 122643 | 
| | 
| 
| 
| | llvm-svn: 122640 | 
| | 
| 
| 
| | llvm-svn: 122634 | 
| | 
| 
| 
| 
| 
| 
| 
| | objects imported into blocks. //rdar://8594790.
Will have a test case coming (as well as one
sent to llvm test suite).
llvm-svn: 120713 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | when an initializer is variable (I handled the constant case in a previous
patch).  This has three pieces:
1. Enhance AggValueSlot to have a 'isZeroed' bit to tell CGExprAgg that
   the memory being stored into has previously been memset to zero.
2. Teach CGExprAgg to not emit stores of zero to isZeroed memory.
3. Teach CodeGenFunction::EmitAggExpr to scan initializers to determine
   whether they are profitable to emit a memset + inividual stores vs
   stores for everything.
The heuristic used is that a global has to be more than 16 bytes and
has to be 3/4 zero to be candidate for this xform.  The two testcases
are illustrative of the scenarios this catches.  We now codegen test9 into:
 call void @llvm.memset.p0i8.i64(i8* %0, i8 0, i64 400, i32 4, i1 false)
 %.array = getelementptr inbounds [100 x i32]* %Arr, i32 0, i32 0
 %tmp = load i32* %X.addr, align 4
 store i32 %tmp, i32* %.array
and test10 into:
  call void @llvm.memset.p0i8.i64(i8* %0, i8 0, i64 392, i32 8, i1 false)
  %tmp = getelementptr inbounds %struct.b* %S, i32 0, i32 0
  %tmp1 = getelementptr inbounds %struct.a* %tmp, i32 0, i32 0
  %tmp2 = load i32* %X.addr, align 4
  store i32 %tmp2, i32* %tmp1, align 4
  %tmp5 = getelementptr inbounds %struct.b* %S, i32 0, i32 3
  %tmp10 = getelementptr inbounds %struct.a* %tmp5, i32 0, i32 4
  %tmp11 = load i32* %X.addr, align 4
  store i32 %tmp11, i32* %tmp10, align 4
Previously we produced 99 stores of zero for test9 and also tons for test10.
This xforms should substantially speed up -O0 builds when it kicks in as well
as reducing code size and optimizer heartburn on insane cases.  This resolves
PR279.
llvm-svn: 120692 | 
| | 
| 
| 
| | llvm-svn: 120688 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | a global is larger than 32 bytes and has fewer than 6 non-zero values in the
initializer.  Previously we'd turn something like this:
char test8(int X) {
  char str[10000] = "abc";
into a 10K global variable which we then memcpy'd from.  Now we generate:
  %str = alloca [10000 x i8], align 16
  %tmp = getelementptr inbounds [10000 x i8]* %str, i64 0, i64 0
  call void @llvm.memset.p0i8.i64(i8* %tmp, i8 0, i64 10000, i32 16, i1 false)
  store i8 97, i8* %tmp, align 16
  %0 = getelementptr [10000 x i8]* %str, i64 0, i64 1
  store i8 98, i8* %0, align 1
  %1 = getelementptr [10000 x i8]* %str, i64 0, i64 2
  store i8 99, i8* %1, align 2
Which is much smaller in space and also likely faster.
This is part of PR279
llvm-svn: 120645 | 
| | 
| 
| 
| 
| 
| 
| 
| | http://llvm.org/bugs/show_bug.cgi?id=279#c3 better.
No functionality change yet.
llvm-svn: 120530 | 
| | 
| 
| 
| 
| 
| | using ConstantInt::get.
llvm-svn: 120527 | 
| | 
| 
| 
| 
| 
| 
| | literal declaration caused crash in CodeGen.
This patch fixes it. pr8707
llvm-svn: 120486 | 
| | 
| 
| 
| | llvm-svn: 119924 | 
| | 
| 
| 
| 
| 
| | about a __block cxx object.
llvm-svn: 119411 | 
| | 
| 
| 
| | llvm-svn: 119174 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | data members by delaying the emission of the initializer until after
linkage and visibility have been set on the global.  Also, don't
emit a guard unless the variable actually ends up with vague linkage,
and don't use thread-safe statics in any case.
llvm-svn: 118336 | 
| | 
| 
| 
| 
| 
| | visibility of their function.
llvm-svn: 118065 | 
| | 
| 
| 
| 
| 
| | ambiguous context.
llvm-svn: 116567 | 
| | 
| 
| 
| 
| 
| | deal with the consequences.  Fixes rdar://problem/8224178.
llvm-svn: 115816 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | slot.  The easiest way to do that was to bundle up the information
we care about for aggregate slots into a new structure which demands
that its creators at least consider the question.
I could probably be convinced that the ObjC 'needs GC' bit should
be rolled into this structure.
Implement generalized copy elision.  The main obstacle here is that
IR-generation must be much more careful about making sure that exactly
llvm-svn: 113962 | 
| | 
| 
| 
| 
| 
| | Itanium guards and use a slightly different compiled-in API.
llvm-svn: 113330 | 
| | 
| 
| 
| 
| 
| 
| 
| | block-literal initializer expression causes IRgen to crash.
This patch fixes by saving it in StaticLocalDecl map
already used for such purposes. (radar 8390455).
llvm-svn: 113307 | 
| | 
| 
| 
| 
| 
| 
| | value is used. This matches with non-block variable
use of bool type. (Fixes radar 8390062).
llvm-svn: 113027 | 
| | 
| 
| 
| 
| 
| 
| | initializing;  it at best matches the element type of the variable
it's initializing.  Fixes PR8073.
llvm-svn: 112992 | 
| | 
| 
| 
| 
| 
| | projects.
llvm-svn: 112922 | 
| | 
| 
| 
| 
| 
| | a 'bool' byref variable in memory. Fixes radar 8382559.
llvm-svn: 112835 | 
| | 
| 
| 
| 
| 
| | This lets us remove Sema.h's dependency on Expr.h and Decl.h.
llvm-svn: 112156 | 
| | 
| 
| 
| 
| 
| 
| 
| | update callers as best I can.
 - This is a work in progress, our alignment handling is very horrible / sketchy -- I am just aiming for monotonic improvement.
 - Serious review appreciated.
llvm-svn: 111707 | 
| | 
| 
| 
| | llvm-svn: 111705 | 
| | 
| 
| 
| 
| 
| | initializations now.
llvm-svn: 110063 | 
| | 
| 
| 
| 
| 
| | failures.  There's a radar tracking this.
llvm-svn: 109170 | 
| | 
| 
| 
| 
| 
| | commits.
llvm-svn: 109000 | 
| | 
| 
| 
| 
| 
| 
| | the block-release unwind cleanup:  we're never going to test it if we don't turn
it on.
llvm-svn: 108992 | 
| | 
| 
| 
| 
| 
| 
| | the order they appeared in the translation unit.  If they get emitted, put them
in their proper order.  Fixes rdar://problem/7458115
llvm-svn: 108477 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | mostly in avoiding unnecessary work at compile time but also in producing more
sensible block orderings.
Move the destructor cleanups for local variables over to use lazy cleanups.
Eventually all cleanups will do this;  for now we have some awkward code
duplication.
Tell IR generation just to never produce landing pads in -fno-exceptions.
This is a much more comprehensive solution to a problem which previously was
half-solved by checks in most cleanup-generation spots.
llvm-svn: 108270 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | self-host.  Hopefully these results hold up on different platforms.  
I tried to keep the GNU ObjC runtime happy, but it's hard for me to test.
Reimplement how clang generates IR for exceptions.  Instead of creating new
invoke destinations which sequentially chain to the previous destination,
push a more semantic representation of *why* we need the cleanup/catch/filter
behavior, then collect that information into a single landing pad upon request.
Also reorganizes how normal cleanups (i.e. cleanups triggered by non-exceptional
control flow) are generated, since it's actually fairly closely tied in with
the former.  Remove the need to track which cleanup scope a block is associated
with.
Document a lot of previously poorly-understood (by me, at least) behavior.
The new framework implements the Horrible Hack (tm), which requires every
landing pad to have a catch-all so that inlining will work.  Clang no longer
requires the Horrible Hack just to make exceptions flow correctly within
a function, however.  The HH is an unfortunate requirement of LLVM's EH IR.
llvm-svn: 107631 | 
| | 
| 
| 
| 
| 
| | CXXRecordDecl::getDestructor(); no functionality change.
llvm-svn: 107394 | 
| | 
| 
| 
| 
| 
| 
| 
| | have CGF create and make accessible standard int32,int64 and 
intptr types.  This fixes a ton of 80 column violations 
introduced by LLVMContextification and cleans up stuff a lot.
llvm-svn: 106977 | 
| | 
| 
| 
| | llvm-svn: 106949 | 
| | 
| 
| 
| | llvm-svn: 106549 | 
| | 
| 
| 
| 
| 
| | dependency edge was reversed such that CodeGen depends on Frontend.
llvm-svn: 106065 | 
| | 
| 
| 
| | llvm-svn: 105525 | 
| | 
| 
| 
| 
| 
| 
| 
| | The macros required for DeclNodes use have changed to match the use of
StmtNodes. The FooFirst enumerator constants have been named firstFoo
to match usage elsewhere.
llvm-svn: 105165 | 
| | 
| 
| 
| | llvm-svn: 104916 | 
| | 
| 
| 
| 
| 
| | variable in a local function. Fixes pr7101.
llvm-svn: 104743 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | variables should have that linkage.  Otherwise, its static local
variables should have internal linkage.  To avoid computing this excessively,
set a function's linkage before we emit code for it.
Previously we were assigning weak linkage to the static variables of
static inline functions in C++, with predictably terrible results.  This
fixes that and also gives better linkage than 'weak' when merging is required.
llvm-svn: 104581 | 
| | 
| 
| 
| 
| 
| 
| | function does not return. Thanks to Eli for pointing out this corner
case.
llvm-svn: 103941 |