|  | Commit message (Collapse) | Author | Age | Files | Lines | 
|---|
| ... |  | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | conversion open the door for many nasty implicit conversion issues, and
can be easily solved by initializing with (V.begin(), V.end()) when 
needed.
This patch includes many small cleanups for sdisel also.
llvm-svn: 50340 | 
| | 
| 
| 
| 
| 
| | heap thrash on tiny (usually single-element) vectors.
llvm-svn: 50335 | 
| | 
| 
| 
| | llvm-svn: 50330 | 
| | 
| 
| 
| | llvm-svn: 50329 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | When choosing between constraints with multiple options,
like "ir", test to see if we can use the 'i' constraint and
go with that if possible.  This produces more optimal ASM in
all cases (sparing a register and an instruction to load it),
and fixes inline asm like this:
void test () {
  asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14));
}
Previously we would dump "42" into a memory location (which
is ok for the 'm' constraint) which would cause a problem
because the 'c' modifier is not valid on memory operands.
Isn't it great how inline asm turns 'missed optimization'
into 'compile failed'??
Incidentally, this was the todo in 
PowerPC/2007-04-24-InlineAsm-I-Modifier.ll
Please do NOT pull this into Tak.
llvm-svn: 50315 | 
| | 
| 
| 
| | llvm-svn: 50314 | 
| | 
| 
| 
| | llvm-svn: 50313 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | to the block that defines their operands. This doesn't work in the
case that the operand is an invoke, because invoke is a terminator
and must be the last instruction in a block.
Replace it with support in SelectionDAGISel for copying struct values
into sequences of virtual registers.
llvm-svn: 50279 | 
| | 
| 
| 
| | llvm-svn: 50181 | 
| | 
| 
| 
| | llvm-svn: 50180 | 
| | 
| 
| 
| | llvm-svn: 49715 | 
| | 
| 
| 
| | llvm-svn: 49652 | 
| | 
| 
| 
| 
| 
| | parts. Fixes PR1643
llvm-svn: 49611 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | on any current target and aren't optimized in DAGCombiner. Instead
of using intermediate nodes, expand the operations, choosing between
simple loads/stores, target-specific code, and library calls,
immediately.
Previously, the code to emit optimized code for these operations
was only used at initial SelectionDAG construction time; now it is
used at all times. This fixes some cases where rep;movs was being
used for small copies where simple loads/stores would be better.
This also cleans up code that checks for alignments less than 4;
let the targets make that decision instead of doing it in
target-independent code. This allows x86 to use rep;movs in
low-alignment cases.
Also, this fixes a bug that resulted in the use of rep;stos for
memsets of 0 with non-constant memory size when the alignment was
at least 4. It's better to use the library in this case, which
can be significantly faster when the size is large.
This also preserves more SourceValue information when memory
intrinsics are lowered into simple loads/stores.
llvm-svn: 49572 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | before an invoke.  Failure to do this causes references in
the landing pad to variables that were not set.  Fixes
g++.dg/eh/delayslot1.C
g++.dg/eh/fp-regs.C
g++.old-deja/g++.brendan/eh1.C
llvm-svn: 49243 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | review feedback.
-enable-eh is still accepted but doesn't do anything.
EH intrinsics use Dwarf EH if the target supports that,
and are handled by LowerInvoke otherwise.
The separation of the EH table and frame move data is,
I think, logically figured out, but either one still
causes full EH info to be generated (not sure how to
split the metadata correctly).
MachineModuleInfo::needsFrameInfo is no longer used and
is removed.
llvm-svn: 49064 | 
| | 
| 
| 
| | llvm-svn: 49046 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | not marked nounwind, or for all functions when -enable-eh
is set, provided the target supports Dwarf EH.
llvm-gcc generates nounwind in the right places; other FEs
will need to do so also.  Given such a FE, -enable-eh should
no longer be needed.
llvm-svn: 49006 | 
| | 
| 
| 
| 
| 
| | patch by David Chisnall.
llvm-svn: 48963 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | nodes. This doesn't currently have much impact the generated code, but it
does produce simpler-looking SelectionDAGs, and consequently
simpler-looking ScheduleDAGs, because there are fewer spurious
dependencies.
In particular, CopyValueToVirtualRegister now uses the entry node as the
input chain dependency for new CopyToReg nodes instead of calling getRoot
and depending on the most recent memory reference.
Also, rename UnorderedChains to PendingExports and pull it up from being
a local variable in SelectionDAGISel::BuildSelectionDAG to being a
member variable of SelectionDAGISel, so that it doesn't have to be
passed around to all the places that need it.
llvm-svn: 48893 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | flags.  This is needed by the new legalize types
infrastructure which wants to expand the 64 bit
constants previously used to hold the flags on
32 bit machines.  There are two functional changes:
(1) in LowerArguments, if a parameter has the zext
attribute set then that is marked in the flags;
before it was being ignored; (2) PPC had some bogus
code for handling two word arguments when using the
ELF 32 ABI, which was hard to convert because of
the bogusness.  As suggested by the original author
(Nicolas Geoffray), I've disabled it for the moment.
Tested with "make check" and the Ada ACATS testsuite.
llvm-svn: 48640 | 
| | 
| 
| 
| 
| 
| | table for nounwind calls.
llvm-svn: 48373 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | getCopyToParts problem was noticed by the new
LegalizeTypes infrastructure.  In order to avoid
this kind of thing in the future I've added a
check that EXTRACT_ELEMENT is only used with
integers.  Once LegalizeTypes is up and running
most likely BUILD_PAIR and EXTRACT_ELEMENT can
be removed, in favour of using apints instead.
llvm-svn: 48294 | 
| | 
| 
| 
| | llvm-svn: 48244 | 
| | 
| 
| 
| 
| 
| 
| 
| | return ValueType can depend its operands' ValueType.
This is a cosmetic change, no functionality impacted.
llvm-svn: 48145 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | field to 32 bits, thus enabling correct handling of ByVal
structs bigger than 0x1ffff.  Abstract interface a bit.
Fixes gcc.c-torture/execute/pr23135.c and 
gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing
on ppc32, quietly producing wrong code on x86-32.)
llvm-svn: 48122 | 
| | 
| 
| 
| | llvm-svn: 48117 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | they are produced by calls (which are known exact) and by cross block copies
which are known to be produced by extends.
This improves:
define double @test2() {
	%tmp85 = call double asm sideeffect "fld0", "={st(0)}"()
	ret double %tmp85
}
from:
_test2:
	subl	$20, %esp
	# InlineAsm Start
	fld0
	# InlineAsm End
	fstpl	8(%esp)
	movsd	8(%esp), %xmm0
	movsd	%xmm0, (%esp)
	fldl	(%esp)
	addl	$20, %esp
	#FP_REG_KILL
	ret
to:
_test2:
	# InlineAsm Start
	fld0
	# InlineAsm End
	#FP_REG_KILL
	ret
by avoiding a f64 <-> f80 trip
llvm-svn: 48108 | 
| | 
| 
| 
| | llvm-svn: 48097 | 
| | 
| 
| 
| 
| 
| | and prefetchnta instructions.
llvm-svn: 48042 | 
| | 
| 
| 
| 
| 
| | an APInt into a uint64_t to call getConstant.
llvm-svn: 47742 | 
| | 
| 
| 
| | llvm-svn: 47587 | 
| | 
| 
| 
| 
| 
| | a SignBitIsZero function to simplify a common use case.
llvm-svn: 47561 | 
| | 
| 
| 
| 
| 
| | the way through.  It is now used for codegen.
llvm-svn: 47484 | 
| | 
| 
| 
| 
| 
| 
| | early clobbers if the clobber list contains a *register* not some thing
like {memory}, {dirflag} etc.
llvm-svn: 47457 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | any, we force sdisel to do all regalloc for an asm.  This
leads to gross but correct codegen.
This fixes the rest of PR2078.
llvm-svn: 47454 | 
| | 
| 
| 
| 
| 
| | failing on archs that haven't implemented them yet
llvm-svn: 47430 | 
| | 
| 
| 
| 
| 
| | Eli Friedman.  This implements CodeGen/Generic/2008-02-20-MatchingMem.ll.
llvm-svn: 47428 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| | inline asms.
Fix PR2078 by marking aliases of registers used when a register is 
marked used.  This prevents EAX from being allocated when AX is listed
in the clobber set for the asm.
llvm-svn: 47426 | 
| | 
| 
| 
| | llvm-svn: 47388 | 
| | 
| 
| 
| | llvm-svn: 47368 | 
| | 
| 
| 
| | llvm-svn: 47348 | 
| | 
| 
| 
| | llvm-svn: 47204 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | the return value is zero-extended if it isn't
sign-extended.  It may also be any-extended.
Also, if a floating point value was returned
in a larger floating point type, pass 1 as the
second operand to FP_ROUND, which tells it
that all the precision is in the original type.
I think this is right but I could be wrong.
Finally, when doing libcalls, set isZExt on
a parameter if it is "unsigned".  Currently
isSExt is set when signed, and nothing is
set otherwise.  This should be right for all
calls to standard library routines.
llvm-svn: 47122 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | node as soon as we create it in SDISel.  Previously we would lower it in
legalize.  The problem with this is that it only exposes the argument
loads implied by FORMAL_ARGUMENTs after legalize, so that only dag combine 2
can hack on them.  This causes us to miss some optimizations because 
datatype expansion also happens here.
Exposing the loads early allows us to do optimizations on them.  For example
we now compile arg-cast.ll to:
_foo:
	movl	$2147483647, %eax
	andl	8(%esp), %eax
	ret
where we previously produced:
_foo:
	subl	$12, %esp
	movsd	16(%esp), %xmm0
	movsd	%xmm0, (%esp)
	movl	$2147483647, %eax
	andl	4(%esp), %eax
	addl	$12, %esp
	ret
It might also make sense to do this for ISD::CALL nodes, which have implicit
stores on many targets.
llvm-svn: 47054 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| | handle arbitrary precision integers and any number
of parts.  For example, on a 32 bit machine an i50
corresponds to two i32 parts.  getCopyToParts will
extend the i50 to an i64 then write half of the i64
to each part; getCopyFromParts will combine the two
i32 parts into an i64 then truncate the result to
i50.
llvm-svn: 47024 | 
| | 
| 
| 
| 
| 
| 
| | in preparation for apint support.  These changes are
intended to have no functional effect.
llvm-svn: 46967 | 
| | 
| 
| 
| | llvm-svn: 46930 | 
| | 
| 
| 
| 
| 
| 
| 
| 
| | information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc.
Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes.
For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time.
llvm-svn: 46659 | 
| | 
| 
| 
| 
| 
| | llvm.dbg.func.start implies a stoppoint is set. SelectionDAGISel records a new source line but does not create a ISD::LABEL node for this special stoppoint. Asm printer will magically print this label. This ensures nothing is emitted before.
llvm-svn: 46635 |