| Commit message (Collapse) | Author | Age | Files | Lines |
... | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This allows us to compile:
#include <emmintrin.h>
typedef __m128i VSInt16;
typedef short vSInt16 __attribute__ ((__vector_size__ (16)));
VSInt16 t3() {
return (VSInt16)((vSInt16)_mm_set1_epi16(6518));
}
into:
_t3:
movaps LCPI1_0, %xmm0
ret
instead of:
_t3:
movl $6518, %eax
movd %eax, %xmm0
pextrw $0, %xmm0, %eax
xorps %xmm0, %xmm0
pinsrw $0, %eax, %xmm0
punpcklwd %xmm0, %xmm0
pshufd $0, %xmm0, %xmm0
ret
llvm-svn: 44856
|
|
|
|
|
|
|
| |
of source/dest elements changes. This implements
test/Transforms/InstCombine/bitcast-vector-fold.ll
llvm-svn: 44855
|
|
|
|
|
|
| |
Support libraries separately into their own module.
llvm-svn: 44852
|
|
|
|
| |
llvm-svn: 44850
|
|
|
|
| |
llvm-svn: 44849
|
|
|
|
| |
llvm-svn: 44838
|
|
|
|
| |
llvm-svn: 44837
|
|
|
|
|
|
|
|
|
| |
possible before resorting to pextrw and pinsrw.
- Better codegen for v4i32 shuffles masquerading as v8i16 or v16i8 shuffles.
- Improves (i16 extract_vector_element 0) codegen by recognizing
(i32 extract_vector_element 0) does not require a pextrw.
llvm-svn: 44836
|
|
|
|
| |
llvm-svn: 44835
|
|
|
|
|
|
|
| |
per-function collector model. Collector is now the factory for
CollectorMetadata, so the latter may be subclassed.
llvm-svn: 44827
|
|
|
|
|
|
|
|
| |
don't have to #include config.h in it. #including config.h breaks
other projects that have their own autoconf stuff and try to #include
the llvm headers. One obscure example is llvm-gcc.
llvm-svn: 44825
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Thompson. Usage should be something like this:
open Llvm
open Llvm_bitreader
match read_bitcode_file fn with
| Bitreader_failure msg ->
prerr_endline msg
| Bitreader_success m ->
...;
dispose_module m
Compile with: ocamlc llvm.cma llvm_bitreader.cma
ocamlopt llvm.cmxa llvm_bitreader.cmxa
llvm-svn: 44824
|
|
|
|
| |
llvm-svn: 44820
|
|
|
|
| |
llvm-svn: 44819
|
|
|
|
|
|
| |
Use it for Darwin.
llvm-svn: 44818
|
|
|
|
|
|
|
|
| |
Reimplement the xform in Analysis/ConstantFolding.cpp where we can use
targetdata to validate that it is safe. While I'm in there, fix some const
correctness issues and generalize the interface to the "operand folder".
llvm-svn: 44817
|
|
|
|
|
|
|
|
|
|
|
| |
These should probably be something like:
CFI(".cfi_def_cfa_offset 16\n")
where CFI is defined to a noop on darwin and other platforms
that don't support those directives.
llvm-svn: 44803
|
|
|
|
|
|
| |
on functions as it calculates them.
llvm-svn: 44802
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
using the minimum possible number of bytes. For little
endian targets run on little endian machines, apints are
stored in memory from LSB to MSB as before. For big endian
targets on big endian machines they are stored from MSB to
LSB which wasn't always the case before (if the target and
host endianness doesn't match values are stored according
to the host's endianness). Doing this requires knowing the
endianness of the host, which is determined when configuring -
thanks go to Anton for this. Only having access to little
endian machines I was unable to properly test the big endian
part, which is also the most complicated...
llvm-svn: 44796
|
|
|
|
|
|
| |
All bad stuff from SSE version is implicitely inherited :)
llvm-svn: 44794
|
|
|
|
|
|
| |
broken, because doesn't mark xmm regs properly
llvm-svn: 44793
|
|
|
|
|
|
|
| |
This will allow us (theoretically) to unwind through JITer.
The code wasn't verified, so I'm pretty sure offsets are wrong :)
llvm-svn: 44792
|
|
|
|
|
|
| |
get it to compile.
llvm-svn: 44791
|
|
|
|
|
|
|
|
| |
sense of
how the CodeGen machinery works.
llvm-svn: 44786
|
|
|
|
|
|
| |
impact the value of fall-through choices.
llvm-svn: 44785
|
|
|
|
| |
llvm-svn: 44775
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
methods are new to Function:
bool hasCollector() const;
const std::string &getCollector() const;
void setCollector(const std::string &);
void clearCollector();
The assembly representation is as such:
define void @f() gc "shadow-stack" { ...
The implementation uses an on-the-side table to map Functions to
collector names, such that there is no overhead. A StringPool is
further used to unique collector names, which are extremely
likely to be unique per process.
llvm-svn: 44769
|
|
|
|
|
|
|
| |
_sabre_: it has a major problem: by the time ~Value is run, all of the "parts" of the derived classes have been destroyed
_sabre_: the vtable lives to fight another day
llvm-svn: 44760
|
|
|
|
| |
llvm-svn: 44756
|
|
|
|
|
|
| |
2007-11-19-InlineAsm.ll
llvm-svn: 44755
|
|
|
|
| |
llvm-svn: 44747
|
|
|
|
|
|
| |
knows the vector is not pow2
llvm-svn: 44740
|
|
|
|
| |
llvm-svn: 44733
|
|
|
|
| |
llvm-svn: 44728
|
|
|
|
| |
llvm-svn: 44727
|
|
|
|
|
|
| |
that LegalizeDAG does.
llvm-svn: 44726
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
%f8 = type <8 x float>
define void @test_f8(%f8* %P, %f8* %Q, %f8* %S) {
%p = load %f8* %P ; <%f8> [#uses=1]
%q = load %f8* %Q ; <%f8> [#uses=1]
%R = add %f8 %p, %q ; <%f8> [#uses=1]
store %f8 %R, %f8* %S
ret void
}
into:
_test_f8:
movaps 16(%rdi), %xmm0
addps 16(%rsi), %xmm0
movaps (%rdi), %xmm1
addps (%rsi), %xmm1
movaps %xmm0, 16(%rdx)
movaps %xmm1, (%rdx)
ret
llvm-svn: 44725
|
|
|
|
| |
llvm-svn: 44724
|
|
|
|
| |
llvm-svn: 44723
|
|
|
|
| |
llvm-svn: 44722
|
|
|
|
| |
llvm-svn: 44720
|
|
|
|
| |
llvm-svn: 44719
|
|
|
|
| |
llvm-svn: 44718
|
|
|
|
| |
llvm-svn: 44717
|
|
|
|
| |
llvm-svn: 44716
|
|
|
|
| |
llvm-svn: 44715
|
|
|
|
|
|
| |
Leave it visibility hidden, but not in an anon namespace.
llvm-svn: 44714
|
|
|
|
| |
llvm-svn: 44710
|
|
|
|
| |
llvm-svn: 44707
|
|
|
|
| |
llvm-svn: 44705
|