diff options
author | Filipe Cabecinhas <me@filcab.net> | 2014-05-19 19:45:57 +0000 |
---|---|---|
committer | Filipe Cabecinhas <me@filcab.net> | 2014-05-19 19:45:57 +0000 |
commit | dc9210276637bf67091ff8885f0c2c0e08fadbcb (patch) | |
tree | 84c9e171289e944f35f147ef6666ebeb2f76e963 /llvm/test/CodeGen/X86/fold-load-vec.ll | |
parent | 9f7d14756df13b907e29b61160b1cb6981013bb3 (diff) | |
download | bcm5719-llvm-dc9210276637bf67091ff8885f0c2c0e08fadbcb.tar.gz bcm5719-llvm-dc9210276637bf67091ff8885f0c2c0e08fadbcb.zip |
Added more insertps optimizations
Summary:
When inserting an element that's coming from a vector load or a broadcast
of a vector (or scalar) load, combine the load into the insertps
instruction.
Added PerformINSERTPSCombine for the case where we need to fix the load
(load of a vector + insertps with a non-zero CountS).
Added patterns for the broadcasts.
Also added tests for SSE4.1, AVX, and AVX2.
Reviewers: delena, nadav, craig.topper
Subscribers: llvm-commits
Differential Revision: http://reviews.llvm.org/D3581
llvm-svn: 209156
Diffstat (limited to 'llvm/test/CodeGen/X86/fold-load-vec.ll')
-rw-r--r-- | llvm/test/CodeGen/X86/fold-load-vec.ll | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/llvm/test/CodeGen/X86/fold-load-vec.ll b/llvm/test/CodeGen/X86/fold-load-vec.ll index e85d8f78c05..96c5be4f752 100644 --- a/llvm/test/CodeGen/X86/fold-load-vec.ll +++ b/llvm/test/CodeGen/X86/fold-load-vec.ll @@ -5,7 +5,7 @@ ; loads from m32. define void @sample_test(<4 x float>* %source, <2 x float>* %dest) nounwind { ; CHECK: sample_test -; CHECK: movaps +; CHECK-NOT: movaps ; CHECK: insertps entry: %source.addr = alloca <4 x float>*, align 8 |