diff options
| author | Michael Gottesman <mgottesman@apple.com> | 2012-10-16 21:29:38 +0000 |
|---|---|---|
| committer | Michael Gottesman <mgottesman@apple.com> | 2012-10-16 21:29:38 +0000 |
| commit | 02a1141e5a36928f7a0ac9d2e0ee44dca3be3a01 (patch) | |
| tree | 0f25b56d0218a647f87ffb1bf12285ca610f7108 /llvm/test/Transforms/InstCombine/obfuscated_splat.ll | |
| parent | 430e540b6b602430490d87001cc386101172ccf4 (diff) | |
| download | bcm5719-llvm-02a1141e5a36928f7a0ac9d2e0ee44dca3be3a01.tar.gz bcm5719-llvm-02a1141e5a36928f7a0ac9d2e0ee44dca3be3a01.zip | |
[InstCombine] Teach InstCombine how to handle an obfuscated splat.
An obfuscated splat is where the frontend poorly generates code for a splat
using several different shuffles to create the splat, i.e.,
%A = load <4 x float>* %in_ptr, align 16
%B = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>
%C = shufflevector <4 x float> %B, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 4, i32 undef>
%D = shufflevector <4 x float> %C, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 2, i32 4>
llvm-svn: 166061
Diffstat (limited to 'llvm/test/Transforms/InstCombine/obfuscated_splat.ll')
| -rw-r--r-- | llvm/test/Transforms/InstCombine/obfuscated_splat.ll | 11 |
1 files changed, 11 insertions, 0 deletions
diff --git a/llvm/test/Transforms/InstCombine/obfuscated_splat.ll b/llvm/test/Transforms/InstCombine/obfuscated_splat.ll new file mode 100644 index 00000000000..c25dade168a --- /dev/null +++ b/llvm/test/Transforms/InstCombine/obfuscated_splat.ll @@ -0,0 +1,11 @@ +; RUN: opt -instcombine -S %s | FileCheck %s + +define void @test(<4 x float> *%in_ptr, <4 x float> *%out_ptr) { + %A = load <4 x float>* %in_ptr, align 16 + %B = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef> + %C = shufflevector <4 x float> %B, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 4, i32 undef> + %D = shufflevector <4 x float> %C, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 2, i32 4> +; CHECK: %D = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> zeroinitializer + store <4 x float> %D, <4 x float> *%out_ptr + ret void +} |

