[X86][AVX] Add support for shuffle decoding of vperm2f128/vperm2i128 with zero'd lanes

The vperm2f128/vperm2i128 shuffle mask decoding was not attempting to deal with shuffles that give zero lanes. This patch fixes this so that the assembly printer can provide shuffle comments. As this decoder is also used in X86ISelLowering for shuffle combining, I've added an early-out to match existing behaviour. The hope is that we can add zero support in the future, this would allow other ops' decodes (e.g. insertps) to be combined as well. Differential Revision: http://reviews.llvm.org/D10593 llvm-svn: 241516
author: Simon Pilgrim <llvm-dev@redking.me.uk> 2015-07-06 22:46:46 +0000
committer: Simon Pilgrim <llvm-dev@redking.me.uk> 2015-07-06 22:46:46 +0000
commit: 40343e6b3a875544b650c08f56ded583e66d0963 (patch)
tree: c0bb37bfa7d19ee36574d0e532ddc122994e6ce3 /llvm/lib/Target/X86/Utils/X86ShuffleDecode.cpp
parent: 681a56ac58681d78fbe273306240ff1e93fc57e9 (diff)
download: bcm5719-llvm-40343e6b3a875544b650c08f56ded583e66d0963.tar.gz
bcm5719-llvm-40343e6b3a875544b650c08f56ded583e66d0963.zip
1 files changed, 3 insertions, 5 deletions
diff --git a/llvm/lib/Target/X86/Utils/X86ShuffleDecode.cpp b/llvm/lib/Target/X86/Utils/X86ShuffleDecode.cpp
index 9777c0d85e9..b1c69e779f3 100644
--- a/llvm/lib/Target/X86/Utils/X86ShuffleDecode.cpp
+++ b/llvm/lib/Target/X86/Utils/X86ShuffleDecode.cpp
@@ -255,15 +255,13 @@ void DecodeUNPCKLMask(MVT VT, SmallVectorImpl<int> &ShuffleMask) {
 
 void DecodeVPERM2X128Mask(MVT VT, unsigned Imm,
                           SmallVectorImpl<int> &ShuffleMask) {
-  if (Imm & 0x88)
-    return; // Not a shuffle
-
   unsigned HalfSize = VT.getVectorNumElements() / 2;
 
   for (unsigned l = 0; l != 2; ++l) {
-    unsigned HalfBegin = ((Imm >> (l * 4)) & 0x3) * HalfSize;
+    unsigned HalfMask = Imm >> (l * 4);
+    unsigned HalfBegin = (HalfMask & 0x3) * HalfSize;
     for (unsigned i = HalfBegin, e = HalfBegin + HalfSize; i != e; ++i)
-      ShuffleMask.push_back(i);
+      ShuffleMask.push_back(HalfMask & 8 ? SM_SentinelZero : i);
   }
 }
author	Simon Pilgrim <llvm-dev@redking.me.uk>	2015-07-06 22:46:46 +0000
committer	Simon Pilgrim <llvm-dev@redking.me.uk>	2015-07-06 22:46:46 +0000
commit	40343e6b3a875544b650c08f56ded583e66d0963 (patch)
tree	c0bb37bfa7d19ee36574d0e532ddc122994e6ce3 /llvm/lib/Target/X86/Utils/X86ShuffleDecode.cpp
parent	681a56ac58681d78fbe273306240ff1e93fc57e9 (diff)
download	bcm5719-llvm-40343e6b3a875544b650c08f56ded583e66d0963.tar.gz bcm5719-llvm-40343e6b3a875544b650c08f56ded583e66d0963.zip