[X86][SSE] Lower 128-bit vectors to SIGN/ZERO_EXTEND_VECTOR_IN_REG ops

As described on PR31712, we miss a variety of legalization combines because we lower these to X86ISD::VSEXT/VZEXT despite them having the same functionality. This patch makes 128-bit (SSE41) SIGN/ZERO_EXTEND_VECTOR_IN_REG ops legal, adds the necessary tablegen plumbing and uses a helper 'getExtendInVec' to decide when to use SIGN/ZERO_EXTEND_VECTOR_IN_REG or VSEXT/VZEXT. We're missing a couple of shuffle combines that will be added in a future patch for review. Later patches can then support the AVX2 cases as a mixture of SIGN/ZERO_EXTEND and SIGN/ZERO_EXTEND_VECTOR_IN_REG, and then finally deal with the AVX512 cases. Differential Revision: https://reviews.llvm.org/D30549 llvm-svn: 296985
author: Simon Pilgrim <llvm-dev@redking.me.uk> 2017-03-05 09:57:20 +0000
committer: Simon Pilgrim <llvm-dev@redking.me.uk> 2017-03-05 09:57:20 +0000
commit: 9f5c251d574c766e08d332ee7f3cefb88bc72438 (patch)
tree: 639925e45dd7230eae23fb350e5736bd218f949a /llvm/lib/CodeGen
parent: 4bc8292a46f2618eca5b8a08b290c6a4f6418e93 (diff)
download: bcm5719-llvm-9f5c251d574c766e08d332ee7f3cefb88bc72438.tar.gz
bcm5719-llvm-9f5c251d574c766e08d332ee7f3cefb88bc72438.zip
1 files changed, 16 insertions, 0 deletions
diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
index 90a9b4d7c66..d43e7ffab39 100644
--- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
@@ -2419,6 +2419,20 @@ void SelectionDAG::computeKnownBits(SDValue Op, APInt &KnownZero,
     }
     break;
   }
+  case ISD::ZERO_EXTEND_VECTOR_INREG: {
+    EVT InVT = Op.getOperand(0).getValueType();
+    unsigned InBits = InVT.getScalarSizeInBits();
+    APInt NewBits   = APInt::getHighBitsSet(BitWidth, BitWidth - InBits);
+    KnownZero = KnownZero.trunc(InBits);
+    KnownOne = KnownOne.trunc(InBits);
+    computeKnownBits(Op.getOperand(0), KnownZero, KnownOne,
+                     DemandedElts.zext(InVT.getVectorNumElements()),
+                     Depth + 1);
+    KnownZero = KnownZero.zext(BitWidth);
+    KnownOne = KnownOne.zext(BitWidth);
+    KnownZero |= NewBits;
+    break;
+  }
   case ISD::ZERO_EXTEND: {
     EVT InVT = Op.getOperand(0).getValueType();
     unsigned InBits = InVT.getScalarSizeInBits();
@@ -2432,6 +2446,7 @@ void SelectionDAG::computeKnownBits(SDValue Op, APInt &KnownZero,
     KnownZero |= NewBits;
     break;
   }
+  // TODO ISD::SIGN_EXTEND_VECTOR_INREG
   case ISD::SIGN_EXTEND: {
     EVT InVT = Op.getOperand(0).getValueType();
     unsigned InBits = InVT.getScalarSizeInBits();
@@ -2859,6 +2874,7 @@ unsigned SelectionDAG::ComputeNumSignBits(SDValue Op, unsigned Depth) const {
   }
 
   case ISD::SIGN_EXTEND:
+  case ISD::SIGN_EXTEND_VECTOR_INREG:
     Tmp = VTBits - Op.getOperand(0).getScalarValueSizeInBits();
     return ComputeNumSignBits(Op.getOperand(0), Depth+1) + Tmp;
author	Simon Pilgrim <llvm-dev@redking.me.uk>	2017-03-05 09:57:20 +0000
committer	Simon Pilgrim <llvm-dev@redking.me.uk>	2017-03-05 09:57:20 +0000
commit	9f5c251d574c766e08d332ee7f3cefb88bc72438 (patch)
tree	639925e45dd7230eae23fb350e5736bd218f949a /llvm/lib/CodeGen
parent	4bc8292a46f2618eca5b8a08b290c6a4f6418e93 (diff)
download	bcm5719-llvm-9f5c251d574c766e08d332ee7f3cefb88bc72438.tar.gz bcm5719-llvm-9f5c251d574c766e08d332ee7f3cefb88bc72438.zip