[x86] try to form broadcast before widening shuffle elements

I noticed that we weren't generating broadcasts as much I thought we would with D54271, and this is part of the problem. Widening the shuffle elements means adding bitcasts and hiding the relationship between a splatted scalar and the vector. If we can form a broadcast, do that before going through the rest of the shuffle lowering because broadcasts should be cheap and can often be load-folded. Differential Revision: https://reviews.llvm.org/D54280 llvm-svn: 346498
author: Sanjay Patel <spatel@rotateright.com> 2018-11-09 14:54:58 +0000
committer: Sanjay Patel <spatel@rotateright.com> 2018-11-09 14:54:58 +0000
commit: fa1c0fe478dbcc2276d7cb58fe2adaf2fee9097c (patch)
tree: 39bb5fc1709ffac75ae39d85bfb017d52080e721 /llvm/lib/Target
parent: 1cc2d0b9fb9fc966fab5fb0575479aa888b2ed57 (diff)
download: bcm5719-llvm-fa1c0fe478dbcc2276d7cb58fe2adaf2fee9097c.tar.gz
bcm5719-llvm-fa1c0fe478dbcc2276d7cb58fe2adaf2fee9097c.zip
1 files changed, 8 insertions, 0 deletions
diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 44fd25bf08e..6c844fa6d8f 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -15627,6 +15627,14 @@ static SDValue lowerVectorShuffle(SDValue Op, const X86Subtarget &Subtarget,
   SmallVector<int, 16> WidenedMask;
   if (VT.getScalarSizeInBits() < 64 && !Is1BitVector &&
       canWidenShuffleElements(ZeroableMask, WidenedMask)) {
+    // Shuffle mask widening should not interfere with a broadcast opportunity
+    // by obfuscating the operands with bitcasts.
+    // TODO: Avoid lowering directly from this top-level function: make this
+    // a query (canLowerAsBroadcast) and defer lowering to the type-based calls.
+    if (SDValue Broadcast =
+            lowerVectorShuffleAsBroadcast(DL, VT, V1, V2, Mask, Subtarget, DAG))
+      return Broadcast;
+
     MVT NewEltVT = VT.isFloatingPoint()
                        ? MVT::getFloatingPointVT(VT.getScalarSizeInBits() * 2)
                        : MVT::getIntegerVT(VT.getScalarSizeInBits() * 2);
author	Sanjay Patel <spatel@rotateright.com>	2018-11-09 14:54:58 +0000
committer	Sanjay Patel <spatel@rotateright.com>	2018-11-09 14:54:58 +0000
commit	fa1c0fe478dbcc2276d7cb58fe2adaf2fee9097c (patch)
tree	39bb5fc1709ffac75ae39d85bfb017d52080e721 /llvm/lib/Target
parent	1cc2d0b9fb9fc966fab5fb0575479aa888b2ed57 (diff)
download	bcm5719-llvm-fa1c0fe478dbcc2276d7cb58fe2adaf2fee9097c.tar.gz bcm5719-llvm-fa1c0fe478dbcc2276d7cb58fe2adaf2fee9097c.zip