[DagCombiner][X86] Simplify a ConcatVectors of a scalar_to_vector with undef.

This patch introduces a new DAGCombiner rule to simplify concat_vectors nodes: concat_vectors( bitcast (scalar_to_vector %A), UNDEF) --> bitcast (scalar_to_vector %A) This patch only partially addresses PR39257. In particular, it is enough to fix one of the two problematic cases mentioned in PR39257. However, it is not enough to fix the original test case posted by Craig; that particular case would probably require a more complicated approach (and knowledge about used bits). Before this patch, we used to generate the following code for function PR39257 (-mtriple=x86_64 , -mattr=+avx): vmovsd (%rdi), %xmm0 # xmm0 = mem[0],zero vxorps %xmm1, %xmm1, %xmm1 vblendps $3, %xmm0, %xmm1, %xmm0 # xmm0 = xmm0[0,1],xmm1[2,3] vmovaps %ymm0, (%rsi) vzeroupper retq Now we generate this: vmovsd (%rdi), %xmm0 # xmm0 = mem[0],zero vmovaps %ymm0, (%rsi) vzeroupper retq As a side note: that VZEROUPPER is completely redundant... I guess the vzeroupper insertion pass doesn't realize that the definition of %xmm0 from vmovsd is already zeroing the upper half of %ymm0. Note that on %-mcpu=btver2, we don't get that vzeroupper because pass vzeroupper insertion %pass is disabled. Differential Revision: https://reviews.llvm.org/D55274 llvm-svn: 348522
author: Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net> 2018-12-06 19:55:38 +0000
committer: Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net> 2018-12-06 19:55:38 +0000
commit: 52a2bac583f366570e604a155ef7c6f12e342b0e (patch)
tree: e9b1bce79daecc7f4e7bb8080a8c7c61147a4cd9 /llvm/lib/CodeGen
parent: 653e8ba79a0089a8055b5dee538b293b0019d1e4 (diff)
download: bcm5719-llvm-52a2bac583f366570e604a155ef7c6f12e342b0e.tar.gz
bcm5719-llvm-52a2bac583f366570e604a155ef7c6f12e342b0e.zip
1 files changed, 12 insertions, 4 deletions
diff --git a/llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp b/llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
index 166577e5faf..d9834cf298c 100644
--- a/llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
@@ -16518,11 +16518,19 @@ SDValue DAGCombiner::visitCONCAT_VECTORS(SDNode *N) {
     SDValue In = N->getOperand(0);
     assert(In.getValueType().isVector() && "Must concat vectors");
 
-    // Transform: concat_vectors(scalar, undef) -> scalar_to_vector(sclr).
-    if (In->getOpcode() == ISD::BITCAST &&
-        !In->getOperand(0).getValueType().isVector()) {
-      SDValue Scalar = In->getOperand(0);
+    SDValue Scalar = peekThroughOneUseBitcasts(In);
 
+    // concat_vectors(scalar_to_vector(scalar), undef) ->
+    //     scalar_to_vector(scalar)
+    if (!LegalOperations && Scalar.getOpcode() == ISD::SCALAR_TO_VECTOR &&
+         Scalar.hasOneUse()) {
+      EVT SVT = Scalar.getValueType().getVectorElementType();
+      if (SVT == Scalar.getOperand(0).getValueType())
+        Scalar = Scalar.getOperand(0);
+    }
+
+    // concat_vectors(scalar, undef) -> scalar_to_vector(scalar)
+    if (!Scalar.getValueType().isVector()) {
       // If the bitcast type isn't legal, it might be a trunc of a legal type;
       // look through the trunc so we can still do the transform:
       //   concat_vectors(trunc(scalar), undef) -> scalar_to_vector(scalar)
author	Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net>	2018-12-06 19:55:38 +0000
committer	Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net>	2018-12-06 19:55:38 +0000
commit	52a2bac583f366570e604a155ef7c6f12e342b0e (patch)
tree	e9b1bce79daecc7f4e7bb8080a8c7c61147a4cd9 /llvm/lib/CodeGen
parent	653e8ba79a0089a8055b5dee538b293b0019d1e4 (diff)
download	bcm5719-llvm-52a2bac583f366570e604a155ef7c6f12e342b0e.tar.gz bcm5719-llvm-52a2bac583f366570e604a155ef7c6f12e342b0e.zip