DAGCombiner: Combine extract_vector_elt from build_vector

This basic combine was surprisingly missing. AMDGPU legalizes many operations in terms of 32-bit vector components, so not doing this results in many extra copies and subregister extracts that need to be cleaned up later. InstCombine already does this for the hasOneUse case. The target hook is to fix a handful of tests which break (e.g. ARM/vmov.ll) which turn from a vector materialize repeated immediate instruction to a constant vector load with more scalar copies from it. llvm-svn: 250129
author: Matt Arsenault <Matthew.Arsenault@amd.com> 2015-10-12 23:59:50 +0000
committer: Matt Arsenault <Matthew.Arsenault@amd.com> 2015-10-12 23:59:50 +0000
commit: 61dc235f20fa7822478b64554473cf917f67e11e (patch)
tree: b5fc7efe09b03ecb8d24766bedc2acbc5dd72d59 /llvm/lib/Target/AMDGPU
parent: 82c00bee3cb011d0cb0fe3f70e3b1549f1d43086 (diff)
download: bcm5719-llvm-61dc235f20fa7822478b64554473cf917f67e11e.tar.gz
bcm5719-llvm-61dc235f20fa7822478b64554473cf917f67e11e.zip
2 files changed, 13 insertions, 0 deletions
diff --git a/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp b/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
index 8300431ec01..a8af7ec75f0 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
@@ -533,6 +533,18 @@ bool AMDGPUTargetLowering:: storeOfVectorConstantIsCheap(EVT MemVT,
   return true;
 }
 
+bool AMDGPUTargetLowering::aggressivelyPreferBuildVectorSources(EVT VecVT) const {
+  // There are few operations which truly have vector input operands. Any vector
+  // operation is going to involve operations on each component, and a
+  // build_vector will be a copy per element, so it always makes sense to use a
+  // build_vector input in place of the extracted element to avoid a copy into a
+  // super register.
+  //
+  // We should probably only do this if all users are extracts only, but this
+  // should be the common case.
+  return true;
+}
+
 bool AMDGPUTargetLowering::isTruncateFree(EVT Source, EVT Dest) const {
   // Truncate is just accessing a subregister.
   return Dest.bitsLT(Source) && (Dest.getSizeInBits() % 32 == 0);
diff --git a/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.h b/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.h
index 3f5b1f59e06..1e060c4d708 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.h
+++ b/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.h
@@ -138,6 +138,7 @@ public:
   bool storeOfVectorConstantIsCheap(EVT MemVT,
                                     unsigned NumElem,
                                     unsigned AS) const override;
+  bool aggressivelyPreferBuildVectorSources(EVT VecVT) const override;
   bool isCheapToSpeculateCttz() const override;
   bool isCheapToSpeculateCtlz() const override;
author	Matt Arsenault <Matthew.Arsenault@amd.com>	2015-10-12 23:59:50 +0000
committer	Matt Arsenault <Matthew.Arsenault@amd.com>	2015-10-12 23:59:50 +0000
commit	61dc235f20fa7822478b64554473cf917f67e11e (patch)
tree	b5fc7efe09b03ecb8d24766bedc2acbc5dd72d59 /llvm/lib/Target/AMDGPU
parent	82c00bee3cb011d0cb0fe3f70e3b1549f1d43086 (diff)
download	bcm5719-llvm-61dc235f20fa7822478b64554473cf917f67e11e.tar.gz bcm5719-llvm-61dc235f20fa7822478b64554473cf917f67e11e.zip