AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit

Summary: For GL_ARB_compute_shader we need to support workgroup sizes of at least 1024. However, if we want to allow large workgroup sizes, we may need to use less registers, as we have to run more waves per SIMD. This patch adds an attribute to specify the maximum work group size the compiled program needs to support. It defaults, to 256, as that has no wave restrictions. Reducing the number of registers available is done similarly to how the registers were reserved for chips with the sgpr init bug. Reviewers: mareko, arsenm, tstellarAMD, nhaehnle Subscribers: FireBurn, kerberizer, llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D18340 Patch By: Bas Nieuwenhuizen llvm-svn: 266337
author: Tom Stellard <thomas.stellard@amd.com> 2016-04-14 16:27:07 +0000
committer: Tom Stellard <thomas.stellard@amd.com> 2016-04-14 16:27:07 +0000
commit: 79a1fd718c71b4480dc9f00e8e77f4408ec9e6fa (patch)
tree: d7ad0eed911b428e15f871225d742595de9420b8 /llvm/lib/Target/AMDGPU/Utils
parent: f110f8f9f7f46c668e03f4808e03aa54c2157269 (diff)
download: bcm5719-llvm-79a1fd718c71b4480dc9f00e8e77f4408ec9e6fa.tar.gz
bcm5719-llvm-79a1fd718c71b4480dc9f00e8e77f4408ec9e6fa.zip
2 files changed, 5 insertions, 0 deletions
diff --git a/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp b/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
index 5e3498f86d6..52f764876f3 100644
--- a/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
+++ b/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
@@ -124,6 +124,10 @@ static unsigned getIntegerAttribute(const Function &F, const char *Name,
   return Result;
 }
 
+unsigned getMaximumWorkGroupSize(const Function &F) {
+  return getIntegerAttribute(F, "amdgpu-max-work-group-size", 256);
+}
+
 unsigned getInitialPSInputAddr(const Function &F) {
   return getIntegerAttribute(F, "InitialPSInputAddr", 0);
 }
diff --git a/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h b/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
index d229dec8036..4feb57c9e34 100644
--- a/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
+++ b/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
@@ -45,6 +45,7 @@ bool isGroupSegment(const GlobalValue *GV);
 bool isGlobalSegment(const GlobalValue *GV);
 bool isReadOnlySegment(const GlobalValue *GV);
 
+unsigned getMaximumWorkGroupSize(const Function &F);
 unsigned getInitialPSInputAddr(const Function &F);
 
 bool isShader(CallingConv::ID cc);
author	Tom Stellard <thomas.stellard@amd.com>	2016-04-14 16:27:07 +0000
committer	Tom Stellard <thomas.stellard@amd.com>	2016-04-14 16:27:07 +0000
commit	79a1fd718c71b4480dc9f00e8e77f4408ec9e6fa (patch)
tree	d7ad0eed911b428e15f871225d742595de9420b8 /llvm/lib/Target/AMDGPU/Utils
parent	f110f8f9f7f46c668e03f4808e03aa54c2157269 (diff)
download	bcm5719-llvm-79a1fd718c71b4480dc9f00e8e77f4408ec9e6fa.tar.gz bcm5719-llvm-79a1fd718c71b4480dc9f00e8e77f4408ec9e6fa.zip