summaryrefslogtreecommitdiffstats
path: root/clang/include
diff options
context:
space:
mode:
authorArtem Belevich <tra@google.com>2018-04-10 18:38:22 +0000
committerArtem Belevich <tra@google.com>2018-04-10 18:38:22 +0000
commitdde3dc27ee71f12eb145ce54158779ab4ddc38ed (patch)
treeeaccb2951afc4f620ca671f5b3cc153ef8909325 /clang/include
parent5da361a0b0f5e361c8285acdf9c9cd5d417416f9 (diff)
downloadbcm5719-llvm-dde3dc27ee71f12eb145ce54158779ab4ddc38ed.tar.gz
bcm5719-llvm-dde3dc27ee71f12eb145ce54158779ab4ddc38ed.zip
[CUDA] Added --[no-]cuda-include-ptx=sm_XX|all option.
Currently we always include PTX into the fatbin along with the GPU code.It about doubles the size of the GPU binary we need to carry in the executable. These options allow control inclusion of PTX into GPU binary. This patch does not change the defaults, though we may consider making no-PTX the default in the future. Differential Revision: https://reviews.llvm.org/D45495 llvm-svn: 329737
Diffstat (limited to 'clang/include')
-rw-r--r--clang/include/clang/Driver/Options.td4
1 files changed, 4 insertions, 0 deletions
diff --git a/clang/include/clang/Driver/Options.td b/clang/include/clang/Driver/Options.td
index 58b5341b348..a08003b75a0 100644
--- a/clang/include/clang/Driver/Options.td
+++ b/clang/include/clang/Driver/Options.td
@@ -546,6 +546,10 @@ def cuda_host_only : Flag<["--"], "cuda-host-only">,
def cuda_compile_host_device : Flag<["--"], "cuda-compile-host-device">,
HelpText<"Compile CUDA code for both host and device (default). Has no "
"effect on non-CUDA compilations.">;
+def cuda_include_ptx_EQ : Joined<["--"], "cuda-include-ptx=">, Flags<[DriverOption]>,
+ HelpText<"Include PTX for the follwing GPU architecture (e.g. sm_35) or 'all'. May be specified more than once.">;
+def no_cuda_include_ptx_EQ : Joined<["--"], "no-cuda-include-ptx=">, Flags<[DriverOption]>,
+ HelpText<"Do not include PTX for the follwing GPU architecture (e.g. sm_35) or 'all'. May be specified more than once.">;
def cuda_gpu_arch_EQ : Joined<["--"], "cuda-gpu-arch=">, Flags<[DriverOption]>,
HelpText<"CUDA GPU architecture (e.g. sm_35). May be specified more than once.">;
def no_cuda_gpu_arch_EQ : Joined<["--"], "no-cuda-gpu-arch=">, Flags<[DriverOption]>,
OpenPOWER on IntegriCloud