diff options
author | Artem Belevich <tra@google.com> | 2018-04-10 18:38:22 +0000 |
---|---|---|
committer | Artem Belevich <tra@google.com> | 2018-04-10 18:38:22 +0000 |
commit | dde3dc27ee71f12eb145ce54158779ab4ddc38ed (patch) | |
tree | eaccb2951afc4f620ca671f5b3cc153ef8909325 /clang/docs | |
parent | 5da361a0b0f5e361c8285acdf9c9cd5d417416f9 (diff) | |
download | bcm5719-llvm-dde3dc27ee71f12eb145ce54158779ab4ddc38ed.tar.gz bcm5719-llvm-dde3dc27ee71f12eb145ce54158779ab4ddc38ed.zip |
[CUDA] Added --[no-]cuda-include-ptx=sm_XX|all option.
Currently we always include PTX into the fatbin along
with the GPU code.It about doubles the size of the GPU binary
we need to carry in the executable. These options allow control
inclusion of PTX into GPU binary.
This patch does not change the defaults, though we may consider
making no-PTX the default in the future.
Differential Revision: https://reviews.llvm.org/D45495
llvm-svn: 329737
Diffstat (limited to 'clang/docs')
-rw-r--r-- | clang/docs/ClangCommandLineReference.rst | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/clang/docs/ClangCommandLineReference.rst b/clang/docs/ClangCommandLineReference.rst index 8e9ef84c99c..67eb56a4cef 100644 --- a/clang/docs/ClangCommandLineReference.rst +++ b/clang/docs/ClangCommandLineReference.rst @@ -144,6 +144,10 @@ Compile CUDA code for device only CUDA GPU architecture (e.g. sm\_35). May be specified more than once. +.. option:: --cuda-include-ptx=<arg>, --no-cuda-include-ptx=<arg> + +Include (or not) PTX along with CUDA GPU binary for the given architecture (e.g. sm\_35). Argument may be 'all'. The option may be specified more than once. Default: --cuda-include-ptx=all + .. option:: --cuda-host-only Compile CUDA code for host only. Has no effect on non-CUDA compilations. |