summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorArtem Belevich <tra@google.com>2017-01-04 18:39:29 +0000
committerArtem Belevich <tra@google.com>2017-01-04 18:39:29 +0000
commit60f25f70c8621da4e8694789f6fb6301e07d54a6 (patch)
treeb1af00fd44981eed6e4c1f5a83eb0f2650dad7e6
parent8d29e62de730b631a8dfcaf56240a859ed30db51 (diff)
downloadbcm5719-llvm-60f25f70c8621da4e8694789f6fb6301e07d54a6.tar.gz
bcm5719-llvm-60f25f70c8621da4e8694789f6fb6301e07d54a6.zip
[CUDA] Pre-include sm_60 and sm_61 headers.
CUDA-8.0 comes with new headers which nvcc pre-includes via cuda_runtime.h Clang now makes them available as well. Differential Revision: https://reviews.llvm.org/D28301 llvm-svn: 290982
-rw-r--r--clang/lib/Headers/__clang_cuda_runtime_wrapper.h5
1 files changed, 5 insertions, 0 deletions
diff --git a/clang/lib/Headers/__clang_cuda_runtime_wrapper.h b/clang/lib/Headers/__clang_cuda_runtime_wrapper.h
index 205e15b40b5..931d44b6965 100644
--- a/clang/lib/Headers/__clang_cuda_runtime_wrapper.h
+++ b/clang/lib/Headers/__clang_cuda_runtime_wrapper.h
@@ -232,6 +232,11 @@ static inline __device__ void __brkpt(int __c) { __brkpt(); }
// hardware, seems to generate faster machine code because ptxas can more easily
// reason about our code.
+#if CUDA_VERSION >= 8000
+#include "sm_60_atomic_functions.hpp"
+#include "sm_61_intrinsics.hpp"
+#endif
+
#undef __MATH_FUNCTIONS_HPP__
// math_functions.hpp defines ::signbit as a __host__ __device__ function. This
OpenPOWER on IntegriCloud