diff options
| author | Gheorghe-Teodor Bercea <gheorghe-teod.bercea@ibm.com> | 2017-08-07 15:39:11 +0000 |
|---|---|---|
| committer | Gheorghe-Teodor Bercea <gheorghe-teod.bercea@ibm.com> | 2017-08-07 15:39:11 +0000 |
| commit | 47e0cf378c793b00207998a0537c2fd75bd1ec74 (patch) | |
| tree | 3f667d95b0f1bf3a9a5009c9c8a568374d48cfb2 /llvm/lib/CodeGen | |
| parent | 02d9945e6f11959887e2eefbbb5a635ff5087dbe (diff) | |
| download | bcm5719-llvm-47e0cf378c793b00207998a0537c2fd75bd1ec74.tar.gz bcm5719-llvm-47e0cf378c793b00207998a0537c2fd75bd1ec74.zip | |
[OpenMP] Add flag for specifying the target device architecture for OpenMP device offloading
Summary:
OpenMP has the ability to offload target regions to devices which may have different architectures.
A new -fopenmp-target-arch flag is introduced to specify the device architecture.
In this patch I use the new flag to specify the compute capability of the underlying NVIDIA architecture for the OpenMP offloading CUDA tool chain.
Only a host-offloading test is provided since full device offloading capability will only be available when [[ https://reviews.llvm.org/D29654 | D29654 ]] lands.
Reviewers: hfinkel, Hahnfeld, carlo.bertolli, caomhin, ABataev
Reviewed By: hfinkel
Subscribers: guansong, cfe-commits
Tags: #openmp
Differential Revision: https://reviews.llvm.org/D34784
llvm-svn: 310263
Diffstat (limited to 'llvm/lib/CodeGen')
0 files changed, 0 insertions, 0 deletions

