summaryrefslogtreecommitdiffstats
path: root/clang/test/CodeGen/builtins-nvptx-sm_70.cu
Commit message (Collapse)AuthorAgeFilesLines
* [NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma ↵Artem Belevich2018-04-181-94/+395
| | | | | | | | | | instructions. The new instructions were added added for sm_70+ GPUs in CUDA-9.1. Differential Revision: https://reviews.llvm.org/D45068 llvm-svn: 330296
* [NVPTX] Make tensor shape part of WMMA intrinsic's name.Artem Belevich2018-03-211-44/+44
| | | | | | | | | | This is needed for the upcoming implementation of the new 8x32x16 and 32x8x16 variants of WMMA instructions introduced in CUDA 9.1. Differential Revision: https://reviews.llvm.org/D44719 llvm-svn: 328158
* [CUDA] Added __hmma_m16n16k16_* builtins to support mma instructions on sm_70Artem Belevich2017-10-121-0/+166
Differential Revision: https://reviews.llvm.org/D38742 llvm-svn: 315624
OpenPOWER on IntegriCloud