summaryrefslogtreecommitdiffstats
path: root/parallel-libs/streamexecutor/examples/HostSaxpy.cpp
Commit message (Collapse)AuthorAgeFilesLines
* [SE] Remove StreamExecutorJason Henline2016-10-251-94/+0
| | | | | | | | | | | | | | Summary: The project has been renamed to Acxxel, so this old directory needs to be deleted. Reviewers: jlebar, jprice Subscribers: beanz, mgorny, parallel_libs-commits, modocache Differential Revision: https://reviews.llvm.org/D25964 llvm-svn: 285115
* [SE] Pack global dev handle addressesJason Henline2016-09-131-2/+2
| | | | | | | | | | | | | | | | | Summary: We were packing global device memory handles in `PackedKernelArgumentArray`, but as I was implementing the CUDA platform, I realized that CUDA wants the address of the handle, not the handle itself. So this patch switches to packing the address of the handle. Reviewers: jlebar Subscribers: jprice, jlebar, parallel_libs-commits Differential Revision: https://reviews.llvm.org/D24528 llvm-svn: 281424
* [SE] Platforms return Device valuesJason Henline2016-09-131-7/+7
| | | | | | | | | | | | | | | Summary: Platforms were returning Device pointers, but a Device is now basically just a pointer to an underlying PlatformDevice, so we will now just pass it around as a value. Reviewers: jlebar Subscribers: jprice, jlebar, parallel_libs-commits Differential Revision: https://reviews.llvm.org/D24537 llvm-svn: 281422
* [SE] Host platform implementationJason Henline2016-09-131-0/+94
Summary: This implementation does not currently support multiple concurrent streams, and it won't allow kernels to be launched with grids larger than one block or blocks larger than one thread. These limitations could be removed in the future by launching new threads on the host, but that is not done in this implementation. Reviewers: jlebar Subscribers: beanz, mgorny, jprice, parallel_libs-commits Differential Revision: https://reviews.llvm.org/D24473 llvm-svn: 281377
OpenPOWER on IntegriCloud