Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | [NVPTX] Fix the codegen for llvm.round. | Bixia Zheng | 2019-04-01 | 1 | -3/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | Summary: Previously, we translate llvm.round to PTX cvt.rni, which rounds to the even interger when the source is equidistant between two integers. This is not correct as llvm.round should round away from zero. This change replaces llvm.round with a round away from zero implementation through target specific custom lowering. Modify a few affected tests to not check for cvt.rni. Instead, we check for the use of a few constants used in implementing round. We are also adding CUDA runnable tests to check for the values produced by llvm.round to test-suites/External/CUDA. Reviewers: tra Subscribers: jholewinski, sanjoy, jlebar, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59947 llvm-svn: 357407 | ||||
* | [NVPTX] Add codegen tests for llvm.fma. | Justin Lebar | 2017-01-15 | 1 | -0/+25 |
| | | | | llvm-svn: 292070 | ||||
* | [NVPTX] Implement llvm.fabs.f32, llvm.max.f32, etc. | Justin Lebar | 2016-09-09 | 1 | -0/+261 |
Summary: Previously these only worked via NVPTX-specific intrinsics. This change will allow us to convert these target-specific intrinsics into the general LLVM versions, allowing existing LLVM passes to reason about their behavior. It also gets us some minor codegen improvements as-is, from situations where we canonicalize code into one of these llvm intrinsics. Reviewers: majnemer Subscribers: llvm-commits, jholewinski, tra Differential Revision: https://reviews.llvm.org/D24300 llvm-svn: 281092 |