summaryrefslogtreecommitdiffstats
path: root/llvm/test/CodeGen/NVPTX/ldg-invariant.ll
Commit message (Collapse)AuthorAgeFilesLines
* [NVPTX] Fixed vectorized LDG for f16.Artem Belevich2018-04-061-0/+45
| | | | | | | | | v2f16 is a special case in NVPTX. v4f16 may be loaded as a pair of v2f16 and that was not previously handled correctly by tryLDGLDU() Differential Revision: https://reviews.llvm.org/D45339 llvm-svn: 329456
* [NVPTX] Add tests that invariant vector loads get lowered to ld.global.nc.Justin Lebar2017-02-041-0/+24
| | | | llvm-svn: 294082
* [NVPTX] Use ldg for explicitly invariant loads.Justin Lebar2016-09-111-0/+27
Summary: With this change (plus some changes to prevent !invariant from being clobbered within llvm), clang will be able to model the __ldg CUDA builtin as an invariant load, rather than as a target-specific llvm intrinsic. This will let the optimizer play with these loads -- specifically, we should be able to vectorize them in the load-store vectorizer. Reviewers: tra Subscribers: jholewinski, hfinkel, llvm-commits, chandlerc Differential Revision: https://reviews.llvm.org/D23477 llvm-svn: 281152
OpenPOWER on IntegriCloud