summaryrefslogtreecommitdiffstats
path: root/llvm/test/CodeGen/AMDGPU/vector-extract-insert.ll
Commit message (Collapse)AuthorAgeFilesLines
* [AMDGPU] Convert insert_vector_elt into set of selectsStanislav Mekhanoshin2018-11-191-1/+4
| | | | | | | | | This allows to avoid scratch use or indirect VGPR addressing for small vectors. Differential Revision: https://reviews.llvm.org/D54606 llvm-svn: 347231
* [AMDGPU] combine extractelement into several selectsStanislav Mekhanoshin2018-11-131-1/+3
| | | | | | | | | | An extractelement with non-constant index will be lowered either to scratch or movrel loop in most cases. This patch converts such instruction into a set of selects if vector size is not too big. Differential Revision: https://reviews.llvm.org/D54351 llvm-svn: 346800
* AMDGPU: Cleanup subtarget featuresMatt Arsenault2017-08-071-2/+2
| | | | | | | | | | | | Try to avoid mutually exclusive features. Don't use a real default GPU, and use a fake "generic". The goal is to make it easier to see which set of features are incompatible between feature strings. Most of the test changes are due to random scheduling changes from not having a default fullspeed model. llvm-svn: 310258
* AMDGPU: Mark all unspecified CC functions in tests as amdgpu_kernelMatt Arsenault2017-03-211-4/+4
| | | | | | | | | | | | Currently the default C calling convention functions are treated the same as compute kernels. Make this explicit so the default calling convention can be changed to a non-kernel. Converted with perl -pi -e 's/define void/define amdgpu_kernel void/' on the relevant test directories (and undoing in one place that actually wanted a non-kernel). llvm-svn: 298444
* Reapply r274829 with fix for FP vectorsMatt Arsenault2016-07-081-0/+18
| | | | llvm-svn: 274937
* DAGCombiner: Fold away vector extract of insert with the same indexMatt Arsenault2016-07-051-0/+66
This only really matters when the index is non-constant since the constant case already gets taken care of by other combines. llvm-svn: 274569
OpenPOWER on IntegriCloud