summaryrefslogtreecommitdiffstats
path: root/llvm/test/CodeGen/AMDGPU/reduction.ll
Commit message (Collapse)AuthorAgeFilesLines
* DAG: Change behavior of fminnum/fmaxnum nodesMatt Arsenault2018-10-221-21/+91
| | | | | | | | | | | Introduce new versions that follow the IEEE semantics to help with legalization that may need quieted inputs. There are some regressions from inserting unnecessary canonicalizes when these are matched from fast math fcmp + select which should be fixed in a future commit. llvm-svn: 344914
* AMDGPU: Restrict extract_vector_elt combine to loadsMatt Arsenault2018-08-311-2/+2
| | | | | | | | | | | The intention is to enable the extract_vector_elt load combine, and doing this for other operations interferes with more useful optimizations on vectors. Handle any type of load since in principle we should do the same combine for the various load intrinsics. llvm-svn: 341219
* AMDGPU: Improve extract_vector_elt reduction combineMatt Arsenault2018-08-151-27/+124
| | | | | | | | | | | Handle fmul, fsub and preserve flags. Also really test minnum/maxnum reductions. The existing tests were only checking from minnum/maxnum matched from a fast math compare and select which is not the same. llvm-svn: 339820
* AMDGPU: Make v2i16/v2f16 legal on VIMatt Arsenault2018-05-221-22/+22
| | | | | | | | | | | | This usually results in better code. Fixes using inline asm with short2, and also fixes having a different ABI for function parameters between VI and gfx9. Partially cleans up the mess used for lowering of the d16 operations. Making v4f16 legal will help clean this up more, but this requires additional work. llvm-svn: 332953
* [AMDGPU] Support horizontal vectorization of min/max.Farhana Aleen2018-05-091-16/+300
| | | | | | | | | | | | Author: FarhanaAleen Reviewed By: rampitec Subscribers: AMDGPU Differential Revision: https://reviews.llvm.org/D46604 llvm-svn: 331920
* [AMDGPU] Support horizontal vectorization.Farhana Aleen2018-05-011-0/+128
Author: FarhanaAleen Reviewed By: rampitec, arsenm Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D46213 llvm-svn: 331313
OpenPOWER on IntegriCloud