summaryrefslogtreecommitdiffstats
path: root/llvm/test/CodeGen/AMDGPU/fminnum.ll
Commit message (Collapse)AuthorAgeFilesLines
* DAG: Change behavior of fminnum/fmaxnum nodesMatt Arsenault2018-10-221-25/+52
| | | | | | | | | | | Introduce new versions that follow the IEEE semantics to help with legalization that may need quieted inputs. There are some regressions from inserting unnecessary canonicalizes when these are matched from fast math fcmp + select which should be fixed in a future commit. llvm-svn: 344914
* AMDGPU: Split amdgcn/r600 fminnum/fmaxnum testsMatt Arsenault2018-07-311-218/+128
| | | | | | | R600 breaks on too many things to usefully test changes with ieee_mode on vs. off. llvm-svn: 338435
* AMDGPU: Stop wasting argument registers with v3i32/v3f32Matt Arsenault2018-07-281-0/+11
| | | | | | | | | | SelectionDAGBuilder widens v3i32/v3f32 arguments to to v4i32/v4f32 which consume an additional register. In addition to wasting argument space, this produces extra instructions since now it appears the 4th vector component has a meaningful value to most combines. llvm-svn: 338197
* AMDGPU: Mark all unspecified CC functions in tests as amdgpu_kernelMatt Arsenault2017-03-211-17/+17
| | | | | | | | | | | | Currently the default C calling convention functions are treated the same as compute kernels. Make this explicit so the default calling convention can be changed to a non-kernel. Converted with perl -pi -e 's/define void/define amdgpu_kernel void/' on the relevant test directories (and undoing in one place that actually wanted a non-kernel). llvm-svn: 298444
* Enable FeatureFlatForGlobal on Volcanic IslandsMatt Arsenault2017-01-241-1/+1
| | | | | | | | | | | This switches to the workaround that HSA defaults to for the mesa path. This should be applied to the 4.0 branch. Patch by Vedran Miletić <vedran@miletic.net> llvm-svn: 292982
* AMDGPU: Support commuting with immediate in src0Matt Arsenault2016-09-081-2/+2
| | | | llvm-svn: 280970
* AMDGPU: Materialize sign bits with bfrevMatt Arsenault2016-03-111-4/+4
| | | | | | | If a constant is the same as the reverse of an inline immediate, this is 4 bytes smaller than having to embed a 32-bit literal. llvm-svn: 263201
* R600 -> AMDGPU renameTom Stellard2015-06-131-0/+281
llvm-svn: 239657
OpenPOWER on IntegriCloud