summaryrefslogtreecommitdiffstats
path: root/libclc/amdgpu/lib/math
Commit message (Collapse)AuthorAgeFilesLines
* Add initial support for half precision builtinsJan Vesely2018-05-172-0/+12
| | | | | | | | | | | | | | v2: fix fmax implementation use consistent checks for __CLC_FP_SIZE add missing TODOs fix whitespace in definitions.h v3: undef ZERO in modf.inc Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 332677
* amdgpu/half_recip: Switch implementation to native_recipJan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325061
* amdgpu/half_log2: Switch implementation to native_log2Jan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325060
* amdgpu/half_log10: Switch implementation to native_log10Jan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325059
* amdgpu/half_log: Switch implementation to native_logJan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325058
* amdgpu/half_exp2: Switch implementation to native_exp2Jan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325057
* amdgpu/half_exp10: Switch implementation to native_exp10Jan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325056
* amdgpu/half_exp: Switch implementation to native_expJan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325055
* amdgpu/half_sqrt: Switch implementation to native_sqrtJan Vesely2018-02-131-0/+6
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325054
* amdgpu/half_rsqrt: Switch implementation to native_rsqrtJan Vesely2018-02-132-0/+17
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325053
* amdgpu: Add workaround for unimplemented llvm.exp intrinsicJan Vesely2017-11-102-0/+8
| | | | | | Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 317935
* math: Implement native_log10Jan Vesely2017-10-252-0/+8
| | | | | | | | | | | | Use llvm instrinsic by default Provide amdgpu workaround v2: drop old amd copyrights Reviewer: Aaron Watry Reviewed-by: Vedran Miletić <vedran@miletic.net> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 316588
* amdgpu/math: Don't use llvm instrinsic for native_logJan Vesely2017-10-252-0/+8
| | | | | | | | | | | | | AMDGPU targets don't have insturction for it, so it'll be expanded to C * log2 anyway. v2: use native_log2 instead of the more precise sw implementation v3: move to amdgpu v4: drop old AMD copyright Reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 316587
* Do not include clc_nextafter header globallyJan Vesely2017-10-081-0/+1
| | | | | | | | Drop unused clc/math/clc_nextafter.h header Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315190
* Replace nextafter implementationMatt Arsenault2016-09-081-0/+5
| | | | | | This one passes conformance. llvm-svn: 280961
* amdgpu: Use right builtn for rsqMatt Arsenault2016-07-191-1/+6
| | | | | | | The r600 path has never actually worked sinced double is not implemented there. llvm-svn: 276009
* Replace llvm.AMDGPU.ldexp with llvm.amdgcn.ldexpMatt Arsenault2016-07-181-47/+0
| | | | | | | It didn't really work on r600 to begin with, which should get its own intrinsic. llvm-svn: 275813
* Split sources for amdgcn and r600Matt Arsenault2016-02-133-0/+110
Most files remain in a common amdgpu directory. Also switches barriers to to use convergent, and use llvm.amdgcn.s.barrier. This now requires 3.9/trunk to build amdgcn. llvm-svn: 260777
OpenPOWER on IntegriCloud