summaryrefslogtreecommitdiffstats
path: root/libclc
Commit message (Collapse)AuthorAgeFilesLines
...
* maxmag: Condition variable needs to be the same bitwidth as operandsJan Vesely2018-03-062-2/+21
| | | | | | | | No changes wrt CTS Reviewed-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 326817
* Move cl_khr_fp64 exntension enablement to gentype include listsJan Vesely2018-03-0626-87/+7
| | | | | | | | This will make adding cl_khr_fp16 support easier Reviewed-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 326816
* utils: Adapt to llvm r325155Jan Vesely2018-02-231-0/+4
| | | | | | | | | r325155 ("Pass a reference to a module to the bitcode writer.") changed bit writer interface from pointer to reference Reviewer: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325867
* amdgcn: Fix build after GDS/const AS swap in r325030Jan Vesely2018-02-2317-14/+72
| | | | | | Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325866
* amdgcn: Fix datalayout after addition of 32bit const AS in r324747Jan Vesely2018-02-234-4/+4
| | | | | | Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325865
* r600: Fix datalayout after clang r324101Jan Vesely2018-02-2316-4/+109
| | | | | | | | r324101 switched around AS numbering Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325864
* amdgcn: Fix datalayout after clang r324101Jan Vesely2018-02-2318-5/+153
| | | | | | | | r324101 switched around AS numbering Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325863
* amdgpu/half_recip: Switch implementation to native_recipJan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325061
* amdgpu/half_log2: Switch implementation to native_log2Jan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325060
* amdgpu/half_log10: Switch implementation to native_log10Jan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325059
* amdgpu/half_log: Switch implementation to native_logJan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325058
* amdgpu/half_exp2: Switch implementation to native_exp2Jan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325057
* amdgpu/half_exp10: Switch implementation to native_exp10Jan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325056
* amdgpu/half_exp: Switch implementation to native_expJan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325055
* amdgpu/half_sqrt: Switch implementation to native_sqrtJan Vesely2018-02-132-0/+7
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325054
* amdgpu/half_rsqrt: Switch implementation to native_rsqrtJan Vesely2018-02-133-0/+18
| | | | | | Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325053
* Add vstore_half_rte implementationJan Vesely2018-02-062-1/+46
| | | | | | | | Passes CTS on carrizo Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324376
* Add vstore_half_rtp implementationJan Vesely2018-02-062-1/+12
| | | | | | | | Passes CTS on carrizo Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324375
* Add vstore_half_rtn implementationJan Vesely2018-02-062-1/+43
| | | | | | | | Passes CTS on carrizo Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324374
* Add vstore_half_rtz implementationJan Vesely2018-02-062-1/+36
| | | | | | | | Passes CTS on carrizo Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324373
* vstore_half: Consolidate declarationsJan Vesely2018-02-061-39/+26
| | | | | | | | Add support for rounding suffix Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324372
* vstore_half: Add support for custom rounding functionsJan Vesely2018-02-061-23/+39
| | | | | | | | | Add another layer of indirection This will be used for specific rounding modes Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324371
* vstore_half: Make sure the helper function is always inlineJan Vesely2018-02-061-1/+1
| | | | | | Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 324370
* half_powr: Implement using powrJan Vesely2018-02-014-0/+15
| | | | | | | | v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 323942
* math.h: Use logical operations instead of bit operations for readabilityJan Vesely2018-01-311-1/+1
| | | | | | | | Trivial. Reported-by: Roman Lebedev <lebedev.ri@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 323920
* math.h: Set HAVE_HW_FMA32 based on compiler provided macroJan Vesely2018-01-291-0/+5
| | | | | | | | | Fixes sin/cos piglits on non-FMA capable asics. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=35983 Reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 323677
* tanpi: Port from amd_builtinsJan Vesely2018-01-196-0/+168
| | | | | | | | | | Passes piglit on turks and carrizo. Passes CTS on carrizo. Acked-By: Aaron Watry <awatry@gmail.com> Tested-By: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322980
* tan: Port from amd_builtinsJan Vesely2018-01-199-21/+164
| | | | | | | | | | | | v2: fixup constant precision Passes piglit on turks and carrizo. Passes CTS on carrizo Fixes half_tan to pass CTS on carrizo Acked-By: Aaron Watry <awatry@gmail.com> Tested-By: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322979
* half_divide: Implement using x/yJan Vesely2018-01-185-0/+27
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322899
* half_tan: Implement using tanJan Vesely2018-01-184-0/+17
| | | | | | | | v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322898
* half_sin: Implement using sinJan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322897
* half_recip: Implement using 1/xJan Vesely2018-01-184-0/+21
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322896
* half_log2: Implement using log2Jan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322895
* half_log10: Implement using log10Jan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322894
* half_log: Implement using logJan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322893
* half_exp10: Implement using exp10Jan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322892
* half_exp2: Implement using exp2Jan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322891
* half_exp: Implement using expJan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322890
* half_cos: Implement using cosJan Vesely2018-01-184-0/+17
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322889
* half_sqrt: Cleanup implementationJan Vesely2018-01-183-51/+2
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322888
* half_rsqrt: Cleanup implementationJan Vesely2018-01-184-51/+11
| | | | | | | | | Passes CTS on carrizo v2: Use full precision implementation Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322887
* rootn: Port from amd_builtinsJan Vesely2018-01-179-0/+399
| | | | | | | | | | | | Passes piglit on turks and carrizo fp64 passes ctx on carrizo v2: fix formatting check fp32 denormal support at runtime Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322763
* powr: Port from amd_builtinsJan Vesely2018-01-176-0/+411
| | | | | | | | | | | | Passes piglit on turks and carrizo fp64 passes cts on carrizo v2: fix formatting check fp32 denormal support at runtime Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322762
* pown: Port from amd_builtinsJan Vesely2018-01-178-31/+395
| | | | | | | | | | | | Passes piglit on turks and carrizo fp64 passes CTS on carrizo v2: fix formatting check fp32 denormal support at runtime Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322761
* pow: Port from amd_builtinsJan Vesely2018-01-178-6/+1096
| | | | | | | | | | | | Passes piglit on turks and carrizo fp64 passes CTS on carrizo v2: fix formatting check fp32 denormal support at runtime Reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 322760
* configure.py: Add gfx900 (Vega, Raven)Vedran Miletic2017-11-271-2/+2
| | | | | | Sort amdgcn-- and amdgcn--amdhsa in a consistent way. llvm-svn: 319017
* math: Implement minmagJan Vesely2017-11-155-0/+17
| | | | | | Reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 318265
* math: Implement maxmagJan Vesely2017-11-155-0/+17
| | | | | | Reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 318264
* native_powr: Switch implementation to native_exp2 and native_log2Jan Vesely2017-11-144-1/+18
| | | | | | | | | | v2: don't use assume check only for x<0, the other conditions are handled transparently v3: don't check inputs at all, nan propagation works as expected Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 318204
* native_divide: provide function implementation instead of macroJan Vesely2017-11-135-1/+17
| | | | | | Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 318067
OpenPOWER on IntegriCloud