bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	integer/add_sat: Use clang builtin instead of llvm asm	Jan Vesely	2017-10-02	1	-2/+0
\| \| \| \| \| \| \|	reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314702
*	integer/clz: Use clang builtin instead of llvm asm	Jan Vesely	2017-10-02	1	-2/+0
\| \| \| \| \| \| \| \| \|	The generated llvm IR mostly identical. char/uchar case is a bit worse. reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314701
*	Rework atomic ops to use clang builtins rather than llvm asm	Jan Vesely	2017-09-25	1	-1/+8
\| \| \| \| \| \| \|	reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314112
*	Implement cl_khr_int64_extended_atomics builtins	Jan Vesely	2017-09-20	1	-0/+5
\| \| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 313811
*	Implement cl_khr_int64_base_atomics builtins	Jan Vesely	2017-09-20	1	-0/+6
\| \| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 313810
*	vstore: Cleanup and add vstore(half)	Jan Vesely	2017-09-08	1	-1/+0
\| \| \| \| \| \| \| \| \| \|	Add missing undefs Make helpers amdgpu specific (NVPTX uses different numbering for private AS) Use clang builtins on clang >= 6 Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tstellar@redhat.com> llvm-svn: 312838
*	relational: Implement shuffle2 builtin	Aaron Watry	2017-09-02	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was added in CL 1.1 Tested with a Radeon HD 7850 (Pitcairn) using the CL CTS via: test_conformance/relationals/test_relationals shuffle_built_in_dual_input v2: Add half support to shuffle2 Move shuffle2 to misc/ Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 312404
*	relational: Implement shuffle builtin	Aaron Watry	2017-09-02	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was added in CL 1.1 Tested with a Radeon HD 7850 (Pitcairn) using the CL CTS via: test_conformance/relationals/test_relationals shuffle_built_in v2: Add half-precision support to shuffle when available. Move to misc/ and add section 6.12.12 to clc.h Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 312403
*	math: Implement sinh function	Jan Vesely	2017-02-25	1	-0/+1
\| \| \| \| \| \|	mostly copied form amd_builtins llvm-svn: 296233
*	math: Add logb builtin	Aaron Watry	2017-01-18	1	-0/+1
\| \| \| \| \| \| \| \| \|	Ported from the amd-builtins branch. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 292335
*	math: Add expm1 builtin function	Aaron Watry	2017-01-18	1	-0/+1
\| \| \| \| \| \| \| \| \|	Ported from the amd-builtins branch. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 292334
*	Provide vstore_half helper to workaround clc restrictions	Jan Vesely	2016-09-21	1	-0/+1
\| \| \| \| \| \|	clang won't accept half precision loads and stores without cl_khr_fp16 since r281904 llvm-svn: 282106
*	math: Implement tgamma	Aaron Watry	2016-09-15	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281566
*	math: Implement lgamma	Aaron Watry	2016-09-15	1	-0/+1
\| \| \| \| \| \| \| \|	Just use lgamma_r and ignore the value returned in the second argument Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281565
*	math: Implement lgamma_r	Aaron Watry	2016-09-15	1	-0/+1
\| \| \| \| \| \| \| \| \|	Ported from the amd-builtins branch, which is itself based on the Sun Microsystems implementation. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281564
*	Implement cbrt builtin	Tom Stellard	2016-07-22	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 276497
*	Implement cosh builtin	Tom Stellard	2016-07-22	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 276496
*	math: Add erf ported from amd-builtins	Jan Vesely	2016-05-06	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The scalar float/double function bodies are a direct copy/paste, aside from the removed (optional) code in float function body that requires subnormals. reviewers: jvesely Patch by: Vedran Miletić <rivanvx@gmail.com> llvm-svn: 268766
*	math: Add fdim implementation	Aaron Watry	2016-05-06	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	Based on the amd-builtin, but explicitly vectorized for all sizes (not just float4), and includes a vectorized double implementation. Passes piglit (float) tests on pitcairn. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 268708
*	math: Add ilogb ported from amd-builtins	Aaron Watry	2016-02-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The scalar float/double function bodies are a direct copy/paste with usage of the CLC wrappers to vectorize them. This commit also adds in the FP_ILOGB0 and FP_ILOGBNAN macros which are equal to the results of ilogb(0.0f) and ilogb(float nan) respectively. v2: Add FP_ILOGB0 and FP_ILOGBNAN definitions Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> v1 Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 261639
*	math: Add frexp ported from amd-builtins	Aaron Watry	2016-02-08	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The float implementation is almost a direct port from the amd-builtins, but instead of just having a scalar and float4 implementation, it has a scalar and arbitrary width vector implementation. The double scalar is also a direct port from AMD's builtin release. The double vector implementation copies the logic in the float vector implementation using the values from the double scalar version. Both have been tested in piglit using tests sent to that project's mailing list. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 260114
*	Implement modf math builtin	Tom Stellard	2016-01-27	1	-0/+1
\| \| \| \| \| \| \| \|	V2: use the reference implementation as suggested by Matt Arsenault Patch By: Pavel Ondračka llvm-svn: 258933
*	Implement tanh builtin	Niels Ole Salscheider	2015-09-29	1	-0/+1
\| \| \| \| \| \|	This is a port from the AMD builtin library. llvm-svn: 248780
*	Add image attribute getter builtins	Tom Stellard	2015-09-21	1	-0/+1
\| \| \| \| \| \| \| \| \|	Added get_image_* OpenCL builtins to the headers. Added implementation to the r600 target. Patch by: Zoltan Gilian llvm-svn: 248159
*	Fix double implementation of log	Tom Stellard	2015-07-24	1	-0/+1
\| \| \| \| \| \|	We need to use M_LOG2E instead of M_LOG2E_F. llvm-svn: 243132
*	Implement accurate log2 function	Tom Stellard	2015-07-24	1	-0/+1
\| \| \| \| \| \| \| \| \|	Use the implementation was ported from the AMD builtin library rather than LLVM Intrinsics. This has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 243131
*	Use llvm intrinsics for native_log and native_log2	Tom Stellard	2015-07-24	1	-0/+2
\| \| \| \|	llvm-svn: 243130
*	Fix implementation of sqrt v2	Tom Stellard	2015-07-10	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	Passing values less than 0 to the llvm.sqrt() intrinsic results in undefined behavior, so we need to check the input and return NaN if is is less than 0. v2: - Fix build failures. llvm-svn: 241906
*	Implement exp2 using OpenCL C rather than using an intrinsic	Tom Stellard	2015-05-13	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Not all targets support the intrinsic, so it's better to have a generic implementation which does not use it. This exp2 implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 237228
*	Implement atan2pi builtin	Tom Stellard	2015-05-12	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 237138
*	Implement fast_normalize builtin v4	Tom Stellard	2015-05-09	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. v2: - Remove f suffix from constant in double implementations. - Consolidate implementations using the .cl/.inc approach. v3: - Use __CLC_FPSIZE instead of __CLC_FP{32,64} v4 (Jan Vesely): - Limit to single precision. llvm-svn: 236920
*	Implement half_rsqrt builtin v3	Tom Stellard	2015-05-08	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	This is a generic implementation which just calls rsqrt. Targets should override this if they want a faster implementation. v2: - Alphabettize SOURCES v3 (Jan Vesely): Limit to single precision types. llvm-svn: 236915
*	Move ldexp soft implementation to a separate file	Jan Vesely	2015-05-06	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 236648
*	Implement sinpi builtin	Jan Vesely	2015-05-06	1	-0/+1
\| \| \| \| \| \| \| \|	Ported from AMD builtin library, passes piglit on Turks. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 236647
*	math: Add ldexp implementation	Tom Stellard	2015-05-06	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Tom Stellard: - Add denormal handling. - Share vectorization code with r600 implementation. Patch By: Aaron Watry llvm-svn: 236639
*	Implement fract builtin	Tom Stellard	2015-04-23	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 235620
*	configure: Add --enable-runtime-subnormal option	Tom Stellard	2015-04-20	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This makes it possible for runtime implementations to disable subnormal handling at runtime. When this flag is enabled, decisions about how to handle subnormals in the library will be controlled by an external variable called __CLC_SUBNORMAL_DISABLE. Function implementations should use these new helpers for querying subnormal support: __clc_fp16_subnormals_supported(); __clc_fp32_subnormals_supported(); __clc_fp64_subnormals_supported(); In order for the library to link correctly with this feature, users will be required to either: 1. Insert this variable into the module (if using the LLVM/Clang C++/C APIs). 2. Pass either subnormal_disable.bc or subnormal_use_default.bc to the linker. These files are distributed with liblclc and installed to $(installdir). e.g.: llvm-link -o kernel-out.bc kernel.bc builtins-nosubnormal.bc subnormal_disable.bc or llvm-link -o kernel-out.bc kernel.bc builtins-nosubnormal.bc subnormal_use_default.bc If you do not supply the --enable-runtime-subnormal then the library behaves the same as it did before this commit. In addition to these changes, the patch adds helper functions that should be used when implementing library functions that need special handling for denormals: __clc_fp16_subnormals_supported(); __clc_fp32_subnormals_supported(); __clc_fp64_subnormals_supported(); llvm-svn: 235329
*	Implement atanh builtin	Tom Stellard	2015-04-07	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 234324
*	Implement acosh builtin	Tom Stellard	2015-04-07	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 234323
*	Implement atanpi builtin	Tom Stellard	2015-04-02	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 233928
*	Implement asinpi builtin	Tom Stellard	2015-04-02	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 233927
*	Implement asinh builtin	Tom Stellard	2015-04-02	1	-0/+2
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 233926
*	Implement acospi builtin	Tom Stellard	2015-04-02	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 233925
*	Implement fast_distance builtin	Tom Stellard	2015-03-23	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 232978
*	Implement fast_length builtin	Tom Stellard	2015-03-23	1	-0/+1
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 232977
*	Implement half_sqrt builtin v2	Tom Stellard	2015-03-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	This is a generic implementation which just calls sqrt. Targets should override this if they want a faster implementation. v2: - Alphabetize SOURCES llvm-svn: 232965
*	Implement distance builtin v2	Tom Stellard	2015-03-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. v2: - Remove unnecessary copyright. llvm-svn: 232964
*	math: Implement erfc	Aaron Watry	2015-03-18	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 232674
*	Fix bitselect for float/double types v2	Tom Stellard	2015-03-05	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We need to reinterpret float/double types as uint/ulong in order to perform the bitwise operations. This has been tested with piglit, OpenCV, and the ocl conformance tests. v2: - Use vector operations rather than splitting vectors into scalar components. Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 231373
*	Move mix from math to common	Aaron Watry	2015-03-03	1	-1/+1
\| \| \| \| \| \| \| \|	It has been part of the common functions since 1.0 Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 231137