bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix build since r286752.	Tom Stellard	2016-11-14	1	-1/+2
\| \| \| \|	llvm-svn: 286839
*	Fix build since llvm r286566 and require at least llvm 4.0	Tom Stellard	2016-11-11	2	-3/+4
\| \| \| \|	llvm-svn: 286634
*	Provide vstore_half helper to workaround clc restrictions	Jan Vesely	2016-09-21	4	-26/+75
\| \| \| \| \| \|	clang won't accept half precision loads and stores without cl_khr_fp16 since r281904 llvm-svn: 282106
*	configure: Add amdgcn-mesa-mesa3d target	Tom Stellard	2016-09-16	1	-1/+5
\| \| \| \|	llvm-svn: 281793
*	amdgcn-amdhsa: Add get_num_groups implementation	Tom Stellard	2016-09-16	3	-0/+14
\| \| \| \|	llvm-svn: 281792
*	amdgcn-amdhsa: Add get_global_size() implementation	Tom Stellard	2016-09-16	2	-0/+40
\| \| \| \|	llvm-svn: 281791
*	math: Implement tgamma	Aaron Watry	2016-09-15	5	-0/+77
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281566
*	math: Implement lgamma	Aaron Watry	2016-09-15	5	-0/+49
\| \| \| \| \| \| \| \|	Just use lgamma_r and ignore the value returned in the second argument Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281565
*	math: Implement lgamma_r	Aaron Watry	2016-09-15	6	-0/+518
\| \| \| \| \| \| \| \| \|	Ported from the amd-builtins branch, which is itself based on the Sun Microsystems implementation. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281564
*	Add ADDR_SPACE parameter to _CLC_V_V_VP_VECTORIZE	Aaron Watry	2016-09-15	1	-12/+27
\| \| \| \| \| \| \| \| \| \| \|	This macro is currently unused, but I plan to use it shortly. The previous form did casts of pointers without an address space, which doesn't work so well for CL 1.x. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 281563
*	Replace nextafter implementation	Matt Arsenault	2016-09-08	2	-28/+29
\| \| \| \| \| \|	This one passes conformance. llvm-svn: 280961
*	Avoid ambiguity in calling atom_add functions.	Jan Vesely	2016-09-07	4	-4/+4
\| \| \| \| \| \| \| \| \| \|	clang (since r280553) allows pointer casts in function overloads, so we need to disambiguate the second argument. clang might be smarter about overloads in the future see https://reviews.llvm.org/D24113, but let's be safe in libclc anyway. llvm-svn: 280871
*	configure.py: Add polaris10 and polaris11	Niels Ole Salscheider	2016-08-30	1	-2/+2
\| \| \| \|	llvm-svn: 280121
*	amdgcn: Fix return type of get_num_groups	Matt Arsenault	2016-08-25	5	-2/+24
\| \| \| \|	llvm-svn: 279723
*	Strip opencl.ocl.version metadata	Matt Arsenault	2016-08-25	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	This should be uniqued when linking, but right now it creates a lot of metadata spam listing the same version. This should also probably be reporting the compiled version of the user program, which may differ from the library. Currently the library IR files report 1.0 while 1.1/1.2 are the default for user programs. llvm-svn: 279692
*	amdgcn: Also correct get_local_size type for HSA	Matt Arsenault	2016-08-24	1	-5/+8
\| \| \| \|	llvm-svn: 279656
*	amdgcn: Fix return type for get_global_size	Matt Arsenault	2016-08-24	5	-2/+24
\| \| \| \|	llvm-svn: 279644
*	amdgpu: Fix default case value for get_local_size	Matt Arsenault	2016-08-20	2	-2/+2
\| \| \| \|	llvm-svn: 279359
*	amdgcn: Fix get_local_size IR return type	Matt Arsenault	2016-08-20	5	-5/+27
\| \| \| \|	llvm-svn: 279350
*	amdgcn: Correct return types to be size_t	Matt Arsenault	2016-08-19	3	-3/+3
\| \| \| \|	llvm-svn: 279343
*	Implement vstore_half{,n}	Jan Vesely	2016-08-17	3	-19/+68
\| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 278962
*	Make min follow the OCL 1.0 specs	Jan Vesely	2016-07-25	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	OpenCL 1.0: "Returns y if y < x, otherwise it returns x. If x and y are infinite or NaN, the return values are undefined." OpenCL 1.1+: "Returns y if y < x, otherwise it returns x. If x or y are infinite or NaN, the return values are undefined." The 1.0 version is stricter so use that one. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276704
*	Implement cbrt builtin	Tom Stellard	2016-07-22	7	-0/+869
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 276497
*	Implement cosh builtin	Tom Stellard	2016-07-22	7	-0/+370
\| \| \| \| \| \| \|	This implementation was ported from the AMD builtin library and has been tested with piglit, OpenCV, and the ocl conformance tests. llvm-svn: 276496
*	geometric/floatn.inc: Add vec8 and vec16 types	Tom Stellard	2016-07-22	1	-0/+16
\| \| \| \|	llvm-svn: 276495
*	AMDGPU: Implement get_global_offset builtin	Jan Vesely	2016-07-22	9	-1/+33
\| \| \| \| \| \| \| \| \| \| \| \| \|	Also fix get_global_id to consider offset No idea how to add this for ptx, so they are stuck with the old get_global_id implementation. v2: split to a separate patch v3: Switch R600 to use implictarg.ptr Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276443
*	AMDGPU: Use clang intrinsics for workitem builtins	Jan Vesely	2016-07-22	14	-136/+71
\| \| \| \| \| \| \| \| \| \| \|	v2: split into 2 patches use clang builtins for other intrinsics as well v3: Fix warnings Switch r600 to use implictarg.ptr Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276442
*	ptx: Fix builtin names after clang r274770	Jan Vesely	2016-07-22	5	-13/+13
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Acked-By: Aaron Watry <awatry@gmail.com> llvm-svn: 276423
*	amdgpu: Use right builtn for rsq	Matt Arsenault	2016-07-19	1	-1/+6
\| \| \| \| \| \| \|	The r600 path has never actually worked sinced double is not implemented there. llvm-svn: 276009
*	R600: Use new barrier intrinsic	Matt Arsenault	2016-07-18	1	-4/+3
\| \| \| \|	llvm-svn: 275874
*	Replace llvm.AMDGPU.ldexp with llvm.amdgcn.ldexp	Matt Arsenault	2016-07-18	3	-3/+3
\| \| \| \| \| \| \|	It didn't really work on r600 to begin with, which should get its own intrinsic. llvm-svn: 275813
*	configure: Remove device specific defines	Jan Vesely	2016-06-17	1	-25/+11
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 273044
*	nvptx: Drop feature defines.	Jan Vesely	2016-06-17	1	-6/+4
\| \| \| \| \| \| \| \|	This is now handled by clang Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 273043
*	64 bit integers are legal in full profile without an extension	Jan Vesely	2016-06-17	2	-6/+12
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 273042
*	math: Use single precision fmax in sp path	Jan Vesely	2016-05-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Fixes fdim piglit on Turks v2: use CL fmax instead of __builtin Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom.stellard@amd.com> llvm-svn: 269807
*	math: Add erf ported from amd-builtins	Jan Vesely	2016-05-06	4	-0/+413
\| \| \| \| \| \| \| \| \| \| \| \|	The scalar float/double function bodies are a direct copy/paste, aside from the removed (optional) code in float function body that requires subnormals. reviewers: jvesely Patch by: Vedran Miletić <rivanvx@gmail.com> llvm-svn: 268766
*	math: Add fdim implementation	Aaron Watry	2016-05-06	6	-0/+86
\| \| \| \| \| \| \| \| \| \| \|	Based on the amd-builtin, but explicitly vectorized for all sizes (not just float4), and includes a vectorized double implementation. Passes piglit (float) tests on pitcairn. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 268708
*	prepare-builtins: Remove call to getGlobalContext()	Tom Stellard	2016-04-15	1	-1/+1
\| \| \| \| \| \| \| \|	This function has been removed from LLVM. Patch By: Laurent Carlier llvm-svn: 266430
*	[AMDGPU] Implement get_local_size for amdgcn--amdhsa triple	Konstantin Zhuravlyov	2016-04-07	5	-1/+41
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D18284 llvm-svn: 265713
*	Update copyright year to 2016.	Paul Robinson	2016-03-30	1	-1/+1
\| \| \| \|	llvm-svn: 264949
*	math: Fix ilogb(double) return type	Aaron Watry	2016-02-24	1	-1/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 261714
*	math: Add ilogb ported from amd-builtins	Aaron Watry	2016-02-23	6	-0/+68
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The scalar float/double function bodies are a direct copy/paste with usage of the CLC wrappers to vectorize them. This commit also adds in the FP_ILOGB0 and FP_ILOGBNAN macros which are equal to the results of ilogb(0.0f) and ilogb(float nan) respectively. v2: Add FP_ILOGB0 and FP_ILOGBNAN definitions Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> v1 Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 261639
*	Add .gitignore for build directories	Matt Arsenault	2016-02-17	1	-0/+13
\| \| \| \|	llvm-svn: 261043
*	amdgcn: Use new workitem intrinsics	Matt Arsenault	2016-02-17	9	-38/+124
\| \| \| \|	llvm-svn: 261042
*	Update page to list supported targets	Matt Arsenault	2016-02-13	1	-2/+2
\| \| \| \|	llvm-svn: 260778
*	Split sources for amdgcn and r600	Matt Arsenault	2016-02-13	34	-38/+75
\| \| \| \| \| \| \| \| \| \| \|	Most files remain in a common amdgpu directory. Also switches barriers to to use convergent, and use llvm.amdgcn.s.barrier. This now requires 3.9/trunk to build amdgcn. llvm-svn: 260777
*	configure: Remove llvm 3.6 defines	Jan Vesely	2016-02-09	1	-3/+3
\| \| \| \| \| \| \| \|	we require llvm 3.7 reviewer: tstellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 260304
*	configure: Remove cl_khr_fp64 for device that don't support doubles	Jan Vesely	2016-02-09	1	-5/+5
\| \| \| \| \| \| \| \| \|	Also remove definitions if provided by clang (3.7+) This halves the size of builtin.opt.{cedar,barts}.bc reviewer: tstellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 260303
*	configure: Introduce per device defines	Jan Vesely	2016-02-09	1	-11/+24
\| \| \| \| \| \| \| \| \| \| \|	Make cl_khr_fp64 define per-device. This patch does not change the generated Makefile (for llvm 3.6, 3.7) v2: Make the device defines per LLVM version, 'all' for all versions reviewer: tstellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 260302
*	math: Fix log2 vectorization on non-fp64 hw	Jan Vesely	2016-02-09	1	-0/+2
\| \| \| \| \| \|	reviewer: tstellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 260301