bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	r600: Remove empty OVERRIDES file	Jan Vesely	2018-11-27	1	-0/+0
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 347666
*	r600: Add datalayout to image builtin implementation	Jan Vesely	2018-11-10	3	-0/+6
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 346597
*	r600: Convert barrier to clc	Jan Vesely	2018-11-04	12	-35/+10
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 346078
*	r600: Convert get_num_groups to clc	Jan Vesely	2018-11-04	12	-49/+16
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 346077
*	r600: Convert get_global_size to clc	Jan Vesely	2018-11-04	12	-49/+16
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 346076
*	r600: Convert get_local_size to clc	Jan Vesely	2018-11-04	12	-49/+16
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 346075
*	r600/fmin: Flush denormals before calling builtin.	Jan Vesely	2018-06-07	2	-0/+31
\| \| \| \| \| \| \| \| \|	Same reason as amdgcn. Fixes fmin, minmag CTS on turks. Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 334228
*	r600/fmax: Flush denormals before calling builtin.	Jan Vesely	2018-06-07	2	-0/+30
\| \| \| \| \| \| \| \| \|	Same reason as amdgcn. Fixes fmax, maxmag CTS on turks. Reviewer: Tom Stellard <tstellar@redhat.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 334227
*	r600: Update datalayout after LLVM r328656	Jan Vesely	2018-04-05	4	-4/+4
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 329291
*	r600: Fix datalayout after clang r324101	Jan Vesely	2018-02-23	16	-4/+109
\| \| \| \| \| \| \| \|	r324101 switched around AS numbering Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325864
*	r600: Add missing datalayout to .ll files	Jan Vesely	2017-10-20	4	-0/+8
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Acked-by: Aaron Watry <awatry@gmail.com> llvm-svn: 316238
*	Make image builtins r600/llvm-3.9 only	Jan Vesely	2017-10-10	16	-0/+356
\| \| \| \| \| \| \| \| \| \|	The implementation uses r600 sepcific intrinsics LLVM-4 switched to _ro_t and _rw_t image types Portions of the code can be moved back as more targets/llvm versions add image support Reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315341
*	Let get_work_dim take exactly 0 arguments	Jeroen Ketema	2017-10-01	1	-1/+1
\| \| \| \| \|	Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314634
*	r600: Cleanup barrier implementation.	Jan Vesely	2017-09-04	1	-26/+5
\| \| \| \| \| \| \| \| \|	We don't have memory fences for r600 so just call group barrier directly Make sure that barrier is called even with 0 flags Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 312492
*	amdgcn: Fix return type of get_num_groups	Matt Arsenault	2016-08-25	2	-0/+19
\| \| \| \|	llvm-svn: 279723
*	amdgcn: Fix return type for get_global_size	Matt Arsenault	2016-08-24	2	-0/+19
\| \| \| \|	llvm-svn: 279644
*	amdgpu: Fix default case value for get_local_size	Matt Arsenault	2016-08-20	1	-1/+1
\| \| \| \|	llvm-svn: 279359
*	amdgcn: Fix get_local_size IR return type	Matt Arsenault	2016-08-20	2	-0/+19
\| \| \| \|	llvm-svn: 279350
*	AMDGPU: Implement get_global_offset builtin	Jan Vesely	2016-07-22	2	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	Also fix get_global_id to consider offset No idea how to add this for ptx, so they are stuck with the old get_global_id implementation. v2: split to a separate patch v3: Switch R600 to use implictarg.ptr Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276443
*	AMDGPU: Use clang intrinsics for workitem builtins	Jan Vesely	2016-07-22	6	-62/+34
\| \| \| \| \| \| \| \| \| \| \|	v2: split into 2 patches use clang builtins for other intrinsics as well v3: Fix warnings Switch r600 to use implictarg.ptr Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276442
*	R600: Use new barrier intrinsic	Matt Arsenault	2016-07-18	1	-4/+3
\| \| \| \|	llvm-svn: 275874
*	amdgcn: Use new workitem intrinsics	Matt Arsenault	2016-02-17	3	-0/+62
\| \| \| \|	llvm-svn: 261042
*	Split sources for amdgcn and r600	Matt Arsenault	2016-02-13	28	-649/+11
\| \| \| \| \| \| \| \| \| \| \|	Most files remain in a common amdgpu directory. Also switches barriers to to use convergent, and use llvm.amdgcn.s.barrier. This now requires 3.9/trunk to build amdgcn. llvm-svn: 260777
*	r600: Add image writing builtins.	Tom Stellard	2015-09-21	5	-0/+83
\| \| \| \| \| \|	Patch by: Zoltan Gilian llvm-svn: 248161
*	r600: Add image reading builtins.	Tom Stellard	2015-09-21	5	-0/+110
\| \| \| \| \| \|	Patch by: Zoltan Gilian llvm-svn: 248160
*	Add image attribute getter builtins	Tom Stellard	2015-09-21	7	-0/+153
\| \| \| \| \| \| \| \| \|	Added get_image_* OpenCL builtins to the headers. Added implementation to the r600 target. Patch by: Zoltan Gilian llvm-svn: 248159
*	R600: Implement accurate double precision sqrt v2	Tom Stellard	2015-07-10	2	-0/+60
\| \| \| \| \| \| \|	v2: - Use same implementation for R600 and gcn. llvm-svn: 241907
*	r600: Use __clc_ldexp on asics that don't implement the intruction	Jan Vesely	2015-05-06	1	-1/+10
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 236649
*	math: Add ldexp implementation	Tom Stellard	2015-05-06	2	-30/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Tom Stellard: - Add denormal handling. - Share vectorization code with r600 implementation. Patch By: Aaron Watry llvm-svn: 236639
*	Implement ldexp for R600/SI	Tom Stellard	2015-05-06	3	-0/+68
\| \| \| \|	llvm-svn: 236638
*	r600: get_work_dim: Update metadata syntax for LLVM 3.6	Tom Stellard	2014-12-31	1	-1/+1
\| \| \| \|	llvm-svn: 225042
*	r600: Fix get_work_dim range metadata	Jan Vesely	2014-10-22	1	-1/+1
\| \| \| \| \| \|	Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 220388
*	r600: Use llvm intrinsic to read work dimension information	Jan Vesely	2014-10-15	2	-0/+9
\| \| \| \| \| \| \| \| \| \|	v2: Fix function declaration Add range metadata to r600 implementation v3: change prefix to AMDGPU Reviewed-by: Tom Stellard <tom@stellard.net> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 219793
*	R600: Map Address spaces for atomic_cmpxchg	Aaron Watry	2014-09-16	1	-0/+19
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217925
*	R600: Map address spaces for atomic_xchg	Aaron Watry	2014-09-16	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217924
*	R600: Map address spaces for atomic_min	Aaron Watry	2014-09-16	1	-0/+10
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217923
*	R600: Map address spaces for atomic_xor	Aaron Watry	2014-09-16	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217922
*	R600: Map addr spaces and use atomic_max	Aaron Watry	2014-09-16	1	-5/+16
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217921
*	R600: Map address spaces for atomic_or	Aaron Watry	2014-09-16	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217920
*	R600: Map atomic_and address spaces	Aaron Watry	2014-09-16	1	-0/+1
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217919
*	vload/vstore: Use casts instead of scalarizing everything in CLC version	Aaron Watry	2014-08-20	3	-189/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This generates bitcode which is indistinguishable from what was hand-written for int32 types in v[load\|store]_impl.ll. v4: Use vec2+scalar for vec3 load/stores to prevent corruption (per Tom) v3: Also remove unused generic/lib/shared/v[load\|store]_impl.ll v2: (Per Matt Arsenault) Fix alignment issues with vector load stores Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: Matt Arsenault <Matthew.Arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 216069
*	Move clcmacro.h to avoid cluttering user namespace v2	Jeroen Ketema	2014-06-24	1	-0/+1
\| \| \| \| \| \| \| \| \|	v2: - use quotes instead of <> - add include to r600/lib/math/nextafter.c changed Reviewed-by: Tom Stellard <tom@stellard.net> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 211576
*	R600: Set the noduplicate attribute on barrier() intrinsics	Tom Stellard	2013-10-31	3	-19/+30
\| \| \| \| \| \| \| \|	This will prevent LLVM optimization passes from creating illegal uses of the barrier() intrinsic (e.g. calling barrier() from a conditional that is not executed by all threads). llvm-svn: 193753
*	Implement nextafter() builtin	Tom Stellard	2013-10-10	2	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are two implementations of nextafter(): 1. Using clang's __builtin_nextafter. Clang replaces this builtin with a call to nextafter which is part of libm. Therefore, this implementation will only work for targets with an implementation of libm (e.g. most CPU targets). 2. The other implementation is written in OpenCL C. This function is known internally as __clc_nextafter and can be used by targets that don't have access to libm. llvm-svn: 192383
*	Add atomic_sub and atomic_dec builtin functions	Aaron Watry	2013-09-06	1	-0/+1
\| \| \| \| \|	Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 190201
*	Add atomic_inc and atomic_add builtins	Aaron Watry	2013-09-05	2	-0/+21
\| \| \| \| \|	Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 190058
*	Enable assembly vload3 int/uint constant/global for R600	Aaron Watry	2013-08-12	1	-16/+2
\| \| \| \| \| \| \| \|	It's supported by the R600 LLVM back-end now, at least for evergreen. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188180
*	Add vload* for addrspace(2) and use as constant load for R600	Aaron Watry	2013-08-12	1	-2/+8
\| \| \| \| \| \|	Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188179
*	Added get_num_groups	Aaron Watry	2013-07-24	2	-0/+19
\| \| \| \| \| \| \| \| \|	The get_num_groups function was missing for r600g. I did the same thing as the other workitem functions. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 187059
*	Fix and re-enable R600 vload/vstore assembly	Aaron Watry	2013-07-16	3	-0/+198
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The assembly optimizations were making unsafe assumptions about which address spaces had which identifiers. Also, fix vload/vstore with 64-bit pointers. This was broken previously on Radeon SI. This version still only has assembly versions of int/uint 2/4/8/16 for global loads and stores on R600, but it does it in a way that would be very easily extended to private/local/constant and could also be handled easily on other architectures. v2: 1) Leave v[load\|store]_impl.ll in generic/lib 2) Remove vload_if.ll and vstore_if.ll interfaces 3) Fix address+offset calculations 3) Remove offset from assembly arg list llvm-svn: 186416