bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	amdgcn: Consolidate atomic minmax helpers	Jan Vesely	2018-11-27	11	-57/+4
\| \| \| \| \| \| \| \|	Removes most overrides Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewer: Aaron Watry llvm-svn: 347665
*	amdgcn: Move __clc_amdgcn_s_waitcnt definition to clc file	Jan Vesely	2018-11-04	4	-15/+1
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-By: Aaron Watry <awatry@gmail.com> llvm-svn: 346082
*	amdgcn: Convert get_num_groups to clc	Jan Vesely	2018-11-04	13	-75/+16
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-By: Aaron Watry <awatry@gmail.com> llvm-svn: 346081
*	amdgcn: Convert get_global_size to clc	Jan Vesely	2018-11-04	13	-75/+16
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-By: Aaron Watry <awatry@gmail.com> llvm-svn: 346080
*	amdgcn: Convert get_local_size to clc	Jan Vesely	2018-11-04	13	-75/+16
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-By: Aaron Watry <awatry@gmail.com> llvm-svn: 346079
*	amdgcn: Use __constant AS for amdgcn builtins.	Jan Vesely	2018-08-03	2	-2/+6
\| \| \| \| \| \| \| \|	Fixes build after clang r338707. Reviewer: Matthew.Arsenault@amd.com Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 338898
*	Add initial support for half precision builtins	Jan Vesely	2018-05-17	2	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	v2: fix fmax implementation use consistent checks for __CLC_FP_SIZE add missing TODOs fix whitespace in definitions.h v3: undef ZERO in modf.inc Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> reviewer: Jeroen Ketema <j.ketema@xs4all.nl> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 332677
*	amdgcn/fmin: Fix typos that reduced precision	Jan Vesely	2018-04-17	1	-3/+3
\| \| \| \| \| \| \| \| \|	Not sure how these sneaked in. Fixes fminD and few other tests(fractD, cosD) on carrizo Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 330198
*	amdgcn: Update datalayout after LLVM r328656	Jan Vesely	2018-04-05	4	-4/+4
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 329290
*	amdgcn/fmax: fcanonicalize operands	Jan Vesely	2018-03-08	2	-0/+32
\| \| \| \| \| \| \| \| \|	v_max instruction needs canonicalized operands. Passes CTS on carrizo Reviewer: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 327076
*	amdgcn/fmin: fcanonicalize operands	Jan Vesely	2018-03-08	2	-0/+32
\| \| \| \| \| \| \| \| \|	v_min instruction needs canonicalized operands. Passes CTS on carrizo Reviewer: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 327075
*	amdgcn,popcount: Workaround broken llvm.ctpop intrinsic on some GCN ASICs	Jan Vesely	2018-03-08	3	-0/+24
\| \| \| \| \| \| \| \| \| \|	This is only really needed for VI+ ASICs. However, llvm would cast the value to i32 for older asics anyway. The proper fix is in LLVM-7 (r326535). Fixes CTS popcount on carrizo. Reviewer: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 327044
*	amdgcn: Fix build after GDS/const AS swap in r325030	Jan Vesely	2018-02-23	6	-10/+20
\| \| \| \| \| \|	Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325866
*	amdgcn: Fix datalayout after addition of 32bit const AS in r324747	Jan Vesely	2018-02-23	4	-4/+4
\| \| \| \| \| \|	Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325865
*	amdgcn: Fix datalayout after clang r324101	Jan Vesely	2018-02-23	17	-5/+150
\| \| \| \| \| \| \| \|	r324101 switched around AS numbering Acked-by: Aaron Watry <awatry@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 325863
*	amdgcn: Add missing datalayout info to .ll files	Jan Vesely	2017-10-20	7	-0/+14
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Acked-by: Aaron Watry <awatry@gmail.com> llvm-svn: 316239
*	Let get_work_dim take exactly 0 arguments	Jeroen Ketema	2017-10-01	1	-1/+1
\| \| \| \| \|	Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 314634
*	Restore support for llvm-3.9	Jan Vesely	2017-09-29	5	-0/+60
\| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Acked-by: Aaron Watry <awatry@gmail.com> llvm-svn: 314543
*	Implement cl_khr_int64_extended_atomics builtins	Jan Vesely	2017-09-20	2	-0/+48
\| \| \| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 313811
*	amdgcn,waitcnt: Add datalayout info	Jan Vesely	2017-09-04	1	-0/+2
\| \| \| \| \| \| \| \|	This file is only compiled for GCN which all share the same layout Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 312493
*	amdgcn: rewrite barrier() using fence and clang __builtin_amdgcn_s_barrier	Jan Vesely	2017-08-16	3	-33/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Specs require using fences when barrier() is invoked: "The barrier function will either flush any variables stored in local memory or queue a memory fence to ensure correct ordering of memory operations to local memory." and "The barrier function will queue a memory fence to ensure correct ordering of memory operations to global memory." Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 311022
*	amdgcn: Implement {read_,write_,}mem_fence builtin	Jan Vesely	2017-08-16	3	-0/+52
\| \| \| \| \| \| \| \| \|	v2: add more detailed comment about waitcnt instruction Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 311021
*	amdgcn: Fix return type of get_num_groups	Matt Arsenault	2016-08-25	2	-0/+22
\| \| \| \|	llvm-svn: 279723
*	amdgcn: Fix return type for get_global_size	Matt Arsenault	2016-08-24	2	-0/+22
\| \| \| \|	llvm-svn: 279644
*	amdgpu: Fix default case value for get_local_size	Matt Arsenault	2016-08-20	1	-1/+1
\| \| \| \|	llvm-svn: 279359
*	amdgcn: Fix get_local_size IR return type	Matt Arsenault	2016-08-20	2	-0/+22
\| \| \| \|	llvm-svn: 279350
*	amdgcn: Correct return types to be size_t	Matt Arsenault	2016-08-19	3	-3/+3
\| \| \| \|	llvm-svn: 279343
*	AMDGPU: Implement get_global_offset builtin	Jan Vesely	2016-07-22	2	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	Also fix get_global_id to consider offset No idea how to add this for ptx, so they are stuck with the old get_global_id implementation. v2: split to a separate patch v3: Switch R600 to use implictarg.ptr Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276443
*	AMDGPU: Use clang intrinsics for workitem builtins	Jan Vesely	2016-07-22	6	-62/+34
\| \| \| \| \| \| \| \| \| \| \|	v2: split into 2 patches use clang builtins for other intrinsics as well v3: Fix warnings Switch r600 to use implictarg.ptr Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 276442
*	Replace llvm.AMDGPU.ldexp with llvm.amdgcn.ldexp	Matt Arsenault	2016-07-18	2	-0/+48
\| \| \| \| \| \| \|	It didn't really work on r600 to begin with, which should get its own intrinsic. llvm-svn: 275813
*	amdgcn: Use new workitem intrinsics	Matt Arsenault	2016-02-17	3	-0/+62
\| \| \| \|	llvm-svn: 261042
*	Split sources for amdgcn and r600	Matt Arsenault	2016-02-13	3	-0/+33
	Most files remain in a common amdgpu directory. Also switches barriers to to use convergent, and use llvm.amdgcn.s.barrier. This now requires 3.9/trunk to build amdgcn. llvm-svn: 260777