bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	NVPTX: Rename __builtin_ptx_shfl -> __nvvm_shfl	Justin Bogner	2016-07-06	1	-8/+4
\| \| \| \| \| \| \|	To match "NVPTX: Make the llvm.nvvm.shfl intrinsics and builtin names consistent" in LLVM. llvm-svn: 274663
*	[CUDA] Implement __shfl* intrinsics in clang headers.	Justin Lebar	2016-06-09	1	-0/+70
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Clang changes to make use of the LLVM intrinsics added in D21160. Reviewers: tra Subscribers: jholewinski, cfe-commits Differential Revision: http://reviews.llvm.org/D21162 llvm-svn: 272299
*	[CUDA] Fix order of vectorized ldg intrinsics' elements.	Justin Lebar	2016-05-30	1	-28/+28
\| \| \| \| \| \| \| \| \| \|	Summary: The order is [x, y, z, w], not [w, x, y, z]. Subscribers: cfe-commits, tra Differential Revision: http://reviews.llvm.org/D20794 llvm-svn: 271215
*	[CUDA] Implement __ldg using intrinsics.	Justin Lebar	2016-05-19	1	-0/+256
	Summary: Previously it was implemented as inline asm in the CUDA headers. This change allows us to use the [addr+imm] addressing mode when executing ld.global.nc instructions. This translates into a 1.3x speedup on some benchmarks that call this instruction from within an unrolled loop. Reviewers: tra, rsmith Subscribers: jhen, cfe-commits, jholewinski Differential Revision: http://reviews.llvm.org/D19990 llvm-svn: 270150