bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Cleanup control flow intrinsics	Matt Arsenault	2017-03-17	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Move backend internal intrinsics along with the rest of the normal intrinsics, and use the Intrinsic::getDeclaration API instead of manually constructing the type list. It's surprising this was working before. fdiv.fast had the wrong number of parameters. The control flow intrinsic declaration attributes were not being applied, and their types were inconsistent. The actual IR use types did not match the declaration, and were closer to the types used for the patterns. The brcond lowering was changing the types, so introduce new nodes for those. llvm-svn: 298119
*	AMDGPU: Support v2i16/v2f16 packed operations	Matt Arsenault	2017-02-27	1	-5/+13
\| \| \| \|	llvm-svn: 296396
*	AMDGPU: Improve nsw/nuw/exact when promoting uniform i16 ops	Matt Arsenault	2017-02-01	1	-18/+41
\| \| \| \| \| \| \| \| \| \| \| \|	These were simply preserving the flags of the original operation, which was too conservative in most cases and incorrect for mul. nsw/nuw may be needed for some combines to cleanup messes when intermediate sext_inregs are introduced later. Tested valid combinations with alive. llvm-svn: 293776
*	[AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-01-20	1	-14/+26
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 292623
*	AMDGPU: Allow rcp and rsq usage with f16	Matt Arsenault	2016-12-22	1	-1/+0
\| \| \| \|	llvm-svn: 290302
*	AMDGPU: Fix crash on i16 constant expression	Matt Arsenault	2016-12-06	1	-2/+3
\| \| \| \|	llvm-svn: 288861
*	[AMDGPU] AMDGPUCodeGenPrepare: remove extra ';'	Konstantin Zhuravlyov	2016-10-07	1	-1/+1
\| \| \| \|	llvm-svn: 283558
*	[AMDGPU] Promote uniform (i1, i16] operations to i32	Konstantin Zhuravlyov	2016-10-07	1	-97/+101
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D25302 llvm-svn: 283555
*	[AMDGPU] Promote uniform i16 bitreverse intrinsic to i32	Konstantin Zhuravlyov	2016-10-06	1	-11/+65
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D25121 llvm-svn: 283415
*	[AMDGPU] Sign extend AShr when promoting (instead of zero extending)	Konstantin Zhuravlyov	2016-10-03	1	-2/+2
\| \| \| \|	llvm-svn: 283130
*	Use StringRef in Pass/PassManager APIs (NFC)	Mehdi Amini	2016-10-01	1	-3/+1
\| \| \| \|	llvm-svn: 283004
*	[AMDGPU] Promote uniform i16 ops to i32 ops for targets that have 16 bit ↵	Konstantin Zhuravlyov	2016-09-28	1	-3/+234
\| \| \| \| \| \| \| \|	instructions Differential Revision: https://reviews.llvm.org/D24125 llvm-svn: 282624
*	AMDGPU: Use rcp for fdiv 1, x with fpmath metadata	Matt Arsenault	2016-07-26	1	-1/+1
\| \| \| \| \| \| \|	Using rcp should be OK for safe math usually, so this should not be replacing the original fdiv. llvm-svn: 276823
*	AMDGPU: Change fdiv lowering based on !fpmath metadata	Matt Arsenault	2016-07-19	1	-6/+117
\| \| \| \| \| \| \| \| \| \| \|	If 2.5 ulp is acceptable, denormals are not required, and isn't a reciprocal which will already be handled, replace with a faster fdiv. Simplify the lowering tests by using per function subtarget features. llvm-svn: 276051
*	AMDGPU: Add stub custom CodeGenPrepare pass	Matt Arsenault	2016-06-24	1	-0/+82
	This will do various things including ones CodeGenPrepare does, but with knowledge of uniform values. llvm-svn: 273657