bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Fix not using v_cvt_f16_[iu]16	Matt Arsenault	2020-01-07	1	-2/+2
\| \| \| \| \|	We weren't treating i16->f16 casts as legal on targets with these instructions, and always using a pair of casts through i32.
*	AMDGPU: Add run line to int_to_fp tests	Matt Arsenault	2020-01-06	1	-35/+62
\| \| \| \| \|	This wasn't catching a regression on targets with legal i16 triggered in a future commit.
*	AMDGPU: Mark all unspecified CC functions in tests as amdgpu_kernel	Matt Arsenault	2017-03-21	1	-9/+9
\| \| \| \| \| \| \| \| \| \| \| \|	Currently the default C calling convention functions are treated the same as compute kernels. Make this explicit so the default calling convention can be changed to a non-kernel. Converted with perl -pi -e 's/define void/define amdgpu_kernel void/' on the relevant test directories (and undoing in one place that actually wanted a non-kernel). llvm-svn: 298444
*	AMDGPU: Use unsigned compare for eq/ne	Matt Arsenault	2016-09-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	For some reason there are both of these available, except for scalar 64-bit compares which only has u64. I'm not sure why there are both (I'm guessing it's for the one bit inputs we don't use), but for consistency always using the unsigned one. llvm-svn: 282832
*	AMDGPU: Run SIFoldOperands after PeepholeOptimizer	Matt Arsenault	2016-04-14	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	PeepholeOptimizer cleans up redundant copies, which makes the operand folding more effective. shader-db stats: Totals: SGPRS: 34200 -> 34336 (0.40 %) VGPRS: 22118 -> 21655 (-2.09 %) Code Size: 632144 -> 633460 (0.21 %) bytes LDS: 11 -> 11 (0.00 %) blocks Scratch: 10240 -> 11264 (10.00 %) bytes per wave Max Waves: 8822 -> 8918 (1.09 %) Wait states: 0 -> 0 (0.00 %) Totals from affected shaders: SGPRS: 7704 -> 7840 (1.77 %) VGPRS: 5169 -> 4706 (-8.96 %) Code Size: 234444 -> 235760 (0.56 %) bytes LDS: 2 -> 2 (0.00 %) blocks Scratch: 0 -> 1024 (0.00 %) bytes per wave Max Waves: 1188 -> 1284 (8.08 %) Wait states: 0 -> 0 (0.00 %) Increases: SGPRS: 35 (0.01 %) VGPRS: 1 (0.00 %) Code Size: 59 (0.02 %) LDS: 0 (0.00 %) Scratch: 1 (0.00 %) Max Waves: 48 (0.02 %) Wait states: 0 (0.00 %) Decreases: SGPRS: 26 (0.01 %) VGPRS: 54 (0.02 %) Code Size: 68 (0.03 %) LDS: 0 (0.00 %) Scratch: 0 (0.00 %) Max Waves: 4 (0.00 %) Wait states: 0 (0.00 %) llvm-svn: 266378
*	AMDGPU: Fold bitcasts of scalar constants to vectors	Matt Arsenault	2016-04-14	1	-4/+3
\| \| \| \| \| \| \|	This cleans up some messes since the individual scalar components can be CSEed. llvm-svn: 266376
*	AMDGPU: Remove some old intrinsic uses from tests	Matt Arsenault	2016-02-11	1	-3/+3
\| \| \| \|	llvm-svn: 260493
*	AMDGPU: Improve accuracy of instruction rates for some FP instructions	Matt Arsenault	2015-08-22	1	-3/+3
\| \| \| \|	llvm-svn: 245774
*	AMDGPU/SI: Add support for shrinking v_cndmask_b32_e32 instructions	Tom Stellard	2015-07-14	1	-5/+5
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11061 llvm-svn: 242146
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+98
	llvm-svn: 239657