[DAGCombiner] Improve division estimation of floating points.

Current implementation of estimating divisions loses precision since it estimates reciprocal first and does multiplication. This patch is to re-order arithmetic operations in the last iteration in DAGCombiner to improve the accuracy. Reviewed By: Sanjay Patel, Jinsong Ji Differential Revision: https://reviews.llvm.org/D66050 llvm-svn: 371713
author: Qiu Chaofan <qiucofan@cn.ibm.com> 2019-09-12 07:51:24 +0000
committer: Qiu Chaofan <qiucofan@cn.ibm.com> 2019-09-12 07:51:24 +0000
commit: b7fb5d0f6f2abf07f0ce6edc3397b4c291c1d25e (patch)
tree: b5fb0cbe9b96f54540d31e4c268e1cf633b249d2 /llvm/test/CodeGen/AMDGPU/rsq.ll
parent: 75f65fe8d316407c5dd7cd9bfdfc760203947aa1 (diff)
download: bcm5719-llvm-b7fb5d0f6f2abf07f0ce6edc3397b4c291c1d25e.tar.gz
bcm5719-llvm-b7fb5d0f6f2abf07f0ce6edc3397b4c291c1d25e.zip
1 files changed, 1 insertions, 1 deletions
diff --git a/llvm/test/CodeGen/AMDGPU/rsq.ll b/llvm/test/CodeGen/AMDGPU/rsq.ll
index 204eeb99838..40c3c94246e 100644
--- a/llvm/test/CodeGen/AMDGPU/rsq.ll
+++ b/llvm/test/CodeGen/AMDGPU/rsq.ll
@@ -48,7 +48,7 @@ define amdgpu_kernel void @rsq_f32_sgpr(float addrspace(1)* noalias %out, float
 
 ; SI-UNSAFE-DAG: v_rsq_f32_e32 [[RSQA:v[0-9]+]], [[A]]
 ; SI-UNSAFE-DAG: v_rcp_f32_e32 [[RCPB:v[0-9]+]], [[B]]
-; SI-UNSAFE-DAG: v_mul_f32_e32 [[TMP:v[0-9]+]], [[RSQA]], [[RCPB]]
+; SI-UNSAFE-DAG: v_mul_f32_e32 [[TMP:v[0-9]+]], [[RCPB]], [[RSQA]]
 ; SI-UNSAFE: v_mul_f32_e32 [[RESULT:v[0-9]+]], [[C]], [[TMP]]
 ; SI-UNSAFE: buffer_store_dword [[RESULT]]
author	Qiu Chaofan <qiucofan@cn.ibm.com>	2019-09-12 07:51:24 +0000
committer	Qiu Chaofan <qiucofan@cn.ibm.com>	2019-09-12 07:51:24 +0000
commit	b7fb5d0f6f2abf07f0ce6edc3397b4c291c1d25e (patch)
tree	b5fb0cbe9b96f54540d31e4c268e1cf633b249d2 /llvm/test/CodeGen/AMDGPU/rsq.ll
parent	75f65fe8d316407c5dd7cd9bfdfc760203947aa1 (diff)
download	bcm5719-llvm-b7fb5d0f6f2abf07f0ce6edc3397b4c291c1d25e.tar.gz bcm5719-llvm-b7fb5d0f6f2abf07f0ce6edc3397b4c291c1d25e.zip