Add DAG-combines for aggressive FMA formation.

This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757
author: Lang Hames <lhames@gmail.com> 2012-06-19 22:51:23 +0000
committer: Lang Hames <lhames@gmail.com> 2012-06-19 22:51:23 +0000
commit: 39fb1d08dce7b7c78a9633ce32c5610d72d1996a (patch)
tree: 5ce060817ee61c35a865fce016bc1d7fa2020fe4 /llvm/test/CodeGen
parent: 63d40202fb478b331abeddcf2b12517f42db0f86 (diff)
download: bcm5719-llvm-39fb1d08dce7b7c78a9633ce32c5610d72d1996a.tar.gz
bcm5719-llvm-39fb1d08dce7b7c78a9633ce32c5610d72d1996a.zip
4 files changed, 4 insertions, 4 deletions
diff --git a/llvm/test/CodeGen/ARM/fusedMAC.ll b/llvm/test/CodeGen/ARM/fusedMAC.ll
index 3bf1ef4ad26..61e7d7b1a21 100644
--- a/llvm/test/CodeGen/ARM/fusedMAC.ll
+++ b/llvm/test/CodeGen/ARM/fusedMAC.ll
@@ -1,4 +1,4 @@
-; RUN: llc < %s -mtriple=armv7-eabi -mattr=+neon,+vfp4 | FileCheck %s
+; RUN: llc < %s -mtriple=armv7-eabi -mattr=+neon,+vfp4 -enable-excess-fp-precision | FileCheck %s
 ; Check generated fused MAC and MLS.
 
 define double @fusedMACTest1(double %d1, double %d2, double %d3) {
diff --git a/llvm/test/CodeGen/PowerPC/a2-fp-basic.ll b/llvm/test/CodeGen/PowerPC/a2-fp-basic.ll
index 932ad7a63ce..a47e662cc87 100644
--- a/llvm/test/CodeGen/PowerPC/a2-fp-basic.ll
+++ b/llvm/test/CodeGen/PowerPC/a2-fp-basic.ll
@@ -1,4 +1,4 @@
-; RUN: llc < %s -march=ppc64 -mcpu=a2 | FileCheck %s
+; RUN: llc < %s -march=ppc64 -mcpu=a2 -enable-excess-fp-precision | FileCheck %s
 
 %0 = type { double, double }
 
diff --git a/llvm/test/CodeGen/PowerPC/fma.ll b/llvm/test/CodeGen/PowerPC/fma.ll
index 815c72c1f8a..02847147edb 100644
--- a/llvm/test/CodeGen/PowerPC/fma.ll
+++ b/llvm/test/CodeGen/PowerPC/fma.ll
@@ -1,4 +1,4 @@
-; RUN: llc < %s -march=ppc32 | \
+; RUN: llc < %s -march=ppc32 -enable-excess-fp-precision | \
 ; RUN:   egrep {fn?madd|fn?msub} | count 8
 
 define double @test_FMADD1(double %A, double %B, double %C) {
diff --git a/llvm/test/CodeGen/PowerPC/ppc440-fp-basic.ll b/llvm/test/CodeGen/PowerPC/ppc440-fp-basic.ll
index 1fad2fa3aaf..25ec5f892c5 100644
--- a/llvm/test/CodeGen/PowerPC/ppc440-fp-basic.ll
+++ b/llvm/test/CodeGen/PowerPC/ppc440-fp-basic.ll
@@ -1,4 +1,4 @@
-; RUN: llc < %s -march=ppc32 -mcpu=440 | FileCheck %s
+; RUN: llc < %s -march=ppc32 -mcpu=440 -enable-excess-fp-precision | FileCheck %s
 
 %0 = type { double, double }
author	Lang Hames <lhames@gmail.com>	2012-06-19 22:51:23 +0000
committer	Lang Hames <lhames@gmail.com>	2012-06-19 22:51:23 +0000
commit	39fb1d08dce7b7c78a9633ce32c5610d72d1996a (patch)
tree	5ce060817ee61c35a865fce016bc1d7fa2020fe4 /llvm/test/CodeGen
parent	63d40202fb478b331abeddcf2b12517f42db0f86 (diff)
download	bcm5719-llvm-39fb1d08dce7b7c78a9633ce32c5610d72d1996a.tar.gz bcm5719-llvm-39fb1d08dce7b7c78a9633ce32c5610d72d1996a.zip