AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all

in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956
author: Stephen Lin <stephenwlin@gmail.com> 2013-07-09 18:16:56 +0000
committer: Stephen Lin <stephenwlin@gmail.com> 2013-07-09 18:16:56 +0000
commit: 73de7bf5dec63e8ca45a446373ab61a2e22d103c (patch)
tree: 61dd22e9276131538d96c046212d55199febb05b /llvm/lib/Target/X86/X86ISelLowering.cpp
parent: ff666bd962a4446d80955fe75619201c29795501 (diff)
download: bcm5719-llvm-73de7bf5dec63e8ca45a446373ab61a2e22d103c.tar.gz
bcm5719-llvm-73de7bf5dec63e8ca45a446373ab61a2e22d103c.zip
1 files changed, 21 insertions, 0 deletions
diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index a680ac09b5f..f00df3543a8 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -12966,6 +12966,27 @@ bool X86TargetLowering::isZExtFree(SDValue Val, EVT VT2) const {
   return false;
 }
 
+bool
+X86TargetLowering::isFMAFasterThanFMulAndFAdd(EVT VT) const {
+  if (!(Subtarget->hasFMA() || Subtarget->hasFMA4()))
+    return false;
+
+  VT = VT.getScalarType();
+
+  if (!VT.isSimple())
+    return false;
+
+  switch (VT.getSimpleVT().SimpleTy) {
+  case MVT::f32:
+  case MVT::f64:
+    return true;
+  default:
+    break;
+  }
+
+  return false;
+}
+
 bool X86TargetLowering::isNarrowingProfitable(EVT VT1, EVT VT2) const {
   // i16 instructions are longer (0x66 prefix) and potentially slower.
   return !(VT1 == MVT::i32 && VT2 == MVT::i16);
author	Stephen Lin <stephenwlin@gmail.com>	2013-07-09 18:16:56 +0000
committer	Stephen Lin <stephenwlin@gmail.com>	2013-07-09 18:16:56 +0000
commit	73de7bf5dec63e8ca45a446373ab61a2e22d103c (patch)
tree	61dd22e9276131538d96c046212d55199febb05b /llvm/lib/Target/X86/X86ISelLowering.cpp
parent	ff666bd962a4446d80955fe75619201c29795501 (diff)
download	bcm5719-llvm-73de7bf5dec63e8ca45a446373ab61a2e22d103c.tar.gz bcm5719-llvm-73de7bf5dec63e8ca45a446373ab61a2e22d103c.zip