MachineCombiner Pass for selecting faster instruction

sequence - target independent framework When the DAGcombiner selects instruction sequences it could increase the critical path or resource len. For example, on arm64 there are multiply-accumulate instructions (madd, msub). If e.g. the equivalent multiply-add sequence is not on the crictial path it makes sense to select it instead of the combined, single accumulate instruction (madd/msub). The reason is that the conversion from add+mul to the madd could lengthen the critical path by the latency of the multiply. But the DAGCombiner would always combine and select the madd/msub instruction. This patch uses machine trace metrics to estimate critical path length and resource length of an original instruction sequence vs a combined instruction sequence and picks the faster code based on its estimates. This patch only commits the target independent framework that evaluates and selects code sequences. The machine instruction combiner is turned off for all targets and expected to evolve over time by gradually handling DAGCombiner pattern in the target specific code. This framework lays the groundwork for fixing rdar://16319955 llvm-svn: 214666
author: Gerolf Hoflehner <ghoflehner@apple.com> 2014-08-03 21:35:39 +0000
committer: Gerolf Hoflehner <ghoflehner@apple.com> 2014-08-03 21:35:39 +0000
commit: 5e1207e54ca646fe0f6d58e655451380041a4886 (patch)
tree: 4a3ccf77d24a3856878c39103aa07bb62311150f /llvm/lib/CodeGen/TargetSchedule.cpp
parent: f57d63f906e080417437dc42681c9e5ac02501a0 (diff)
download: bcm5719-llvm-5e1207e54ca646fe0f6d58e655451380041a4886.tar.gz
bcm5719-llvm-5e1207e54ca646fe0f6d58e655451380041a4886.zip
1 files changed, 22 insertions, 0 deletions
diff --git a/llvm/lib/CodeGen/TargetSchedule.cpp b/llvm/lib/CodeGen/TargetSchedule.cpp
index b0f2ca68884..f42946f35ef 100644
--- a/llvm/lib/CodeGen/TargetSchedule.cpp
+++ b/llvm/lib/CodeGen/TargetSchedule.cpp
@@ -225,6 +225,28 @@ unsigned TargetSchedModel::computeOperandLatency(
   return DefMI->isTransient() ? 0 : TII->defaultDefLatency(&SchedModel, DefMI);
 }
 
+unsigned TargetSchedModel::computeInstrLatency(unsigned Opcode) const {
+  assert(hasInstrSchedModel() && "Only call this function with a SchedModel");
+
+  unsigned SCIdx = TII->get(Opcode).getSchedClass();
+  const MCSchedClassDesc *SCDesc = SchedModel.getSchedClassDesc(SCIdx);
+  unsigned Latency = 0;
+
+  if (SCDesc->isValid() && !SCDesc->isVariant()) {
+    for (unsigned DefIdx = 0, DefEnd = SCDesc->NumWriteLatencyEntries;
+         DefIdx != DefEnd; ++DefIdx) {
+      // Lookup the definition's write latency in SubtargetInfo.
+      const MCWriteLatencyEntry *WLEntry =
+          STI->getWriteLatencyEntry(SCDesc, DefIdx);
+      Latency = std::max(Latency, capLatency(WLEntry->Cycles));
+    }
+    return Latency;
+  }
+
+  assert(Latency && "No MI sched latency");
+  return 0;
+}
+
 unsigned
 TargetSchedModel::computeInstrLatency(const MachineInstr *MI,
                                       bool UseDefaultDefLatency) const {
author	Gerolf Hoflehner <ghoflehner@apple.com>	2014-08-03 21:35:39 +0000
committer	Gerolf Hoflehner <ghoflehner@apple.com>	2014-08-03 21:35:39 +0000
commit	5e1207e54ca646fe0f6d58e655451380041a4886 (patch)
tree	4a3ccf77d24a3856878c39103aa07bb62311150f /llvm/lib/CodeGen/TargetSchedule.cpp
parent	f57d63f906e080417437dc42681c9e5ac02501a0 (diff)
download	bcm5719-llvm-5e1207e54ca646fe0f6d58e655451380041a4886.tar.gz bcm5719-llvm-5e1207e54ca646fe0f6d58e655451380041a4886.zip