[X86] condition branches folding for three-way conditional codes

This patch implements a pass that optimizes condition branches on x86 by taking advantage of the three-way conditional code generated by compare instructions. Currently, it tries to hoisting EQ and NE conditional branch to a dominant conditional branch condition where the same EQ/NE conditional code is computed. An example: bb_0: cmp %0, 19 jg bb_1 jmp bb_2 bb_1: cmp %0, 40 jg bb_3 jmp bb_4 bb_4: cmp %0, 20 je bb_5 jmp bb_6 Here we could combine the two compares in bb_0 and bb_4 and have the following code: bb_0: cmp %0, 20 jg bb_1 jl bb_2 jmp bb_5 bb_1: cmp %0, 40 jg bb_3 jmp bb_6 For the case of %0 == 20 (bb_5), we eliminate two jumps, and the control height for bb_6 is also reduced. bb_4 is gone after the optimization. This optimization is motivated by the branch pattern generated by the switch lowering: we always have pivot-1 compare for the inner nodes and we do a pivot compare again the leaf (like above pattern). This pass currently is enabled on Intel's Sandybridge and later arches. Some reviewers pointed out that on some arches (like AMD Jaguar), this pass may increase branch density to the point where it hurts the performance of the branch predictor. Differential Revision: https://reviews.llvm.org/D46662 llvm-svn: 343993
author: Rong Xu <xur@google.com> 2018-10-08 18:52:39 +0000
committer: Rong Xu <xur@google.com> 2018-10-08 18:52:39 +0000
commit: 67b1b328f702df70e52d93e67cf2eebf647cef76 (patch)
tree: bcef9f5223dd45c676ee365f03285c295cb22887 /llvm/lib/Target/X86/X86TargetMachine.cpp
parent: 823549a6ec08340d1c2f329fa450f572546175c7 (diff)
download: bcm5719-llvm-67b1b328f702df70e52d93e67cf2eebf647cef76.tar.gz
bcm5719-llvm-67b1b328f702df70e52d93e67cf2eebf647cef76.zip
1 files changed, 7 insertions, 0 deletions
diff --git a/llvm/lib/Target/X86/X86TargetMachine.cpp b/llvm/lib/Target/X86/X86TargetMachine.cpp
index dd1bbc761b2..812b8b28ebd 100644
--- a/llvm/lib/Target/X86/X86TargetMachine.cpp
+++ b/llvm/lib/Target/X86/X86TargetMachine.cpp
@@ -54,6 +54,11 @@ static cl::opt<bool> EnableMachineCombinerPass("x86-machine-combiner",
                                cl::desc("Enable the machine combiner pass"),
                                cl::init(true), cl::Hidden);
 
+static cl::opt<bool> EnableCondBrFoldingPass("x86-condbr-folding",
+                               cl::desc("Enable the conditional branch "
+                                        "folding pass"),
+                               cl::init(true), cl::Hidden);
+
 namespace llvm {
 
 void initializeWinEHStatePassPass(PassRegistry &);
@@ -447,6 +452,8 @@ bool X86PassConfig::addGlobalInstructionSelect() {
 }
 
 bool X86PassConfig::addILPOpts() {
+  if (EnableCondBrFoldingPass)
+    addPass(createX86CondBrFolding());
   addPass(&EarlyIfConverterID);
   if (EnableMachineCombinerPass)
     addPass(&MachineCombinerID);
author	Rong Xu <xur@google.com>	2018-10-08 18:52:39 +0000
committer	Rong Xu <xur@google.com>	2018-10-08 18:52:39 +0000
commit	67b1b328f702df70e52d93e67cf2eebf647cef76 (patch)
tree	bcef9f5223dd45c676ee365f03285c295cb22887 /llvm/lib/Target/X86/X86TargetMachine.cpp
parent	823549a6ec08340d1c2f329fa450f572546175c7 (diff)
download	bcm5719-llvm-67b1b328f702df70e52d93e67cf2eebf647cef76.tar.gz bcm5719-llvm-67b1b328f702df70e52d93e67cf2eebf647cef76.zip