summaryrefslogtreecommitdiffstats
path: root/llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll
diff options
context:
space:
mode:
authorDehao Chen <dehao@google.com>2016-12-30 00:50:28 +0000
committerDehao Chen <dehao@google.com>2016-12-30 00:50:28 +0000
commitcc76344ef5f2b5752759f92111efe00dd736a11e (patch)
treed8e78eea33ff792ed68626205aa9a61afe3c810d /llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll
parent30a9b6bb4e7a2c95ea4935c2c6b9652fad1fd27b (diff)
downloadbcm5719-llvm-cc76344ef5f2b5752759f92111efe00dd736a11e.tar.gz
bcm5719-llvm-cc76344ef5f2b5752759f92111efe00dd736a11e.zip
Use continuous boosting factor for complete unroll.
Summary: The current loop complete unroll algorithm checks if unrolling complete will reduce the runtime by a certain percentage. If yes, it will apply a fixed boosting factor to the threshold (by discounting cost). The problem for this approach is that the threshold abruptly. This patch makes the boosting factor a function of runtime reduction percentage, capped by a fixed threshold. In this way, the threshold changes continuously. The patch also simplified the code by reducing one parameter in UP. The patch only affects code-gen of two speccpu2006 benchmark: 445.gobmk binary size decreases 0.08%, no performance change. 464.h264ref binary size increases 0.24%, no performance change. Reviewers: mzolotukhin, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26989 llvm-svn: 290737
Diffstat (limited to 'llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll')
-rw-r--r--llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll2
1 files changed, 1 insertions, 1 deletions
diff --git a/llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll b/llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll
index 723a384ea2d..230912538d2 100644
--- a/llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll
+++ b/llvm/test/Transforms/LoopUnroll/full-unroll-heuristics-geps.ll
@@ -1,4 +1,4 @@
-; RUN: opt < %s -S -loop-unroll -unroll-max-iteration-count-to-analyze=100 -unroll-dynamic-cost-savings-discount=1000 -unroll-threshold=10 -unroll-percent-dynamic-cost-saved-threshold=60 | FileCheck %s
+; RUN: opt < %s -S -loop-unroll -unroll-max-iteration-count-to-analyze=100 -unroll-threshold=10 -unroll-max-percent-threshold-boost=200 | FileCheck %s
target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"
; When examining gep-instructions we shouldn't consider them simplified if the
OpenPOWER on IntegriCloud