diff options
| author | Dehao Chen <dehao@google.com> | 2016-12-30 00:50:28 +0000 |
|---|---|---|
| committer | Dehao Chen <dehao@google.com> | 2016-12-30 00:50:28 +0000 |
| commit | cc76344ef5f2b5752759f92111efe00dd736a11e (patch) | |
| tree | d8e78eea33ff792ed68626205aa9a61afe3c810d /llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll | |
| parent | 30a9b6bb4e7a2c95ea4935c2c6b9652fad1fd27b (diff) | |
| download | bcm5719-llvm-cc76344ef5f2b5752759f92111efe00dd736a11e.tar.gz bcm5719-llvm-cc76344ef5f2b5752759f92111efe00dd736a11e.zip | |
Use continuous boosting factor for complete unroll.
Summary:
The current loop complete unroll algorithm checks if unrolling complete will reduce the runtime by a certain percentage. If yes, it will apply a fixed boosting factor to the threshold (by discounting cost). The problem for this approach is that the threshold abruptly. This patch makes the boosting factor a function of runtime reduction percentage, capped by a fixed threshold. In this way, the threshold changes continuously.
The patch also simplified the code by reducing one parameter in UP.
The patch only affects code-gen of two speccpu2006 benchmark:
445.gobmk binary size decreases 0.08%, no performance change.
464.h264ref binary size increases 0.24%, no performance change.
Reviewers: mzolotukhin, chandlerc
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D26989
llvm-svn: 290737
Diffstat (limited to 'llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll')
| -rw-r--r-- | llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll b/llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll index f5c7734de96..9f1529139de 100644 --- a/llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll +++ b/llvm/test/Transforms/LoopUnroll/full-unroll-crashers.ll @@ -1,5 +1,5 @@ ; Check that we don't crash on corner cases. -; RUN: opt < %s -S -loop-unroll -unroll-max-iteration-count-to-analyze=1000 -unroll-threshold=1 -unroll-percent-dynamic-cost-saved-threshold=20 -o /dev/null +; RUN: opt < %s -S -loop-unroll -unroll-max-iteration-count-to-analyze=1000 -unroll-threshold=1 -unroll-max-percent-threshold-boost=200 -o /dev/null target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128" @known_constant = internal unnamed_addr constant [10 x i32] [i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1, i32 1], align 16 |

