summaryrefslogtreecommitdiffstats
path: root/clang/lib/Frontend/CompilerInstance.cpp
diff options
context:
space:
mode:
authorAnna Thomas <anna@azul.com>2017-09-12 16:32:45 +0000
committerAnna Thomas <anna@azul.com>2017-09-12 16:32:45 +0000
commit9f1be02fa333f7d9d4a6b3021181d15df847c03c (patch)
tree85fe6d52eeb65d5a691e69796489555e1167fa5d /clang/lib/Frontend/CompilerInstance.cpp
parenta54ed0a4074d255665b3ffcc6602244800e13f2c (diff)
downloadbcm5719-llvm-9f1be02fa333f7d9d4a6b3021181d15df847c03c.tar.gz
bcm5719-llvm-9f1be02fa333f7d9d4a6b3021181d15df847c03c.zip
[LV] Clamp the VF to the trip count
Summary: When the MaxVectorSize > ConstantTripCount, we should just clamp the vectorization factor to be the ConstantTripCount. This vectorizes loops where the TinyTripCountThreshold >= TripCount < MaxVF. Earlier we were finding the maximum vector width, which could be greater than the trip count itself. The Loop vectorizer does all the work for generating a vectorizable loop, but in the end we would always choose the scalar loop (since the VF > trip count). This allows us to choose the VF keeping in mind the trip count if available. This is a fix on top of rL312472. Reviewers: Ayal, zvi, hfinkel, dneilson Reviewed by: Ayal Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37702 llvm-svn: 313046
Diffstat (limited to 'clang/lib/Frontend/CompilerInstance.cpp')
0 files changed, 0 insertions, 0 deletions
OpenPOWER on IntegriCloud