summaryrefslogtreecommitdiffstats
path: root/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
diff options
context:
space:
mode:
authorDavid Green <david.green@arm.com>2018-07-01 12:47:30 +0000
committerDavid Green <david.green@arm.com>2018-07-01 12:47:30 +0000
commit963401d2be2d5ba1cfe3471f241f4c7585c33010 (patch)
treea24eb44f15e16e9b5240a358168503101ac9b5c3 /llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
parent8dabda70afba00d82f30e83bf0d513076eba37ab (diff)
downloadbcm5719-llvm-963401d2be2d5ba1cfe3471f241f4c7585c33010.tar.gz
bcm5719-llvm-963401d2be2d5ba1cfe3471f241f4c7585c33010.zip
[UnrollAndJam] New Unroll and Jam pass
This is a simple implementation of the unroll-and-jam classical loop optimisation. The basic idea is that we take an outer loop of the form: for i.. ForeBlocks(i) for j.. SubLoopBlocks(i, j) AftBlocks(i) Instead of doing normal inner or outer unrolling, we unroll as follows: for i... i+=2 ForeBlocks(i) ForeBlocks(i+1) for j.. SubLoopBlocks(i, j) SubLoopBlocks(i+1, j) AftBlocks(i) AftBlocks(i+1) Remainder Loop So we have unrolled the outer loop, then jammed the two inner loops into one. This can lead to a simpler inner loop if memory accesses can be shared between the now jammed loops. To do this we have to prove that this is all safe, both for the memory accesses (using dependence analysis) and that ForeBlocks(i+1) can move before AftBlocks(i) and SubLoopBlocks(i, j). Differential Revision: https://reviews.llvm.org/D41953 llvm-svn: 336062
Diffstat (limited to 'llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp')
0 files changed, 0 insertions, 0 deletions
OpenPOWER on IntegriCloud