summaryrefslogtreecommitdiffstats
path: root/llvm/test/CodeGen/AMDGPU/vector-extract-insert.ll
diff options
context:
space:
mode:
authorTobias Grosser <tobias@grosser.es>2017-02-18 16:39:04 +0000
committerTobias Grosser <tobias@grosser.es>2017-02-18 16:39:04 +0000
commit8ee46985d265ffd1a8b2004e4dc40e8d67c90fd5 (patch)
treee8db71d212f9bede37d528f7f0e2e17daca4b44f /llvm/test/CodeGen/AMDGPU/vector-extract-insert.ll
parent41f0d81b314f50846846ec89a8ae65c965af1ece (diff)
downloadbcm5719-llvm-8ee46985d265ffd1a8b2004e4dc40e8d67c90fd5.tar.gz
bcm5719-llvm-8ee46985d265ffd1a8b2004e4dc40e8d67c90fd5.zip
[Dependences] Compute reduction dependences on schedule tree [NFC]
This change gets rid of the need for zero padding, makes the reduction computation code more similar to the normal dependence computation, and also better documents what we do at the moment. Making the dependence computation for reductions a little bit easier to understand will hopefully help us to further reduce code duplication. This reduces the time spent only in the reduction dependence pass from 260ms to 150ms for test/DependenceInfo/reduction_sequence.ll. This is a reduction of over 40% in dependence computation time. This change was inspired by discussions with Michael Kruse, Utpal Bora, Siddharth Bhat, and Johannes Doerfert. It can hopefully lay the base for further cleanups of the reduction code. llvm-svn: 295550
Diffstat (limited to 'llvm/test/CodeGen/AMDGPU/vector-extract-insert.ll')
0 files changed, 0 insertions, 0 deletions
OpenPOWER on IntegriCloud