summaryrefslogtreecommitdiffstats
path: root/llvm/docs/ReleaseNotes.rst
diff options
context:
space:
mode:
authorChandler Carruth <chandlerc@gmail.com>2012-12-14 13:37:17 +0000
committerChandler Carruth <chandlerc@gmail.com>2012-12-14 13:37:17 +0000
commit108d19e6b57d598e9addf233870a43711d59e035 (patch)
treec61f4fba94eea5eed285f5b17589a58025403c99 /llvm/docs/ReleaseNotes.rst
parentb745ac48a6fd0e8efb30fb950d8bca274662e26c (diff)
downloadbcm5719-llvm-108d19e6b57d598e9addf233870a43711d59e035.tar.gz
bcm5719-llvm-108d19e6b57d598e9addf233870a43711d59e035.zip
Add a rough draft of some content about the new SROA. I'll try to proof
read this and clean it up tomorrow, but hopefully it's a good placeholder. llvm-svn: 170196
Diffstat (limited to 'llvm/docs/ReleaseNotes.rst')
-rw-r--r--llvm/docs/ReleaseNotes.rst16
1 files changed, 15 insertions, 1 deletions
diff --git a/llvm/docs/ReleaseNotes.rst b/llvm/docs/ReleaseNotes.rst
index 486074914b8..31c567b33ac 100644
--- a/llvm/docs/ReleaseNotes.rst
+++ b/llvm/docs/ReleaseNotes.rst
@@ -351,7 +351,21 @@ We vectorize under the following loops:
'``noalias``' and are checked at runtime.
#. ...
-SROA - We've re-written SROA to be significantly more powerful.
+SROA - We've re-written SROA to be significantly more powerful and generate
+code which is much more friendly to the rest of the optimization pipeline.
+Previously this pass had scaling problems that required it to only operate on
+relatively small aggregates, and at times it would mistakenly replace a large
+aggregate with a single very large integer in order to make it a scalar SSA
+value. The result was a large number of i1024 and i2048 values representing any
+small stack buffer. These in turn slowed down many subsequent optimization
+paths.
+
+The new SROA pass uses a different algorithm that allows it to only promote to
+scalars the pieces of the aggregate actively in use. Because of this it doesn't
+require any thresholds. It also always deduces the scalar values from the uses
+of the aggregate rather than the specific LLVM type of the aggregate. These
+features combine to both optimize more code with the pass but to improve the
+compile time of many functions dramatically.
#. Branch weight metadata is preseved through more of the optimizer.
#. ...
OpenPOWER on IntegriCloud