Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | [llvm-mca] Add fields "Total uOps" and "uOps Per Cycle" to the report ↵ | Andrea Di Biagio | 2018-08-29 | 1 | -1/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | generated by the SummaryView. This patch adds two new fields to the perf report generated by the SummaryView. Fields are now logically organized into two small groups; only the second group contains throughput indicators. Example: ``` Iterations: 100 Instructions: 300 Total Cycles: 414 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.69 IPC: 0.72 Block RThroughput: 4.0 ``` This patch also updates the docs for llvm-mca. Due to the nature of this change, several tests in the tools/llvm-mca directory were affected, and had to be updated using script `update_mca_test_checks.py`. llvm-svn: 340946 | ||||
* | [llvm-mca] Fix PR38575: Avoid an invalid implicit truncation of a processor ↵ | Andrea Di Biagio | 2018-08-15 | 1 | -0/+82 |
resource mask (an uint64_t value) to unsigned. This patch fixes a regression introduced at revision 338702. A processor resource mask was incorrectly implicitly truncated to an unsigned quantity. Later on, the truncated mask was used to initialize an element of a vector of processor resource descriptors. On targets with more than 32 processor resources, some elements of the vector are left uninitialized. As a consequence, this bug might have eventually caused a crash due to null dereference in the Scheduler. This patch fixes PR38575, and adds a test for it. llvm-svn: 339768 |