bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[llvm-mca] Add support for move elimination in class RegisterFile.	Andrea Di Biagio	2018-10-03	7	-10/+164
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch teaches class RegisterFile how to analyze register writes from instructions that are move elimination candidates. In particular, it teaches it how to check if a move can be effectively eliminated by the underlying PRF, and (if necessary) how to perform move elimination. The long term goal is to allow processor models to describe instructions that are valid move elimination candidates. The idea is to let register file definitions in tablegen declare if/when moves can be eliminated. This patch is a non functional change. The logic that performs move elimination is currently disabled. A future patch will add support for move elimination in the processor models, and enable this new code path. llvm-svn: 343691
*	[llvm-mca] Remove unecessary forward decls. NFC.	Matt Davis	2018-10-02	4	-5/+0
\| \| \| \| \| \|	This patch also removes an unecessary include. llvm-svn: 343621
*	[llvm-mca] Constify the 'notify' routines. NFC.	Matt Davis	2018-10-02	6	-16/+18
\| \| \| \| \| \|	Also fixed up some whitespace formatting in DispatchStage.cpp. llvm-svn: 343615
*	[MCA] Remove SM.hasNext() call in FetchStage::execute.	Owen Rodley	2018-10-02	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is redundant, as FetchStage::getNextInstruction already checks this and returns llvm::ErrorSuccess() as appropriate. NFC. Reviewers: andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D52642 llvm-svn: 343555
*	[llvm-mca] Rename the 'Subtract' method to 'subtract'	Matt Davis	2018-10-01	2	-2/+2
\| \| \| \|	llvm-svn: 343549
*	[llvm-mca] Remove redundant namespace prefixes. NFC	Andrea Di Biagio	2018-09-28	8	-37/+36
\| \| \| \| \| \|	We are already "using" namespace llvm in all the files modified by this change. llvm-svn: 343312
*	[llvm-mca] Teach how to track zero registers in class RegisterFile.	Andrea Di Biagio	2018-09-28	3	-26/+78
\| \| \| \| \| \| \| \| \|	This change is in preparation for a future work on improving support for optimizable register moves. We already know if a write is from a zero-idiom, so we can propagate that bit of information to the PRF. We use an APInt mask to identify registers that are set to zero. llvm-svn: 343307
*	Test commit. NFC.	Owen Rodley	2018-09-28	1	-1/+1
\| \| \| \|	llvm-svn: 343296
*	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...)	Fangrui Song	2018-09-27	2	-5/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: The convenience wrapper in STLExtras is available since rL342102. Reviewers: dblaikie, javed.absar, JDevlieghere, andreadb Subscribers: MatzeB, sanjoy, arsenm, dschuff, mehdi_amini, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, eraman, aheejin, kbarton, JDevlieghere, javed.absar, gbedwell, jrtc27, mgrang, atanasyan, steven_wu, george.burgess.iv, dexonsmith, kristina, jsji, llvm-commits Differential Revision: https://reviews.llvm.org/D52573 llvm-svn: 343163
*	[llvm-mca] Improve code comments in LSUnit.{h, cpp}. NFC	Andrea Di Biagio	2018-09-24	2	-15/+25
\| \| \| \|	llvm-svn: 342877
*	[MCA] Remove dependency on CodeGen.	Dean Michael Berris	2018-09-21	3	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There isn't any actual dependency - there's one #include from CodeGen but nothing from the header is actually used. With this change we can use the MCA library from CodeGen without circular dependencies (e.g. for scheduling). Reviewers: andreadb Reviewed By: andreadb Authored By: orodley Subscribers: mgorny, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D52288 llvm-svn: 342706
*	[TableGen][SubtargetEmitter] Add the ability for processor models to ↵	Andrea Di Biagio	2018-09-19	1	-5/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	describe dependency breaking instructions. This patch adds the ability for processor models to describe dependency breaking instructions. Different processors may specify a different set of dependency-breaking instructions. That means, we cannot assume that all processors of the same target would use the same rules to classify dependency breaking instructions. The main goal of this patch is to provide the means to describe dependency breaking instructions directly via tablegen, and have the following TargetSubtargetInfo hooks redefined in overrides by tabegen'd XXXGenSubtargetInfo classes (here, XXX is a Target name). ``` virtual bool isZeroIdiom(const MachineInstr MI, APInt &Mask) const { return false; } virtual bool isDependencyBreaking(const MachineInstr MI, APInt &Mask) const { return isZeroIdiom(MI); } ``` An instruction MI is a dependency-breaking instruction if a call to method isDependencyBreaking(MI) on the STI (TargetSubtargetInfo object) evaluates to true. Similarly, an instruction MI is a special case of zero-idiom dependency breaking instruction if a call to STI.isZeroIdiom(MI) returns true. The extra APInt is used for those targets that may want to select which machine operands have their dependency broken (see comments in code). Note that by default, subtargets don't know about the existence of dependency-breaking. In the absence of external information, those method calls would always return false. A new tablegen class named STIPredicate has been added by this patch to let processor models classify instructions that have properties in common. The idea is that, a MCInstrPredicate definition can be used to "generate" an instruction equivalence class, with the idea that instructions of a same class all have a property in common. STIPredicate definitions are essentially a collection of instruction equivalence classes. Also, different processor models can specify a different variant of the same STIPredicate with different rules (i.e. predicates) to classify instructions. Tablegen backends (in this particular case, the SubtargetEmitter) will be able to process STIPredicate definitions, and automatically generate functions in XXXGenSubtargetInfo. This patch introduces two special kind of STIPredicate classes named IsZeroIdiomFunction and IsDepBreakingFunction in tablegen. It also adds a definition for those in the BtVer2 scheduling model only. This patch supersedes the one committed at r338372 (phabricator review: D49310). The main advantages are: - We can describe subtarget predicates via tablegen using STIPredicates. - We can describe zero-idioms / dep-breaking instructions directly via tablegen in the scheduling models. In future, the STIPredicates framework can be used for solving other problems. Examples of future developments are: - Teach how to identify optimizable register-register moves - Teach how to identify slow LEA instructions (each subtarget defining its own concept of "slow" LEA). - Teach how to identify instructions that have undocumented false dependencies on the output registers on some processors only. It is also (in my opinion) an elegant way to expose knowledge to both external tools like llvm-mca, and codegen passes. For example, machine schedulers in LLVM could reuse that information when internally constructing the data dependency graph for a code region. This new design feature is also an "opt-in" feature. Processor models don't have to use the new STIPredicates. It has all been designed to be as unintrusive as possible. Differential Revision: https://reviews.llvm.org/D52174 llvm-svn: 342555
*	[llvm-mca] Add the ability to mark register reads/writes associated with ↵	Andrea Di Biagio	2018-09-18	6	-41/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	dep-breaking instructions. NFCI This patch adds two new boolean fields: - Field `ReadState::IndependentFromDef`. - Field `WriteState::WritesZero`. Field `IndependentFromDef` is set for ReadState objects associated with dependency-breaking instructions. It is used by the simulator when updating data dependencies between registers. Field `WritesZero` is set by WriteState objects associated with dependency breaking zero-idiom instructions. It helps the PRF identify which writes don't consume any physical registers. llvm-svn: 342483
*	[llvm-mca] Slightly refactor class InstRef. NFC.	Andrea Di Biagio	2018-09-18	2	-10/+13
\| \| \| \|	llvm-svn: 342480
*	Revert r342148 (and follow-on fix attempts r342154, r342180, r342182, r342193)	Nico Weber	2018-09-15	1	-4/+8
\| \| \| \| \| \| \|	Many bots buildling with make have been broken for several days, e.g. http://lab.llvm.org:8011/builders/lld-x86_64-darwin13 llvm-svn: 342336
*	Renovate CMake files in the `llvm-(cfi-verify\|exegesis\|mca)` tools.	Richard Diamond	2018-09-13	1	-8/+4
\| \| \| \|	llvm-svn: 342148
*	[llvm-mca] Delay calculation of Cycles per Resources, separate the cycles ↵	Matt Davis	2018-09-11	12	-31/+76
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and resource quantities. Summary: This patch removes the storing of accumulated floating point data within the llvm-mca library. This patch splits-up the two quantities: cycles and number of resource units. By splitting-up these two quantities, we delay the calculation of "cycles per resource unit" until that value is read, reducing the chance of accumulating floating point error. I considered using the APFloat, but after measuring performance, for a large (many iteration) sample, I decided to go with this faster solution. Reviewers: andreadb, courbet, RKSimon Reviewed By: andreadb Subscribers: llvm-commits, javed.absar, tschuett, gbedwell Differential Revision: https://reviews.llvm.org/D51903 llvm-svn: 341980
*	[llvm-mca] Fix typo in debug output. NFC.	Matt Davis	2018-09-01	1	-1/+1
\| \| \| \|	llvm-svn: 341281
*	[llvm-mca] correctly initialize field 'CycleRetired' in the TimelineView.	Andrea Di Biagio	2018-08-30	1	-1/+1
\| \| \| \| \| \| \|	This fixes a [-Wmissing-field-initializers] warning reported by buildbot lld-x86_64-darwin13, build #25152. llvm-svn: 341056
*	[llvm-mca] Report the number of dispatched micro opcodes in the ↵	Andrea Di Biagio	2018-08-30	8	-45/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DispatchStatistics view. This patch introduces the following changes to the DispatchStatistics view: * DispatchStatistics now reports the number of dispatched opcodes instead of the number of dispatched instructions. * The "Dynamic Dispatch Stall Cycles" table now also reports the percentage of stall cycles against the total simulated cycles. This change allows users to easily compare dispatch group sizes with the processor DispatchWidth. Before this change, it was difficult to correlate the two numbers, since DispatchStatistics view reported numbers of instructions (instead of opcodes). DispatchWidth defines the maximum size of a dispatch group in terms of number of micro opcodes. The other change introduced by this patch is related to how DispatchStage generates "instruction dispatch" events. In particular: * There can be multiple dispatch events associated with a same instruction * Each dispatch event now encapsulates the number of dispatched micro opcodes. The number of micro opcodes declared by an instruction may exceed the processor DispatchWidth. Therefore, we cannot assume that instructions are always fully dispatched in a single cycle. DispatchStage knows already how to handle instructions declaring a number of opcodes bigger that DispatchWidth. However, DispatchStage always emitted a single instruction dispatch event (during the first simulated dispatch cycle) for instructions dispatched. With this patch, DispatchStage now correctly notifies multiple dispatch events for instructions that cannot be dispatched in a single cycle. A few views had to be modified. Views can no longer assume that there can only be one dispatch event per instruction. Tests (and docs) have been updated. Differential Revision: https://reviews.llvm.org/D51430 llvm-svn: 341055
*	[llvm-mca] Add fields "Total uOps" and "uOps Per Cycle" to the report ↵	Andrea Di Biagio	2018-08-29	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	generated by the SummaryView. This patch adds two new fields to the perf report generated by the SummaryView. Fields are now logically organized into two small groups; only the second group contains throughput indicators. Example: ``` Iterations: 100 Instructions: 300 Total Cycles: 414 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.69 IPC: 0.72 Block RThroughput: 4.0 ``` This patch also updates the docs for llvm-mca. Due to the nature of this change, several tests in the tools/llvm-mca directory were affected, and had to be updated using script `update_mca_test_checks.py`. llvm-svn: 340946
*	[llvm-mca] Don't disable the SummaryView if flag `-all-stats` is false.	Andrea Di Biagio	2018-08-29	1	-1/+0
\| \| \| \|	llvm-svn: 340945
*	[llvm-mca] Remove unused formal. NFC.	Matt Davis	2018-08-29	2	-5/+4
\| \| \| \|	llvm-svn: 340888
*	[llvm-mca] Move the initialization of Pipeline. NFC.	Matt Davis	2018-08-29	1	-2/+2
\| \| \| \| \| \|	Code cleanup to make the pipeline creation routine easier to read. llvm-svn: 340887
*	[llvm-mca] use llvm::any_of instead of std::any_of. NFC	Andrea Di Biagio	2018-08-28	1	-1/+2
\| \| \| \|	llvm-svn: 340863
*	[llvm-mca] Initialize each element in vector TimelineView::UsedBuffers to a ↵	Andrea Di Biagio	2018-08-28	2	-14/+18
\| \| \| \| \| \| \| \| \|	default invalid buffer descriptor. NFCI Also change the default buffer size for UsedBuffer entries to -1 (i.e. "unknown size"). No functional change intended. llvm-svn: 340830
*	[llvm-mca][TimelineView] Force the same number of executions for every entry ↵	Andrea Di Biagio	2018-08-28	2	-70/+106
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	in the 'wait-times' table. This patch also uses colors to highlight problematic wait-time entries. A problematic entry is an entry with an high wait time that tends to match (or exceed) the size of the scheduler's buffer. Color RED is used if an instruction had to wait an average number of cycles which is bigger than (or equal to) the size of the underlying scheduler's buffer. Color YELLOW is used if the time (in cycles) spend waiting for the operands or pipeline resources is bigger than half the size of the underlying scheduler's buffer. Color MAGENTA is used if an instruction does not consume buffer resources according to the scheduling model. llvm-svn: 340825
*	[llvm-mca] Pass an instruction reference when notifying event listeners ↵	Andrea Di Biagio	2018-08-28	5	-32/+30
\| \| \| \| \| \|	about reserved/released buffer resources. NFC llvm-svn: 340821
*	[llvm-mca] Remove unused include. NFC	Andrea Di Biagio	2018-08-27	1	-1/+0
\| \| \| \|	llvm-svn: 340768
*	[llvm-mca] Introduce the llvm-mca library and organize the directory ↵	Matt Davis	2018-08-27	41	-62/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	accordingly. NFC. Summary: This patch introduces llvm-mca as a library. The driver (llvm-mca.cpp), views, and stats, are not part of the library. Those are separate components that are not required for the functioning of llvm-mca. The directory has been organized as follows: All library source files now reside in: - `lib/HardwareUnits/` - All subclasses of HardwareUnit (these represent the simulated hardware components of a backend). (LSUnit does not inherit from HardwareUnit, but Scheduler does which uses LSUnit). - `lib/Stages/` - All subclasses of the pipeline stages. - `lib/` - This is the root of the library and contains library code that does not fit into the Stages or HardwareUnit subdirs. All library header files now reside in the `include` directory and mimic the same layout as the `lib` directory mentioned above. In the (near) future we would like to move the library (include and lib) contents from tools and into the core of llvm somewhere. That change would allow various analysis and optimization passes to make use of MCA functionality for things like cost modeling. I left all of the non-library code just where it has always been, in the root of the llvm-mca directory. The include directives for the non-library source file have been updated to refer to the llvm-mca library headers. I updated the llvm-mca/CMakeLists.txt file to include the library headers, but I made the non-library code explicitly reference the library's 'include' directory. Once we eventually (hopefully) migrate the MCA library components into llvm the include directives used by the non-library source files will be updated to point to the proper location in llvm. Reviewers: andreadb, courbet, RKSimon Reviewed By: andreadb Subscribers: mgorny, javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D50929 llvm-svn: 340755
*	[llvm-mca] Remove unused method. NFC.	Matt Davis	2018-08-27	1	-1/+0
\| \| \| \|	llvm-svn: 340754
*	[llvm-mca] Improved report generated by the SchedulerStatistics view.	Andrea Di Biagio	2018-08-27	2	-67/+99
\| \| \| \| \| \| \| \| \| \| \| \| \|	Before this patch, the SchedulerStatistics only printed the maximum number of buffer entries consumed in each scheduler's queue at a given point of the simulation. This patch restructures the reported table, and adds an extra field named "Average number of used buffer entries" to it. This patch also uses different colors to help identifying bottlenecks caused by high scheduler's buffer pressure. llvm-svn: 340746
*	[llvm-mca] Move ResourceManager from Scheduler into its own file. NFC.	Matt Davis	2018-08-24	5	-621/+672
\| \| \| \| \| \|	This time I should be preserving history of the ResourceManager changes. llvm-svn: 340668
*	[llvm-mca] Revert r340659. NFC.	Matt Davis	2018-08-24	5	-672/+621
\| \| \| \| \| \| \| \|	Choosing to revert the change and do it again, hopefully preserving the history of the changes by using svn copy instead of simply creating a new file from the contents within Scheduler. llvm-svn: 340661
*	[llvm-mca] Move the ResourceManger from the Scheduler into its own file. NFC.	Matt Davis	2018-08-24	5	-621/+672
\| \| \| \|	llvm-svn: 340659
*	[llvm-mca] Move views and stats into a Views subdir. NFC.	Matt Davis	2018-08-24	23	-38/+37
\| \| \| \|	llvm-svn: 340645
*	[llvm-mca] Fix parameter name. NFC.	Walter Lee	2018-08-23	1	-2/+2
\| \| \| \|	llvm-svn: 340570
*	[llvm-mca] Set the Selection strategy to Default if nullptr is passed.	Matt Davis	2018-08-23	2	-7/+20
\| \| \| \| \| \|	* Set (not reset) the strategy in Scheduler::setCustomStrategyImpl() llvm-svn: 340566
*	[llvm-mca] Fix wrong call to setCustomStrategy().	Andrea Di Biagio	2018-08-23	2	-9/+9
\| \| \| \| \| \| \| \| \|	Thanks to @waltl for reporting this issue. I have also added an assert to check for invalid null strategy objects, and I have reworded a couple of code comments in Scheduler.h llvm-svn: 340545
*	[llvm-mca] Allow the definition of custom strategies for selecting processor ↵	Andrea Di Biagio	2018-08-23	2	-124/+182
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	resource units. With this patch, users can now customize the pipeline selection strategy for scheduler resources. The resource selection strategy can be defined at processor resource granularity. This enables the definition of different strategies for different hardware schedulers. To override the strategy associated with a processor resource, users can call method ResourceManager::setCustomStrategy(), and pass a 'ResourceStrategy' object in input. Class ResourceStrategy is an abstract class which declares virtual method `ResourceStrategy::select()`. Method select() is meant to implement the actual strategy; it is responsible for picking the next best resource from a set of available pipeline resources. Custom strategy must simply override that method. By default, processor resources are associated with instances of 'DefaultResourceStrategy'. A 'DefaultResourceStrategy' internally implements a simple round-robin selector. For more details, please refer to the code comments in Scheduler.h. llvm-svn: 340536
*	[llvm-mca] Clean up a comment about the Context class. NFC.	Matt Davis	2018-08-22	2	-2/+2
\| \| \| \|	llvm-svn: 340431
*	[llvm-mca] Remove unused decl. NFC.	Matt Davis	2018-08-22	1	-2/+0
\| \| \| \|	llvm-svn: 340422
*	[llvm-mca] Improved code comments and moved some method definitions from ↵	Andrea Di Biagio	2018-08-22	2	-81/+98
\| \| \| \| \| \|	Scheduler.h to Scheduler.cpp. NFC llvm-svn: 340395
*	[llvm-mca] Remove unused decl. NFC.	Matt Davis	2018-08-21	1	-1/+0
\| \| \| \|	llvm-svn: 340316
*	[llvm-mca] Add the ability to customize the instruction selection strategy ↵	Andrea Di Biagio	2018-08-21	2	-34/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	in the Scheduler. The constructor of Scheduler now accepts a SchedulerStrategy object, which is used internally by method Scheduler::select() to drive the instruction selection process. The goal of this patch is to enable the definition of custom selection strategies while reusing the same algorithms implemented by class Scheduler. The motivation is that, on some targets, the default strategy may not well approximate the selection logic in the hardware schedulers. This patch also adds the ability to pass a ResourceManager object to the constructor of Scheduler. This gives a bit more flexibility to the design, and potentially it allows to expose processor resources to SchedulerStrategy objects. Differential Revision: https://reviews.llvm.org/D51051 llvm-svn: 340314
*	[llvm-mca] Replace use of llvm::any_of with std::any_of.	Andrea Di Biagio	2018-08-21	1	-2/+3
\| \| \| \| \| \|	This should unbreak the buildbots. llvm-svn: 340274
*	[llvm-mca] Add method cycleEvent() to class Scheduler. NFCI	Andrea Di Biagio	2018-08-21	6	-110/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The goal of this patch is to simplify the Scheduler's interface in preparation for D50929. Some methods in the Scheduler's interface should not be exposed to external users, since their presence makes it hard to both understand, and extend the Scheduler's interface. This patch removes the following two methods from the public Scheduler's API: - reclaimSimulatedResources() - updatePendingQueue() Their logic has been migrated to a new method named 'cycleEvent()'. Methods 'updateIssuedSet()' and 'promoteToReadySet()' still exist. However, they are now private members of class Scheduler. This simplifies the interaction with the Scheduler from the ExecuteStage. llvm-svn: 340273
*	[llvm-mca] Remove unused formal parameter. NFC.	Matt Davis	2018-08-20	1	-4/+4
\| \| \| \|	llvm-svn: 340227
*	[llvm-mca] Make the LSUnit a HardwareUnit, and allow derived classes to ↵	Andrea Di Biagio	2018-08-20	6	-143/+153
\| \| \| \| \| \| \| \| \| \| \| \| \|	implement a different memory consistency model. The LSUnit is now a HardwareUnit, and it is owned by the mca::Context. Derived classes can now implement a different consistency model by overriding method `LSUnit::isReady()`. This patch also slightly refactors the Scheduler interface in the attempt to simplifying the interaction between ExecuteStage and the underlying Scheduler. llvm-svn: 340176
*	[llvm-mca] Reformat a few lines (fix spacing). NFC.	Matt Davis	2018-08-17	3	-7/+5
\| \| \| \|	llvm-svn: 340065