bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Remove remaining usages of OperationInst in lib/Transforms.	River Riddle	2019-03-29	3	-37/+27
\| \| \| \|	PiperOrigin-RevId: 232323671
*	Replace the walkOps/visitOperationInst variants from the InstWalkers with ↵	River Riddle	2019-03-29	3	-4/+4
\| \| \| \| \| \|	the Instruction variants. PiperOrigin-RevId: 232322030
*	Update dma-generate pass to (1) work on blocks of instructions (instead of just	Uday Bondhugula	2019-03-29	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	loops), (2) take into account fast memory space capacity and lower 'dmaDepth' to fit, (3) add location information for debug info / errors - change dma-generate pass to work on blocks of instructions (start/end iterators) instead of 'for' loops; complete TODOs - allows DMA generation for straightline blocks of operation instructions interspersed b/w loops - take into account fast memory capacity: check whether memory footprint fits in fastMemoryCapacity parameter, and recurse/lower the depth at which DMA generation is performed until it does fit in the provided memory - add location information to MemRefRegion; any insufficient fast memory capacity errors or debug info w.r.t dma generation shows location information - allow DMA generation pass to be instantiated with a fast memory capacity option (besides command line flag) - change getMemRefRegion to return unique_ptr's - change getMemRefFootprintBytes to work on a 'Block' instead of 'ForInst' - other helper methods; add postDomInstFilter option for replaceAllMemRefUsesWith; drop forInst->walkOps, add Block::walkOps methods Eg. output $ mlir-opt -dma-generate -dma-fast-mem-capacity=1 /tmp/single.mlir /tmp/single.mlir:9:13: error: Total size of all DMA buffers' for this block exceeds fast memory capacity for %i3 = (d0) -> (d0)(%i1) to (d0) -> (d0 + 32)(%i1) { ^ $ mlir-opt -debug-only=dma-generate -dma-generate -dma-fast-mem-capacity=400 /tmp/single.mlir /tmp/single.mlir:9:13: note: 8 KiB of DMA buffers in fast memory space for this block for %i3 = (d0) -> (d0)(%i1) to (d0) -> (d0 + 32)(%i1) { PiperOrigin-RevId: 232297044
*	Fold the functionality of OperationInst into Instruction. OperationInst ↵	River Riddle	2019-03-29	1	-1/+1
\| \| \| \| \| \|	still exists as a forward declaration and will be removed incrementally in a set of followup cleanup patches. PiperOrigin-RevId: 232198540
*	Define the AffineForOp and replace ForInst with it. This patch is largely ↵	River Riddle	2019-03-29	2	-83/+92
\| \| \| \| \| \|	mechanical, i.e. changing usages of ForInst to OpPointer<AffineForOp>. An important difference is that upon construction an AffineForOp no longer automatically creates the body and induction variable. To generate the body/iv, 'createBody' can be called on an AffineForOp with no body. PiperOrigin-RevId: 232060516
*	Define an detail::OperandStorage class to handle managing instruction ↵	River Riddle	2019-03-29	1	-0/+1
\| \| \| \| \| \|	operands. This class stores operands in a similar way to SmallVector except for two key differences. The first is the inline storage, which is a trailing objects array. The second is that being able to dynamically resize the operand list is optional. This means that we can enable the cases where operations need to change the number of operands after construction without losing the spatial locality benefits of the common case (operation instructions / non-control flow instructions with a lifetime fixed number of operands). PiperOrigin-RevId: 231910497
*	Change AffineApplyOp to produce a single result, simplifying the code that	Chris Lattner	2019-03-29	2	-12/+10
\| \| \| \| \| \|	works with it, and updating the g3docs. PiperOrigin-RevId: 231120927
*	Change the ForInst induction variable to be a block argument of the body ↵	River Riddle	2019-03-29	1	-11/+15
\| \| \| \| \| \|	instead of the ForInst itself. This is a necessary step in converting ForInst into an operation. PiperOrigin-RevId: 231064139
*	Drop AffineMap::Null and IntegerSet::Null	Nicolas Vasilache	2019-03-29	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Addresses b/122486036 This CL addresses some leftover crumbs in AffineMap and IntegerSet by removing the Null method and cleaning up the constructors. As the ::Null uses were tracked down, opportunities appeared to untangle some of the Parsing logic and make it explicit where AffineMap/IntegerSet have ambiguous syntax. Previously, ambiguous cases were hidden behind the implicit pointer values of AffineMap* and IntegerSet* that were passed as function parameters. Depending the values of those pointers one of 3 behaviors could occur. This parsing logic convolution is one of the rare cases where I would advocate for code duplication. The more proper fix would be to make the syntax unambiguous or to allow some lookahead. PiperOrigin-RevId: 231058512
*	Update replaceAllMemRefUsesWith to generate single result affine_apply's for	Uday Bondhugula	2019-03-29	1	-4/+9
\| \| \| \| \| \| \| \| \| \| \|	index remapping - generate a sequence of single result affine_apply's for the index remapping (instead of one multi result affine_apply) - update dma-generate and loop-fusion test cases; while on this, change test cases to use single result affine apply ops - some fusion comment fix/cleanup PiperOrigin-RevId: 230985830
*	Update createAffineComputationSlice to generate single result affine maps	Uday Bondhugula	2019-03-29	1	-19/+25
\| \| \| \| \| \| \| \| \| \|	- Update createAffineComputationSlice to generate a sequence of single result affine apply ops instead of one multi-result affine apply - update pipeline-data-transfer test case; while on this, also update the test case to use only single result affine maps, and make it more robust to change. PiperOrigin-RevId: 230965478
*	Update dma-generate: update for multiple load/store op's per memref	Uday Bondhugula	2019-03-29	1	-1/+3
\| \| \| \| \| \| \| \| \| \|	- introduce a way to compute union using symbolic rectangular bounding boxes - handle multiple load/store op's to the same memref by taking a union of the regions - command-line argument to provide capacity of the fast memory space - minor change to replaceAllMemRefUsesWith to not generate affine_apply if the supplied index remap was identity PiperOrigin-RevId: 230848185
*	Introduce a new operation hook point for implementing simple local	Chris Lattner	2019-03-29	1	-7/+44
\| \| \| \| \| \| \| \| \| \| \|	canonicalizations of operations. The ultimate important user of this is going to be a funcBuilder->foldOrCreate<YourOp>(...) API, but for now it is just a more convenient way to write certain classes of canonicalizations (see the change in StandardOps.cpp). NFC. PiperOrigin-RevId: 230770021
*	Add cloning functionality to Block and Function, this also adds support for ↵	River Riddle	2019-03-29	1	-7/+7
\| \| \| \| \| \|	remapping successor block operands of terminator operations. We define a new BlockAndValueMapping class to simplify mapping between cloned values. PiperOrigin-RevId: 230768759
*	loop unroll update: unroll factor one for a single iteration loop	Uday Bondhugula	2019-03-29	1	-1/+4
\| \| \| \| \| \| \| \|	- unrolling a single iteration loop by a factor of one should promote its body into its parent; this makes it consistent with the behavior/expectation that unrolling a loop by a factor equal to its trip count makes the loop go away. PiperOrigin-RevId: 230426499
*	Allocate private/local buffers for slices accurately during fusion	Uday Bondhugula	2019-03-29	2	-21/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- the size of the private memref created for the slice should be based on the memref region accessed at the depth at which the slice is being materialized, i.e., symbolic in the outer IVs up until that depth, as opposed to the region accessed based on the entire domain. - leads to a significant contraction of the temporary / intermediate memref whenever the memref isn't reduced to a single scalar (through store fwd'ing). Other changes - update to promoteIfSingleIteration - avoid introducing unnecessary identity map affine_apply from IV; makes it much easier to write and read test cases and pass output for all passes that use promoteIfSingleIteration; loop-fusion test cases become much simpler - fix replaceAllMemrefUsesWith bug that was exposed by the above update - 'domInstFilter' could be one of the ops erased due to a memref replacement in it. - fix getConstantBoundOnDimSize bug: a division by the coefficient of the identifier was missing (the latter need not always be 1); add lbFloorDivisors output argument - rename getBoundingConstantSizeAndShape -> getConstantBoundingSizeAndShape PiperOrigin-RevId: 230405218
*	Minor code cleanup - NFC.	Uday Bondhugula	2019-03-29	1	-8/+4
\| \| \| \| \| \|	- readability changes PiperOrigin-RevId: 229443430
*	Swap the type and attribute parameter in ConstantOp::build()	Lei Zhang	2019-03-29	1	-2/+2
\| \| \| \| \| \| \|	This is to keep consistent with other TableGen generated builders so that we can also use this builder in TableGen rules. PiperOrigin-RevId: 229244630
*	Simplify compositions of AffineApply	Nicolas Vasilache	2019-03-29	1	-81/+7
\| \| \| \| \| \| \| \|	This CL is the 6th and last on the path to simplifying AffineMap composition. This removes `AffineValueMap::forwardSubstitutions` and replaces it by simple calls to `fullyComposeAffineMapAndOperands`. PiperOrigin-RevId: 228962580
*	Misc readability and doc / code comment related improvements - NFC	Uday Bondhugula	2019-03-29	1	-22/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- when SSAValue/MLValue existed, code at several places was forced to create additional aggregate temporaries of SmallVector<SSAValue/MLValue> to handle the conversion; get rid of such redundant code - use filling ctors instead of explicit loops - for smallvectors, change insert(list.end(), ...) -> append(... - improve comments at various places - turn getMemRefAccess into MemRefAccess ctor and drop duplicated getMemRefAccess. In the next CL, provide getAccess() accessors for load, store, DMA op's to return a MemRefAccess. PiperOrigin-RevId: 228243638
*	Merge LowerAffineApplyPass into LowerIfAndForPass, rename to LowerAffinePass	Alex Zinenko	2019-03-29	1	-137/+0
\| \| \| \| \| \| \| \| \| \| \| \|	This change is mechanical and merges the LowerAffineApplyPass and LowerIfAndForPass into a single LowerAffinePass. It makes a step towards defining an "affine dialect" that would contain all polyhedral-related constructs. The motivation for merging these two passes is based on retiring MLFunctions and, eventually, transforming If and For statements into regular operations. After that happens, LowerAffinePass becomes yet another legalization. PiperOrigin-RevId: 227566113
*	LowerForAndIf: expand affine_apply's inplace	Alex Zinenko	2019-03-29	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Existing implementation was created before ML/CFG unification refactoring and did not concern itself with further lowering to separate concerns. As a result, it emitted `affine_apply` instructions to implement `for` loop bounds and `if` conditions and required a follow-up function pass to lower those `affine_apply` to arithmetic primitives. In the unified function world, LowerForAndIf is mostly a lowering pass with low complexity. As we move towards a dialect for affine operations (including `for` and `if`), it makes sense to lower `for` and `if` conditions directly to arithmetic primitives instead of relying on `affine_apply`. Expose `expandAffineExpr` function in LoweringUtils. Use this function together with `expandAffineMaps` to emit primitives that implement loop and branch conditions directly. Also remove tests that become unnecessary after transforming LowerForAndIf into a function pass. PiperOrigin-RevId: 227563608
*	Refactor LowerAffineApply	Alex Zinenko	2019-03-29	1	-35/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In LoweringUtils, extract out `expandAffineMap`. This function takes an affine map and a list of values the map should be applied to and emits a sequence of arithmetic instructions that implement the affine map. It is independent of the AffineApplyOp and can be used in places where we need to insert an evaluation of an affine map without relying on a (temporary) `affine_apply` instruction. This prepares for a merge between LowerAffineApply and LowerForAndIf passes. Move the `expandAffineApply` function to the LowerAffineApply pass since it is the only place that must be aware of the `affine_apply` instructions. PiperOrigin-RevId: 227563439
*	Update and generalize various passes to work on both CFG and ML functions,	Chris Lattner	2019-03-29	1	-2/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	simplifying them in minor ways. The only significant cleanup here is the constant folding pass. All the other changes are simple and easy, but this is still enough to shrink the compiler by 45LOC. The one pass left to merge is the CSE pass, which will be move involved, so I'm splitting it out to its own patch (which I'll tackle right after this). This is step 28/n towards merging instructions and statements. PiperOrigin-RevId: 227328115
*	Simplify the remapFunctionAttrs logic, merging CFG/ML function handling.	Chris Lattner	2019-03-29	1	-30/+3
\| \| \| \| \| \| \| \| \| \| \|	Remove an unnecessary restriction in forward substitution. Slightly simplify LLVM IR lowering, which previously would crash if given an ML function, it should now produce a clean error if given a function with an if/for instruction in it, just like it does any other unsupported op. This is step 27/n towards merging instructions and statements. PiperOrigin-RevId: 227324542
*	Simplify GreedyPatternRewriteDriver now that functions are merged into one	Chris Lattner	2019-03-29	1	-116/+50
\| \| \| \| \| \| \| \| \|	representation, shrinking by 70LOC. The PatternRewriter class can probably also be simplified as well, but one step at a time. This is step 26/n towards merging instructions and statements. NFC. PiperOrigin-RevId: 227324218
*	Introduce PostDominanceInfo, fix properlyDominates() for Instructions	Uday Bondhugula	2019-03-29	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- introduce PostDominanceInfo in the right/complete way and use that for post dominance check in store-load forwarding - replace all uses of Analysis/Utils::dominates/properlyDominates with DominanceInfo::dominates/properlyDominates - drop all redundant copies of dominance methods in Analysis/Utils/ - in pipeline-data-transfer, replace dominates call with a much less expensive check; similarly, substitute dominates() in checkMemRefAccessDependence with a simpler check suitable for that context - fix a bug in properlyDominates - improve doc for 'for' instruction 'body' PiperOrigin-RevId: 227320507
*	Extend InstVisitor and Walker to handle arbitrary CFG functions, expand the	Chris Lattner	2019-03-29	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	Function::walk functionality into f->walkInsts/Ops which allows visiting all instructions, not just ops. Eliminate Function::getBody() and Function::getReturn() helpers which crash in CFG functions, and were only kept around as a bridge. This is step 25/n towards merging instructions and statements. PiperOrigin-RevId: 227243966
*	Standardize naming of statements -> instructions, revisting the code base to be	Chris Lattner	2019-03-29	3	-222/+222
\| \| \| \| \| \| \| \| \|	consistent and moving the using declarations over. Hopefully this is the last truly massive patch in this refactoring. This is step 21/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227178245
*	Rename BasicBlock and StmtBlock to Block, and make a pass cleaning it up. I ↵	Chris Lattner	2019-03-29	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \|	did not make an effort to rename all of the 'bb' names in the codebase, since they are still correct and any specific missed once can be fixed up on demand. The last major renaming is Statement -> Instruction, which is why Statement and Stmt still appears in various places. This is step 19/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227163082
*	Eliminate the using decls for MLFunction and CFGFunction standardizing on	Chris Lattner	2019-03-29	3	-7/+7
\| \| \| \| \| \| \| \|	Function. This is step 18/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227139399
*	Rename BBArgument -> BlockArgument, Op::getOperation -> Op::getInst(),	Chris Lattner	2019-03-29	2	-8/+8
\| \| \| \| \| \| \| \|	StmtResult -> InstResult, StmtOperand -> InstOperand, and remove the old names. This is step 17/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227121537
*	Merge Operation into OperationInst and standardize nomenclature around	Chris Lattner	2019-03-29	4	-38/+38
\| \| \| \| \| \| \| \|	OperationInst. This is a big mechanical patch. This is step 16/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227093712
*	Rework inherentance hierarchy: Operation now derives from Statement, and	Chris Lattner	2019-03-29	1	-13/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	OperationInst derives from it. This allows eliminating some forwarding functions, other complex code handling multiple paths, and the 'isStatement' bit tracked by Operation. This is the last patch I think I can make before the big mechanical change merging Operation into OperationInst, coming next. This is step 15/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227077411
*	Minor renamings: Trim the "Stmt" prefix off	Chris Lattner	2019-03-29	1	-4/+3
\| \| \| \| \| \| \| \| \|	StmtSuccessorIterator/StmtSuccessorIterator, and rename and move the CFGFunctionViewGraph pass to ViewFunctionGraph. This is step 13/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227069438
*	Merge CFGFuncBuilder/MLFuncBuilder/FuncBuilder together into a single new	Chris Lattner	2019-03-29	3	-18/+18
\| \| \| \| \| \| \| \|	FuncBuilder class. Also rename SSAValue.cpp to Value.cpp This is step 12/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227067644
*	Merge SSAValue, CFGValue, and MLValue together into a single Value class, which	Chris Lattner	2019-03-29	4	-45/+40
\| \| \| \| \| \| \| \| \|	is the new base of the SSA value hierarchy. This CL also standardizes all the nomenclature and comments to use 'Value' where appropriate. This also eliminates a large number of cast<MLValue>(x)'s, which is very soothing. This is step 11/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227064624
*	Eliminate the Instruction, BasicBlock, CFGFunction, MLFunction, and ↵	Chris Lattner	2019-03-29	2	-17/+18
\| \| \| \| \| \| \| \| \| \| \| \|	ExtFunction classes, using the Statement/StmtBlock hierarchy and Function instead. This only changes the internal data structures, it does not affect the user visible syntax or structure of MLIR code. Function gets new "isCFG()" sorts of predicates as a transitional measure. This patch is gross in a number of ways, largely in an effort to reduce the amount of mechanical churn in one go. It introduces a bunch of using decls to keep the old names alive for now, and a bunch of stuff needs to be renamed. This is step 10/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 227044402
*	Rename findFunction from the ML side of the house to be named getFunction(),	Chris Lattner	2019-03-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	making it more similar to the CFG side of things. It is true that in a deeply nested case that this is not a guaranteed O(1) time operation, and that 'get' could lead compiler hackers to think this is cheap, but we need to merge these and we can look into solutions for this in the future if it becomes a problem in practice. This is step 9/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 226983931
*	Refactor MLFunction to contain a StmtBlock for its body instead of inheriting	Chris Lattner	2019-03-29	2	-2/+3
\| \| \| \| \| \| \| \| \| \|	from it. This is necessary progress to squaring away the parent relationship that a StmtBlock has with its enclosing if/for/fn, and makes room for functions to have more than one block in the future. This also removes IfClause and ForStmtBody. This is step 5/n towards merging instructions and statements, NFC. PiperOrigin-RevId: 226936541
*	Refactor ForStmt: having it contain a StmtBlock instead of subclassing	Chris Lattner	2019-03-29	1	-9/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	StmtBlock. This is more consistent with IfStmt and also conceptually makes more sense - a forstmt "isn't" its body, it contains its body. This is step 1/N towards merging BasicBlock and StmtBlock. This is required because in the new regime StmtBlock will have a use list (just like BasicBlock does) of operands, and ForStmt already has a use list for its induction variable. This is a mechanical patch, NFC. PiperOrigin-RevId: 226684158
*	Check if the operation is already in the worklist before adding it.	River Riddle	2019-03-29	1	-0/+4
\| \| \| \|	PiperOrigin-RevId: 225379496
*	Update/Fix LoopUtils::stmtBodySkew to handle loop step.	Uday Bondhugula	2019-03-29	1	-56/+64
\| \| \| \| \| \| \| \| \| \| \|	- loop step wasn't handled and there wasn't a TODO or an assertion; fix this. - rename 'delay' to shift for consistency/readability. - other readability changes. - remove duplicate attribute print for DmaStartOp; fix misplaced attribute print for DmaWaitOp - add build method for AddFOp (unrelated to this CL, but add it anyway) PiperOrigin-RevId: 224892958
*	Update/fix -pipeline-data-transfer; fix b/120770946	Uday Bondhugula	2019-03-29	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	- fix replaceAllMemRefUsesWith call to replace only inside loop body. - handle the case where DMA buffers are dynamic; extend doubleBuffer() method to handle dynamically shaped DMA buffers (pass the right operands to AllocOp) - place alloc's for DMA buffers at the depth at which pipelining is being done (instead of at top-level) - add more test cases PiperOrigin-RevId: 224852231
*	Fix cases where unsigned / signed arithmetic was being mixed (following up on	Uday Bondhugula	2019-03-29	1	-4/+4
\| \| \| \| \| \| \| \| \|	cl/224246657); eliminate repeated evaluation of exprs in loop upper bounds. - while on this, sweep through and fix potential repeated evaluation of expressions in loop upper bounds PiperOrigin-RevId: 224268918
*	Minor fix for replaceAllMemRefUsesWith.	Uday Bondhugula	2019-03-29	1	-12/+10
\| \| \| \| \| \| \| \| \| \| \|	The check for whether the memref was used in a non-derefencing context had to be done inside, i.e., only for the op stmt's that the replacement was specified to be performed on (by the domStmtFilter arg if provided). As such, it is completely fine for example for a function to return a memref while the replacement is being performed only a specific loop's body (as in the case of DMA generation). PiperOrigin-RevId: 223827753
*	Split "rewrite" functionality out of Pattern into a new RewritePattern derived	Chris Lattner	2019-03-29	1	-5/+11
\| \| \| \| \| \| \| \|	class. This change is NFC, but allows for new kinds of patterns, specifically LegalizationPatterns which will be allowed to change the types of things they rewrite. PiperOrigin-RevId: 223243783
*	Rename Deaffinator to LowerAffineApply and patch it.	Alex Zinenko	2019-03-29	1	-14/+15
\| \| \| \| \| \| \| \|	Several things were suggested in post-submission reviews. In particular, use pointers in function interfaces instead of references (still use references internally). Clarify the behavior of the pass in presence of MLFunctions. PiperOrigin-RevId: 222556851
*	Fix bugs in DMA generation and FlatAffineConstraints; add more test	Uday Bondhugula	2019-03-29	1	-9/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	cases. - fix bug in calculating index expressions for DMA buffers in certain cases (affected tiled loop nests); add more test cases for better coverage. - introduce an additional optional argument to replaceAllMemRefUsesWith; additional operands to the index remap AffineMap can now be supplied by the client. - FlatAffineConstraints::addBoundsForStmt - fix off by one upper bound, ::composeMap - fix position bug. - Some clean up and more comments PiperOrigin-RevId: 222434628
*	Introduce Deaffinator pass.	Alex Zinenko	2019-03-29	1	-0/+136
\| \| \| \| \| \| \| \| \| \| \| \| \|	This function pass replaces affine_apply operations in CFG functions with sequences of primitive arithmetic instructions that form the affine map. The actual replacement functionality is located in LoweringUtils as a standalone function operating on an individual affine_apply operation and inserting the result at the location of the original operation. It is expected to be useful for other, target-specific lowering passes that may start at MLFunction level that Deaffinator does not support. PiperOrigin-RevId: 222406692