bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[StackProtector] Ignore certain intrinsics when calculating sspstrong heuristic.	Matt Davis	2018-04-06	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The 'strong' StackProtector heuristic takes into consideration call instructions. Certain intrinsics, such as lifetime.start, can cause the StackProtector to protect functions that do not need to be protected. Specifically, a volatile variable, (not optimized away), but belonging to a stack allocation will encourage a llvm.lifetime.start to be inserted during compilation. Because that intrinsic is a 'call' the strong StackProtector will see that the alloca'd variable is being passed to a call instruction, and insert a stack protector. In this case the intrinsic isn't really lowered to a call. This can cause unnecessary stack checking, at the cost of additional (wasted) CPU cycles. In the future we should rely on TargetTransformInfo::isLoweredToCall, but as of now that routine considers all intrinsics as not being lowerable. That needs to be corrected, and such a change is on my list of things to get moving on. As a side note, the updated stack-protector-dbginfo.ll test always seems to pass. I never see the dbg.declare/dbg.value reaching the StackProtector::HasAddressTaken, but I don't see any code excluding dbg intrinsic calls either, so I think it's the safest thing to do. Reviewers: void, timshen Reviewed By: timshen Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D45331 llvm-svn: 329450
*	[CodeGen] Change std::sort to llvm::sort in response to r327219	Mandeep Singh Grang	2018-04-06	20	-76/+79
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: r327219 added wrappers to std::sort which randomly shuffle the container before sorting. This will help in uncovering non-determinism caused due to undefined sorting order of objects having the same key. To make use of that infrastructure we need to invoke llvm::sort instead of std::sort. Note: This patch is one of a series of patches to replace all std::sort to llvm::sort. Refer the comments section in D44363 for a list of all the required patches. Reviewers: bogner, rnk, MatzeB, RKSimon Reviewed By: rnk Subscribers: JDevlieghere, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D45133 llvm-svn: 329435
*	[MIR] Add support for MachineFrameInfo::LocalFrameSize	Francis Visoiu Mistrih	2018-04-06	2	-0/+2
\| \| \| \| \| \| \| \| \| \|	MFI.LocalFrameSize was not serialized. It is usually set from LocalStackSlotAllocation, so if that pass doesn't run it is impossible do deduce it from the stack objects. Until now, this information was lost. llvm-svn: 329382
*	ARM: Do not spill CSR to stack on entry to noreturn functions	Tim Northover	2018-04-05	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	A noreturn nounwind function can be expected to never return in any way, and by never returning it will also never have to restore any callee-saved registers for its caller. This makes it possible to skip spills of those registers during function entry, saving some stack space and time in the process. This is rather useful for embedded targets with limited stack space. Should fix PR9970. Patch by myeisha (pmb). llvm-svn: 329287
*	[DAGCombine] Revert r329160	Sam Parker	2018-04-05	1	-26/+0
\| \| \| \| \| \|	Again, broke the big endian stage 2 builders. llvm-svn: 329283
*	[MIR-Canon] Fixing warnings in Non-assert builds.	Puyan Lotfi	2018-04-05	1	-2/+2
\| \| \| \|	llvm-svn: 329258
*	[MIR-Canon] Improving performance by switching to named vregs.	Puyan Lotfi	2018-04-05	1	-62/+61
\| \| \| \| \| \|	No more skipping thounsands of vregs. Much faster running time. llvm-svn: 329246
*	[MIR-Canon] Adding support for multi-def -> user distance reduction.	Puyan Lotfi	2018-04-05	1	-0/+17
\| \| \| \|	llvm-svn: 329243
*	[MachineOutliner] Add `useMachineOutliner` target hook	Jessica Paquette	2018-04-04	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	The MachineOutliner has a bunch of target hooks that will call llvm_unreachable if the target doesn't implement them. Therefore, if you enable the outliner on such a target, it'll just crash. It'd be much better if it'd just not run the outliner at all in this case. This commit adds a hook to TargetInstrInfo that returns false by default. Targets that implement the hook make it return true. The outliner checks the return value of this hook to decide whether or not to continue. llvm-svn: 329220
*	Fix build breakage from r329201	Pavel Labath	2018-04-04	2	-5/+5
\| \| \| \| \| \| \| \| \|	Some compilers do not like having an enum type and a variable with the same name (AccelTableKind). I rename the variable to TheAccelTableKind. Suggestions for a better name welcome. llvm-svn: 329202
*	Re-commit r329179 after fixing build&test issues	Pavel Labath	2018-04-04	3	-25/+332
\| \| \| \| \| \| \| \| \| \| \|	- MSVC was not OK with a static_assert referencing a non-static member variable, even though it was just in a sizeof(expression). I move the assert into the emit function, where it is probably more useful. - Tests were failing in builds which did not have the X86 target configured. Since this functionality is not target-specific, I have removed the target specifiers from the .ll files. llvm-svn: 329201
*	Revert r329179 (and follow-up unsuccessful fix attempts 329184, 329186); it ↵	Nico Weber	2018-04-04	3	-332/+25
\| \| \| \| \| \|	doesn't build. llvm-svn: 329190
*	Attempt to fix bots more after r329179.	Nico Weber	2018-04-04	1	-1/+1
\| \| \| \|	llvm-svn: 329186
*	Attempt to fix bots after r329179.	Nico Weber	2018-04-04	1	-2/+2
\| \| \| \|	llvm-svn: 329184
*	[CodeGen] Generate DWARF v5 Accelerator Tables	Pavel Labath	2018-04-04	3	-25/+332
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch adds a DwarfAccelTableEmitter class, which generates an accelerator table, as specified in DWARF v5 standard. At the moment it only generates a DIE offset column and (if we are indexing more than one compile unit) a CU column. Indexing type units is not currently supported, as we don't even have the ability to generate DWARF v5-compatible compile units. The implementation is not data-source agnostic like the one generating apple tables. This was not necessary as we currently only have one user of this code, and without a second user it was not obvious to me how to best abstract this. (The difference between these tables and the apple ones is that they need a lot more metadata about the debug info they are indexing). The generation is triggered by the --accel-tables argument, which supersedes the --dwarf-accel-tables arg -- the latter was a simple on-off switch, but not we can choose between two kinds of accelerator tables we can generate. This is tested by parsing the generated tables with llvm-dwarfdump and the DWARFVerifier, and I've also checked that GNU readelf is able to make sense of the tables. Differential Revision: https://reviews.llvm.org/D43286 llvm-svn: 329179
*	[DAGCombine] Improve ReduceLoadWidth for SRL	Sam Parker	2018-04-04	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Recommitting rL321259. Previosuly this caused an issue with PPCBE but I didn't receieve a reproducer and didn't have the time to follow up. If the issue appears again, please provide a reproducer so I can fix it. Original commit message: If the SRL node is only used by an AND, we may be able to set the ExtVT to the width of the mask, making the AND redundant. To support this, another check has been added in isLegalNarrowLoad which queries whether the load is valid. Differential Revision: https://reviews.llvm.org/D41350 llvm-svn: 329160
*	[MachineOutliner][NFC] Make outlined functions have internal linkage	Jessica Paquette	2018-04-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The linkage type on outlined functions was private before. This meant that if you set a breakpoint in an outlined function, the debugger wouldn't be able to give a sane name to the outlined function. This commit changes the linkage type to internal and updates any tests that relied on the prefixes on the names of outlined functions. llvm-svn: 329116
*	[CodeGen]Add NoVRegs property on PostRASink and ShrinkWrap	Jun Bum Lim	2018-04-03	2	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change declare that PostRAMachineSinking and ShrinkWrap require NoVRegs property, so now the MachineFunctionPass can enforce this check. These passes are disabled in NVPTX & WebAssembly. Reviewers: dschuff, jlebar, tra, jgravelle-google, MatzeB, sebpop, thegameg, mcrosier Reviewed By: dschuff, thegameg Subscribers: jholewinski, jfb, sbc100, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D45183 llvm-svn: 329095
*	Adding optional Name parameter to createVirtualRegister and ↵	Puyan Lotfi	2018-04-03	1	-4/+5
\| \| \| \| \| \|	createGenericVirtualRegister. llvm-svn: 329076
*	Align stubs for external and common global variables to pointer size.	Rafael Espindola	2018-04-02	1	-0/+1
\| \| \| \| \| \| \| \| \|	This patch fixes PR36885: clang++ generates unaligned stub symbol holding a pointer. Patch by Rahul Chaudhry! llvm-svn: 329030
*	[dsymutil] Upstream emitting of papertrail warnings.	Jonas Devlieghere	2018-04-02	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	When running dsymutil as part of your build system, it can be desirable for warnings to be part of the end product, rather than just being emitted to the output stream. This patch upstreams that functionality. Differential revision: https://reviews.llvm.org/D44639 llvm-svn: 328965
*	[DAGCombine] (float)((int) f) --> ftrunc (PR36617)	Sanjay Patel	2018-03-31	1	-0/+18
\| \| \| \| \| \| \| \| \| \|	fptosi / fptoui round towards zero, and that's the same behavior as ISD::FTRUNC, so replace a pair of casts with the equivalent node. We don't have to account for special cases (NaN, INF) because out-of-range casts are undefined. Differential Revision: https://reviews.llvm.org/D44909 llvm-svn: 328921
*	[MIR-Canon] Adding support for local idempotent instruction hoisting.	Puyan Lotfi	2018-03-31	1	-5/+116
\| \| \| \|	llvm-svn: 328915
*	Fix a bunch of typoes. NFC	Fangrui Song	2018-03-30	3	-3/+3
\| \| \| \|	llvm-svn: 328907
*	[MIR] Adding support for Named Virtual Registers in MIR.	Puyan Lotfi	2018-03-30	9	-11/+79
\| \| \| \|	llvm-svn: 328887
*	[BlockPlacement] Disable block placement tail duplciation in structured CFG.	Tim Shen	2018-03-30	1	-6/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Tail duplication easily breaks the structure of CFG, e.g. duplicating on a region entry. If the structure is intended to be preserved, then we may want to configure tail duplication, or disable it for structured CFG. From our benchmark results disabling it doesn't cause performance regression. Notice that this currently affects AMDGPU backend. In the next patch, I also plan to turn on requiresStructuredCFG for NVPTX. All unit tests still pass. Reviewers: jlebar, arsenm Subscribers: jholewinski, sanjoy, wdng, tpr, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45008 llvm-svn: 328884
*	[SelectionDAG] Removing FABS folding from DAGCombiner	Sanjay Patel	2018-03-30	1	-18/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code has bugs dealing with -0.0. Since D44550 introduced FABS pattern folding in InstCombine, this patch removes the now-redundant code that causes https://bugs.llvm.org/show_bug.cgi?id=36600. Patch by Mikhail Dvoretckii! Differential Revision: https://reviews.llvm.org/D44683 llvm-svn: 328872
*	[MachineCopyPropagation] Handle COPY with overlapping source/dest.	Eli Friedman	2018-03-30	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MachineCopyPropagation::CopyPropagateBlock has a bunch of special handling for COPY instructions. This handling assumes that COPY instructions do not modify the source of the copy; this is wrong if the COPY destination overlaps the source. To fix the bug, check explicitly for this situation, and fall back to the generic instruction handling. This bug can't happen for most register classes because they don't have this sort of overlap, but there are a few register classes where this is possible. The testcase uses the AArch64 QQQQ register class. Differential Revision: https://reviews.llvm.org/D44911 llvm-svn: 328851
*	Style update. NFC.	Rafael Espindola	2018-03-29	10	-75/+75
\| \| \| \| \| \| \|	Rename 3 functions to start with lowercase letters. Don't repeat the name in the comments. llvm-svn: 328848
*	[IR][CodeGen] Remove dependency on EVT from IR/Function.cpp. Move EVT to ↵	Craig Topper	2018-03-29	15	-13/+335
\| \| \| \| \| \| \| \| \| \| \| \|	CodeGen layer. Currently EVT is in the IR layer only because of Function.cpp needing a very small piece of the functionality of EVT::getEVTString(). The rest of EVT is used in codegen making CodeGen a better place for it. The previous code converted a Type* to EVT and then called getEVTString. This was only expected to handle the primitive types from Type*. Since there only a few primitive types, we can just print them as strings directly. Differential Revision: https://reviews.llvm.org/D45017 llvm-svn: 328806
*	Reapply "[DWARFv5] Emit file 0 to the line table."	Paul Robinson	2018-03-29	3	-9/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DWARF v5 specifies that the root file (also given in the DW_AT_name attribute of the compilation unit DIE) should be emitted explicitly to the line table's list of files. This makes the line table more independent of the .debug_info section. We emit the new syntax only for DWARF v5 and later. Fixes the bug found by asan. Also XFAIL the new test for Darwin, which is stuck on DWARF v2, and fix up other tests so they stop failing on Windows. Last but not least, don't break "clang -g" of an assembler file that has .file directives in it. Differential Revision: https://reviews.llvm.org/D44054 llvm-svn: 328805
*	Plumb useAA through TargetTransformInfo to remove Transforms->CodeGen header ↵	David Blaikie	2018-03-28	1	-2/+1
\| \| \| \| \| \| \| \|	dependency Thanks to echristo for the pointers on direction. llvm-svn: 328737
*	[PostRAMachineSink] preserve CFG	Jun Bum Lim	2018-03-28	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Mark CFG is preserved since this pass do not make any change in CFG. Reviewers: sebpop, mzolotukhin, mcrosier Reviewed By: mzolotukhin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D44845 llvm-svn: 328727
*	Transforms: Introduce Transforms/Utils.h rather than spreading the ↵	David Blaikie	2018-03-28	1	-0/+1
\| \| \| \| \| \| \| \| \|	declarations amongst Scalar.h and IPO.h Fixes layering - Transforms/Utils shouldn't depend on including a Scalar or IPO header, because Scalar and IPO depend on Utils. llvm-svn: 328717
*	Revert "Reapply "[DWARFv5] Emit file 0 to the line table.""	Alexander Potapenko	2018-03-28	3	-12/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r328676. Commit r328676 broke the -no-integrated-as flag necessary to build Linux kernel with Clang: $ cat t.c void foo() {} $ clang -no-integrated-as -c t.c -g /tmp/t-dcdec5.s: Assembler messages: /tmp/t-dcdec5.s:8: Error: file number less than one clang-7.0: error: assembler command failed with exit code 1 (use -v to see invocation) llvm-svn: 328699
*	[RegisterCoalescing] Don't move COPY if it would interfere with another value	Mikael Holmen	2018-03-28	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: RegisterCoalescer::removePartialRedundancy tries to hoist B = A from BB0/BB2 to BB1: BB1: ... BB0/BB2: ---- B = A; \| ... \| A = B; \| \|------- \| It does so if a number of conditions are fulfilled. However, it failed to check if B was used by any of the terminators in BB1. Since we must insert B = A before the terminators (since it's not a terminator itself), this means that we could erroneously insert a new definition of B before a use of it. Reviewers: wmi, qcolombet Reviewed By: wmi Subscribers: MatzeB, llvm-commits, sdardis Differential Revision: https://reviews.llvm.org/D44918 llvm-svn: 328689
*	Reapply "[DWARFv5] Emit file 0 to the line table."	Paul Robinson	2018-03-27	3	-9/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DWARF v5 specifies that the root file (also given in the DW_AT_name attribute of the compilation unit DIE) should be emitted explicitly to the line table's list of files. This makes the line table more independent of the .debug_info section. Fixes the bug found by asan. Also XFAIL the new test for Darwin, which is stuck on DWARF v2, and fix up other tests so they stop failing on Windows. Last but not least, don't break "clang -g" of an assembler file that has .file directives in it. Differential Revision: https://reviews.llvm.org/D44054 llvm-svn: 328676
*	[DWARF] Suppress split line tables more carefully.	Paul Robinson	2018-03-27	4	-18/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a given split type unit does not have source locations, don't have it refer to the split line table. If no split type unit refers to the split line table, don't emit the line table at all. This will save a little space on rare occasions, but also refactors things a bit to improve which class is responsible for what. Responding to review comments on r326395. Differential Revision: https://reviews.llvm.org/D44220 llvm-svn: 328670
*	[CodeGen] Fixed unreachable with -print-machineinstrs and custom pseudo ↵	Tim Renouf	2018-03-27	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	source value Summary: Rev 327580 "[CodeGen] Use MIR syntax for MachineMemOperand printing" broke -print-machineinstrs for us on AMDGPU, because we have custom pseudo source values, and MIR serialization does not implement that. This commit at least restores the functionality of -print-machineinstrs, even if it does not properly implement the missing MIR serialization functionality. Differential Revision: https://reviews.llvm.org/D44871 Change-Id: I44961c0b90bf6d48c01484ed7a4e466fd300db66 llvm-svn: 328668
*	Remap values in PromotedFloats	Pirama Arumuga Nainar	2018-03-27	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When a node is about to be erased from ReplacedValues, we should also remap its corresponding values in PromotedFloats. Patch by Yan Luo (Yan.Luo2@synopsys.com) Reviewers: pirama Reviewed By: pirama Subscribers: lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D44872 llvm-svn: 328644
*	Use local symbols for creating .stack-size.	Rafael Espindola	2018-03-26	1	-2/+3
\| \| \| \|	llvm-svn: 328581
*	Disable [MachineLICM] Add functions to MachineLICM to hoist invariant stores	Zaara Syeda	2018-03-26	1	-1/+1
\| \| \| \| \| \| \|	Disable https://reviews.llvm.org/D40196 with setting option hoist-const-stores to false since failing s390 buildbot. llvm-svn: 328555
*	[Pipeliner] Several node-ordering fixes	Krzysztof Parzyszek	2018-03-26	1	-18/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	First, we change the heuristic that is used to ignore the recurrent node-sets in the node ordering. In certain cases it's not important to focus on the recurrent node-sets. Instead, the algorithm begins by considering all the instructions in the node ordering step. Second, a minor change to the bottom up traversal, which needs to consider loop carried dependences (modeled as anti dependences). Previously, these instructions were skipped, which caused problems because the instruction ends up having both predecessors and sucessors in the schedule. Third, consider anti-dependences as a tie breaker when choosing between instructions in the node ordering. We want to make sure that the source of the anti-dependence does not end up with both predecesssors and sucessors in the final node ordering. Patch by Brendon Cahoon. llvm-svn: 328554
*	[Pipeliner] Check for affine expression in isLoopCarriedOrder	Krzysztof Parzyszek	2018-03-26	1	-1/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pipeliner must add a loop carried dependence between two memory operations if the base register is not an affine (linear) exression. The current implementation doesn't check how the base register is defined, which allows non-affine expressions, and then the pipeliner does not add a loop carried dependence when one is needed. This patch adds code to isLoopCarriedOrder that checks if the base register of the memory operations is defined by a phi, and the loop definition for the phi is a constant increment value. This is a very simple check for a linear expression. Patch by Brendon Cahoon. llvm-svn: 328550
*	[Pipeliner] Add missing loop carried dependences	Krzysztof Parzyszek	2018-03-26	1	-14/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pipeliner is not adding a dependence edge for a loop carried dependence, and ends up scheduling a load from iteration n prior to an aliased store in iteration n-1. The code that adds the loop carried dependences in the pipeliner doesn't check if the memory objects for loads and stores are "identified" (i.e., distinct) objects. If they are not, then the code that adds the dependences needs to be conservative. The objects can be used to check dependences only when they are distinct objects. The code that checks for loop carried dependences has been updated to classify loads and stores that are not identified as "unknown" values. A store with an "unknown" value can potentially create a loop carried dependence with any pending load. Patch by Brendon Cahoon. llvm-svn: 328547
*	[Pipeliner] Fix renaming in pipeliner when eliminating phis	Krzysztof Parzyszek	2018-03-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The phi renaming code in the pipeliner uses the wrong value when rewriting phi uses, which results in an undefined value. In this case, the original phi is no longer needed due to the order of instruction in the pipelined loop. The pipeliner was assuming, in this case, the the phi loop definition should be used to rewrite the uses. However, the pipeliner needs to check to make sure that the loop definition has already been scheduled. If not, then the phi initial value needs to be used instead. Patch by Brendon Cahoon. llvm-svn: 328545
*	[Pipeliner] Fix number of phis to generate in the epilog	Krzysztof Parzyszek	2018-03-26	1	-10/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pipeliner was generating too many phis in the epilog blocks, which caused incorrect code generation when rewriting an instruction that uses the phi. In this case, there 3 prolog and epilog stages. An existing phi was scheduled at stage 1. When generating the code for the 2nd epilog an extra new phi was generated. To fix this, we need to update the code that calculates the maximum number of phis that can be generated, which is based upon the current prolog stage and the stage of the original phi. In this case, when the prolog stage is 1 and the original phi stage is 1, the maximum number of phis to generate is 2. Patch by Brendon Cahoon. llvm-svn: 328543
*	[Pipeliner] Use latency to compute RecMII	Krzysztof Parzyszek	2018-03-26	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The patch contains severals changes needed to pipeline an example that was transformed so that a Phi with a subreg is converted to copies. The pipeliner wasn't working for a couple of reasons. - The RecMII was 3 instead of 2 due to the extra copies. - Copy instructions contained a latency of 1. - The node order algorithm was not choosing the best "bottom" node, which caused an instruction to be scheduled that had a predecessor and successor already scheduled. - Updated the Hexagon Machine Scheduler to check if the node is latency bound when adding the cost for a 0-latency dependence. The RecMII was 3 because the computation looks at the number of nodes in the recurrence. The extra copy is an extra node but it shouldn't increase the latency. The new RecMII computation looks at the latency of the instructions in the recurrence. We changed the latency of the dependence of a copy to 0. The latency computation for the copy also checks the use of the copy (similar to a reg_sequence). The node order algorithm was not choosing the last instruction in the recurrence for a bottom up traversal. This was when the last instruction is a copy. A check was added when choosing the instruction to check for NodeNum if the maxASAP is the same. This means that the scheduler will not end up with another node in the recurrence that has both a predecessor and successor already scheduled. The cost computation in Hexagon Machine Scheduler adds cost when an instruction can be packetized with a zero-latency instruction. We should only do this if the schedule is latency bound. Patch by Brendon Cahoon. llvm-svn: 328542
*	[Pipeliner] Fix assert caused by pipeliner serialization	Krzysztof Parzyszek	2018-03-26	1	-28/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pipeliner is asserting because the serialization step that occurs at the end is deleting an instruction. The assert occurs later on because there is a use without a definition. The problem occurs when an instruction defines a value used by a REQ_SEQUENCE and that value is used by a COPY instruction. The latencies between these instructions are zero, so they are put in to the same packet. The serialization code is unable to handle this correctly, and ends up putting the REG_SEQUENCE before its definition. There is special code in the serialization step that attempts to handle zero-cost instructions (phis, copy, reg_sequence) differently than regular instructions. Unfortunately, this means the order does not come out correct. This patch simplifies the code by changing the seperate steps for handling zero-cost and regular instructions. Only phis are handled separate now, since they should occurs first. Then, this patch adds checks to make use the MoveUse is set to the smallest value if there are multiple uses in a cycle. Patch by Brendon Cahoon. llvm-svn: 328540
*	[Pipeliner] Enable more base+offset dependence changes in pipeliner	Krzysztof Parzyszek	2018-03-26	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pipeliner changes dependences between base+offset instructions (loads and stores) so that the instructions have more flexibility to be scheduled with respect to each other. This occurs when the pipeliner is able to compute that the instructions will not alias if their order is changed. The prevous code enforced the alias property by checking if the base register is the same, and that the offset values are either both positive or negative. This patch improves the alias check by using the API areMemAccessesTriviallyDisjoint instead. This enables more cases, especially if the offset is a negative value. The pipeliner uses the function by creating a new instruction with the offset used in the next iteration. Patch by Brendon Cahoon. llvm-svn: 328538