bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Target/TargetInstrInfo.h -> CodeGen/TargetInstrInfo.h to match layering	David Blaikie	2017-11-08	1	-1/+1
\| \| \| \| \| \| \| \|	This header includes CodeGen headers, and is not, itself, included by any Target headers, so move it into CodeGen to match the layering of its implementation. llvm-svn: 317647
*	[SystemZ] Don't drop any operands in expandZExtPseudo()	Jonas Paulsson	2017-03-22	1	-4/+8
\| \| \| \| \| \| \| \|	Make sure that any operands, e.g. of an implicit def of a super reg is transferred to the new instruction. Review: Ulrich Weigand llvm-svn: 298484
*	Make TargetInstrInfo::isPredicable take a const reference, NFC	Krzysztof Parzyszek	2017-03-03	1	-1/+1
\| \| \| \|	llvm-svn: 296901
*	[SystemZ] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-01-24	1	-3/+15
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 292983
*	[SystemZ] Proper handling of undef flag while expanding pseudo.	Jonas Paulsson	2017-01-18	1	-1/+2
\| \| \| \| \| \| \| \|	During post-RA pseudo expansion, an 'undef' flag of the source operand should be propagated by emitGRX32Move(). Review: Ulrich Weigand llvm-svn: 292353
*	[SystemZ] Support load-and-trap instructions	Ulrich Weigand	2016-11-28	1	-0/+4
\| \| \| \| \| \| \|	This adds support for the instructions provided with the load-and-trap facility. llvm-svn: 288030
*	[SystemZ] Improve use of conditional instructions	Ulrich Weigand	2016-11-28	1	-1/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch moves formation of LOC-type instructions from (late) IfConversion to the early if-conversion pass, and in some cases additionally creates them directly from select instructions during DAG instruction selection. To make early if-conversion work, the patch implements the canInsertSelect / insertSelect callbacks. It also implements the commuteInstructionImpl and FoldImmediate callbacks to enable generation of the full range of LOC instructions. Finally, the patch adds support for all instructions of the load-store-on-condition-2 facility, which allows using LOC instructions also for high registers. Due to the use of the GRX32 register class to enable high registers, we now also have to handle the cases where there are still no single hardware instructions (conditional move from a low register to a high register or vice versa). These are converted back to a branch sequence after register allocation. Since the expandRAPseudos callback is not allowed to create new basic blocks, this requires a simple new pass, modelled after the ARM/AArch64 ExpandPseudos pass. Overall, this patch causes significantly more LOC-type instructions to be used, and results in a measurable performance improvement. llvm-svn: 288028
*	[SystemZ] Post-RA scheduler implementation	Jonas Paulsson	2016-10-20	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Post-RA sched strategy and scheduling instruction annotations for z196, zEC12 and z13. This scheduler optimizes decoder grouping and balances processor resources (including side steering the FPd unit instructions). The SystemZHazardRecognizer keeps track of the scheduling state, which can be dumped with -debug-only=misched. Reviers: Ulrich Weigand, Andrew Trick. https://reviews.llvm.org/D17260 llvm-svn: 284704
*	Finish renaming remaining analyzeBranch functions	Matt Arsenault	2016-09-14	1	-2/+2
\| \| \| \|	llvm-svn: 281535
*	Make analyzeBranch family of instruction names consistent	Matt Arsenault	2016-09-14	1	-1/+1
\| \| \| \| \| \| \|	analyzeBranch was renamed to use lowercase first, rename the related set to match. llvm-svn: 281506
*	AArch64: Use TTI branch functions in branch relaxation	Matt Arsenault	2016-09-14	1	-2/+4
\| \| \| \| \| \| \| \| \|	The main change is to return the code size from InsertBranch/RemoveBranch. Patch mostly by Tim Northover llvm-svn: 281505
*	TargetInstrInfo: add virtual function getInstSizeInBytes	Sjoerd Meijer	2016-07-29	1	-1/+1
\| \| \| \| \| \| \| \| \|	This adds a target hook getInstSizeInBytes to TargetInstrInfo that a lot of subclasses already implement. Differential Revision: https://reviews.llvm.org/D22885 llvm-svn: 277126
*	Rename AnalyzeBranch* to analyzeBranch*.	Jacques Pienaar	2016-07-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: NFC. Rename AnalyzeBranch/AnalyzeBranchPredicate to analyzeBranch/analyzeBranchPredicate to follow LLVM coding style and be consistent with TargetInstrInfo's analyzeCompare and analyzeSelect. Reviewers: tstellarAMD, mcrosier Subscribers: mcrosier, jholewinski, jfb, arsenm, dschuff, jyknight, dsanders, nemanjai Differential Revision: https://reviews.llvm.org/D22409 llvm-svn: 275564
*	CodeGen: Use MachineInstr& in TargetInstrInfo, NFC	Duncan P. N. Exon Smith	2016-06-30	1	-24/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is mostly a mechanical change to make TargetInstrInfo API take MachineInstr& (instead of MachineInstr* or MachineBasicBlock::iterator) when the argument is expected to be a valid MachineInstr. This is a general API improvement. Although it would be possible to do this one function at a time, that would demand a quadratic amount of churn since many of these functions call each other. Instead I've done everything as a block and just updated what was necessary. This is mostly mechanical fixes: adding and removing `` and `&` operators. The only non-mechanical change is to split ARMBaseInstrInfo::getOperandLatencyImpl out from ARMBaseInstrInfo::getOperandLatency. Previously, the latter took a `MachineInstr` which it updated to the instruction bundle leader; now, the latter calls the former either with the same `MachineInstr&` or the bundle leader. As a side effect, this removes a bunch of MachineInstr* to MachineBasicBlock::iterator implicit conversions, a necessary step toward fixing PR26753. Note: I updated WebAssembly, Lanai, and AVR (despite being off-by-default) since it turned out to be easy. I couldn't run tests for AVR since llc doesn't link with it turned on. llvm-svn: 274189
*	Pass DebugLoc and SDLoc by const ref.	Benjamin Kramer	2016-06-12	1	-3/+3
\| \| \| \| \| \| \| \|	This used to be free, copying and moving DebugLocs became expensive after the metadata rewrite. Passing by reference eliminates a ton of track/untrack operations. No functionality change intended. llvm-svn: 272512
*	[SystemZ] Support Compare and Traps	Zhan Jun Liau	2016-06-10	1	-9/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Support and generate Compare and Traps like CRT, CIT, etc. Support Trap as legal DAG opcodes and generate "j .+2" for them by default. Add support for Conditional Traps and use the If Converter to convert them into the corresponding compare and trap opcodes. Differential Revision: http://reviews.llvm.org/D21155 llvm-svn: 272419
*	[foldMemoryOperand()] Pass LiveIntervals to enable liveness check.	Jonas Paulsson	2016-05-10	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SystemZ (and probably other targets as well) can fold a memory operand by changing the opcode into a new instruction that as a side-effect also clobbers the CC-reg. In order to do this, liveness of that reg must first be checked. When LIS is passed, getRegUnit() can be called on it and the right LiveRange is computed on demand. Reviewed by Matthias Braun. http://reviews.llvm.org/D19861 llvm-svn: 269026
*	[SystemZ] [SSP] Add support for LOAD_STACK_GUARD.	Marcin Koscielnicki	2016-04-24	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes PR22248 on s390x. The previous attempt at this was D19101, which was before LOAD_STACK_GUARD existed. Compared to the previous version, this always emits a rather ugly block of 4 instructions, involving a thread pointer load that can't be shared with other potential users. However, this is necessary for SSP - spilling the guard value (or thread pointer used to load it) is counter to the goal, since it could be overwritten along with the frame it protects. Differential Revision: http://reviews.llvm.org/D19363 llvm-svn: 267340
*	[SystemZ] Support conditional indirect sibling calls via BCR	Ulrich Weigand	2016-04-11	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a conditional variant of CallBR instruction, CallBCR. Also, it can be fused with integer comparisons, resulting in one of the new C*BCall instructions. In addition to CallBRCL limitations, this has another one: it won't trigger if the function to call isn't already in %r1 - see f22 in the test for an example (it's also why the loads in tests are volatile). Author: koriakin Differential Revision: http://reviews.llvm.org/D18928 llvm-svn: 265933
*	[SystemZ] Implement conditional returns	Ulrich Weigand	2016-04-07	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Return is now considered a predicable instruction, and is converted to a newly-added CondReturn (which maps to BCR to %r14) instruction by the if conversion pass. Also, fused compare-and-branch transform knows about conditional returns, emitting the proper fused instructions for them. This transform triggers on a lot of tests, hence the huge diffstat. The changes are mostly jX to br %r14 -> bXr %r14. Author: koriakin Differential Revision: http://reviews.llvm.org/D17339 llvm-svn: 265689
*	CodeGen: TII: Take MachineInstr& in predicate API, NFC	Duncan P. N. Exon Smith	2016-02-23	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Change TargetInstrInfo API to take `MachineInstr&` instead of `MachineInstr*` in the functions related to predicated instructions (I'll try to come back later and get some of the rest). All of these functions require non-null parameters already, so references are more clear. As a bonus, this happens to factor away a host of implicit iterator => pointer conversions. No functionality change intended. llvm-svn: 261605
*	Pass BranchProbability/BlockMass by value instead of const& as they are ↵	Cong Hou	2015-09-10	1	-2/+2
\| \| \| \| \| \|	small. NFC. llvm-svn: 247357
*	[CodeGen] ArrayRef'ize cond/pred in various TII APIs. NFC.	Ahmed Bougacha	2015-06-11	1	-4/+2
\| \| \| \|	llvm-svn: 239553
*	[InstrInfo] Refactor foldOperandImpl to thread through InsertPt. NFC	Keno Fischer	2015-06-08	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This was a longstanding FIXME and is a necessary precursor to cases where foldOperandImpl may have to create more than one instruction (e.g. to constrain a register class). This is the split out NFC changes from D6262. Reviewers: pete, ributzka, uweigand, mcrosier Reviewed By: mcrosier Subscribers: mcrosier, ted, llvm-commits Differential Revision: http://reviews.llvm.org/D10174 llvm-svn: 239336
*	ArrayRefize memory operand folding. NFC.	Benjamin Kramer	2015-02-28	1	-4/+4
\| \| \| \|	llvm-svn: 230846
*	[SystemZ] Support all TLS access models - CodeGen part	Ulrich Weigand	2015-02-18	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current SystemZ back-end only supports the local-exec TLS access model. This patch adds all required CodeGen support for the other TLS models, which means in particular: - Expand initial-exec TLS accesses by loading TLS offsets from the GOT using @indntpoff relocations. - Expand general-dynamic and local-dynamic accesses by generating the appropriate calls to __tls_get_offset. Note that this routine has a non-standard ABI and requires loading the GOT pointer into %r12, so the patch also adds support for the GLOBAL_OFFSET_TABLE ISD node. - Add a new platform-specific optimization pass to remove redundant __tls_get_offset calls in the local-dynamic model (modeled after the corresponding X86 pass). - Add test cases verifying all access models and optimizations. llvm-svn: 229654
*	Canonicalize header guards into a common format.	Benjamin Kramer	2014-08-13	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	Add header guards to files that were missing guards. Remove #endif comments as they don't seem common in LLVM (we can easily add them back if we decide they're useful) Changes made by clang-tidy with minor tweaks. llvm-svn: 215558
*	Remove target machine caching from SystemZInstrInfo and	Eric Christopher	2014-06-27	1	-2/+3
\| \| \| \| \| \| \| \|	SystemZRegisterInfo and replace it with the subtarget as that's all they needed in the first place. Update all uses and calls accordingly. llvm-svn: 211877
*	[C++] Use 'nullptr'.	Craig Topper	2014-04-28	1	-1/+1
\| \| \| \|	llvm-svn: 207394
*	[SystemZ] Remove "virtual" from override methods	Richard Sandiford	2014-03-06	1	-64/+52
\| \| \| \| \| \| \|	Also fix a couple of cases where "override" was missing. No behavioural change intended. llvm-svn: 203110
*	[SystemZ] Update namespace formatting to match current guidelines	Richard Sandiford	2014-03-06	1	-82/+82
\| \| \| \| \| \|	No functional change intended. llvm-svn: 203103
*	Switch all uses of LLVM_OVERRIDE to just use 'override' directly.	Craig Topper	2014-03-02	1	-30/+27
\| \| \| \|	llvm-svn: 202621
*	[weak vtables] Remove a bunch of weak vtables	Juergen Ributzka	2013-11-19	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. The memory leaks in this version have been fixed. Thanks Alexey for pointing them out. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 195064
*	Revert r194865 and r194874.	Alexey Samsonov	2013-11-18	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \|	This change is incorrect. If you delete virtual destructor of both a base class and a subclass, then the following code: Base *foo = new Child(); delete foo; will not cause the destructor for members of Child class. As a result, I observe plently of memory leaks. Notable examples I investigated are: ObjectBuffer and ObjectBufferStream, AttributeImpl and StringSAttributeImpl. llvm-svn: 194997
*	[weak vtables] Remove a bunch of weak vtables	Juergen Ributzka	2013-11-15	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 194865
*	[SystemZ] Add immediate addition involving high words	Richard Sandiford	2013-10-01	1	-0/+2
\| \| \| \|	llvm-svn: 191774
*	[SystemZ] Add patterns to load a constant into a high word (IIHF)	Richard Sandiford	2013-10-01	1	-0/+2
\| \| \| \| \| \| \|	Similar to low words, we can use the shorter LLIHL and LLIHH if it turns out that the other half of the GR64 isn't live. llvm-svn: 191750
*	[SystemZ] Add register zero extensions involving at least one high word	Richard Sandiford	2013-10-01	1	-0/+2
\| \| \| \|	llvm-svn: 191746
*	[SystemZ] Use upper words of GR64s for codegen	Richard Sandiford	2013-10-01	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This just adds the basics necessary for allocating the upper words to virtual registers (move, load and store). The move support is parameterised in a way that makes it easy to handle zero extensions, but the associated zero-extend patterns are added by a later patch. The easiest way of testing this seemed to be add a new "h" register constraint for high words. I don't expect the constraint to be useful in real inline asms, but it should work, so I didn't try to hide it behind an option. llvm-svn: 191739
*	[SystemZ] Add unsigned compare-and-branch instructions	Richard Sandiford	2013-09-18	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For some reason I never got around to adding these at the same time as the signed versions. No idea why. I'm not sure whether this SystemZII::BranchC* stuff is useful, or whether it should just be replaced with an "is normal" flag. I'll leave that for later though. There are some boundary conditions that can be tweaked, such as preferring unsigned comparisons for equality with [128, 256), and "<= 255" over "< 256", but again I'll leave those for a separate patch. llvm-svn: 190930
*	[SystemZ] Use CLC and IPM to implement memcmp	Richard Sandiford	2013-08-12	1	-0/+6
\| \| \| \| \| \| \|	For now this is restricted to fixed-length comparisons with a length in the range [1, 256], as for memcpy() and MVC. llvm-svn: 188163
*	[SystemZ] Optimize floating-point comparisons with zero	Richard Sandiford	2013-08-07	1	-14/+17
\| \| \| \| \| \| \| \| \|	This follows the same lines as the integer code. In the end it seemed easier to have a second 4-bit mask in TSFlags to specify the compare-like CC values. That eats one more TSFlags bit than adding a CCHasUnordered would have done, but it feels more concise. llvm-svn: 187883
*	[SystemZ] Use BRCT and BRCTG to eliminate add-&-compare sequences	Richard Sandiford	2013-08-05	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch just uses a peephole test for "add; compare; branch" sequences within a single block. The IR optimizers already convert loops to decrement-and-branch-on-nonzero form in some cases, so even this simplistic test triggers many times during a clang bootstrap and projects/test-suite run. It looks like there are still cases where we need to more strongly prefer branches on nonzero though. E.g. I saw a case where a loop that started out with a check for 0 ended up with a check for -1. I'll try to look at that sometime. I ended up adding the Reference class because MachineInstr::readsRegister() doesn't check for subregisters (by design, as far as I could tell). llvm-svn: 187723
*	[SystemZ] Use LOAD AND TEST to eliminate comparisons against zero	Richard Sandiford	2013-08-05	1	-0/+4
\| \| \| \|	llvm-svn: 187720
*	[SystemZ] Reuse CC results for integer comparisons with zero	Richard Sandiford	2013-08-01	1	-7/+17
\| \| \| \| \| \| \| \| \| \|	This also fixes a bug in the predication of LR to LOCR: I'd forgotten that with these in-place instruction builds, the implicit operands need to be added manually. I think this was latent until now, but is tested by int-cmp-45.c. It also adds a CC valid mask to STOC, again tested by int-cmp-45.c. llvm-svn: 187573
*	[SystemZ] Be more careful about inverting CC masks	Richard Sandiford	2013-07-31	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	System z branches have a mask to select which of the 4 CC values should cause the branch to be taken. We can invert a branch by inverting the mask. However, not all instructions can produce all 4 CC values, so inverting the branch like this can lead to some oddities. For example, integer comparisons only produce a CC of 0 (equal), 1 (less) or 2 (greater). If an integer EQ is reversed to NE before instruction selection, the branch will test for 1 or 2. If instead the branch is reversed after instruction selection (by inverting the mask), it will test for 1, 2 or 3. Both are correct, but the second isn't really canonical. This patch therefore keeps track of which CC values are possible and uses this when inverting a mask. Although this is mostly cosmestic, it fixes undefined behavior for the CIJNLH in branch-08.ll. Another fix would have been to mask out bit 0 when generating the fused compare and branch, but the point of this patch is that we shouldn't need to do that in the first place. The patch also makes it easier to reuse CC results from other instructions. llvm-svn: 187495
*	[SystemZ] Move compare-and-branch generation even later	Richard Sandiford	2013-07-31	1	-8/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	r187116 moved compare-and-branch generation from the instruction-selection pass to the peephole optimizer (via optimizeCompare). It turns out that even this is a bit too early. Fused compare-and-branch instructions don't interact well with predication, where a CC result is needed. They also make it harder to reuse the CC side-effects of earlier instructions (not yet implemented, but the subject of a later patch). Another problem was that the AnalyzeBranch family of routines weren't handling compares and branches, so we weren't able to reverse the fused form in cases where we would reverse a separate branch. This could have been fixed by extending AnalyzeBranch, but given the other problems, I've instead moved the fusing to the long-branch pass, which is also responsible for the opposite transformation: splitting out-of-range compares and branches into separate compares and long branches. I've added a test for the AnalyzeBranch problem. A test for the predication problem is included in the next patch, which fixes a bug in the choice of CC mask. llvm-svn: 187494
*	[SystemZ] Postpone NI->RISBG conversion to convertToThreeAddress()	Richard Sandiford	2013-07-31	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	r186399 aggressively used the RISBG instruction for immediate ANDs, both because it can handle some values that AND IMMEDIATE can't, and because it allows the destination register to be different from the source. I realized later while implementing the distinct-ops support that it would be better to leave the choice up to convertToThreeAddress() instead. The AND IMMEDIATE form is shorter and is less likely to be cracked. This is a problem for 32-bit ANDs because we assume that all 32-bit operations will leave the high word untouched, whereas RISBG used in this way will either clear the high word or copy it from the source register. The patch uses the z196 instruction RISBLG for this instead. This means that z10 will be restricted to NILL, NILH and NILF for 32-bit ANDs, but I think that should be OK for now. Although we're using z10 as the base architecture, the optimization work is going to be focused more on z196 and zEC12. llvm-svn: 187492
*	[SystemZ] Rework compare and branch support	Richard Sandiford	2013-07-25	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before the patch we took advantage of the fact that the compare and branch are glued together in the selection DAG and fused them together (where possible) while emitting them. This seemed to work well in practice. However, fusing the compare so early makes it harder to remove redundant compares in cases where CC already has a suitable value. This patch therefore uses the peephole analyzeCompare/optimizeCompareInstr pair of functions instead. No behavioral change intended, but it paves the way for a later patch. llvm-svn: 187116
*	[SystemZ] Add LOCR and LOCGR	Richard Sandiford	2013-07-25	1	-0/+17
\| \| \| \|	llvm-svn: 187113