bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[opaque pointer types] Remove some calls to generic Type subtype accessors.	James Y Knight	2019-01-10	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \|	That is, remove many of the calls to Type::getNumContainedTypes(), Type::subtypes(), and Type::getContainedType(N). I'm not intending to remove these accessors -- they are useful/necessary in some cases. However, removing the pointee type from pointers would potentially break some uses, and reducing the number of calls makes it easier to audit. llvm-svn: 350835
*	[CodeGen] Ignore return sext/zext attributes of unused results for tail calls	Francis Visoiu Mistrih	2019-01-09	1	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the caller's return type does not have a zeroext attribute but the callee does a tail call zeroext, we won't consider the tail call during CodeGenPrepare because the attributes don't match. However, if the result of the tail call has no uses, it makes sense to drop the sext/zext attributes. Differential Revision: https://reviews.llvm.org/D56486 llvm-svn: 350753
*	[DebugInfo] Omit location list entries with empty ranges	David Stenberg	2019-01-09	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes PR39710. In that case we emitted a location list looking like this: .Ldebug_loc0: .quad .Lfunc_begin0-.Lfunc_begin0 .quad .Lfunc_begin0-.Lfunc_begin0 .short 1 # Loc expr size .byte 85 # DW_OP_reg5 .quad .Lfunc_begin0-.Lfunc_begin0 .quad .Lfunc_end0-.Lfunc_begin0 .short 1 # Loc expr size .byte 85 # super-register DW_OP_reg5 .quad 0 .quad 0 As seen, the first entry's beginning and ending addresses evalute to 0, which meant that the entry inadvertently became an "end of list" entry, resulting in the location list ending sooner than expected. To fix this, omit all entries with empty ranges. Location list entries with empty ranges do not have any effect, as specified by DWARF, so we might as well drop them: "A location list entry (but not a base address selection or end of list entry) whose beginning and ending addresses are equal has no effect because the size of the range covered by such an entry is zero." Reviewers: davide, aprantl, dblaikie Reviewed By: aprantl Subscribers: javed.absar, JDevlieghere, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D55919 llvm-svn: 350698
*	GlobalISel: Implement fewerElements for implicit_def	Matt Arsenault	2019-01-09	1	-0/+26
\| \| \| \|	llvm-svn: 350697
*	GlobalISel: Implement widenScalar for implicit_def	Matt Arsenault	2019-01-09	1	-0/+6
\| \| \| \|	llvm-svn: 350695
*	[NFC] fix trivial typos in comments	Hiroshi Inoue	2019-01-09	7	-10/+10
\| \| \| \|	llvm-svn: 350690
*	Remove check for single use in ShrinkDemandedConstant	Stanislav Mekhanoshin	2019-01-09	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This removes check for single use from general ShrinkDemandedConstant to the BE because of the AArch64 regression after D56289/rL350475. After several hours of experiments I did not come up with a testcase failing on any other targets if check is not performed. Moreover, direct call to ShrinkDemandedConstant is not really needed and superceed by SimplifyDemandedBits. Differential Revision: https://reviews.llvm.org/D56406 llvm-svn: 350684
*	RegisterCoalescer: Assume CR_Replace for SubRangeJoin	Matt Arsenault	2019-01-08	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently it's possible for following check on V.WriteLanes (which is not really meaningful during SubRangeJoin) to pass for one half of the pair, and then fall through to to one of the impossible or unresolved states. This then fails as inconsistent on the other half. During the main range join, the check between V.WriteLanes and OtherV.ValidLanes must have passed, meaning this should be a CR_Replace. Fixes most of the testcases in bugs 39542 and 39602 llvm-svn: 350678
*	RegisterCoalescer: Defer clearing implicit_def lanes	Matt Arsenault	2019-01-08	1	-16/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can't go back and recover the lanes if it turns out the implicit_def really can't be erased. Assume all lanes are valid if an unresolved conflict is encountered. There aren't any tests where this seems to matter either way, but this seems like a safer option. Fixes bug 39602 llvm-svn: 350676
*	Revert "Revert "Revert "Resubmit rL345008 "Split MachinePipeliner code into ↵	Adrian Prantl	2019-01-08	1	-5/+595
\| \| \| \| \| \| \| \|	header and cpp files"""" This reverts commit D56084. llvm-svn: 350654
*	Rename DIFlagFixedEnum to DIFlagEnumClass. NFC	Paul Robinson	2019-01-08	1	-1/+1
\| \| \| \|	llvm-svn: 350641
*	[MachineVerifier] Include offending register in allocatable live-in error msg.	Florian Hahn	2019-01-08	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a convenience report() method for physical registers and uses it to print the offending register with the 'MBB has allocatable live-in' error. Reviewers: MatzeB, rtereshin, dsanders Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D55946 llvm-svn: 350630
*	[GlobalISel] Fix choice of instruction selector for AArch64 at -O0 with ↵	Petr Pavlu	2019-01-08	1	-12/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	-global-isel=0 Commit rL347861 introduced an unintentional change in the behaviour when compiling for AArch64 at -O0 with -global-isel=0. Previously, explicitly disabling GlobalISel resulted in using FastISel but an updated condition in the commit changed it to using SelectionDAG. The patch fixes this condition and slightly better organizes the code that chooses the instruction selector. Fixes PR40131. Differential Revision: https://reviews.llvm.org/D56266 llvm-svn: 350626
*	Revert "Revert "Resubmit rL345008 "Split MachinePipeliner code into header ↵	Lama Saba	2019-01-08	1	-595/+5
\| \| \| \| \| \| \| \| \| \|	and cpp files""" This reverts commit rL350497 reported remaining issues seem to be unrelated to modules or this change. more info: https://reviews.llvm.org/D56084 llvm-svn: 350621
*	[GlobalISel] Fix unused variable warning in Release builds.	Benjamin Kramer	2019-01-08	1	-3/+3
\| \| \| \|	llvm-svn: 350618
*	Fix typos	Matt Arsenault	2019-01-08	1	-2/+2
\| \| \| \|	llvm-svn: 350597
*	RegBankSelect: Fix copy insertion point for terminators	Matt Arsenault	2019-01-08	1	-9/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a copy was needed to handle the condition of brcond, it was being inserted before the defining instruction. Add tests for iterator edge cases. I find the existing code here suspect for the case where it's looking for terminators that modify the register. It's going to insert a copy in the middle of the terminators, which isn't allowed (it might be necessary to have a COPY_terminator if anybody actually needs this). Also legalize brcond for AMDGPU. llvm-svn: 350595
*	[RegisterCoalescer] dst register's live interval needs to be updated when	Wei Mi	2019-01-08	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	merging a src register in ToBeUpdated set. This is to fix PR40061 related with https://reviews.llvm.org/rL339035. In https://reviews.llvm.org/rL339035, live interval of source pseudo register in rematerialized copy may be saved in ToBeUpdated set and its update may be postponed. In PR40061, %t2 = %t1 is rematerialized and %t1 is added into toBeUpdated set to postpone its live interval update. After the rematerialization, the live interval of %t1 is larger than necessary. Then %t1 is merged into %t3 and %t1 gets removed. After the merge, %t3 contains live interval larger than necessary. Because %t3 is not in toBeUpdated set, its live interval is not updated after register coalescing and it will break some assumption in regalloc. The patch requires the live interval of destination register in a merge to be updated if the source register is in ToBeUpdated. Differential revision: https://reviews.llvm.org/D55867 llvm-svn: 350586
*	[TargetLowering][AMDGPU] Remove the SimplifyDemandedBits function that takes ↵	Craig Topper	2019-01-07	1	-50/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	a User and OpIdx. Stop using it in AMDGPU target for simplifyI24. As we saw in D56057 when we tried to use this function on X86, it's unsafe. It allows the operand node to have multiple users, but doesn't prevent recursing past the first node when it does have multiple users. This can cause other simplifications earlier in the graph without regard to what bits are needed by the other users of the first node. Ideally all we should do to the first node if it has multiple uses is bypass it when its not needed by the user we started from. Doing any other transformation that SimplifyDemandedBits can do like turning ZEXT/SEXT into AEXT would result in an increase in instructions. Fortunately, we already have a function that can do just that, GetDemandedBits. It will only make transformations that involve bypassing a node. This patch changes AMDGPU's simplifyI24, to use a combination of GetDemandedBits to handle the multiple use simplifications. And then uses the regular SimplifyDemandedBits on each operand to handle simplifications allowed when the operand only has a single use. Unfortunately, GetDemandedBits simplifies constants more aggressively than SimplifyDemandedBits. This caused the -7 constant in the changed test to be simplified to remove the upper bits. I had to modify computeKnownBits to account for this by ignoring the upper 8 bits of the input. Differential Revision: https://reviews.llvm.org/D56087 llvm-svn: 350560
*	Revert "Resubmit rL345008 "Split MachinePipeliner code into header and cpp ↵	Lama Saba	2019-01-06	1	-5/+595
\| \| \| \| \| \| \| \| \|	files"" This reverts commit rL350493 issues related to modules still appear in http://green.lab.llvm.org/green/job/lldb-cmake llvm-svn: 350497
*	Resubmit rL345008 "Split MachinePipeliner code into header and cpp files"	Lama Saba	2019-01-06	1	-595/+5
\| \| \| \| \| \| \| \| \|	Resubmitted in rL345290 and reverted in rL350345 due to failures in http://green.lab.llvm.org/green/job/lldb-cmake/ Resubmitting after a workaround to lldb-cmake failure was committed in rL350346, more info in https://reviews.llvm.org/D56084 llvm-svn: 350493
*	[LegalizeVectorOps] Add FSHL/FSHR to the list of vector operations that ↵	Craig Topper	2019-01-06	1	-0/+2
\| \| \| \| \| \| \| \|	should be handled. The FSHL/FSHR nodes are handled in the expand function, but they need to also be listed in the code that queries for the operation action too. llvm-svn: 350490
*	Added single use check to ShrinkDemandedConstant	Stanislav Mekhanoshin	2019-01-05	1	-0/+3
\| \| \| \| \| \| \| \| \|	Fixes cvt_f32_ubyte combine. performCvtF32UByteNCombine() could shrink source node to demanded bits only even if there are other uses. Differential Revision: https://reviews.llvm.org/D56289 llvm-svn: 350475
*	[X86] Add INSERT_SUBVECTOR to ComputeNumSignBits	Craig Topper	2019-01-04	1	-1/+35
\| \| \| \| \| \| \| \| \| \|	This adds support for calculating sign bits of insert_subvector. I based it on the computeKnownBits. My motivating case is propagating sign bits information across basic blocks on AVX targets where concatenating using insert_subvector is common. Differential Revision: https://reviews.llvm.org/D56283 llvm-svn: 350432
*	[DAGCombiner][x86] scalarize binop followed by extractelement	Sanjay Patel	2019-01-03	1	-5/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As noted in PR39973 and D55558: https://bugs.llvm.org/show_bug.cgi?id=39973 ...this is a partial implementation of a fold that we do as an IR canonicalization in instcombine: // extelt (binop X, Y), Index --> binop (extelt X, Index), (extelt Y, Index) We want to have this in the DAG too because as we can see in some of the test diffs (reductions), the pattern may not be visible in IR. Given that this is already an IR canonicalization, any backend that would prefer a vector op over a scalar op is expected to already have the reverse transform in DAG lowering (not sure if that's a realistic expectation though). The transform is limited with a TLI hook because there's an existing transform in CodeGenPrepare that tries to do the opposite transform. Differential Revision: https://reviews.llvm.org/D55722 llvm-svn: 350354
*	Revert "Resubmit rL345008 "Split MachinePipeliner code into header and cpp ↵	Stefan Granitz	2019-01-03	1	-5/+595
\| \| \| \| \| \| \| \|	files"" This reverts commit r350290. llvm-svn: 350345
*	Resubmit rL345008 "Split MachinePipeliner code into header and cpp files"	Lama Saba	2019-01-03	1	-595/+5
\| \| \| \| \| \| \| \| \|	The commit caused unclear failures in http://green.lab.llvm.org/green//job/lldb-cmake/ will revert if the error reappears Differential Revision: https://reviews.llvm.org/D56084 llvm-svn: 350290
*	[CodeGen] Skip over dbg-instr in twoaddr pass	Markus Lavin	2019-01-03	1	-3/+6
\| \| \| \| \| \| \| \| \| \|	A DBG_VALUE between a two-address instruction and a following COPY would prevent rescheduleMIBelowKill optimization inside TwoAddressInstructionPass. Differential Revision: https://reviews.llvm.org/D55987 llvm-svn: 350289
*	[DAGCombiner] After performing the division by constant optimization for a ↵	Craig Topper	2019-01-02	1	-2/+29
\| \| \| \| \| \| \| \| \| \| \| \|	DIV or REM node, replace the users of the corresponding REM or DIV node if it exists. Currently we expand the two nodes separately. This gives DAG combiner an opportunity to optimize the expanded sequence taking into account only one set of users. When we expand the other node we'll create the expansion again, but might not be able to optimize it the same way. So the nodes won't CSE and we'll have two similarish sequences in the same basic block. By expanding both nodes at the same time we'll avoid prematurely optimizing the expansion until both the division and remainder have been replaced. Improves the test case from PR38217. There may be additional opportunities after this. Differential Revision: https://reviews.llvm.org/D56145 llvm-svn: 350239
*	[LegalizeIntegerTypes] When promoting the result of an extract_vector_elt ↵	Craig Topper	2019-01-02	1	-2/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	also promote the input type if necessary By also promoting the input type we get a better idea for what scalar type to use. This can provide better results if the result of the extract is sign extended. What was previously happening is that the extract result would be legalized, sometime later the input of the sign extend would be legalized using the result of the extract. Then later the extract input would be legalized forcing a truncate into the input of the sign extend using a replace all uses. This requires DAG combine to combine out the sext/truncate pair. But sometimes we visited the truncate first and messed things up before the sext could be combined. By creating the extract with the correct scalar type when we create legalize the result type, the truncate will be added right away. Then when the sign_extend input is legalized it will create an any_extend of the truncate which can be optimized by getNode to maybe remove the truncate. And then a sign_extend_inreg. Now DAG combine doesn't have to worry about getting rid of the extend. This fixes the regression on X86 in D56156. Differential Revision: https://reviews.llvm.org/D56176 llvm-svn: 350236
*	[DAGCombiner][X86][PowerPC] Teach visitSIGN_EXTEND_INREG to fold ↵	Craig Topper	2019-01-02	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \|	(sext_in_reg (aext/sext x)) -> (sext x) when x has more than 1 sign bit and the sext_inreg is from one of them. If x has multiple sign bits than it doesn't matter which one we extend from so we can sext from x's msb instead. The X86 setcc-combine.ll changes are a little weird. It appears we ended up with a (sext_inreg (aext (trunc (extractelt)))) after type legalization. The sext_inreg+aext now gets optimized by this combine to leave (sext (trunc (extractelt))). Then we visit the trunc before we visit the sext. This ends up changing the truncate to an extractvectorelt from a bitcasted vector. I have a follow up patch to fix this. Differential Revision: https://reviews.llvm.org/D56156 llvm-svn: 350235
*	Reversing the commit in revision 350186. Revision causes regression in 4	Ayonam Ray	2019-01-01	2	-33/+53
\| \| \| \| \| \|	tests. llvm-svn: 350187
*	Omit range checks from jump tables when lowering switches with unreachable	Ayonam Ray	2019-01-01	2	-53/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	default During the lowering of a switch that would result in the generation of a jump table, a range check is performed before indexing into the jump table, for the switch value being outside the jump table range and a conditional branch is inserted to jump to the default block. In case the default block is unreachable, this conditional jump can be omitted. This patch implements omitting this conditional branch for unreachable defaults. Review Reference: D52002 llvm-svn: 350186
*	[SelectionDAG] Add SIGN_EXTEND_VECTOR_INREG support to computeKnownBits.	Craig Topper	2018-12-31	1	-1/+9
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D56168 llvm-svn: 350179
*	[DAGCombiner] Add missing one use check on the shuffle in the ↵	Craig Topper	2018-12-31	1	-1/+1
\| \| \| \| \| \| \| \|	bitcast(shuffle(bitcast(s0),bitcast(s1))) -> shuffle(s0,s1) transform. Found while trying out some other changes so I don't really have a test case. llvm-svn: 350172
*	[PowerPC] Fix ADDE, SUBE do not know how to promote operator	Kang Zhang	2018-12-30	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch is created to fix the Bugzilla bug 39815: https://bugs.llvm.org/show_bug.cgi?id=39815 This patch is to support promotion integer result for the instruction ADDE, SUBE. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D56119 llvm-svn: 350161
*	Add vtable anchor to classes.	Richard Trieu	2018-12-29	2	-0/+6
\| \| \| \|	llvm-svn: 350142
*	[codeview] Check if this 'this' type of a method is a pointer	Reid Kleckner	2018-12-26	2	-8/+19
\| \| \| \| \| \| \| \| \|	Fixes crash reported after r347354 for frontends that don't always emit 'this' pointers for methods. Now we will silently produce debug info that makes functions like this look like static methods, which seems reasonable. llvm-svn: 350073
*	[NVPTX] Allow libcalls that are defined in the current module.	Justin Lebar	2018-12-26	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The patch adds a possibility to make library calls on NVPTX. An important thing about library functions - they must be defined within the current module. This basically should guarantee that we produce a valid PTX assembly (without calls to not defined functions). The one who wants to use the libcalls is probably will have to link against compiler-rt or any other implementation. Currently, it's completely impossible to make library calls because of error LLVM ERROR: Cannot select: i32 = ExternalSymbol '...'. But we can lower ExternalSymbol to TargetExternalSymbol and verify if the function definition is available. Also, there was an issue with a DAG during legalisation. When we expand instruction into libcall, the inner call-chain isn't being "integrated" into outer chain. Since the last "data-flow" (call retval load) node is located in call-chain earlier than CALLSEQ_END node, the latter becomes a leaf and therefore a dead node (and is being removed quite fast). Proposed here solution relies on another data-flow pseudo nodes (ProxyReg) which purpose is only to keep CALLSEQ_END at legalisation and instruction selection phases - we remove the pseudo instructions before register scheduling phase. Patch by Denys Zariaiev! Differential Revision: https://reviews.llvm.org/D34708 llvm-svn: 350069
*	[MIPS GlobalISel] Select G_SELECT	Petar Avramovic	2018-12-25	1	-8/+11
\| \| \| \| \| \| \| \| \| \|	Add widen scalar for type index 1 (i1 condition) for G_SELECT. Select G_SELECT for pointer, s32(integer) and smaller low level types on MIPS32. Differential Revision: https://reviews.llvm.org/D56001 llvm-svn: 350063
*	[X86] Use GetDemandedBits to simplify the operands of PMULDQ/PMULUDQ.	Craig Topper	2018-12-24	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is an alternative to what I attempted in D56057. GetDemandedBits is a special version of SimplifyDemandedBits that allows simplifications even when the operand has other uses. GetDemandedBits will only do simplifications that allow a node to be bypassed. It won't create new nodes or alter any of the other users. I had to add support for bypassing SIGN_EXTEND_INREG to GetDemandedBits. Based on a patch that Simon Pilgrim sent me in email. Fixes PR40142. llvm-svn: 350059
*	Revert rL350048 and rL350050	Max Kazantsev	2018-12-24	2	-16/+9
\| \| \| \| \| \| \|	These patches have broken almost all buildbots on test DebugInfo/X86/addr_comments.ll. Reverting to green. llvm-svn: 350052
*	Fix build - follow-up to r350048 which broke headerless (v4) address pool	David Blaikie	2018-12-24	2	-8/+10
\| \| \| \|	llvm-svn: 350050
*	DebugInfo: Use assembly label arithmetic for address pool size for easier ↵	David Blaikie	2018-12-24	2	-5/+10
\| \| \| \| \| \|	reading/editing llvm-svn: 350048
*	DebugInfo: Add assembly comments for debug_addr contribution header fields	David Blaikie	2018-12-24	1	-0/+4
\| \| \| \|	llvm-svn: 350047
*	[SelectionDAGBuilder] Use ::precise LocationSizes; NFC	George Burgess IV	2018-12-24	1	-11/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	More migration so we can disable the implicit int -> LocationSize conversion. All of these are either scatter/gather'ed vector instructions, or direct loads. Hence, they're all precise. Perhaps if we see way more getTypeStoreSize calls, we can make a getTypeStoreLocationSize (or similar) as a wrapper that applies this ::precise. Doesn't appear that it's a good idea to make getTypeStoreSize return a LocationSize itself, however. llvm-svn: 350042
*	[DAGCombiner] limit shuffle to extend transform (PR40146)	Sanjay Patel	2018-12-23	1	-4/+5
\| \| \| \| \| \| \| \| \| \|	It's dangerous to knowingly create an illegal vector type no matter what stage of combining we're in. This prevents the missed folding/scalarization seen in: https://bugs.llvm.org/show_bug.cgi?id=40146 llvm-svn: 350034
*	[DAGCombiner] allow hoisting vector bitwise logic ahead of extends	Sanjay Patel	2018-12-23	1	-6/+5
\| \| \| \|	llvm-svn: 350032
*	[DAGCombiner] allow narrowing of add followed by truncate	Sanjay Patel	2018-12-22	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	trunc (add X, C ) --> add (trunc X), C' If we're throwing away the top bits of an 'add' instruction, do it in the narrow destination type. This makes the truncate-able opcode list identical to the sibling transform done in IR (in instcombine). This change used to show regressions for x86, but those are gone after D55494. This gets us closer to deleting the x86 custom function (combineTruncatedArithmetic) that does almost the same thing. Differential Revision: https://reviews.llvm.org/D55866 llvm-svn: 350006
*	[IR] Add Instruction::isLifetimeStartOrEnd, NFC	Vedant Kumar	2018-12-21	3	-17/+5
\| \| \| \| \| \| \| \| \| \| \|	Instruction::isLifetimeStartOrEnd() checks whether an Instruction is an llvm.lifetime.start or an llvm.lifetime.end intrinsic. This was suggested as a cleanup in D55967. Differential Revision: https://reviews.llvm.org/D56019 llvm-svn: 349964