bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SLP] Visualize SLP trees with -view-slp-tree	Adam Nemet	2017-03-08	1	-62/+167
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Analyzing larger trees is extremely difficult with the current debug output so this adds GraphTraits and DOTGraphTraits on top of the VectorizableTree data structure. We can now display the SLP trees with Graphviz as in https://reviews.llvm.org/F3132765. I decorated the graph where a value needs to be gathered for one reason or another. These are the red nodes. There are other improvement I am planning to make as I work through my case here. For example, I would also like to mark nodes that need to be extracted. Differential Revision: https://reviews.llvm.org/D30731 llvm-svn: 297303
*	[LV] Select legal insert point when fixing first-order recurrences	Matthew Simpson	2017-03-08	1	-7/+9
\| \| \| \| \| \| \| \| \| \|	Because IRBuilder performs constant-folding, it's not guaranteed that an instruction in the original loop map to an instruction in the vector loop. It could map to a constant vector instead. The handling of first-order recurrences was incorrectly making this assumption when setting the IRBuilder's insert point. llvm-svn: 297302
*	[GlobalISel] Add default action for G_FNEG	Volkan Keles	2017-03-08	2	-0/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: rL297171 introduced G_FNEG for floating-point negation instruction and IRTranslator started to translate `FSUB -0.0, X` to `FNEG X`. This patch adds a default action for G_FNEG to avoid breaking existing targets. Reviewers: qcolombet, ab, kristof.beyls, t.p.northover, aditya_nandakumar, dsanders Reviewed By: qcolombet Subscribers: dberris, rovka, llvm-commits Differential Revision: https://reviews.llvm.org/D30721 llvm-svn: 297301
*	Resubmit FileSystem changes.	Zachary Turner	2017-03-08	1	-0/+12
\| \| \| \| \| \| \| \| \| \|	This was originall reverted due to some test failures in ModuleCache and TestCompDirSymlink. These issues have all been resolved and the code now passes all tests. Differential Revision: https://reviews.llvm.org/D30698 llvm-svn: 297300
*	[Hexagon] Use correct offset when extracting from the high word	Krzysztof Parzyszek	2017-03-08	1	-0/+1
\| \| \| \| \| \| \| \|	When extracting a bitfield from the high register in a register pair, the final offset should be relative to the high register (for 32-bit extracts). llvm-svn: 297288
*	[Sparc] Check register use with isPhysRegUsed() instead of reg_nodbg_empty()	Daniel Cederman	2017-03-08	1	-6/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: By using reg_nodbg_empty() to determine if a function can be treated as a leaf function or not, we miss the case when the register pair L0_L1 is used but not L0 by itself. This has the effect that use_all_i32_regs(), a test in reserved-regs.ll which tries to use all registers, gets treated as a leaf function. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: davide, RKSimon, sepavloff, llvm-commits Differential Revision: https://reviews.llvm.org/D27089 llvm-svn: 297285
*	[JumpThread] Use AA in SimplifyPartiallyRedundantLoad()	Jun Bum Lim	2017-03-08	1	-11/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Use AA when scanning to find an available load value. Reviewers: rengolin, mcrosier, hfinkel, trentxintong, dberlin Reviewed By: rengolin, dberlin Subscribers: aemerson, dberlin, llvm-commits Differential Revision: https://reviews.llvm.org/D30352 llvm-svn: 297284
*	[InstCombine] avoid crashing on shuffle shrinkage when input type is not ↵	Sanjay Patel	2017-03-08	1	-1/+2
\| \| \| \| \| \|	same as result type llvm-svn: 297280
*	[LoopRotate] Propagate dbg.value intrinsics	Sam Parker	2017-03-08	1	-3/+45
\| \| \| \| \| \| \| \| \| \| \| \|	Recommitting patch which was previously reverted in r297159. These changes should address the casting issues. The original patch enables dbg.value intrinsics to be attached to newly inserted PHI nodes. Differential Review: https://reviews.llvm.org/D30701 llvm-svn: 297269
*	[X86][SSE] combineX86ShufflesRecursively can handle shuffle masks up to 64 ↵	Simon Pilgrim	2017-03-08	1	-8/+7
\| \| \| \| \| \| \| \|	elements wide By defining the mask types as SmallVector<int, 16> we were causing a lot of unnecessary heap usage. llvm-svn: 297267
*	[SLP] Fixed non-deterministic behavior in Loop Vectorizer.	Amjad Aboud	2017-03-08	1	-9/+11
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D30638 llvm-svn: 297257
*	Revert "Revert "[PowerPC][ELFv2ABI] Allocate parameter area on-demand to ↵	Tim Shen	2017-03-08	1	-5/+43
\| \| \| \| \| \| \| \| \| \| \| \| \|	reduce stack frame size"" After inspection, it's an UB in our code base. Someone cast a var-arg function pointer to a non-var-arg one. :/ Re-commit r296771 to continue testing on the patch. Sorry for the trouble! llvm-svn: 297256
*	Handle UnreachableInst in isGuaranteedToTransferExecutionToSuccessor	Sebastian Pop	2017-03-08	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	A block with an UnreachableInst does not transfer execution to a successor. The problem was exposed by GVN-hoist. This patch fixes bug 32153. Patch by Aditya Kumar. Differential Revision: https://reviews.llvm.org/D30667 llvm-svn: 297254
*	[SCCP] Merge markOverdefined and markAnythingOverdefined.	Davide Italiano	2017-03-08	1	-23/+17
\| \| \| \| \| \|	There's no need to have two separate APIs. llvm-svn: 297253
*	[NVPTX] Remove unnecessary isImageReadoOnly(), isImageWriteOnly(), & ↵	Justin Lebar	2017-03-08	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \| \|	isImageReadWrite calls This is repetition of isImage() function in NVPTXUtilities.cpp. Patch by Briana Grace! Differential Revision: https://reviews.llvm.org/D30706 llvm-svn: 297252
*	AMDGPU: Don't wait at end of block with a trivial successor	Matt Arsenault	2017-03-08	1	-2/+14
\| \| \| \| \| \| \| \| \| \|	If there is only one successor, and that successor only has one predecessor the wait can obviously be delayed until uses or the end of the next block. This avoids code quality regressions when there are trivial fallthrough blocks inserted for structurization. llvm-svn: 297251
*	[DAGCombine] Simplify ISD::AND in GetDemandedBits.	Eli Friedman	2017-03-08	1	-0/+11
\| \| \| \| \| \| \| \| \|	This helps in cases involving bitfields where an AND is exposed by legalization. Differential Revision: https://reviews.llvm.org/D30472 llvm-svn: 297249
*	AMDGPU: Constant fold rcp node	Matt Arsenault	2017-03-08	1	-2/+12
\| \| \| \| \| \| \|	When doing arcp optimization with a constant denominator, this was leaving behind rcps with constant inputs. llvm-svn: 297248
*	[DebugInfo] Make legal and emit DW_OP_swap and DW_OP_xderef	Konstantin Zhuravlyov	2017-03-08	2	-0/+19
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D29672 llvm-svn: 297247
*	Fix additional constructor call missed by r297241.	Daniel Sanders	2017-03-07	1	-1/+1
\| \| \| \| \| \|	It was added between my build+test and my commit. llvm-svn: 297244
*	AMDGPU/SI: Do not insert EndCf in an unreachable block	Changpeng Fang	2017-03-07	1	-2/+3
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm Differential Revision: http://reviews.llvm.org/D22025 llvm-svn: 297243
*	[InstCombine] shrink truncated insertelement into undef vector	Sanjay Patel	2017-03-07	1	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \|	This is the 2nd part of solving: http://lists.llvm.org/pipermail/llvm-dev/2017-February/110293.html D30123 moves the trunc ahead of the shuffle, and this moves the trunc ahead of the insertelement. We're limiting this transform to undef rather than any constant to avoid backend problems. Differential Revision: https://reviews.llvm.org/D30137 llvm-svn: 297242
*	Recommit: [globalisel] Change LLT constructor string into an LLT-based ↵	Daniel Sanders	2017-03-07	7	-61/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	object that knows how to generate it. Summary: This will allow future patches to inspect the details of the LLT. The implementation is now split between the Support and CodeGen libraries to allow TableGen to use this class without introducing layering concerns. Thanks to Ahmed Bougacha for finding a reasonable way to avoid the layering issue and providing the version of this patch without that problem. The problem with the previous commit appears to have been that TableGen was including CodeGen/LowLevelType.h instead of Support/LowLevelTypeImpl.h. Reviewers: t.p.northover, qcolombet, rovka, aditya_nandakumar, ab, javed.absar Subscribers: arsenm, nhaehnle, mgorny, dberris, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30046 llvm-svn: 297241
*	[Hexagon] Check for presence before looking registers up in bit tracker	Krzysztof Parzyszek	2017-03-07	1	-0/+4
\| \| \| \|	llvm-svn: 297240
*	[Hexagon] Generate bitsplit instruction	Krzysztof Parzyszek	2017-03-07	1	-1/+118
\| \| \| \|	llvm-svn: 297239
*	GlobalISel: use inserts for landingpad instead of sequences.	Tim Northover	2017-03-07	1	-26/+28
\| \| \| \|	llvm-svn: 297237
*	Fix one-after-the-end type metadata handling in globalsplit.	Evgeniy Stepanov	2017-03-07	1	-1/+10
\| \| \| \| \| \| \| \| \| \|	Itanium ABI may have an address point one byte after the end of a vtable. When such vtable global is split, the !type metadata needs to follow the right vtable. Differential Revision: https://reviews.llvm.org/D30716 llvm-svn: 297236
*	[InstCombine] shrink truncated splat shuffle (2nd try)	Sanjay Patel	2017-03-07	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was committed at r297155 and reverted at r297166 because of an over-reaching clang test. That should be fixed with r297189. This is one part of solving a recent bug report: http://lists.llvm.org/pipermail/llvm-dev/2017-February/110293.html This keeps with our general approach: changing arbitrary shuffles is off-limts, but changing splat is ok. The transform is very similar to the existing shrinkBitwiseLogic() canonicalization. Differential Revision: https://reviews.llvm.org/D30123 llvm-svn: 297232
*	[ObjectYAML] Fix issue with DWARF2 AddrSize 8	Chris Bieneman	2017-03-07	1	-2/+6
\| \| \| \| \| \| \| \|	In my refactoring I introduced a bug where we were using the reference size instead of the offset size for DW_FORM_strp and similar forms. This patch resolves the error and adds a test case testing all the DWARF forms for DWARF2 AddrSize 8. There is similar coverage already in the DWARFDebugInfoTest sources that covers the parser. Once I migrate the DWARFGenerator APIs to be built on the YAML tools they will be fully covered under the same tests. llvm-svn: 297230
*	GlobalISel: fix legalization of G_INSERT	Tim Northover	2017-03-07	1	-14/+19
\| \| \| \| \| \| \| \|	We were calculating incorrect extract/insert offsets by trying to be too tricksy with min/max. It's clearer to just split the logic up into "register starts before this segment" vs "after". llvm-svn: 297226
*	[coroutines] Add handling for unwind coro.ends	Gor Nishanov	2017-03-07	1	-4/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The purpose of coro.end intrinsic is to allow frontends to mark the cleanup and other code that is only relevant during the initial invocation of the coroutine and should not be present in resume and destroy parts. In landing pads coro.end is replaced with an appropriate instruction to unwind to caller. The handling of coro.end differs depending on whether the target is using landingpad or WinEH exception model. For landingpad based exception model, it is expected that frontend uses the `coro.end`_ intrinsic as follows: ``` ehcleanup: %InResumePart = call i1 @llvm.coro.end(i8* null, i1 true) br i1 %InResumePart, label %eh.resume, label %cleanup.cont cleanup.cont: ; rest of the cleanup eh.resume: %exn = load i8, i8* %exn.slot, align 8 %sel = load i32, i32* %ehselector.slot, align 4 %lpad.val = insertvalue { i8, i32 } undef, i8 %exn, 0 %lpad.val29 = insertvalue { i8, i32 } %lpad.val, i32 %sel, 1 resume { i8, i32 } %lpad.val29 ``` The `CoroSpit` pass replaces `coro.end` with ``True`` in the resume functions, thus leading to immediate unwind to the caller, whereas in start function it is replaced with ``False``, thus allowing to proceed to the rest of the cleanup code that is only needed during initial invocation of the coroutine. For Windows Exception handling model, a frontend should attach a funclet bundle referring to an enclosing cleanuppad as follows: ``` ehcleanup: %tok = cleanuppad within none [] %unused = call i1 @llvm.coro.end(i8* null, i1 true) [ "funclet"(token %tok) ] cleanupret from %tok unwind label %RestOfTheCleanup ``` The `CoroSplit` pass, if the funclet bundle is present, will insert ``cleanupret from %tok unwind to caller`` before the `coro.end`_ intrinsic and will remove the rest of the block. Reviewers: majnemer Reviewed By: majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D25543 llvm-svn: 297223
*	Implement FreeMachineFunction::getPassName().	Yaron Keren	2017-03-07	1	-0/+4
\| \| \| \|	llvm-svn: 297222
*	[GlobalISel] Don't translate intrinsics with metadata parameters.	Ahmed Bougacha	2017-03-07	1	-0/+3
\| \| \| \| \| \| \| \| \|	Some intrinsics take metadata parameters. These all need custom handling of some form, and cannot possibly be lowered generically to G_INTRINSIC calls with vreg operands. Reject them, instead of hitting an assert later in getOrCreateVReg. llvm-svn: 297209
*	[GlobalISel] Avoid invalidating ValToVReg when translating no-op bitcast.	Ahmed Bougacha	2017-03-07	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we translate a no-op (same type) bitcast, we try to be clever and only emit a COPY if we already assigned a vreg to the defined value. However, when we didn't, we tried to assign to a reference into the ValToVReg DenseMap, even though the RHS of the assignment (getOrCreateVReg) could potentially grow that DenseMap, invalidating the reference. Avoid that by getting the source vreg first. I audited the rest of the translator; this is the only tricky case. The test is quite unwieldy, as the problem is caused by the DenseMap growing, which happens after the 47th mapped value. llvm-svn: 297208
*	[GlobalISel] Relax vector G_SELECT assertion.	Ahmed Bougacha	2017-03-07	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \|	For vector operands, the `select` instruction supports both vector and non-vector conditions. The MIR builder had an overly restrictive assertion, that only accepted vector conditions for vector selects (in effect implementing ISD::VSELECT). Make it possible to express the full range of G_SELECTs. llvm-svn: 297207
*	[GlobalISel] Slightly clean up DBG_VALUE FP build code.	Ahmed Bougacha	2017-03-07	1	-2/+1
\| \| \| \| \| \| \|	I messed up my rebases leading to r297200, and ended up with stale (but working) code. Fix it. llvm-svn: 297205
*	[fuzzer] Don't crash if LLVMFuzzerMutate was called by CustomCrossOver	Vitaly Buka	2017-03-07	5	-2/+40
\| \| \| \| \| \| \| \| \| \|	Reviewers: kcc Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D30682 llvm-svn: 297202
*	[GlobalISel] Ignore %noreg when applying default regbank mapping.	Ahmed Bougacha	2017-03-07	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	When computing the mapping for non-generic instructions, we skipped %noreg operands, because we can't always reason about their banks. Also skip them when applying the mapping. Otherwise, we could end up with mappings that we can't apply. While there, duplicate an assert to distinguish between the two error conditions. llvm-svn: 297201
*	[GlobalISel] Emit DBG_VALUE %noreg for non-int/fp constant values.	Ahmed Bougacha	2017-03-07	1	-1/+6
\| \| \| \| \| \| \| \|	When a dbg_value has a constant operand that isn't representable in MI, there isn't much we can do. Use %noreg (0) for those situations. This matches the SelectionDAG behavior. llvm-svn: 297200
*	[NVPTX] Fixed lowering of unaligned loads/stores of f16 scalars and vectors.	Artem Belevich	2017-03-07	1	-11/+31
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D30672 llvm-svn: 297198
*	SjLjEHPrepare: Fix the pass for swifterror arguments	Arnold Schwaighofer	2017-03-07	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \|	We cannot leave the identity copies 'select true, arg, undef' that this pass inserts for arguments to simplify handling of values on swifterror arguments. swifterror arguments have restrictions on their uses. rdar://30839288 llvm-svn: 297197
*	Fix C2712 build error on Windows	Konstantin Zhuravlyov	2017-03-07	1	-6/+12
\| \| \| \| \| \| \| \|	Move the __try/__except block outside of the set_thread_name function to avoid a conflict with object unwinding due to the use of the llvm::Storage. Differential Revision: https://reviews.llvm.org/D30707 llvm-svn: 297192
*	[AArch64] Vulcan is now ThunderXT99	Joel Jones	2017-03-07	5	-193/+203
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Broadcom Vulcan is now Cavium ThunderX2T99. LLVM Bugzilla: http://bugs.llvm.org/show_bug.cgi?id=32113 Minor fixes for the alignments of loops and functions for ThunderX T81/T83/T88 (better performance). Patch was tested with SpecCPU2006. Patch by Stefan Teleman Differential Revision: https://reviews.llvm.org/D30510 llvm-svn: 297190
*	Revert r297177: Change LLT constructor string into an LLT-based object ...	Daniel Sanders	2017-03-07	7	-77/+61
\| \| \| \| \| \| \| \| \| \|	More module problems. This time it only showed up in the stage 2 compile of clang-x86_64-linux-selfhost-modules-2 but not the stage 1 compile. Somehow, this change causes the build to need Attributes.gen before it's been generated. llvm-svn: 297188
*	[JumpThread] Simplify CmpInst-as-Condition branch-folding a bit.	Xin Tong	2017-03-07	1	-4/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Simplify CmpInst-as-Condition branch-folding a bit. Reviewers: sanjoy, efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30429 llvm-svn: 297186
*	[ObjectYAML] Add support for DWARF5 Unit header	Chris Bieneman	2017-03-07	2	-2/+11
\| \| \| \| \| \|	In DWARF5 the Unit header added a new field, UnitType, and swapped the order of the address size and abbreviation offset fields. llvm-svn: 297183
*	[LV] Consider users that are memory accesses in uniforms expansion step	Matthew Simpson	2017-03-07	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \|	When expanding the set of uniform instructions beyond the seed instructions (e.g., consecutive pointers), we mark a new instruction uniform if all its loop-varying users are uniform. We should also allow users that are consecutive or interleaved memory accesses. This fixes cases where we have an instruction that is used as the pointer operand of a consecutive access but also used by a non-memory instruction that later becomes uniform as part of the expansion. llvm-svn: 297179
*	[X86] Add option to specify preferable loop alignment	Sanjoy Das	2017-03-07	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Loop alignment can cause a significant change of the perfromance for short loops. To be able to evaluate the impact of loop alignment this change introduces the new option x86-experimental-pref-loop-alignment. The alignment will be 2^Value bytes, the default value is 4. Patch by Serguei Katkov! Reviewers: craig.topper Reviewed By: craig.topper Subscribers: sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D30391 llvm-svn: 297178
*	[globalisel] Change LLT constructor string into an LLT-based object that ↵	Daniel Sanders	2017-03-07	7	-61/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	knows how to generate it. Summary: This will allow future patches to inspect the details of the LLT. The implementation is now split between the Support and CodeGen libraries to allow TableGen to use this class without introducing layering concerns. Thanks to Ahmed Bougacha for finding a reasonable way to avoid the layering issue and providing the version of this patch without that problem. Reviewers: t.p.northover, qcolombet, rovka, aditya_nandakumar, ab, javed.absar Subscribers: arsenm, nhaehnle, mgorny, dberris, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30046 llvm-svn: 297177
*	[GlobalISel] Translate floating-point negation	Volkan Keles	2017-03-07	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: qcolombet, javed.absar, aditya_nandakumar, dsanders, t.p.northover, ab Reviewed By: qcolombet Subscribers: dberris, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30671 llvm-svn: 297171