bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86][SSE] Use range loop. NFCI.	Simon Pilgrim	2016-04-24	1	-3/+2
\| \| \| \|	llvm-svn: 267349
*	[Lanai] Use EVT::getEVTString() to print a type as a string instead of an ↵	Craig Topper	2016-04-24	1	-1/+1
\| \| \| \| \| \|	enum encoding value. llvm-svn: 267348
*	[X86][XOP] Fixed VPPERM permute op decoding (PR27472).	Simon Pilgrim	2016-04-24	1	-1/+1
\| \| \| \| \| \|	Fixed issue with VPPERM target shuffle mask decoding that was incorrectly masking off the 3-bit permute op with a 2-bit mask. llvm-svn: 267346
*	BitcodeReader: Delay metadata parsing until reading a function body	Duncan P. N. Exon Smith	2016-04-24	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \|	There's hardly any functionality change here. Instead of calling materializeMetadata on the first call to materialize(GlobalValue*), wait until the first one that's actually going to do something. Noticed by inspection; I don't have a concrete case where this makes a difference. Added an assertion in materializeMetadata to be sure this (or a future change) doesn't delay materializeMetadata after function-level metadata. llvm-svn: 267345
*	[ThinLTO] Remove GlobalValueInfo class from index	Teresa Johnson	2016-04-24	6	-174/+150
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Remove the GlobalValueInfo and change the ModuleSummaryIndex to directly reference summary objects. The info structure was there to support lazy parsing of the combined index summary objects, which is no longer needed and not supported. Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19462 llvm-svn: 267344
*	[X86][SSE] Improved support for decoding target shuffle masks through bitcasts	Simon Pilgrim	2016-04-24	1	-20/+26
\| \| \| \| \| \| \| \|	Reused the ability to split constants of a type wider than the shuffle mask to work with masks generated from scalar constants transfered to xmm. This fixes an issue preventing PSHUFB target shuffle masks decoding rematerialized scalar constants and also exposes the XOP VPPERM bug described in PR27472. llvm-svn: 267343
*	[SystemZ] [SSP] Add support for LOAD_STACK_GUARD.	Marcin Koscielnicki	2016-04-24	4	-0/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes PR22248 on s390x. The previous attempt at this was D19101, which was before LOAD_STACK_GUARD existed. Compared to the previous version, this always emits a rather ugly block of 4 instructions, involving a thread pointer load that can't be shared with other potential users. However, this is necessary for SSP - spilling the guard value (or thread pointer used to load it) is counter to the goal, since it could be overwritten along with the frame it protects. Differential Revision: http://reviews.llvm.org/D19363 llvm-svn: 267340
*	Silence two C4806 warnings ('\|': unsafe operation: no value of type 'bool' ↵	Aaron Ballman	2016-04-24	1	-2/+2
\| \| \| \| \| \|	promoted to type 'const unsigned int' can equal the given constant). The fact that they trigger with this code seems like it may be a bug, but the warning itself is still generally useful enough to retain it for now. llvm-svn: 267337
*	BitcodeReader: Fix some holes in upgrade from r267296	Duncan P. N. Exon Smith	2016-04-24	1	-8/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add tests for some missing cases to bitcode upgrade in r267296. - DICompositeType with an 'elements:' field, which will cause it to be involved in a cycle after the upgrade. - A DIDerivedType that references a class in 'extraData:'. I updated test/Bitcode/dityperefs-3.8.ll with the missing cases and regenerated test/Bitcode/dityperefs-3.8.ll.bc. llvm-svn: 267332
*	[X86] Merge LowerCTLZ and LowerCTLZ_ZERO_UNDEF into a single function that ↵	Craig Topper	2016-04-24	1	-38/+16
\| \| \| \| \| \|	branches internally for the one difference, allowing the rest of the code to be common. NFC llvm-svn: 267331
*	[X86] Node need to check if AVX512 is supported when lowering vector CTLZ. ↵	Craig Topper	2016-04-24	1	-7/+5
\| \| \| \| \| \|	The CTLZ operation is only Custom for vectors if AVX512 is enabled so if a vector gets here AVX512 is implied. NFC llvm-svn: 267330
*	Add "hasSection" flag in the Summary	Mehdi Amini	2016-04-24	2	-3/+11
\| \| \| \| \| \| \| \| \| \| \|	Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19405 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267329
*	[MachineCombiner] Support for floating-point FMA on ARM64 (re-commit r267098)	Gerolf Hoflehner	2016-04-24	7	-38/+583
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original patch caused crashes because it could derefence a null pointer for SelectionDAGTargetInfo for targets that do not define it. Evaluates fmul+fadd -> fmadd combines and similar code sequences in the machine combiner. It adds support for float and double similar to the existing integer implementation. The key features are: - DAGCombiner checks whether it should combine greedily or let the machine combiner do the evaluation. This is only supported on ARM64. - It gives preference to throughput over latency: the heuristic used is to combine always in loops. The targets decides whether the machine combiner should optimize for throughput or latency. - Supports for fmadd, f(n)msub, fmla, fmls patterns - On by default at O3 ffast-math llvm-svn: 267328
*	[X86] Remove isel patterns for selecting tzcnt/lzcnt from ↵	Craig Topper	2016-04-24	1	-80/+0
\| \| \| \| \| \|	cmove/ne+cttz/ctlz. These are folded by DAG combine now. llvm-svn: 267326
*	[CodeGen] Teach DAG combine to fold select_cc seteq X, 0, sizeof(X), ↵	Craig Topper	2016-04-24	1	-0/+35
\| \| \| \| \| \| \| \|	ctlz_zero_undef(X) -> ctlz(X). InstCombine already does this for IR and X86 pattern matches this during isel. A follow up commit will remove the X86 patterns to allow this to be tested. llvm-svn: 267325
*	Fix an assertion that can never fire because the condition ANDed with the ↵	Craig Topper	2016-04-24	1	-1/+1
\| \| \| \| \| \|	string is just true or 1. llvm-svn: 267324
*	Revert "Verifier: Verify that each inlinable callsite of a ↵	Adrian Prantl	2016-04-24	1	-9/+0
\| \| \| \| \| \| \| \|	debug-info-bearing function" This reverts commit r267320 while investigating an OpenMP buildbot failure. llvm-svn: 267322
*	Verifier: Verify that each inlinable callsite of a debug-info-bearing function	Adrian Prantl	2016-04-24	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	in a debug-info-bearing function has a debug location attached to it. Failure to do so causes an "!dbg attachment points at wrong subprogram for function" assertion failure when the inliner sets up inline scope info. rdar://problem/25878916 llvm-svn: 267320
*	Reorganize GlobalValueSummary with a "Flags" bitfield.	Mehdi Amini	2016-04-24	3	-41/+65
\| \| \| \| \| \| \| \| \| \|	Right now it only contains the LinkageType, but will be extended with "hasSection", "isOptSize", "hasInlineAssembly", etc. Differential Revision: http://reviews.llvm.org/D19404 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267319
*	Add a version field in the bitcode for the summary	Mehdi Amini	2016-04-24	2	-1/+22
\| \| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D19456 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267318
*	Add an internalization step to the ThinLTOCodeGenerator	Mehdi Amini	2016-04-24	1	-18/+151
\| \| \| \| \| \| \| \| \| \| \| \| \|	Keeping as much as possible internal/private is known to help the optimizer. Let's try to benefit from this in ThinLTO. Note: this is early work, but is enough to build clang (and all the LLVM tools). I still need to write some lit-tests... Differential Revision: http://reviews.llvm.org/D19103 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267317
*	Fix a couple assertions that can never fire because they just contained the ↵	Craig Topper	2016-04-24	2	-2/+2
\| \| \| \| \| \|	text string which always evaluates to true. Add a ! so they'll evaluate to false. llvm-svn: 267312
*	[X86] Fix patterns that turn cmove/cmovne+ctlz/cttz into lzcnt/tzcnt ↵	Craig Topper	2016-04-24	1	-30/+24
\| \| \| \| \| \|	instructions. Only one of the conditions should be valid for each pattern, not both. Update tests accordingly. llvm-svn: 267311
*	[RuntimeDyldELF] Handle GOTPCRELX/REX_GOTPCRELX.	Davide Italiano	2016-04-24	1	-1/+5
\| \| \| \|	llvm-svn: 267309
*	[MC/ELF] Implement support for GOTPCRELX/REX_GOTPCRELX.	Davide Italiano	2016-04-24	2	-5/+25
\| \| \| \| \| \| \| \| \|	The option to control the emission of the new relocations is -relax-relocations (blatantly copied from GNU as). It can't be enabled by default because it breaks relatively recent versions of ld.bfd/ld.gold (late 2015). llvm-svn: 267307
*	Store and emit original name in combined index	Mehdi Amini	2016-04-23	2	-25/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As discussed in D18298, some local globals can't be renamed/promoted (because they have a section, or because they are referenced from inline assembly). To be able to detect naming collision, we need to keep around the "GUID" using their original name without taking the linkage into account. Reviewers: tejohnson Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19454 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267304
*	Always traverse GlobalVariable initializer when computing the export list	Mehdi Amini	2016-04-23	1	-21/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We are always importing the initializer for a GlobalVariable. So if a GlobalVariable is in the export-list, we pull in any refs as well. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19102 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 267303
*	DebugInfo: Change DIBuilder to make distinct DIGlobalVariables	Duncan P. N. Exon Smith	2016-04-23	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A long overdue change to make DIGlobalVariable distinct. Much like DISubprogram definitions (changed in r246098), it isn't logical to unique DIGlobalVariable definitions from two different compile units. (Longer-term, we should also find a way to reverse the link between GlobalVariable and DIGlobalVariable, and between DIGlobalVariable and DICompileUnit, so that debug info to do with optimized-out globals disappears. Admittedly it's harder than with Function/DISubprogram, since global variables may be constant-folded and the debug info should still describe that somehow.) llvm-svn: 267301
*	[MC/ELF] Pass Fixup to getRelocType64.	Davide Italiano	2016-04-23	1	-2/+3
\| \| \| \| \| \|	In preparation for other changes. llvm-svn: 267300
*	BitcodeReader: Avoid std::vector with non-movable types from r267296	Duncan P. N. Exon Smith	2016-04-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	r267298 didn't quite fix the build errors. Use SmallVector instead of std::vector, the latter of which I think is trying to maintain a strong exception safety guarantee. http://lab.llvm.org:8011/builders/lldb-amd64-ninja-freebsd11/builds/6228 llvm-svn: 267299
*	BitcodeReader: Avoid non-moving std::piecewise_construct from r267296	Duncan P. N. Exon Smith	2016-04-23	1	-3/+3
\| \| \| \| \| \| \| \| \|	Not exactly sure why the host tries to use a copy constructor here, but it's easy enough to work around it. http://lab.llvm.org:8011/builders/lldb-amd64-ninja-freebsd11/builds/6227 llvm-svn: 267298
*	DebugInfo: Remove MDString-based type references	Duncan P. N. Exon Smith	2016-04-23	11	-365/+279
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Eliminate DITypeIdentifierMap and make DITypeRef a thin wrapper around DIType*. It is no longer legal to refer to a DICompositeType by its 'identifier:', and DIBuilder no longer retains all types with an 'identifier:' automatically. Aside from the bitcode upgrade, this is mainly removing logic to resolve an MDString-based reference to an actualy DIType. The commits leading up to this have made the implicit type map in DICompileUnit's 'retainedTypes:' field superfluous. This does not remove DITypeRef, DIScopeRef, DINodeRef, and DITypeRefArray, or stop using them in DI-related metadata. Although as of this commit they aren't serving a useful purpose, there are patchces under review to reuse them for CodeView support. The tests in LLVM were updated with deref-typerefs.sh, which is attached to the thread "[RFC] Lazy-loading of debug info metadata": http://lists.llvm.org/pipermail/llvm-dev/2016-April/098318.html llvm-svn: 267296
*	replace duplicated static functions for profile metadata access with ↵	Sanjay Patel	2016-04-23	3	-52/+28
\| \| \| \| \| \|	BranchInst member function; NFCI llvm-svn: 267295
*	Revert "[AArch64] Fix optimizeCondBranch logic."	Renato Golin	2016-04-23	1	-5/+5
\| \| \| \| \| \|	This reverts commit r267206, as it broke self-hosting on AArch64. llvm-svn: 267294
*	improve documentation comments; NFC	Sanjay Patel	2016-04-23	2	-135/+10
\| \| \| \|	llvm-svn: 267292
*	[CodeGen] When promoting CTTZ operations to larger type, don't insert a ↵	Craig Topper	2016-04-23	1	-9/+11
\| \| \| \| \| \|	select to detect if the input is zero to return the original size instead of the extended size. Instead just set the first bit in the zero extended part. llvm-svn: 267280
*	BitcodeWriter: Emit uniqued subgraphs after all distinct nodes	Duncan P. N. Exon Smith	2016-04-23	2	-1/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since forward references for uniqued node operands are expensive (and those for distinct node operands are cheap due to DistinctMDOperandPlaceholder), minimize forward references in uniqued node operands. Moreover, guarantee that when a cycle is broken by a distinct node, none of the uniqued nodes have any forward references. In ValueEnumerator::EnumerateMetadata, enumerate uniqued node subgraphs first, delaying distinct nodes until all uniqued nodes have been handled. This guarantees that uniqued nodes only have forward references when there is a uniquing cycle (since r267276 changed ValueEnumerator::organizeMetadata to partition distinct nodes in front of uniqued nodes as a post-pass). Note that a single uniqued subgraph can hit multiple distinct nodes at its leaves. Ideally these would themselves be emitted in post-order, but this commit doesn't attempt that; I think it requires an extra pass through the edges, which I'm not convinced is worth it (since DistinctMDOperandPlaceholder makes forward references quite cheap between distinct nodes). I've added two testcases: - test/Bitcode/mdnodes-distinct-in-post-order.ll is just like test/Bitcode/mdnodes-in-post-order.ll, except with distinct nodes instead of uniqued ones. This confirms that, in the absence of uniqued nodes, distinct nodes are still emitted in post-order. - test/Bitcode/mdnodes-distinct-nodes-break-cycles.ll is the minimal example where a naive post-order traversal would cause one uniqued node to forward-reference another. IOW, it's the motivating test. llvm-svn: 267278
*	Avoid MSVC failure with default arguments in lambdas from r267270	Duncan P. N. Exon Smith	2016-04-23	1	-7/+10
\| \| \| \| \| \|	http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/11700 llvm-svn: 267277
*	BitcodeWriter: Emit distinct nodes before uniqued nodes	Duncan P. N. Exon Smith	2016-04-23	1	-6/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When an operand of a distinct node hasn't been read yet, the reader can use a DistinctMDOperandPlaceholder. This is much cheaper than forward referencing from a uniqued node. Change ValueEnumerator::organizeMetadata to partition distinct nodes and uniqued nodes to reduce the overhead of cycles broken by distinct nodes. Mehdi measured this for me; this removes most of the RAUW from the importing step of -flto=thin, even after a WIP patch that removes string-based DITypeRefs (introducing many more cycles to the metadata graph). llvm-svn: 267276
*	Address comments.	Teresa Johnson	2016-04-23	1	-243/+238
\| \| \| \|	llvm-svn: 267274
*	Refactor bitcode writer into classes (NFC)	Teresa Johnson	2016-04-23	1	-681/+823
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As discussed in on the mailing list yesterday, I have refactored BitcodeWriter.cpp to use classes to manage the bitcode writing process, instead of passing around long lists of parameters between static functions. See: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098610.html I created a parent BitcodeWriter class to own the BitstreamWriter, write the header, and contain the main entry point into the writing process. There are two derived classes, one for writing a module and one for writing a combined index file (for ThinLTO), which manage the writing process specific to those bitcode file types. I also changed the functions to conform to LLVM coding standards (lowercase function name first letter). The only two routines that still start with an uppercase letter are the two external interfaces, which can be fixed as a follow-on (I wanted to keep this round just within BitcodeWriter.cpp). Reviewers: dexonsmith, joker.eph Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19447 llvm-svn: 267273
*	Avoid ternery statement to please g++ after r267270, NFC	Duncan P. N. Exon Smith	2016-04-23	1	-1/+3
\| \| \| \| \| \|	http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/36074 llvm-svn: 267272
*	ValueEnumerator: Use std::find_if, NFC	Duncan P. N. Exon Smith	2016-04-23	2	-23/+8
\| \| \| \| \| \| \| \|	Mehdi's pattern recognition pulled this one out. This is cleaner with std::find_if than with the strange helper function that took an iterator by reference and updated it. llvm-svn: 267271
*	BitcodeReader: Avoid referencing unresolved nodes from distinct ones	Duncan P. N. Exon Smith	2016-04-23	2	-6/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Each reference to an unresolved MDNode is expensive, since the RAUW support in MDNode uses a separate allocation and side map. Since a distinct MDNode doesn't require its operands on creation (unlike uniuqed nodes, there's no need to check for structural equivalence), use nullptr for any of its unresolved operands. Besides reducing the burden on MDNode maps, this can avoid allocating temporary MDNodes in the first place. We need some way to track operands. Invent DistinctMDOperandPlaceholder for this purpose, which is a Metadata subclass that holds an ID and points at its single user. DistinctMDOperandPlaceholder::replaceUseWith is just like RAUW, but its name highlights that there is only ever exactly one use. There is no support for moving (or, obviously, copying) these. Move support would be possible but expensive; leaving it unimplemented prevents user error. In the BitcodeReader I originally considered allocating on a BumpPtrAllocator and keeping a vector of pointers to them, and then I realized that std::deque implements exactly this. A couple of obvious follow-ups: - Change ValueEnumerator to emit distinct nodes first to take more advantage of this optimization. (How convenient... I think I might have a couple of patches for this.) - Change DIBuilder and its consumers (like CGDebugInfo in clang) to use something like this when constructing debug info in the first place. llvm-svn: 267270
*	BitcodeReader: Consistently use IsDistinct, NFC	Duncan P. N. Exon Smith	2016-04-23	1	-33/+52
\| \| \| \| \| \| \| \|	Consistently use the IsDistinct variable and start relying on it in GET_OR_DISTINCT. This change has NFC, but prepares for using IsDistinct to optimize the behaviour of the getMD() and getMDOrNull() helpers. llvm-svn: 267268
*	BitcodeReader: Use getMD/getMDOrNull helpers consistently, almost NFC	Duncan P. N. Exon Smith	2016-04-23	2	-11/+5
\| \| \| \| \| \| \| \| \| \|	The only functionality change was removing an error check from the BitcodeReader (and an assertion from DILocation::getImpl) that is already caught by Verifier::visitDILocation. The Verifier is a better place for this anyway, and being inconsistent with other subclasses of MDNode isn't serving anyone. llvm-svn: 267267
*	[Hexagon] Set ctlz_zero_undef/cttz_zero_undef to Expand so LegalizeDAG will ↵	Craig Topper	2016-04-23	3	-16/+8
\| \| \| \| \| \|	convert them to ctlz/cttz. Remove the now unneccessary isel patterns. NFC llvm-svn: 267266
*	[NVPTX] Set ctlz_zero_undef to Expand so LegalizeDAG will convert it to ↵	Craig Topper	2016-04-23	2	-10/+3
\| \| \| \| \| \|	ctlz. Remove the now unneccessary isel patterns. NFC llvm-svn: 267265
*	[WebAssembly] Set ctlz_zero_undef/cttz_zero_undef to Expand so LegalizeDAG ↵	Craig Topper	2016-04-23	2	-7/+1
\| \| \| \| \| \|	will convert them to ctlz/cttz. Remove the now unneccessary isel patterns. NFC llvm-svn: 267264
*	Style fix in Core.h / Core.cpp. NFC	Amaury Sechet	2016-04-23	1	-14/+10
\| \| \| \|	llvm-svn: 267257