bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Replace MachineRegisterInfo::isSSA() with a MachineFunctionProperty	Derek Schuff	2016-04-04	2	-17/+15
\| \| \| \| \| \| \| \| \|	Use the MachineFunctionProperty mechanism to indicate whether a MachineFunction is in SSA form instead of a custom method on MachineRegisterInfo. NFC Differential Revision: http://reviews.llvm.org/D18574 llvm-svn: 265318
*	Revert r265309 and r265312 because they caused some errors I need to ↵	Wei Mi	2016-04-04	10	-715/+527
\| \| \| \| \| \|	investigate. llvm-svn: 265317
*	Add MachineFunctionProperty checks for AllVRegsAllocated for target passes	Derek Schuff	2016-04-04	44	-6/+209
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds the same checks that were added in r264593 to all target-specific passes that run after register allocation. Reviewers: qcolombet Subscribers: jyknight, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18525 llvm-svn: 265313
*	Fix unused var warning caused by r265309.	Wei Mi	2016-04-04	1	-3/+3
\| \| \| \|	llvm-svn: 265312
*	Replace analyzeSiblingValues with new algorithm to fix its compile	Wei Mi	2016-04-04	10	-526/+714
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	time issue. The patch is to solve PR17409 and its duplicates. analyzeSiblingValues is a N x N complexity algorithm where N is the number of siblings generated by reg splitting. Although it causes siginificant compile time issue when N is large, it is also important for performance since it removes redundent spills and enables rematerialization. To solve the compile time issue, the patch removes analyzeSiblingValues and replaces it with lower cost alternatives containing two parts. The first part creates a new spill hoisting method in postOptimization of register allocation. It does spill hoisting at once after all the spills are generated instead of inside every instance of selectOrSplit. The second part queries the define expr of the original register for rematerializaiton and keep it always available during register allocation even if it is already dead. It deletes those dead instructions only in postOptimization. With the two parts in the patch, it can remove analyzeSiblingValues without sacrificing performance. Differential Revision: http://reviews.llvm.org/D15302 llvm-svn: 265309
*	[mips] Range check simm32 and fold MIPS16's imm32 into simm32.	Daniel Sanders	2016-04-04	3	-39/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: At this point we should be able to enable IAS by default for O32 without breaking check-all, or recursion. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18439 llvm-svn: 265302
*	[SystemZ] Add compare-and-branch instructions to MC	Ulrich Weigand	2016-04-04	2	-21/+106
\| \| \| \| \| \| \| \| \| \| \| \|	This adds MC support for fused compare + indirect branch instructions, ie. CRB, CGRB, CLRB, CLGRB, CIB, CGIB, CLIB, CLGIB. They aren't actually generated yet -- this is preparation for their use for conditional returns in the next iteration of D17339. Author: koriakin Differential Revision: http://reviews.llvm.org/D18742 llvm-svn: 265296
*	[SystemZ] Support ATOMIC_FENCE	Ulrich Weigand	2016-04-04	5	-0/+40
\| \| \| \| \| \| \| \| \| \| \|	A cross-thread sequentially consistent fence should be lowered into z/Architecture's BCR serialization instruction, instead of causing a fatal error in the back-end. Author: bryanpkc Differential Revision: http://reviews.llvm.org/D18644 llvm-svn: 265292
*	[SystemZ] Support llvm.frameaddress/llvm.returnaddress intrinsics	Ulrich Weigand	2016-04-04	3	-2/+64
\| \| \| \| \| \| \| \| \| \| \|	Enable the SystemZ back-end to lower FRAMEADDR and RETURNADDR, which previously would cause the back-end to crash. Currently, only a frame count of zero is supported. Author: bryanpkc Differential Revision: http://reviews.llvm.org/D18514 llvm-svn: 265291
*	AVX-512: Truncating store for i1 vectors	Elena Demikhovsky	2016-04-04	1	-1/+62
\| \| \| \| \| \| \| \| \|	Implemented truncstore for KNL and skylake-avx512. Covered vectors from v2i1 to v64i1. We save the value in bits (not in bytes) - v32i1 is saved in 4 bytes. Differential Revision: http://reviews.llvm.org/D18740 llvm-svn: 265283
*	ValueMapper: Remove old FIXMEs; almost NFC	Duncan P. N. Exon Smith	2016-04-04	1	-21/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove a few old FIXMEs from the original commit of the Metadata/Value split in r223802. These are commented out assertions to the effect that calls between mapValue and mapMetadata never return nullptr. (The only behaviour change is that Mapper::mapSimpleMetadata memoizes the nullptr return.) When I originally rewrote the mapping code, I thought we could be stricter in the new metadata hierarchy and never return nullptr when RF_NullMapMissingGlobalValues was off. It's still not entirely clear to me why these assertions failed (a few months ago, I had a theory that I forgot to write down, but that's helping no one). Understood or not, I no longer see how these commented-out assertions would be useful. I'm relegating them to the annals of source control before making significant changes to ValueMapper.cpp. llvm-svn: 265282
*	IR: Lazily create ReplaceableMetadataImpl on MDNode	Duncan P. N. Exon Smith	2016-04-03	1	-31/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	RAUW support on MDNode usually requires an extra allocation for ReplaceableMetadataImpl. This is only strictly necessary if there are tracking references to the MDNode. Make the construction of ReplaceableMetadataImpl lazy, so that we don't get allocations if we don't need them. Since MDNode::isResolved now checks MDNode::isTemporary and MDNode::NumUnresolved instead of whether a ReplaceableMetadataImpl is allocated, the internal changes are intrusive (at various internal checkpoints, isResolved now has a different answer). However, there should be no real functionality change here; just slightly lazier allocation behaviour. The external semantics should be identical. llvm-svn: 265279
*	Various style fix in Core.h/Core.cpp . NFC	Amaury Sechet	2016-04-03	1	-7/+7
\| \| \| \|	llvm-svn: 265277
*	ValueMapper: Disallow metadata mapping recursion through mapValue	Duncan P. N. Exon Smith	2016-04-03	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds an assertion to maintain the property from r265273. When Mapper::mapSimpleMetadata calls Mapper::mapValue, it should not find its way back to mapMetadataImpl. This guarantees that mapSimpleMetadata is not involved in any recursion. Since Mapper::mapValue calls out to arbitrary materializers, we need to save a bit on the ValueMap to make this assertion effective. There should be no functionality change here. This co-recursion should already have been impossible. llvm-svn: 265276
*	Work around MSVC failure from r265273	Duncan P. N. Exon Smith	2016-04-03	1	-0/+10
\| \| \| \| \| \|	http://lab.llvm.org:8011/builders/sanitizer-windows/builds/19726 llvm-svn: 265275
*	[X86] Removed duplicate code.	Simon Pilgrim	2016-04-03	1	-5/+5
\| \| \| \|	llvm-svn: 265274
*	ValueMapper: Avoid recursion in mapSimplifiedMetadata, NFC	Duncan P. N. Exon Smith	2016-04-03	1	-9/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The main change is to delay materializing GlobalValue initializers from Mapper::mapValue until Mapper::~Mapper. This effectively removes all recursion from mapSimplifiedMetadata, as promised in r265270. mapSimplifiedMetadata calls mapValue for ConstantAsMetadata nodes to find the mapped constant, and now it shouldn't be possible for mapValue to indirectly re-invoke mapMetadata. I'll add an assertion to that effect in a follow-up (separated so that the assertion can easily be reverted independently, if it comes to that). This a step toward a broader goal: converting Mapper::mapMetadataImpl from a recursive to an iterative algorithm. When a BlockAddress points at a BasicBlock inside an unmaterialized function body, we need to delay it until the function body is materialized in Mapper::~Mapper. This commit creates a temporary BasicBlock and returns a new BlockAddress, then RAUWs the BasicBlock once it is known. This situation should be extremely rare since a BlockAddress is usually used from within the function it's referencing (and BlockAddress itself is rare). There should be no observable functionality change. llvm-svn: 265273
*	[CodeGenPrepare] Fix r265264 (again).	Peter Zotov	2016-04-03	1	-3/+3
\| \| \| \| \| \| \|	Don't require TLI for SinkCmpExpression, like it wasn't before r265264. llvm-svn: 265271
*	ValueMapper: Split out mapSimpleMetadata, NFC	Duncan P. N. Exon Smith	2016-04-03	1	-4/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Split out a helper for mapping metadata without operands. This is any metadata that is not an MDNode, and any MDNode where the answer is known without looking at operands. Through some weird twists, this function is co-recursive: mapSimpleMetadata => MapValue => materializeInitFor => linkFunctionBody => RemapInstructions => MapMetadata => mapSimpleMetadata I plan to break the recursion in a follow-up. llvm-svn: 265270
*	ValueMapper: Introduce Mapper helper class, NFC	Duncan P. N. Exon Smith	2016-04-03	1	-85/+101
\| \| \| \| \| \| \|	Remove a bunch of boilerplate from ValueMapper.cpp by using a new file-local class called Mapper. llvm-svn: 265268
*	[X86][SSE] Support for MOVMSK signbit extraction instructions	Simon Pilgrim	2016-04-03	5	-45/+32
\| \| \| \| \| \| \| \| \| \|	Add support for lowering with the MOVMSK instruction to extract vector element signbits to a GPR. This is an early step towards more optimal handling of vector comparison results. Differential Revision: http://reviews.llvm.org/D18741 llvm-svn: 265266
*	[CodeGenPrepare] Fix r265264.	Peter Zotov	2016-04-03	1	-3/+3
\| \| \| \| \| \| \|	The case where there was no TargetLowering was not handled, leading to null pointer dereferences. llvm-svn: 265265
*	[CodeGenPrepare] Avoid sinking soft-FP comparisons	Peter Zotov	2016-04-03	1	-5/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Sinking comparisons in CGP can undo the job of hoisting them done earlier by LICM, and soft-FP makes this an expensive mistake. A common pattern that produces floating point comparisons uniform over a loop is an explicit check for division by zero. If the divisor is hoisted out of the loop, the comparison can also be, but hoisting the function that unwinds is never legal, since it may cause side effects in the loop body prior to the unwinding to not be executed. Differential Revision: http://reviews.llvm.org/D18744 llvm-svn: 265264
*	[X86] Tidied up X86ISD instruction nodes. NFCI.	Simon Pilgrim	2016-04-03	1	-50/+59
\| \| \| \| \| \| \| \|	Tidied up comments, stripped trailing whitespace, split apart nodes that aren't related. No change in ordering although there is definitely some scope for it. llvm-svn: 265263
*	Mark some FP intrinsics as safe to speculatively execute	Peter Zotov	2016-04-03	1	-4/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Floating point intrinsics in LLVM are generally not speculatively executed, since most of them are defined to behave the same as libm functions, which set errno. However, the only error that can happen when executing ceil, floor, nearbyint, rint and round libm functions per POSIX.1-2001 is -ERANGE, and that requires the maximum value of the exponent to be smaller than the number of mantissa bits, which is not the case with any of the floating point types supported by LLVM. The trunc and copysign functions never set errno per per POSIX.1-2001. Differential Revision: http://reviews.llvm.org/D18643 llvm-svn: 265262
*	AVX-512: Load and Extended Load for i1 vectors	Elena Demikhovsky	2016-04-03	2	-10/+122
\| \| \| \| \| \| \| \| \| \|	Implemented load+{sign\|zero}_extend for i1 vectors Fixed failures in i1 vector load. Covered loading of v2i1, v4i1, v8i1, v16i1, v32i1, v64i1 vectors for KNL and SKX. Differential Revision: http://reviews.llvm.org/D18737 llvm-svn: 265259
*	[SimplifyLibCalls] Garbage collect dead code.	Davide Italiano	2016-04-03	1	-28/+7
\| \| \| \| \| \| \| \| \| \|	We already skip optimizations if the return value of printf() is used, so CI->use_empty() is always true. Differential Revision: http://reviews.llvm.org/D18656 llvm-svn: 265253
*	[lanai] Fix for LanaiDelaySlotFiller and LanaiMCInstLower.cpp	Jacques Pienaar	2016-04-03	3	-104/+87
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: * Fix to stop delay slot filler from inserting SP modifying instructions in the newly expanded call/return instructions. * In LowerSymbol the outermost type was not LanaiMCExpr if there was a binary expression * Remove printExpr in LanaiInstPrinter Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18734 llvm-svn: 265251
*	[mips][microMIPS] Revert commits r264245 and r264248.	Zoran Jovanovic	2016-04-02	11	-106/+51
\| \| \| \| \| \| \|	Commit r264245 was the reason for failing tests in LLVM test suite. Commit r264248 depends on the first one. llvm-svn: 265249
*	AArch64: support .cpu directive	Saleem Abdulrasool	2016-04-02	1	-0/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for the AArch64 .cpu directive. This is a slightly involved directive since the parameter is actually a variable encoded string. The general structure is: <cpu>[[+-]<feature>]* We now map some of the supported string names for features for internal representation of feature flags. If we encounter one which we do not support, bail out as we cannot validate the assembly any longer. Resolves PR27010. llvm-svn: 265240
*	Linker: Split mapUnneededSubprograms into two; almost NFC	Duncan P. N. Exon Smith	2016-04-02	1	-11/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Split the loop through compile units in mapUnneededSubprograms in two. First, visit imported entities to ensure that we've visited all need subprograms. Second, visit subprograms, and drop the ones we don't need. Hypothetically this protects against a subprogram from one compile unit being referenced from an imported entity in a different compile unit. I don't think that's valid IR (a debug info expert could confirm), but I think the refactor makes the code more clear. llvm-svn: 265233
*	Remove redundant assertion after cast, NFC	Duncan P. N. Exon Smith	2016-04-02	1	-1/+0
\| \| \| \|	llvm-svn: 265232
*	Linker: Avoid unnecessary work when moving named metadata	Duncan P. N. Exon Smith	2016-04-02	1	-17/+11
\| \| \| \| \| \| \| \| \| \| \|	IRLinker::mapUnneededSubprograms has to be sure that any "needed" subprograms get linked in. Rather than traversing through imported entities using llvm::getSubprogram, call MapMetadata. The latter memoizes the result in the ValueMap (sharing work with IRLinker::linkNamedMDNodes proper), and makes the local SmallPtrSet redundant. llvm-svn: 265231
*	Rename FunctionIndex into GlobalValueIndex to reflect the recent changes (NFC)	Mehdi Amini	2016-04-02	1	-21/+23
\| \| \| \| \| \| \| \|	The index used to contain only Function, but now contains GlobalValue in general. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265230
*	Linker: Remove IRMover::isMetadataUnneeded indirection; almost NFC	Duncan P. N. Exon Smith	2016-04-02	2	-51/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of checking live during MapMetadata whether a subprogram is needed, seed the ValueMap with `nullptr` up-front. There is a small hypothetical functionality change. Previously, calling MapMetadataOp on a node whose "scope:" chain led to an unneeded subprogram would return nullptr. However, if that were ever called, then the subprogram would be needed; a situation that the IRMover is supposed to avoid a priori! Besides cleaning up the code a little, this restores a nice property: MapMetadataOp returns the same as MapMetadata. llvm-svn: 265229
*	ValueMapper: Add support for seeding metadata with nullptr	Duncan P. N. Exon Smith	2016-04-02	2	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	Support seeding a ValueMap with nullptr for Metadata entries, a situation I didn't consider in the Metadata/Value split. I added a ValueMapper::getMappedMD accessor that returns an Optional<Metadata*> with the mapped (possibly null) metadata. IRMover needs to use this to avoid modifying the map when it's checking for unneeded subprograms. I updated a call from bugpoint since I find the new code clearer. llvm-svn: 265228
*	Bitcode: Try to emit metadata in function blocks	Duncan P. N. Exon Smith	2016-04-02	3	-39/+231
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Whenever metadata is only referenced by a single function, emit the metadata just in that function block. This should improve lazy-loading by reducing the amount of metadata in the global block. For now, this should catch all DILocations, and anything else that happens to be referenced only by a single function. It's also a first step toward a couple of possible future directions (which this commit does not implement): 1. Some debug info metadata is only referenced from compile units and individual functions. If we can drop the link from the compile unit, this optimization will get more powerful. 2. Any uniqued metadata that isn't referenced globally can in theory be emitted in every function block that references it (trading off bitcode size and full-parse time vs. lazy-load time). Note: this assumes the new BitcodeReader error checking from r265223. The metadata stored in function blocks gets purged after parsing each function, which means unresolved forward references will get lost. Since all the global metadata should have already been resolved by the time we get to the function metadata blocks we just need to check for that case. (If for some reason we need to handle bitcode that fails the checks in r265223, the fix is to store about-to-be-dropped unresolved nodes in MetadataList::shrinkTo until they can be handled succesfully by a future call to MetadataList::tryToResolveCycles.) llvm-svn: 265226
*	Fix doxygen comments from r265224, NFC	Duncan P. N. Exon Smith	2016-04-02	1	-2/+2
\| \| \| \|	llvm-svn: 265225
*	BitcodeWriter: Further unify function metadata, NFC	Duncan P. N. Exon Smith	2016-04-02	3	-12/+17
\| \| \| \| \| \| \| \| \| \| \| \| \|	Further unify the handling of function-local metadata with global metadata, by exposing the same interface in ValueEnumerator. Both contexts use the same accessors: - getMDStrings(): get the strings for this block. - getNonMDStrings(): get the non-strings for this block. A future commit will start adding strings to the function-block. llvm-svn: 265224
*	BitcodeReader: Check for unresolved function metadata	Duncan P. N. Exon Smith	2016-04-02	1	-2/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	A follow-up commit will start using function metadata blocks more heavily. This commit adds some error checking to confirm that metadata is fully resolved before (and after) materializing each function. This is valid even when reading very old bitcode from before the metadata/value split. The global metadata block always came before the function blocks. However, in case somehow this causes a regression (i.e., an old LLVM did produce such bitcode after all) I'm committing separately. llvm-svn: 265223
*	Reverts r265219.	Mehdi Amini	2016-04-02	1	-15/+15
\| \| \| \| \| \| \|	Unintentionally commited... time to call the day off! From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265221
*	Fix "warning: variabl 'XX’ set but not used" in release build (variable ↵	Mehdi Amini	2016-04-02	2	-2/+2
\| \| \| \| \| \| \|	used in assertion, NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265220
*	wip	Mehdi Amini	2016-04-02	1	-15/+15
\| \| \| \| \|	From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265219
*	constify GlobalValue::getGUID() and GlobalValue::getGlobalIdentifier() (NFC)	Mehdi Amini	2016-04-02	1	-1/+1
\| \| \| \| \|	From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265217
*	Revert "ThinLTO: add module caching handling."	Mehdi Amini	2016-04-02	1	-95/+1
\| \| \| \| \| \| \|	This reverts commit r265214, unintentionally commited. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265216
*	Create a typedef GlobalValue::GUID for uint64_t and RAUW (NFC)	Mehdi Amini	2016-04-02	4	-28/+31
\| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This should make the code more readable, especially all the map declarations. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18721 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265215
*	ThinLTO: add module caching handling.	Mehdi Amini	2016-04-02	1	-1/+95
\| \| \| \| \| \| \| \| \| \| \|	Reviewers: tejohnson Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18494 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265214
*	80 lines column after renaming "shouldDiscardValueNames" (NFC)	Mehdi Amini	2016-04-02	1	-1/+3
\| \| \| \| \|	From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265212
*	Rename Context::discardValueNames() to shouldDiscardValueNames() (NFC)	Mehdi Amini	2016-04-02	3	-3/+3
\| \| \| \| \| \| \|	Suggested by Sean Silva. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265211
*	Add Cache Pruning support	Mehdi Amini	2016-04-02	2	-0/+131
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Incremental LTO will usea cache to store object files. This patch handles the pruning part of the cache, exposing a few knobs: - Pruning interval: the implementation keeps a "timestamp" file in the directory and will scan it only after a given interval since the last modification of the timestamp file. This is for performance purpose, we don't want to scan continuously the folder. - Entry expiration: this is the time after which a file that hasn't been used is remove from the cache. - Maximum size: expressed in percentage of the available disk space, it helps to avoid that we blow up the disk space. http://reviews.llvm.org/D18422 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265209