bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Remove unused function	David Blaikie	2014-11-01	1	-3/+0
\| \| \| \|	llvm-svn: 221037
*	And... fix the build some more.	David Blaikie	2014-11-01	1	-1/+1
\| \| \| \|	llvm-svn: 221036
*	Just iterate the DwarfCompileUnits rather than trying to filter them out of ↵	David Blaikie	2014-11-01	1	-49/+46
\| \| \| \| \| \|	the list of all units. llvm-svn: 221034
*	Add '*' to auto variable that is a pointer, as per the coding conventions.	David Blaikie	2014-11-01	1	-1/+1
\| \| \| \|	llvm-svn: 221033
*	Add show and merge tools for sample PGO profiles.	Diego Novillo	2014-11-01	3	-57/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch extends the 'show' and 'merge' commands in llvm-profdata to handle sample PGO formats. Using the 'merge' command it is now possible to convert one sample PGO format to another. The only format that is currently not working is 'gcc'. I still need to implement support for it in lib/ProfileData. The changes in the sample profile support classes are needed for the merge operation. Reviewers: bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6065 llvm-svn: 221032
*	Add DwarfCompileUnit::getSkeleton that returns DwarfCompileUnit* to avoid ↵	David Blaikie	2014-11-01	2	-3/+6
\| \| \| \| \| \|	having to cast from DwarfUnit* on every call. llvm-svn: 221031
*	Temporarily revert r220777 to sort out build bot breakage.	Adrian Prantl	2014-11-01	1	-10/+10
\| \| \| \| \| \|	"[x86] Simplify vector selection if condition value type matches vselect value type and true value is all ones or false value is all zeros." llvm-svn: 221028
*	IR: MDNode => Value: Instruction::getAllMetadata()	Duncan P. N. Exon Smith	2014-11-01	4	-14/+13
\| \| \| \| \| \| \|	Change `Instruction::getAllMetadata()` to modify a vector of `Value` instead of `MDNode` and update call sites. This is part of PR21433. llvm-svn: 221027
*	IR: MDNode => Value: Instruction::getMetadata()	Duncan P. N. Exon Smith	2014-11-01	21	-81/+78
\| \| \| \| \| \| \| \| \| \|	Change `Instruction::getMetadata()` to return `Value` as part of PR21433. Update most callers to use `Instruction::getMDNode()`, which wraps the result in a `cast_or_null<MDNode>`. llvm-svn: 221024
*	IR: MDNode => Value: Add Instruction::getMDNode()	Duncan P. N. Exon Smith	2014-10-31	1	-0/+8
\| \| \| \| \| \| \| \| \| \|	Add `Instruction::getMDNode()` that casts to `MDNode` before changing `Instruction::getMetadata()` to return `Value`. This avoids adding `cast_or_null<MDNode>` boiler-plate throughout the code. Part of PR21433. llvm-svn: 221023
*	Revert "R600: Add missing file to CMakeLists.txt"	Reid Kleckner	2014-10-31	1	-1/+0
\| \| \| \| \| \| \| \|	This reverts commit r220998. It should've been reverted with the other change. llvm-svn: 221021
*	Revert "R600: Make sure to inline all internal functions"	Reid Kleckner	2014-10-31	3	-81/+0
\| \| \| \| \| \| \| \| \|	This reverts commit r220996. It introduced layering violations causing link errors in many configurations. llvm-svn: 221020
*	Work around bugs in MSVC "14" CTP 3's conversion logic	Reid Kleckner	2014-10-31	5	-8/+13
\| \| \| \| \| \| \| \| \| \|	It appears to ignore or find ambiguous MachineInstrBuilder's conversion operators that allow conversion to MachineInstr* and MachineBasicBlock::bundle_iterator. As a workaround, add an explicit way to get the MachineInstr. llvm-svn: 221017
*	Refactor duplicated code in liking GlobalValues.	Rafael Espindola	2014-10-31	1	-245/+128
\| \| \| \| \| \| \| \| \|	There is quiet a bit of logic that is common to any GlobalValue but was duplicated for Functions, GlobalVariables and GlobalAliases. While at it, merge visibility even when comdats are used, fixing pr21415. llvm-svn: 221014
*	Sink some of DwarfDebug::collectDeadVariables down into DwarfCompileUnit.	David Blaikie	2014-10-31	4	-20/+25
\| \| \| \|	llvm-svn: 221010
*	Correctly update dom-tree after loop vectorizer.	Michael Zolotukhin	2014-10-31	1	-1/+1
\| \| \| \|	llvm-svn: 221009
*	Sink most of DwarfDebug::constructAbstractSubprogramScopeDIE into ↵	David Blaikie	2014-10-31	3	-14/+13
\| \| \| \| \| \|	DwarfCompileUnit llvm-svn: 221005
*	R600: Add IPO to the list of required libraries	Tom Stellard	2014-10-31	1	-1/+1
\| \| \| \|	llvm-svn: 221004
*	[Object] Modify OwningBinary's interface to separate inspection from ownership.	Lang Hames	2014-10-31	2	-4/+7
\| \| \| \| \| \| \| \|	The getBinary and getBuffer method now return ordinary pointers of appropriate const-ness. Ownership is transferred by calling takeBinary(), which returns a pair of the Binary and a MemoryBuffer. llvm-svn: 221003
*	R600: Add missing file to CMakeLists.txt	Tom Stellard	2014-10-31	1	-0/+1
\| \| \| \|	llvm-svn: 220998
*	R600: Don't promote allocas when one of the users is a ptrtoint instruction	Tom Stellard	2014-10-31	1	-6/+19
\| \| \| \| \| \| \| \|	We need to figure out how to track ptrtoint values all the way until result is converted back to a pointer in order to correctly rewrite the pointer type. llvm-svn: 220997
*	R600: Make sure to inline all internal functions	Tom Stellard	2014-10-31	3	-0/+81
\| \| \| \| \| \|	Function calls aren't supported yet. llvm-svn: 220996
*	IR: Instruction::setMetadata() should use cast_or_null	Duncan P. N. Exon Smith	2014-10-31	1	-1/+1
\| \| \| \| \| \| \| \| \|	Not sure why this assertion didn't fire locally [1], but in r220994 `Instruction::setMetadata()` should be using `cast_or_null`. [1]: http://lab.llvm.org:8011/builders/llvm-hexagon-elf/builds/12327 llvm-svn: 220995
*	IR: MDNode => Value: Instruction::setMetadata()	Duncan P. N. Exon Smith	2014-10-31	1	-6/+9
\| \| \| \| \| \| \|	Change `Instruction::setMetadata()` API to accept `Value` instead of `MDNode`. Part of PR21433. llvm-svn: 220994
*	[PowerPC] Initial VSX intrinsic support, with min/max for vector double	Bill Schmidt	2014-10-31	1	-6/+18
\| \| \| \| \| \| \| \| \| \| \| \| \|	Now that we have initial support for VSX, we can begin adding intrinsics for programmer access to VSX instructions. This patch adds basic support for VSX intrinsics in general, and tests it by implementing intrinsics for minimum and maximum for the vector double data type. The LLVM portion of this is quite straightforward. There is a companion patch for Clang. llvm-svn: 220988
*	[AArch64] Check Dest Register Liveness in CondOpt pass.	Chad Rosier	2014-10-31	1	-6/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Our internal test reveals such case should not be transformed: cmp x17, #3 b.lt .LBB10_15 ... subs x12, x12, #1 b.gt .LBB10_1 where x12 is a liveout, becomes: cmp x17, #2 b.le .LBB10_15 ... subs x12, x12, #2 b.ge .LBB10_1 Unable to provide test case as it's difficult to reproduce on community branch. http://reviews.llvm.org/D6048 Patch by Zhaoshi Zheng <zhaoshiz@codeaurora.org>! llvm-svn: 220987
*	[asan] do not treat inline asm calls as indirect calls	Kostya Serebryany	2014-10-31	1	-1/+3
\| \| \| \|	llvm-svn: 220985
*	[CodeGenPrepare] Move extractelement close to store if they can be combined.	Quentin Colombet	2014-10-31	3	-1/+411
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds an optimization in CodeGenPrepare to move an extractelement right before a store when the target can combine them. The optimization may promote any scalar operations to vector operations in the way to make that possible. Context Some targets use different register files for both vector and scalar operations. This means that transitioning from one domain to another may incur copy from one register file to another. These copies are not coalescable and may be expensive. For example, according to the scheduling model, on cortex-A8 a vector to GPR move is 20 cycles. Motivating Example Let us consider an example: define void @foo(<2 x i32>* %addr1, i32* %dest) { %in1 = load <2 x i32>* %addr1, align 8 %extract = extractelement <2 x i32> %in1, i32 1 %out = or i32 %extract, 1 store i32 %out, i32* %dest, align 4 ret void } As it is, this IR generates the following assembly on armv7: vldr d16, [r0] @vector load vmov.32 r0, d16[1] @ cross-register-file copy: 20 cycles orr r0, r0, #1 @ scalar bitwise or str r0, [r1] @ scalar store bx lr Whereas we could generate much faster code: vldr d16, [r0] @ vector load vorr.i32 d16, #0x1 @ vector bitwise or vst1.32 {d16[1]}, [r1:32] @ vector extract + store bx lr Half of the computation made in the vector is useless, but this allows to get rid of the expensive cross-register-file copy. Proposed Solution To avoid this cross-register-copy penalty, we promote the scalar operations to vector operations. The penalty will be removed if we manage to promote the whole chain of computation in the vector domain. Currently, we do that only when the chain of computation ends by a store and the target is able to combine an extract with a store. Stores are the most likely candidates, because other instructions produce values that would need to be promoted and so, extracted as some point[1]. Moreover, this is customary that targets feature stores that perform a vector extract (see AArch64 and X86 for instance). The proposed implementation relies on the TargetTransformInfo to decide whether or not it is beneficial to promote a chain of computation in the vector domain. Unfortunately, this interface is rather inaccurate for this level of details and although this optimization may be beneficial for X86 and AArch64, the inaccuracy will lead to the optimization being too aggressive. Basically in TargetTransformInfo, everything that is legal has a cost of 1, whereas, even if a vector type is legal, usually a vector operation is slightly more expensive than its scalar counterpart. That will lead to too many promotions that may not be counter balanced by the saving of the cross-register-file copy. For instance, on AArch64 this penalty is just 4 cycles. For now, the optimization is just enabled for ARM prior than v8, since those processors have a larger penalty on cross-register-file copies, and the scope is limited to basic blocks. Because of these two factors, we limit the effects of the inaccuracy. Indeed, I did not want to build up a fancy cost model with block frequency and everything on top of that. [1] We can imagine targets that can combine an extractelement with other instructions than just stores. If we want to go into that direction, the current interfaces must be augmented and, moreover, I think this becomes a global isel problem. Differential Revision: http://reviews.llvm.org/D5921 <rdar://problem/14170854> llvm-svn: 220978
*	[asan] fix caller-calee instrumentation to emit new cache for every call site	Kostya Serebryany	2014-10-31	1	-4/+4
\| \| \| \|	llvm-svn: 220973
*	Update the non-pthreads fallback for RWMutex on Unix	David Blaikie	2014-10-31	1	-6/+6
\| \| \| \| \| \| \| \| \| \|	Tested this by #if 0'ing out the pthreads implementation, which indicated that this fallback was not currently compiling successfully and applying this patch resolves that. Patch by Andy Chien. llvm-svn: 220969
*	Correct assert text from r220923	David Blaikie	2014-10-31	1	-1/+1
\| \| \| \| \| \|	Noticed in post-commit review by Adrian Prantl. llvm-svn: 220967
*	Mark a few variables const. NFC.	Rafael Espindola	2014-10-31	1	-9/+11
\| \| \| \|	llvm-svn: 220964
*	[AArch64] CondOpt pass is missing FCMP instructions when searching backward for	Chad Rosier	2014-10-31	1	-0/+11
\| \| \| \| \| \| \| \| \|	a CMP which defines the flags used by B.CC. http://reviews.llvm.org/D6047 Patch by Zhaoshi Zheng <zhaoshiz@codeaurora.org>! llvm-svn: 220961
*	[SCEV] Improve Scalar Evolution's use of no {un,}signed wrap flags	Bradley Smith	2014-10-31	1	-6/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In a case where we have a no {un,}signed wrap flag on the increment, if RHS - Start is constant then we can avoid inserting a max operation bewteen the two, since we can statically determine which is greater. This allows us to unroll loops such as: void testcase3(int v) { for (int i=v; i<=v+1; ++i) f(i); } llvm-svn: 220960
*	[PowerPC] Load BlockAddress values from the TOC in 64-bit SVR4 code	Ulrich Weigand	2014-10-31	4	-10/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since block address values can be larger than 2GB in 64-bit code, they cannot be loaded simply using an @l / @ha pair, but instead must be loaded from the TOC, just like GlobalAddress, ConstantPool, and JumpTable values are. The commit also fixes a bug in PPCLinuxAsmPrinter::doFinalization where temporary labels could not be used as TOC values, since code would attempt (and fail) to use GetOrCreateSymbol to create a symbol of the same name as the temporary label. llvm-svn: 220959
*	Object, COFF: Cleanup symbol type code, improve binutils compatibility	David Majnemer	2014-10-31	1	-44/+75
\| \| \| \| \| \| \|	Do a better job classifying symbols. This increases the consistency between the COFF handling code and the ELF side of things. llvm-svn: 220952
*	Move definition closer to use. NFC.	Rafael Espindola	2014-10-31	1	-3/+3
\| \| \| \|	llvm-svn: 220949
*	PR20557: Fix the bug that bogus cpu parameter crashes llc on AArch64 backend.	Hao Liu	2014-10-31	1	-1/+5
\| \| \| \| \| \|	Initial patch by Oleg Ranevskyy. llvm-svn: 220945
*	[SelectionDAG] When scalarizing trunc, don't assert for legal operands.	Ahmed Bougacha	2014-10-30	1	-1/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	r212242 introduced a legalizer hook, originally to let AArch64 widen v1i{32,16,8} rather than scalarize, because the legalizer expected, when scalarizing the result of a conversion operation, to already have scalarized the operands. On AArch64, v1i64 is legal, so that commit ensured operations such as v1i32 = trunc v1i64 wouldn't assert. It did that by choosing to widen v1 types whenever possible. However, v1i1 types, for which there's no legal widened type, would still trigger the assert. This commit fixes that, by only scalarizing a trunc's result when the operand has already been scalarized, and introducing an extract_elt otherwise. This is similar to r205625. Fixes PR20777. llvm-svn: 220937
*	Speculative fix for Windows build after r220932	Hans Wennborg	2014-10-30	1	-0/+5
\| \| \| \|	llvm-svn: 220936
*	Fix incorrect invariant check in DAG Combine	Louis Gerbarg	2014-10-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Earlier this summer I fixed an issue where we were incorrectly combining multiple loads that had different constraints such alignment, invariance, temporality, etc. Apparently in one case I made copt paste error and swapped alignment and invariance. Tests included. rdar://18816719 llvm-svn: 220933
*	Removing the static initializer in ManagedStatic.cpp by using llvm_call_once ↵	Chris Bieneman	2014-10-30	5	-4/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to initialize the ManagedStatic mutex. Summary: This patch adds an llvm_call_once which is a wrapper around std::call_once on platforms where it is available and devoid of bugs. The patch also migrates the ManagedStatic mutex to be allocated using llvm_call_once. These changes are philosophically equivalent to the changes added in r219638, which were reverted due to a hang on Win32 which was the result of a bug in the Windows implementation of std::call_once. Reviewers: aaron.ballman, chapuni, chandlerc, rnk Reviewed By: rnk Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D5922 llvm-svn: 220932
*	Fix the merging of the constantness of declarations.	Rafael Espindola	2014-10-30	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The langref says: LLVM explicitly allows declarations of global variables to be marked constant, even if the final definition of the global is not. This capability can be used to enable slightly better optimization of the program, but requires the language definition to guarantee that optimizations based on the ‘constantness’ are valid for the translation units that do not include the definition. Given that definition, when merging two declarations, we have to drop constantness if of of them is not marked contant, since the Module without the constant marker might not have the necessary guarantees. llvm-svn: 220927
*	Add handling for range metadata in ValueTracking isKnownNonZero	Philip Reames	2014-10-30	1	-0/+29
\| \| \| \| \| \| \| \| \| \| \|	If we load from a location with range metadata, we can use information about the ranges of the loaded value for optimization purposes. This helps to remove redundant checks and canonicalize checks for other optimization passes. This particular patch checks whether a value is known to be non-zero from the range metadata. Currently, these tests are against InstCombine. In theory, all of these should be InstSimplify since we're not inserting any new instructions. Moving the code may follow in a separate change. Reviewed by: Hal Differential Revision: http://reviews.llvm.org/D5947 llvm-svn: 220925
*	PR21408: Workaround the appearance of duplicate variables due to problems ↵	David Blaikie	2014-10-30	1	-1/+6
\| \| \| \| \| \|	when inlining two calls to the same function from the same call site. llvm-svn: 220923
*	Fix Twine corruption problem with diagnostics.	Diego Novillo	2014-10-30	1	-2/+1
\| \| \| \| \| \| \|	This fixes the autobuilders I broke with a recent patch. Thanks echristo and dblaikie for beating me with a clue stick. llvm-svn: 220918
*	Add profile writing capabilities for sampling profiles.	Diego Novillo	2014-10-30	5	-40/+381
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch finishes up support for handling sampling profiles in both text and binary formats. The new binary format uses uleb128 encoding to represent numeric values. This makes profiles files about 25% smaller. The profile writer class can write profiles in the existing text and the new binary format. In subsequent patches, I will add the capability to read (and perhaps write) profiles in the gcov format used by GCC. Additionally, I will be adding support in llvm-profdata to manipulate sampling profiles. There was a bit of refactoring needed to separate some code that was in the reader files, but is actually common to both the reader and writer. The new test checks that reading the same profile encoded as text or raw, produces the same results. Reviewers: bogner, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6000 llvm-svn: 220915
*	[AVX512] Added VBROADCAST{SS/SD} encoding for VL subset.	Robert Khasanov	2014-10-30	1	-26/+51
\| \| \| \| \| \| \|	Refactored through AVX512_maskable llvm-svn: 220908
*	[dfsan] New calling convention for custom functions with variadic arguments.	Peter Collingbourne	2014-10-30	1	-9/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The previous calling convention prevented custom functions from being able to access argument labels unless it knew how many variadic arguments there were, and of which type. This restriction made it impossible to correctly model functions in the printf family, as it is legal to pass more arguments than required to those functions. We now pass arguments in the following order: non-vararg arguments labels for non-vararg arguments [if vararg function, pointer to array of labels for vararg arguments] [if non-void function, pointer to label for return value] vararg arguments Differential Revision: http://reviews.llvm.org/D6028 llvm-svn: 220906
*	Untabify.	NAKAMURA Takumi	2014-10-29	1	-2/+2
\| \| \| \|	llvm-svn: 220884