bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[ValueTracking] Improve isImpliedCondition for conditions with matching ↵	Chad Rosier	2016-04-19	2	-0/+507
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	operands. This patch improves SimplifyCFG to catch cases like: if (a < b) { if (a > b) <- known to be false unreachable; } Phabricator Revision: http://reviews.llvm.org/D18905 llvm-svn: 266767
*	[InstCombine][X86] Added extra tests introduced for D17490	Simon Pilgrim	2016-04-19	4	-0/+578
\| \| \| \|	llvm-svn: 266732
*	[InstCombine][X86] Regenerate SSE combine tests as part of setup for D17490	Simon Pilgrim	2016-04-19	6	-468/+581
\| \| \| \| \| \|	Regenerated with utils/update_test_checks.py llvm-svn: 266731
*	ARM: use a pseudo-instruction for cmpxchg at -O0.	Tim Northover	2016-04-18	3	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The fast register-allocator cannot cope with inter-block dependencies without spilling. This is fine for ldrex/strex loops coming from atomicrmw instructions where any value produced within a block is dead by the end, but not for cmpxchg. So we lower a cmpxchg at -O0 via a pseudo-inst that gets expanded after regalloc. Fortunately this is at -O0 so we don't have to care about performance. This simplifies the various axes of expansion considerably: we assume a strong seq_cst operation and ensure ordering via the always-present DMB instructions rather than v8 acquire/release instructions. Should fix the 32-bit part of PR25526. llvm-svn: 266679
*	[ValueTracking] Correct lit test comments. NFC.	Chad Rosier	2016-04-18	1	-2/+2
\| \| \| \|	llvm-svn: 266657
*	Revert "Replace the use of MaxFunctionCount module flag"	Eric Liu	2016-04-18	2	-38/+16
\| \| \| \| \| \| \| \| \| \|	This reverts commit r266477. This commit introduces cyclic dependency. This commit has "Analysis" depend on "ProfileData", while "ProfileData" depends on "Object", which depends on "BitCode", which depends on "Analysis". llvm-svn: 266619
*	[ARM] AArch32 v8 NEON is still not IEEE-754 compliant	Renato Golin	2016-04-18	1	-14/+8
\| \| \| \|	llvm-svn: 266603
*	Fix a typo in rL265762	Sanjoy Das	2016-04-17	1	-0/+12
\| \| \| \| \| \| \| \| \|	I accidentally replaced `mayBeOverridden` with `!isInterposable`. Remove the negation and add a test case that would've caught this. Many thanks to Håkan Hjort for spotting this! llvm-svn: 266551
*	ThinLTO: Make aliases explicit in the summary	Mehdi Amini	2016-04-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	To be able to work accurately on the reference graph when taking decision about internalizing, promoting, renaming, etc. We need to have the alias information explicit. Differential Revision: http://reviews.llvm.org/D18836 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266517
*	[cfi] Support explicit sections for functions in cfi-icall.	Evgeniy Stepanov	2016-04-15	1	-0/+26
\| \| \| \| \| \| \| \| \| \|	Allow explicit section for indirectly called functions in cfi-icall. Jumptables for functions in the same type class must be contiguous, so they always go to the default text section. Fixes PR25079. llvm-svn: 266486
*	Convert this sample-based-profiling testcase to use a NoDebug CU.	Adrian Prantl	2016-04-15	1	-4/+1
\| \| \| \|	llvm-svn: 266481
*	Replace the use of MaxFunctionCount module flag	Easwaran Raman	2016-04-15	2	-16/+38
\| \| \| \| \| \| \| \|	Adds an interface to get ProfileSummary for a module and makes InlineCost use ProfileSummary to get max function count. Differential Revision: http://reviews.llvm.org/D18622 llvm-svn: 266477
*	ARM: don't try to hoist constant RHS out of a division.	Tim Northover	2016-04-15	1	-0/+45
\| \| \| \| \| \| \| \| \| \| \| \|	Divisions by a constant can be converted into multiplies which are usually cheaper, but this isn't possible if the constant gets separated (particularly in loops). Fix this by telling ConstantHoisting that the immediate in a DIV is cheap. I considered making the check generic, but neither AArch64 (strangely) nor x86 showed any benefit on the tests I had. llvm-svn: 266464
*	[InstCombine] Don't transform compares of calls to functions named fabs{f,l,}	David Majnemer	2016-04-15	1	-0/+12
\| \| \| \| \| \| \| \|	InstCombine wants to optimize compares of calls to fabs with zero. However, we didn't have the necessary legality checking to verify that the function call had the same behavior as fabs. llvm-svn: 266452
*	[PR27284] Reverse the ownership between DICompileUnit and DISubprogram.	Adrian Prantl	2016-04-15	90	-315/+245
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently each Function points to a DISubprogram and DISubprogram has a scope field. For member functions the scope is a DICompositeType. DIScopes point to the DICompileUnit to facilitate type uniquing. Distinct DISubprograms (with isDefinition: true) are not part of the type hierarchy and cannot be uniqued. This change removes the subprograms list from DICompileUnit and instead adds a pointer to the owning compile unit to distinct DISubprograms. This would make it easy for ThinLTO to strip unneeded DISubprograms and their transitively referenced debug info. Motivation ---------- Materializing DISubprograms is currently the most expensive operation when doing a ThinLTO build of clang. We want the DISubprogram to be stored in a separate Bitcode block (or the same block as the function body) so we can avoid having to expensively deserialize all DISubprograms together with the global metadata. If a function has been inlined into another subprogram we need to store a reference the block containing the inlined subprogram. Attached to https://llvm.org/bugs/show_bug.cgi?id=27284 is a python script that updates LLVM IR testcases to the new format. http://reviews.llvm.org/D19034 <rdar://problem/25256815> llvm-svn: 266446
*	[SimplifyCFG] propagate branch metadata when creating select (PR27344)	Sanjay Patel	2016-04-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	This is almost identical to: http://reviews.llvm.org/rL264527 This doesn't solve PR27344; it just allows the profile weights to survive. To solve the bug, we need to use the profile weights in the backend. llvm-svn: 266442
*	[SimplifyCFG] add metadata to show failure to propagate (PR27344)	Sanjay Patel	2016-04-15	1	-7/+10
\| \| \| \|	llvm-svn: 266435
*	Move divergent-target test into CodeGen/NVPTX because it requires an NVPTX ↵	Justin Lebar	2016-04-15	1	-22/+0
\| \| \| \| \| \|	target. llvm-svn: 266403
*	[Speculation] Add a SpeculativeExecution mode where the pass does nothing ↵	Justin Lebar	2016-04-15	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	unless TTI::hasBranchDivergence() is true. Summary: This lets us add this pass to the IR pass manager unconditionally; it will simply not do anything on targets without branch divergence. Reviewers: tra Subscribers: llvm-commits, jingyue, rnk, chandlerc Differential Revision: http://reviews.llvm.org/D18625 llvm-svn: 266398
*	[test] Require 'asserts' for a test which uses -debug-only	Vedant Kumar	2016-04-14	1	-0/+1
\| \| \| \| \| \| \|	Without this line, bots which run check-all on Release compilers will break. llvm-svn: 266386
*	[AliasSetTracker] Correctly handle changing the size of an entry	Michael Kuperstein	2016-04-14	1	-0/+33
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the size of an AST entry changes, we also need to make sure we perform necessary alias set merges, as the new size may overlap pointers in other sets. We happen to run into this with memset, because memset allows an entry for a i8* pointer to have a decidedly non-i8 size. This fixes PR27262. Differential Revision: http://reviews.llvm.org/D18939 llvm-svn: 266381
*	[ARM] Adding IEEE-754 SIMD detection to loop vectorizer	Renato Golin	2016-04-14	1	-0/+335
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Some SIMD implementations are not IEEE-754 compliant, for example ARM's NEON. This patch teaches the loop vectorizer to only allow transformations of loops that either contain no floating-point operations or have enough allowance flags supporting lack of precision (ex. -ffast-math, Darwin). For that, the target description now has a method which tells us if the vectorizer is allowed to handle FP math without falling into unsafe representations, plus a check on every FP instruction in the candidate loop to check for the safety flags. This commit makes LLVM behave like GCC with respect to ARM NEON support, but it stops short of fixing the underlying problem: sub-normals. Neither GCC nor LLVM have a flag for allowing sub-normal operations. Before this patch, GCC only allows it using unsafe-math flags and LLVM allows it by default with no way to turn it off (short of not using NEON at all). As a first step, we push this change to make it safe and in sync with GCC. The second step is to discuss a new sub-normal's flag on both communitues and come up with a common solution. The third step is to improve the FastMath flags in LLVM to encode sub-normals and use those flags to restrict NEON FP. Fixes PR16275. llvm-svn: 266363
*	[InstCombine] remove constant by inverting compare + logic (PR27105)	Sanjay Patel	2016-04-14	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	https://llvm.org/bugs/show_bug.cgi?id=27105 We can check if all bits outside of a constant mask are set with a single constant. As noted in the bug report, although this form should be considered the canonical IR, backends may want to transform this into an 'andn' / 'andc' comparison against zero because that could be a single machine instruction. Differential Revision: http://reviews.llvm.org/D18842 llvm-svn: 266362
*	Update discriminator assignment algorithm to handle nested call correctly.	Dehao Chen	2016-04-14	1	-0/+50
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add discriminator for nested call correctly. Reviewers: davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19127 llvm-svn: 266354
*	Revert "Support arbitrary addrspace pointers in masked load/store intrinsics"	Adam Nemet	2016-04-14	3	-117/+42
\| \| \| \| \| \| \| \|	This reverts commit r266086. It breaks the LTO build of gcc in SPEC2000. llvm-svn: 266282
*	ARM: override cost function to re-enable ConstantHoisting (& fix it).	Tim Northover	2016-04-13	2	-0/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	At some point, ARM stopped getting any benefit from ConstantHoisting because the pass called a different variant of getIntImmCost. Reimplementing the correct variant revealed some problems, however: + ConstantHoisting was modifying switch statements. This is simply invalid, the cases must remain integer constants no matter the notional cost. + ConstantHoisting was mangling alloca instructions in the entry block. These should be handled by FrameLowering, so constants actually have a cost of 0. Worse, the resulting bitcasts meant they became dynamic allocas. rdar://25707382 llvm-svn: 266260
*	Test case for r265852.	Easwaran Raman	2016-04-13	1	-0/+19
\| \| \| \|	llvm-svn: 266237
*	[PGO] Remove redundant VP instrumentation	Betul Buyukkurt	2016-04-13	1	-0/+19
\| \| \| \| \| \| \| \|	LLVM optimization passes may reduce a profiled target expression to a constant. Removing runtime calls at such instrumentation points would help speedup the runtime of the instrumented program. llvm-svn: 266229
*	Revert "Make aliases explicit in the summary"	Mehdi Amini	2016-04-13	1	-1/+1
\| \| \| \| \| \| \| \| \|	Inadvertently commited... This reverts commit e618ec93786d99df2ddf280ad2d5e02f5516cecf. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266215
*	Make aliases explicit in the summary	Mehdi Amini	2016-04-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: To be able to work accurately on the reference graph when taking decision about internalizing, promoting, renaming, etc. We need to have the alias information explicit. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18836 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266214
*	Simplify strlen to a subtraction for certain cases.	David L Kreitzer	2016-04-13	1	-7/+5
\| \| \| \| \| \| \| \|	Patch by Li Huang (li1.huang@intel.com) Differential Revision: http://reviews.llvm.org/D18230 llvm-svn: 266200
*	Calculate __builtin_object_size when pointer depends on a condition	Petar Jovanovic	2016-04-13	2	-0/+148
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes calculating of builtin_object_size if it depends on a condition. Before this patch compiler did not know how to calculate the object size when it finds a condition that cannot be eliminated. This patch enables calculating of builtin_object_size even in case when condition cannot be eliminated by choosing minimum or maximum value as a result from condition. Choosing minimum or maximum value from condition is based on the second argument of __builtin_object_size function. Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D18438 llvm-svn: 266193
*	[InstCombine] We folded an fcmp to an i1 instead of a vector of i1	David Majnemer	2016-04-13	2	-3/+14
\| \| \| \| \| \| \| \| \|	Remove an ad-hoc transform in InstCombine and replace it with more general machinery (ValueTracking, InstructionSimplify and VectorUtils). This fixes PR27332. llvm-svn: 266175
*	AMDGPU: Remove leftover ShaderType attributes in tests	Matt Arsenault	2016-04-13	1	-1/+1
\| \| \| \|	llvm-svn: 266155
*	[x86, InstCombine] fix masked load pass-through operand to be a zero vector	Sanjay Patel	2016-04-12	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This bug was introduced with: http://reviews.llvm.org/rL262269 AVX masked loads are specified to set vector lanes to zero when the high bit of the mask element for that lane is zero: "If the mask is 0, the corresponding data element is set to zero in the load form of these instructions, and unmodified in the store form." --Intel manual Differential Revision: http://reviews.llvm.org/D19017 llvm-svn: 266148
*	Add a pass to name anonymous/nameless function	Mehdi Amini	2016-04-12	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For correct handling of alias to nameless function, we need to be able to refer them through a GUID in the summary. Here we name them using a hash of the non-private global names in the module. Reviewers: tejohnson Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18883 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266132
*	Move summary creation out of llvm-as into opt	Mehdi Amini	2016-04-12	4	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Let keep llvm-as "dumb": it converts textual IR to bitcode. This commit removes the dependency from llvm-as to libLLVMAnalysis. We'll add back summary in llvm-as if we get to a textual representation for it at some point. In the meantime, opt seems like a better place for that. Reviewers: tejohnson Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19032 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266131
*	Add __atomic_* lowering to AtomicExpandPass.	James Y Knight	2016-04-12	2	-0/+259
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(Recommit of r266002, with r266011, r266016, and not accidentally including an extra unused/uninitialized element in LibcallRoutineNames) AtomicExpandPass can now lower atomic load, atomic store, atomicrmw, and cmpxchg instructions to __atomic_* library calls, when the target doesn't support atomics of a given size. This is the first step towards moving all atomic lowering from clang into llvm. When all is done, the behavior of __sync_* builtins, __atomic_* builtins, and C11 atomics will be unified. Previously LLVM would pass everything through to the ISelLowering code. There, unsupported atomic instructions would turn into __sync_* library calls. Because of that behavior, Clang currently avoids emitting llvm IR atomic instructions when this would happen, and emits __atomic_* library functions itself, in the frontend. This change makes LLVM able to emit __atomic_* libcalls, and thus will eventually allow clang to depend on LLVM to do the right thing. It is advantageous to do the new lowering to atomic libcalls in AtomicExpandPass, before ISel time, because it's important that all atomic operations for a given size either lower to __atomic_* libcalls (which may use locks), or native instructions which won't. No mixing and matching. At the moment, this code is enabled only for SPARC, as a demonstration. The next commit will expand support to all of the other targets. Differential Revision: http://reviews.llvm.org/D18200 llvm-svn: 266115
*	Support arbitrary addrspace pointers in masked load/store intrinsics	Artur Pilipenko	2016-04-12	3	-42/+117
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a resubmittion of 263158 change. This patch fixes the problem which occurs when loop-vectorize tries to use @llvm.masked.load/store intrinsic for a non-default addrspace pointer. It fails with "Calling a function with a bad signature!" assertion in CallInst constructor because it tries to pass a non-default addrspace pointer to the pointer argument which has default addrspace. The fix is to add pointer type as another overloaded type to @llvm.masked.load/store intrinsics. Reviewed By: reames Differential Revision: http://reviews.llvm.org/D17270 llvm-svn: 266086
*	This reverts commit r266002, r266011 and r266016.	Rafael Espindola	2016-04-12	2	-259/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	They broke the msan bot. Original message: Add __atomic_* lowering to AtomicExpandPass. AtomicExpandPass can now lower atomic load, atomic store, atomicrmw,and cmpxchg instructions to __atomic_* library calls, when the target doesn't support atomics of a given size. This is the first step towards moving all atomic lowering from clang into llvm. When all is done, the behavior of __sync_* builtins, __atomic_* builtins, and C11 atomics will be unified. Previously LLVM would pass everything through to the ISelLowering code. There, unsupported atomic instructions would turn into __sync_* library calls. Because of that behavior, Clang currently avoids emitting llvm IR atomic instructions when this would happen, and emits __atomic_* library functions itself, in the frontend. This change makes LLVM able to emit __atomic_* libcalls, and thus will eventually allow clang to depend on LLVM to do the right thing. It is advantageous to do the new lowering to atomic libcalls in AtomicExpandPass, before ISel time, because it's important that all atomic operations for a given size either lower to __atomic_* libcalls (which may use locks), or native instructions which won't. No mixing and matching. At the moment, this code is enabled only for SPARC, as a demonstration. The next commit will expand support to all of the other targets. Differential Revision: http://reviews.llvm.org/D18200 llvm-svn: 266062
*	Add the allocsize attribute to LLVM.	George Burgess IV	2016-04-12	2	-0/+170
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`allocsize` is a function attribute that allows users to request that LLVM treat arbitrary functions as allocation functions. This patch makes LLVM accept the `allocsize` attribute, and makes `@llvm.objectsize` recognize said attribute. The review for this was split into two patches for ease of reviewing: D18974 and D14933. As promised on the revisions, I'm landing both patches as a single commit. Differential Revision: http://reviews.llvm.org/D14933 llvm-svn: 266032
*	MergeFunctions: test alloca better	JF Bastien	2016-04-12	1	-7/+35
\| \| \| \| \| \|	r237193 fix handling of alloca size / align in MergeFunctions, but only tested one and didn't follow FunctionComparator::cmpOperations's usual comparison pattern. It also didn't update Instruction.cpp:haveSameSpecialState which I'll do separately. llvm-svn: 266022
*	ThinLTO renaming: use module hash instead of position in the summary	Mehdi Amini	2016-04-11	2	-9/+9
\| \| \| \| \| \| \| \| \|	This is more robust to changes in the link ordering. Differential Revision: http://reviews.llvm.org/D18946 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266018
*	[safestack] Add canary to unsafe stack frames	Evgeniy Stepanov	2016-04-11	3	-0/+71
\| \| \| \| \| \| \| \|	Add StackProtector to SafeStack. This adds limited protection against data corruption in the caller frame. Current implementation treats all stack protector levels as -fstack-protector-all. llvm-svn: 266004
*	Add __atomic_* lowering to AtomicExpandPass.	James Y Knight	2016-04-11	2	-0/+259
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AtomicExpandPass can now lower atomic load, atomic store, atomicrmw, and cmpxchg instructions to __atomic_* library calls, when the target doesn't support atomics of a given size. This is the first step towards moving all atomic lowering from clang into llvm. When all is done, the behavior of __sync_* builtins, __atomic_* builtins, and C11 atomics will be unified. Previously LLVM would pass everything through to the ISelLowering code. There, unsupported atomic instructions would turn into __sync_* library calls. Because of that behavior, Clang currently avoids emitting llvm IR atomic instructions when this would happen, and emits __atomic_* library functions itself, in the frontend. This change makes LLVM able to emit __atomic_* libcalls, and thus will eventually allow clang to depend on LLVM to do the right thing. It is advantageous to do the new lowering to atomic libcalls in AtomicExpandPass, before ISel time, because it's important that all atomic operations for a given size either lower to __atomic_* libcalls (which may use locks), or native instructions which won't. No mixing and matching. At the moment, this code is enabled only for SPARC, as a demonstration. The next commit will expand support to all of the other targets. Differential Revision: http://reviews.llvm.org/D18200 llvm-svn: 266002
*	[DebugInfo/Test] Add CU as required.	Davide Italiano	2016-04-11	1	-0/+2
\| \| \| \|	llvm-svn: 265999
*	[LoopUtils, LV] Fix PR27246 (first-order recurrences)	Matthew Simpson	2016-04-11	1	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \|	This patch ensures that when we detect first-order recurrences, we reject a phi node if its previous value is also a phi node. During vectorization the initial and previous values of the recurrence are shuffled together to create the value for the current iteration. However, phi nodes are not widened like other instructions. This fixes PR27246. Differential Revision: http://reviews.llvm.org/D18971 llvm-svn: 265983
*	[DebugInfo] Fix even more tests to include DICompileunit.	Davide Italiano	2016-04-11	4	-0/+8
\| \| \| \|	llvm-svn: 265980
*	Fix missing DICompileUnits in testcases	Adrian Prantl	2016-04-11	2	-5/+7
\| \| \| \|	llvm-svn: 265974
*	[InstCombine] consolidate tests for related bugs	Sanjay Patel	2016-04-11	2	-26/+32
\| \| \| \|	llvm-svn: 265973