bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Reapply r237453 with a fix for the test timeouts.	James Molloy	2015-05-16	1	-0/+84
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The test timeouts were due to instcombine fighting itself. Regression test added. Original log message: Canonicalize min/max expressions correctly. This patch introduces a canonical form for min/max idioms where one operand is extended or truncated. This often happens when the other operand is a constant. For example: %1 = icmp slt i32 %a, i32 0 %2 = sext i32 %a to i64 %3 = select i1 %1, i64 %2, i64 0 Would now be canonicalized into: %1 = icmp slt i32 %a, i32 0 %2 = select i1 %1, i32 %a, i32 0 %3 = sext i32 %2 to i64 This builds upon a patch posted by David Majenemer (https://www.marc.info/?l=llvm-commits&m=143008038714141&w=2). That pass passively stopped instcombine from ruining canonical patterns. This patch additionally actively makes instcombine canonicalize too. Canonicalization of expressions involving a change in type from int->fp or fp->int are not yet implemented. llvm-svn: 237520
*	[MemCpyOpt] Turn memcpy from just-memset'd source into memset.	Ahmed Bougacha	2015-05-16	3	-2/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There's no point in copying around constants, so, when all else fails, we can still transform memcpy of memset into two independent memsets. To quote the example, we can turn: memset(dst1, c, dst1_size); memcpy(dst2, dst1, dst2_size); into: memset(dst1, c, dst1_size); memset(dst2, c, dst2_size); When dst2_size <= dst1_size. Like r235232 for copy constructors, this can occur in move constructors. Differential Revision: http://reviews.llvm.org/D9682 llvm-svn: 237506
*	Remove dead code in testcase. NFC.	Ahmed Bougacha	2015-05-16	1	-4/+0
\| \| \| \|	llvm-svn: 237501
*	[PPC64] Add vector pack/unpack support from ISA 2.07	Bill Schmidt	2015-05-16	4	-0/+131
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds support for the following new instructions in the Power ISA 2.07: vpksdss vpksdus vpkudus vpkudum vupkhsw vupklsw These instructions are available through the vec_packs, vec_packsu, vec_unpackh, and vec_unpackl built-in interfaces. These are lane-sensitive instructions, so the built-ins have different implementations for big- and little-endian, and the instructions must be marked as killing the vector swap optimization for now. The first three instructions perform saturating pack operations. The fourth performs a modulo pack operation, which means it can be represented with a vector shuffle, and conversely the appropriate vector shuffles may cause this instruction to be generated. The other instructions are only generated via built-in support for now. Appropriate tests have been added. There is a companion patch to clang for the rest of this support. llvm-svn: 237499
*	[BitcodeReader] Don't allow INSERTVAL/EXTRACTVAL with 0 indices	Filipe Cabecinhas	2015-05-16	3	-0/+10
\| \| \| \| \| \| \| \|	This would trigger an assertion later. Bug found with AFL fuzz. llvm-svn: 237494
*	Add a speculative execution pass	Jingyue Wu	2015-05-15	1	-0/+195
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is a pass for speculative execution of instructions for simple if-then (triangle) control flow. It's aimed at GPUs, but could perhaps be used in other contexts. Enabling this pass gives us a 1.0% geomean improvement on Google benchmark suites, with one benchmark improving 33%. Credit goes to Jingyue Wu for writing an earlier version of this pass. Patched by Bjarke Roune. Test Plan: This patch adds a set of tests in test/Transforms/SpeculativeExecution/spec.ll The pass is controlled by a flag which defaults to having the pass not run. Reviewers: eliben, dberlin, meheff, jingyue, hfinkel Reviewed By: jingyue, hfinkel Subscribers: majnemer, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D9360 llvm-svn: 237459
*	Revert "Canonicalize min/max expressions correctly."	James Molloy	2015-05-15	1	-73/+0
\| \| \| \| \| \| \|	This reverts r237453 - it was causing timeouts on some bots. Reverting while I investigate (it's probably InstCombine fighting itself...) llvm-svn: 237458
*	[SLSR] handle (B \| i) * S	Jingyue Wu	2015-05-15	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Consider (B \| i) * S as (B + i) * S if B and i have no bits set in common. Test Plan: @or in slsr-mul.ll Reviewers: broune, meheff Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9788 llvm-svn: 237456
*	Mark SMIN/SMAX/UMIN/UMAX nodes as legal and add patterns for them.	James Molloy	2015-05-15	1	-0/+96
\| \| \| \| \| \| \|	The new [SU]{MIN,MAX} SDNodes can be lowered directly to instructions for most NEON datatypes - the big exclusion being v2i64. llvm-svn: 237455
*	Canonicalize min/max expressions correctly.	James Molloy	2015-05-15	1	-0/+73
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch introduces a canonical form for min/max idioms where one operand is extended or truncated. This often happens when the other operand is a constant. For example: %1 = icmp slt i32 %a, i32 0 %2 = sext i32 %a to i64 %3 = select i1 %1, i64 %2, i64 0 Would now be canonicalized into: %1 = icmp slt i32 %a, i32 0 %2 = select i1 %1, i32 %a, i32 0 %3 = sext i32 %2 to i64 This builds upon a patch posted by David Majenemer (https://www.marc.info/?l=llvm-commits&m=143008038714141&w=2). That pass passively stopped instcombine from ruining canonical patterns. This patch additionally actively makes instcombine canonicalize too. Canonicalization of expressions involving a change in type from int->fp or fp->int are not yet implemented. llvm-svn: 237453
*	[llvm-readobj] Teach llvm-readobj to print PT_MIPS_ABIFLAGS program header	Simon Atanasyan	2015-05-15	2	-5/+17
\| \| \| \|	llvm-svn: 237451
*	NFC - Test case invokes llc on a file rather than redirected from a file.	Nemanja Ivanovic	2015-05-15	1	-1/+1
\| \| \| \| \| \| \| \|	This has caused some local failures. Updating the test case to be more like the majority of the similar test cases. Committing on behalf of Hubert Tong (hstong@ca.ibm.com). llvm-svn: 237449
*	[DependenceAnalysis] Fix for PR21585: collectUpperBound triggers asserts	James Molloy	2015-05-15	1	-0/+105
\| \| \| \| \| \| \| \| \| \|	collectUpperBound hits an assertion when the back edge count is wider then the desired type. If that happens, truncate the backedge count. Patch by Philip Pfaffe! llvm-svn: 237439
*	[mips] [IAS] Fix expansion of negative 32-bit immediates for LI/DLI.	Toma Tabacu	2015-05-15	2	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: To maintain compatibility with GAS, we need to stop treating negative 32-bit immediates as 64-bit values when expanding LI/DLI. This currently happens because of sign extension. To do this we need to choose the 32-bit value expansion for values which use their upper 33 bits only for sign extension (i.e. no 0's, only 1's). Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8662 llvm-svn: 237428
*	[PlaceSafepoints] Fix a bug that came in with rL236672.	Sanjoy Das	2015-05-15	1	-0/+42
\| \| \| \| \| \| \| \|	Transfer the calling convention from the invoke being replaced by PlaceStatepoints to the new invoke to gc.statepoint created. Add a test case that would have caught this issue. llvm-svn: 237414
*	[PlaceSafepoints] Fix a bug that came in with rL236672.	Sanjoy Das	2015-05-15	1	-0/+42
\| \| \| \| \| \| \| \| \|	rL236672 would generate all invoke statepoints with deopt args set to a list containing the single element "0", instead of an empty list. Also add a test case that would have caught this. llvm-svn: 237413
*	Fix the check strings in a test case committed in r212455.	Akira Hatanaka	2015-05-15	1	-2/+2
\| \| \| \| \| \| \|	The access size (8, in this case) was missing in the function name that was being checked. llvm-svn: 237410
*	[ValueTracking] refactor: extract method haveNoCommonBitsSet	Jingyue Wu	2015-05-14	1	-0/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Extract method haveNoCommonBitsSet so that we don't have to duplicate this logic in InstCombine and SeparateConstOffsetFromGEP. This patch also makes SeparateConstOffsetFromGEP more precise by passing DominatorTree to computeKnownBits. Test Plan: value-tracking-domtree.ll that tests ValueTracking indeed leverages dominating conditions Reviewers: broune, meheff, majnemer Reviewed By: majnemer Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D9734 llvm-svn: 237407
*	Add another InstCombine pass after LoopUnroll.	Wei Mi	2015-05-14	1	-0/+85
\| \| \| \| \| \| \| \|	This is to cleanup some redundency generated by LoopUnroll pass. Such redundency may not be cleaned up by existing passes after LoopUnroll. Differential Revision: http://reviews.llvm.org/D9777 llvm-svn: 237395
*	[Hexagon] Generate hardware loop for a vectorized loop	Brendon Cahoon	2015-05-14	1	-0/+93
\| \| \| \| \| \| \| \| \|	The induction variable in the vectorized loop wasn't recognized properly, so a hardware loop wasn't generated. Differential Revision: http://reviews.llvm.org/D9722 llvm-svn: 237388
*	[ConstantFolding] Fix wrong folding of intrinsic 'convert.from.fp16'.	Andrea Di Biagio	2015-05-14	1	-0/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Function 'ConstantFoldScalarCall' (in ConstantFolding.cpp) works under the wrong assumption that a call to 'convert.from.fp16' returns a value of type 'float'. However, intrinsic 'convert.from.fp16' can be overloaded; for example, we can call 'convert.from.fp16.f64' to convert from half to double; etc. Before this patch, the following example would have triggered an assertion failure in opt (with -constprop): ``` define double @foo() { entry: %0 = call double @llvm.convert.from.fp16.f64(i16 0) ret double %0 } ``` This patch fixes the problem in ConstantFolding.cpp. When folding a call to convert.from.fp16, we perform a different kind of conversion based on the call return type. Added test 'Transform/ConstProp/convert-from-fp16.ll'. Differential Revision: http://reviews.llvm.org/D9771 llvm-svn: 237377
*	[Hexagon] Remove dead constant assignment in hardware loop pass	Brendon Cahoon	2015-05-14	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \|	After converting a loop to a hardware loop, the pass should remove any unnecessary instructions from the old compare-and-branch code. This patch removes a dead constant assignment that was used in the compare instruction. Differential Revision: http://reviews.llvm.org/D9720 llvm-svn: 237373
*	[mips] [IAS] Enforce .set nomacro.	Toma Tabacu	2015-05-14	2	-0/+79
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When used, ".set nomacro" causes warning messages to be reported when we expand pseudo-instructions to multiple machine instructions. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9564 llvm-svn: 237366
*	[Hexagon] Check for underflow/wrap in hardware loop pass	Brendon Cahoon	2015-05-14	4	-0/+198
\| \| \| \| \| \| \| \|	If the loop trip count may underflow or wrap, the compiler should not generate a hardware loop since the trip count will be incorrect. llvm-svn: 237365
*	[mips] [IAS] Emit .set macro/nomacro.	Toma Tabacu	2015-05-14	2	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9563 llvm-svn: 237363
*	[mips] Do not place users of $ra in the delay slot of call instructions.	Vasileios Kalintiris	2015-05-14	1	-0/+183
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When we are trying to fill the delay slot of a call instruction, we must avoid filler instructions that use the $ra register. This fixes the test MultiSource/Applications/JM/lencod when we enable the forward delay slot filler. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9670 llvm-svn: 237362
*	Re-apply r237247 - [AArch64] Codegen VMAX/VMIN for safe math cases	Artyom Skrobov	2015-05-14	1	-3/+31
\| \| \| \| \| \|	No longer breaks SPEC2000/2006 llvm-svn: 237361
*	New Loop Distribution pass	Adam Nemet	2015-05-14	6	-0/+484
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This implements the initial version as was proposed earlier this year (http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-January/080462.html). Since then Loop Access Analysis was split out from the Loop Vectorizer and was made into a separate analysis pass. Loop Distribution becomes the second user of this analysis. The pass is off by default and can be enabled with -enable-loop-distribution. There is currently no notion of profitability; if there is a loop with dependence cycles, the pass will try to split them off from other memory operations into a separate loop. I decided to remove the control-dependence calculation from this first version. This and the issues with the PDT are actively discussed so it probably makes sense to treat it separately. Right now I just mark all terminator instruction required which keeps identical CFGs for each distributed loop. This seems to be working pretty well for 456.hmmer where even though there is an empty if-then block in the distributed loop initially, it gets completely removed. The pass keeps DominatorTree and LoopInfo updated. I've tested this with -loop-distribute-verify with the testsuite where we distribute ~90 loops. SimplifyLoop is violated in some cases and I have a FIXME covering this. Reviewers: hfinkel, nadav, aschwaighofer Reviewed By: aschwaighofer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8831 llvm-svn: 237358
*	[mips] [IAS] Warn when LA is used with a 64-bit symbol.	Toma Tabacu	2015-05-14	2	-17/+9
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9295 llvm-svn: 237356
*	AVX-512: Added i1 type handling for calling conventions.	Elena Demikhovsky	2015-05-14	1	-1/+68
\| \| \| \| \| \| \| \|	i1 type is a legal type on AVX-512 and can be passed as parameter or return value. i1 is promoted to i8 on return and to i32 for call arguments (i8 is also promoted to i32 here). The result code is similar to the previous X86 targets, where i1 is allways promoted to i8. llvm-svn: 237350
*	Don't omit the constant when computing a cross-section relative relocation.	Andy Ayers	2015-05-14	1	-0/+97
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9692 llvm-svn: 237327
*	[CodeGen] Use standard -not gnueabi- naming for f16 libcalls on Darwin.	Ahmed Bougacha	2015-05-14	1	-3/+3
\| \| \| \| \| \| \| \|	Other targets probably should as well. Since r237161, compiler-rt has both, but I don't see why anything other than gnueabi would use a gnueabi naming scheme. llvm-svn: 237324
*	YAML: Implement block scalar parsing.	Alex Lorenz	2015-05-13	11	-37/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit implements the parsing of YAML block scalars. Some code existed for it before, but it couldn't parse block scalars. This commit adds a new yaml node type to represent the block scalar values. This commit also deletes the 'spec-09-27' and 'spec-09-28' tests as they are identical to the test file 'spec-09-26'. This commit introduces 3 new utility functions to the YAML scanner class: `skip_s_space`, `advanceWhile` and `consumeLineBreakIfPresent`. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D9503 llvm-svn: 237314
*	[X86] Fix PR23271 - RIP-relative decoding bug in disassembler.	Douglas Katzman	2015-05-13	1	-0/+38
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9110 llvm-svn: 237310
*	InstrProf: Fix display of large numbers in llvm-cov	Justin Bogner	2015-05-13	4	-37/+37
\| \| \| \| \| \| \| \|	llvm-cov was truncating numbers that were larger than a particular fixed width, which is as confusing as it is useless. Instead, we use engineering notation with SI prefix for magnitude. llvm-svn: 237307
*	[PlaceSafepoints] New attributes for patchable statepoints.	Sanjoy Das	2015-05-13	1	-0/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch teaches the PlaceSafepoints pass about two `CallSite` function attributes: * "statepoint-id": if the string value of this attribute can be parsed as an integer, then it is propagated to the ID parameter of the statepoint created. * "statepoint-num-patch-bytes": if the string value of this attribute can be parsed as an integer, then it is propagated to the `num patch bytes` parameter of the statepoint created. This change intentionally does not assert on a malformed value for these attributes, given that they're not "official" attributes. Reviewers: reames, pgavlin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9735 llvm-svn: 237286
*	[NaryReassociate] avoid running forever	Jingyue Wu	2015-05-13	1	-0/+11
\| \| \| \| \| \| \| \| \|	Avoid running forever by checking we are not reassociating an expression into the same form. Tested with @avoid_infinite_loops in nary-add.ll llvm-svn: 237269
*	[Hexagon] Generate loop1 instruction for nested loops	Brendon Cahoon	2015-05-13	1	-0/+68
\| \| \| \| \| \| \| \|	loop1 is for the outer loop and loop0 is for the inner loop. Differential Revision: http://reviews.llvm.org/D9680 llvm-svn: 237266
*	Add function entry counts from sample profiles.	Diego Novillo	2015-05-13	2	-0/+27
\| \| \| \| \| \| \| \| \| \|	This patch uses the new function profile metadata "function_entry_count" to annotate entry counts from sample profiles. In a sampling profile, the total samples collected at the function entry are an approximation for the number of times that function was invoked. llvm-svn: 237265
*	Add function entry count metadata.	Diego Novillo	2015-05-13	2	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This adds three Function methods to handle function entry counts: setEntryCount() and getEntryCount(). Entry counts are stored under the MD_prof metadata node with the name "function_entry_count". They are unsigned 64 bit values set by profilers (instrumentation and sample profiler changes coming up). Added documentation for new profile metadata and tests. Reviewers: dexonsmith, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9628 llvm-svn: 237260
*	[Hexagon] Generate hardware loop when loop has a critical edge	Brendon Cahoon	2015-05-13	1	-0/+58
\| \| \| \| \| \| \| \| \|	The hardware loop pass should try to generate a hardware loop instruction when the original loop has a critical edge. Differential Revision: http://reviews.llvm.org/D9678 llvm-svn: 237258
*	[mips][microMIPSr6] Implement CLO and CLZ instructions	Jozef Kolek	2015-05-13	2	-0/+6
\| \| \| \| \| \| \| \|	This patch implements CLO and CLZ instructions using mapping. Differential Revision: http://reviews.llvm.org/D8553 llvm-svn: 237257
*	Revert r237247 - [AArch64] Codegen VMAX/VMIN.. as it is causing failures in ↵	Silviu Baranga	2015-05-13	1	-22/+1
\| \| \| \| \| \|	SPEC2000/2006 llvm-svn: 237256
*	[mips] [IAS] Unify common functionality of LA and LI.	Toma Tabacu	2015-05-13	3	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: A side-effect of this is that LA gains proper handling of unsigned and positive signed 16-bit immediates and more accurate error messages. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9290 llvm-svn: 237255
*	[AArch64] Codegen VMAX/VMIN for safe math cases	Artyom Skrobov	2015-05-13	1	-1/+22
\| \| \| \|	llvm-svn: 237247
*	Reverting r237234, "Use std::bitset for SubtargetFeatures"	Michael Kuperstein	2015-05-13	1	-1/+1
\| \| \| \| \| \| \|	The buildbots are still not satisfied. MIPS and ARM are failing (even though at least MIPS was expected to pass). llvm-svn: 237245
*	[mips] [IAS] Merge the micromips-expressions.s test into expr1.s. NFC.	Toma Tabacu	2015-05-13	2	-49/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Also did some minor reformatting in the resulting test. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9702 llvm-svn: 237242
*	[DebugInfo] Debug locations for constant SD nodes	Sergey Dmitrouk	2015-05-13	6	-6/+150
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Several updates for [DebugInfo] Add debug locations to constant SD nodes (r235989). Includes: * re-enabling the change (disabled recently); * missing change for FP constants; * resetting debug location of constant node if it's used more than at one place to prevent emission of wrong locations in case of coalesced constants; * a couple of additional tests. Now all look ups in CSEMap are wrapped by additional method. Comment in D9084 suggests that debug locations aren't useful for "target constants", so there might be one more change related to this API (namely, dropping debug locations for getTarget*Constant methods). Differential Revision: http://reviews.llvm.org/D9604 llvm-svn: 237237
*	Use std::bitset for SubtargetFeatures	Michael Kuperstein	2015-05-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	Previously, subtarget features were a bitfield with the underlying type being uint64_t. Since several targets (X86 and ARM, in particular) have hit or were very close to hitting this bound, switching the features to use a bitset. No functional change. The first two times this was committed (r229831, r233055), it caused several buildbot failures. At least some of the ARM and MIPS ones were due to gcc/binutils issues, and should now be fixed. llvm-svn: 237234
*	AVX-512: fixed a bug in encoding of VPSRAQ instrcution,	Elena Demikhovsky	2015-05-13	1	-1/+914
\| \| \| \| \| \|	added a bunch of encoding tests. llvm-svn: 237232