bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Propagate Undef in llvm.cos Intrinsic	Sanjoy Das	2016-04-08	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The llvm cos intrinsic currently does not propagate undef's. This change transforms cos(undef) to null value or 0. There are 2 test cases added as well. Patch by Anna Thomas! Reviewers: sanjoy Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D18863 llvm-svn: 265825
*	Revert r265817	Colin LeMahieu	2016-04-08	61	-105/+104
\| \| \| \| \| \|	lld tests need to be addressed. llvm-svn: 265822
*	[llvm-objdump] Printing hex instead of dec by default	Colin LeMahieu	2016-04-08	60	-103/+104
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D18770 llvm-svn: 265817
*	[SystemZ] Support conditional sibling calls via BRCL	Ulrich Weigand	2016-04-08	1	-0/+369
\| \| \| \| \| \| \| \| \| \| \| \|	This adds a conditional variant of CallJG instruction, CallBRCL. It can be used for conditional sibling calls. Unfortunately, due to IfCvt limitations, it only really works well for functions without arguments. Author: koriakin Differential Revision: http://reviews.llvm.org/D18864 llvm-svn: 265814
*	[AArch64] Add a test case for the default mapping of RegBankSelect.	Quentin Colombet	2016-04-08	1	-0/+49
\| \| \| \|	llvm-svn: 265811
*	[InstCombine] Fix miscompile in FoldSPFofSPF	David Majnemer	2016-04-08	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \|	We had a select of a cast of a select but attempted to replace the outer select with the inner select dispite their incompatible types. Patch by Anton Korobeynikov! This fixes PR27236. llvm-svn: 265805
*	[MIR] Teach the parser how to deal with register banks.	Quentin Colombet	2016-04-08	1	-1/+1
\| \| \| \|	llvm-svn: 265802
*	[LoopVectorize] Register cloned assumptions	David Majnemer	2016-04-08	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \|	InstCombine cannot effectively remove redundant assumptions without them registered in the assumption cache. The vectorizer can create identical assumptions but doesn't register them with the cache, resulting in slower compile times because InstCombine tries to reason about a lot more assumptions. Fix this by registering the cloned assumptions. llvm-svn: 265800
*	[ARM] Enable SMLAW[B\|T] and SMLUW[B\|T] instruction selection	Sam Parker	2016-04-08	2	-26/+105
\| \| \| \| \| \| \| \| \| \|	Added ISelDAGToDAG functions to enable selection of the smlawb, smlawt, smulwb and smulwt instructions for the ARM backend. Also updated the smul CodeGen test and removed the smulw one. Differential Revision: http://reviews.llvm.org/D18892 llvm-svn: 265793
*	Revert r265547 "Recommit r265309 after fixed an invalid memory reference bug ↵	Hans Wennborg	2016-04-08	5	-199/+518
\| \| \| \| \| \| \| \| \| \| \| \| \|	happened" It caused PR27275: "ARM: Bad machine code: Using an undefined physical register" Also reverting the following commits that were landed on top: r265610 "Fix the compare-clang diff error introduced by r265547." r265639 "Fix the sanitizer bootstrap error in r265547." r265657 "InlineSpiller.cpp: Escap \@ in r265547. [-Wdocumentation]" llvm-svn: 265790
*	[X86][SSE] Added 32-bit tests for vector lzcnt/tzcnt tests	Simon Pilgrim	2016-04-08	2	-0/+607
\| \| \| \| \| \|	v2i64 tests are particularly bad on 32-bit targets. llvm-svn: 265789
*	Re-commit [SCEV] Introduce a guarded backedge taken count and use it in LAA ↵	Silviu Baranga	2016-04-08	3	-1/+277
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and LV This re-commits r265535 which was reverted in r265541 because it broke the windows bots. The problem was that we had a PointerIntPair which took a pointer to a struct allocated with new. The problem was that new doesn't provide sufficient alignment guarantees. This pattern was already present before r265535 and it just happened to work. To fix this, we now separate the PointerToIntPair from the ExitNotTakenInfo struct into a pointer and a bool. Original commit message: Summary: When the backedge taken codition is computed from an icmp, SCEV can deduce the backedge taken count only if one of the sides of the icmp is an AddRecExpr. However, due to sign/zero extensions, we sometimes end up with something that is not an AddRecExpr. However, we can use SCEV predicates to produce a 'guarded' expression. This change adds a method to SCEV to get this expression, and the SCEV predicate associated with it. In HowManyGreaterThans and HowManyLessThans we will now add a SCEV predicate associated with the guarded backedge taken count when the analyzed SCEV expression is not an AddRecExpr. Note that we only do this as an alternative to returning a 'CouldNotCompute'. We use new feature in Loop Access Analysis and LoopVectorize to analyze and transform more loops. Reviewers: anemet, mzolotukhin, hfinkel, sanjoy Subscribers: flyingforyou, mcrosier, atrick, mssimpso, sanjoy, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D17201 llvm-svn: 265786
*	CXX_FAST_TLS calling convention: performance improvement for PPC64	Chuang-Yu Cheng	2016-04-08	1	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the same change on PPC64 as r255821 on AArch64. I have even borrowed his commit message. The access function has a short entry and a short exit, the initialization block is only run the first time. To improve the performance, we want to have a short frame at the entry and exit. We explicitly handle most of the CSRs via copies. Only the CSRs that are not handled via copies will be in CSR_SaveList. Frame lowering and prologue/epilogue insertion will generate a short frame in the entry and exit according to CSR_SaveList. The majority of the CSRs will be handled by register allcoator. Register allocator will try to spill and reload them in the initialization block. We add CSRsViaCopy, it will be explicitly handled during lowering. 1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target supports it for the given machine function and the function has only return exits). We also call TLI->initializeSplitCSR to perform initialization. 2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to virtual registers at beginning of the entry block and copies from virtual registers to CSRsViaCopy at beginning of the exit blocks. 3> we also need to make sure the explicit copies will not be eliminated. Author: Tom Jablin (tjablin) Reviewers: hfinkel kbarton cycheng http://reviews.llvm.org/D17533 llvm-svn: 265781
*	[llvm-c] Expose LLVMContextGetDiagnostic{Handler,Context}	Jeroen Ketema	2016-04-08	2	-0/+8
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D18820 llvm-svn: 265773
*	[mips][microMIPS] Add CodeGen support for ADD, ADDIU, ADDU and DADD* ↵	Zlatko Buljan	2016-04-08	3	-8/+362
\| \| \| \| \| \| \| \|	instructions Differential Revision: http://reviews.llvm.org/D16454 llvm-svn: 265772
*	[IFUNC] Fix ifunc-asm.ll test	Dmitry Polukhin	2016-04-08	2	-3/+18
\| \| \| \| \| \| \|	It seems that llc cannot be called used in assembler tests so test that checks asm for particular target needs to be moved to codegen. llvm-svn: 265770
*	[AMDGPU] Add some VI disassembler tests missing from previous autogeneration ↵	Valery Pykhtin	2016-04-08	1	-0/+66
\| \| \| \| \| \|	due to different filecheck prefix. NFC. llvm-svn: 265769
*	Reapply "ValueMapper: Treat LocalAsMetadata more like function-local Values"	Duncan P. N. Exon Smith	2016-04-08	1	-0/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r265765, reapplying r265759 after changing a call from LocalAsMetadata::get to ValueAsMetadata::get (and adding a unit test). When a local value is mapped to a constant (like "i32 %a" => "i32 7"), the new debug intrinsic operand may no longer be pointing at a local. http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_build/19020/ The previous coommit message follows: -- This is a partial re-commit -- maybe more of a re-implementation -- of r265631 (reverted in r265637). This makes RF_IgnoreMissingLocals behave (almost) consistently between the Value and the Metadata hierarchy. In particular: - MapValue returns nullptr or "metadata !{}" for missing locals in MetadataAsValue/LocalAsMetadata bridging paris, depending on the RF_IgnoreMissingLocals flag. - MapValue doesn't memoize LocalAsMetadata-related results. - MapMetadata no longer deals with LocalAsMetadata or RF_IgnoreMissingLocals at all. (This wasn't in r265631 at all, but I realized during testing it would make the patch simpler with no loss of generality.) r265631 went too far, making both functions universally ignore RF_IgnoreMissingLocals. This broke building (e.g.) compiler-rt. Reassociate (and possibly other passes) don't currently maintain dominates-use invariants for metadata operands, resulting in IR like this: define void @foo(i32 %arg) { call void @llvm.some.intrinsic(metadata i32 %x) %x = add i32 1, i32 %arg } If the inliner chooses to inline @foo into another function, then RemapInstruction will call `MapValue(metadata i32 %x)` and assert that the return is not nullptr. I've filed PR27273 to add a Verifier check and fix the underlying problem in the optimization passes. As a workaround, return `!{}` instead of nullptr for unmapped LocalAsMetadata when RF_IgnoreMissingLocals is unset. Otherwise, match the behaviour of r265631. Original commit message: ValueMapper: Make LocalAsMetadata match function-local Values Start treating LocalAsMetadata similarly to function-local members of the Value hierarchy in MapValue and MapMetadata. - Don't memoize them. - Return nullptr if they are missing. This also cleans up ConstantAsMetadata to stop listening to the RF_IgnoreMissingLocals flag. llvm-svn: 265768
*	Revert "ValueMapper: Treat LocalAsMetadata more like function-local Values"	Duncan P. N. Exon Smith	2016-04-08	1	-49/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r265759, since even this limited version breaks some bots: http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/3311 http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/17696 This also reverts r265761 "ValueMapper: Unduplicate RF_NoModuleLevelChanges check, NFC", since I had trouble separating it from r265759. llvm-svn: 265765
*	Don't IPO over functions that can be de-refined	Sanjoy Das	2016-04-08	7	-1/+166
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fixes PR26774. If you're aware of the issue, feel free to skip the "Motivation" section and jump directly to "This patch". Motivation: I define "refinement" as discarding behaviors from a program that the optimizer has license to discard. So transforming: ``` void f(unsigned x) { unsigned t = 5 / x; (void)t; } ``` to ``` void f(unsigned x) { } ``` is refinement, since the behavior went from "if x == 0 then undefined else nothing" to "nothing" (the optimizer has license to discard undefined behavior). Refinement is a fundamental aspect of many mid-level optimizations done by LLVM. For instance, transforming `x == (x + 1)` to `false` also involves refinement since the expression's value went from "if x is `undef` then { `true` or `false` } else { `false` }" to "`false`" (by definition, the optimizer has license to fold `undef` to any non-`undef` value). Unfortunately, refinement implies that the optimizer cannot assume that the implementation of a function it can see has all of the behavior an unoptimized or a differently optimized version of the same function can have. This is a problem for functions with comdat linkage, where a function can be replaced by an unoptimized or a differently optimized version of the same source level function. For instance, FunctionAttrs cannot assume a comdat function is actually `readnone` even if it does not have any loads or stores in it; since there may have been loads and stores in the "original function" that were refined out in the currently visible variant, and at the link step the linker may in fact choose an implementation with a load or a store. As an example, consider a function that does two atomic loads from the same memory location, and writes to memory only if the two values are not equal. The optimizer is allowed to refine this function by first CSE'ing the two loads, and the folding the comparision to always report that the two values are equal. Such a refined variant will look like it is `readonly`. However, the unoptimized version of the function can still write to memory (since the two loads //can// result in different values), and selecting the unoptimized version at link time will retroactively invalidate transforms we may have done under the assumption that the function does not write to memory. Note: this is not just a problem with atomics or with linking differently optimized object files. See PR26774 for more realistic examples that involved neither. This patch: This change introduces a new set of linkage types, predicated as `GlobalValue::mayBeDerefined` that returns true if the linkage type allows a function to be replaced by a differently optimized variant at link time. It then changes a set of IPO passes to bail out if they see such a function. Reviewers: chandlerc, hfinkel, dexonsmith, joker.eph, rnk Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18634 llvm-svn: 265762
*	DwarfDebug: Support floating point constants in location lists.	Adrian Prantl	2016-04-08	2	-66/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch closes a gap in the DWARF backend that caused LLVM to drop debug info for floating point variables that were constant for part of their scope. Floating point constants are emitted as one or more DW_OP_constu joined via DW_OP_piece. This fixes a regression caught by the LLDB testsuite that I introduced in r262247 when we stopped blindly expanding the range of singular DBG_VALUEs to span the entire scope and started to emit location lists with accurate ranges instead. Also deletes a now-impossible testcase (debug-loc-empty-entries). <rdar://problem/25448338> llvm-svn: 265760
*	ValueMapper: Treat LocalAsMetadata more like function-local Values	Duncan P. N. Exon Smith	2016-04-08	1	-0/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a partial re-commit -- maybe more of a re-implementation -- of r265631 (reverted in r265637). This makes RF_IgnoreMissingLocals behave (almost) consistently between the Value and the Metadata hierarchy. In particular: - MapValue returns nullptr or "metadata !{}" for missing locals in MetadataAsValue/LocalAsMetadata bridging paris, depending on the RF_IgnoreMissingLocals flag. - MapValue doesn't memoize LocalAsMetadata-related results. - MapMetadata no longer deals with LocalAsMetadata or RF_IgnoreMissingLocals at all. (This wasn't in r265631 at all, but I realized during testing it would make the patch simpler with no loss of generality.) r265631 went too far, making both functions universally ignore RF_IgnoreMissingLocals. This broke building (e.g.) compiler-rt. Reassociate (and possibly other passes) don't currently maintain dominates-use invariants for metadata operands, resulting in IR like this: define void @foo(i32 %arg) { call void @llvm.some.intrinsic(metadata i32 %x) %x = add i32 1, i32 %arg } If the inliner chooses to inline @foo into another function, then RemapInstruction will call `MapValue(metadata i32 %x)` and assert that the return is not nullptr. I've filed PR27273 to add a Verifier check and fix the underlying problem in the optimization passes. As a workaround, return `!{}` instead of nullptr for unmapped LocalAsMetadata when RF_IgnoreMissingLocals is unset. Otherwise, match the behaviour of r265631. Original commit message: ValueMapper: Make LocalAsMetadata match function-local Values Start treating LocalAsMetadata similarly to function-local members of the Value hierarchy in MapValue and MapMetadata. - Don't memoize them. - Return nullptr if they are missing. This also cleans up ConstantAsMetadata to stop listening to the RF_IgnoreMissingLocals flag. llvm-svn: 265759
*	[IR/Verifier] Fix (yet another) crash.	Davide Italiano	2016-04-08	1	-0/+10
\| \| \| \| \| \| \|	We need to check that if we reference a retainedType from DICompileUnit we're actually referencing a DICompositeType. llvm-svn: 265752
*	Do not select EhPad BB in MachineBlockPlacement when there is regular BB to ↵	Amaury Sechet	2016-04-07	5	-63/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	schedule Summary: EHPad BB are not entered the classic way and therefor do not need to be placed after their predecessors. This patch make sure EHPad BB are not chosen amongst successors to form chains, and are selected as last resort when selecting the best candidate. EHPad are scheduled in reverse probability order in order to have them flow into each others naturally. Reviewers: chandlerc, majnemer, rafael, MatzeB, escha, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17625 llvm-svn: 265726
*	AMDGPU/SI: Implement atomic load/store for i32 and i64	Jan Vesely	2016-04-07	1	-0/+178
\| \| \| \| \| \| \| \| \| \|	Standard load/store instructions with GLC bit set. Reviewers: tstellardAMD, arsenm Differential Revision: http://reviews.llvm.org/D18760 llvm-svn: 265709
*	AMDGPU/SI: Add latency for export instructions	Tom Stellard	2016-04-07	1	-5/+4
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Subscribers: nhaehnle, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18599 llvm-svn: 265708
*	[X86][SSE] Added bitmask pattern shuffle tests	Simon Pilgrim	2016-04-07	2	-0/+195
\| \| \| \| \| \|	Based on OR(AND(MASK,V0),AND(~MASK,V1)) style patterns llvm-svn: 265697
*	[X86]: Fix for PR27251.	Kevin B. Smith	2016-04-07	1	-2/+4
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D18850 llvm-svn: 265690
*	[SystemZ] Implement conditional returns	Ulrich Weigand	2016-04-07	73	-729/+742
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Return is now considered a predicable instruction, and is converted to a newly-added CondReturn (which maps to BCR to %r14) instruction by the if conversion pass. Also, fused compare-and-branch transform knows about conditional returns, emitting the proper fused instructions for them. This transform triggers on a lot of tests, hence the huge diffstat. The changes are mostly jX to br %r14 -> bXr %r14. Author: koriakin Differential Revision: http://reviews.llvm.org/D17339 llvm-svn: 265689
*	[GVN] Fix handling of sub-byte types in big-endian mode	Ulrich Weigand	2016-04-07	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When GVN wants to re-interpret an already available value in a smaller type, it needs to right-shift the value on big-endian systems to ensure the correct bytes are accessed. The shift value is the difference of the sizes of the two types. This is correct as long as both types occupy multiples of full bytes. However, when one of them is a sub-byte type like i1, this no longer holds true: we still need to shift, but only to access the correct byte. Accessing bits within the byte requires no shift in either endianness; e.g. an i1 resides in the least-significant bit of its containing byte on both big- and little-endian systems. Therefore, the appropriate shift value to be used is the difference of the storage sizes of the two types. This is already handled correctly in one place where such a shift takes place (GetStoreValueForLoad), but is incorrect in two other places: GetLoadValueForLoad and CoerceAvailableValueToLoadType. This patch changes both places to use the storage size as well. Differential Revision: http://reviews.llvm.org/D18662 llvm-svn: 265684
*	[PPC] Enable transformations in PPCPassConfig::addIRPasses at O2	Ehsan Amiri	2016-04-07	29	-345/+171
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	http://reviews.llvm.org/D18562 A large number of testcases has been modified so they pass after this test. One testcase is deleted, because I realized even after undoing the original change that was committed with this testcase, the testcase still passes. So I removed it. The change to one other testcase (test/CodeGen/PowerPC/pr25802.ll) is an arbitrary change to keep it passing. Given the original intention of the testcase, and the fact that fixing it will require some time to change the testcase, we concluded that this quick change will be enough. llvm-svn: 265683
*	[AMDGPU] fix readlane/readfirstlane src vgpr operand type.	Valery Pykhtin	2016-04-07	2	-2/+5
\| \| \| \| \| \| \| \| \|	For VGPR_32 operand disassembler expects a VGPR register encoded as 0..255 (enum8 src operand). readfirstlane/readline actually has enum9 operand and this change fixes VGPR_32 to VS_32 (enum9 encoding). Differential Revision: http://reviews.llvm.org/D18696 llvm-svn: 265670
*	Fix test/Assembler/ifunc-asm.ll test on hexagon-elf bots	Dmitry Polukhin	2016-04-07	1	-2/+0
\| \| \| \| \| \| \|	Temporary disable llc test, it seems that such test should be in some other directory. llvm-svn: 265669
*	[GCC] Attribute ifunc support in llvm	Dmitry Polukhin	2016-04-07	3	-0/+81
\| \| \| \| \| \| \| \| \| \| \|	This patch add support for GCC attribute((ifunc("resolver"))) for targets that use ELF as object file format. In general ifunc is a special kind of function alias with type @gnu_indirect_function. Patch for Clang http://reviews.llvm.org/D15524 Differential Revision: http://reviews.llvm.org/D15525 llvm-svn: 265667
*	[X86][SSE] Add support for VZEXT constant folding	Simon Pilgrim	2016-04-07	3	-6/+3
\| \| \| \|	llvm-svn: 265646
*	[AMDGPU] llvm-objdump: Minimal HSA Code Object disassembler support.	Valery Pykhtin	2016-04-07	2	-0/+77
\| \| \| \| \| \| \| \| \| \| \| \|	Reenable reverted r265550 with endianness issue fixed. Variables of endian-aware types such as ulittle32_t should be explicitly casted to their natural equivalent types before passing it as vararg to printf like functions (format in my case). Added lit config file depending on AMDGPU target as the testcase uses assembler. Differential revision: http://reviews.llvm.org/D16998 llvm-svn: 265645
*	[X86] Reuse EFLAGS and form LOCKed ops when only user is SETCC.	Ahmed Bougacha	2016-04-07	1	-24/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Re-apply r265450 which caused PR27245 and was reverted in r265559 because of a wrong generalization: the fetch_and_add->add_and_fetch combine only works in specific, but pretty common, cases: (icmp slt x, 0) -> (icmp sle (add x, 1), 0) (icmp sge x, 0) -> (icmp sgt (add x, 1), 0) (icmp sle x, 0) -> (icmp slt (sub x, 1), 0) (icmp sgt x, 0) -> (icmp sge (sub x, 1), 0) Original Message: We only generate LOCKed versions of add/sub when the result is unused. It often happens that the result is used, but only by a comparison. We can optimize those out by reusing EFLAGS, which lets us use the proper instructions, instead of having to fallback to LXADD. Instead of doing this as an MI peephole (as we do for the other non-LOCKed (really, non-MR) forms), do it in ISel. It becomes quite tricky later. This also makes it eventually possible to stop expanding and/or/xor if the only user is an icmp (also see D18141). This uses the LOCK ISD opcodes added by r262244. Differential Revision: http://reviews.llvm.org/D17633 llvm-svn: 265636
*	[X86] Refresh and tweak EFLAGS reuse tests. NFC.	Ahmed Bougacha	2016-04-07	1	-51/+82
\| \| \| \| \| \|	The non-1 and EQ/NE tests were misguided. llvm-svn: 265635
*	Re-commit r265039 "[X86] Merge adjacent stack adjustments in ↵	Hans Wennborg	2016-04-07	8	-17/+75
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	eliminateCallFramePseudoInstr (PR27140)" Third time's the charm? The previous attempt (r265345) caused ASan test failures on X86, as broken CFI caused stack traces to not work. This version of the patch makes sure not to merge with stack adjustments that have CFI, and to not add merged instructions' offests to the CFI about to be generated. This is already covered by the lit tests; I just got the expectations wrong previously. llvm-svn: 265623
*	[sancov] enabling coverage edge pruning by default.	Mike Aizatsky	2016-04-06	3	-9/+2
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D18844 llvm-svn: 265615
*	Thread Expected<...> up from createMachOObjectFile() to allow llvm-objdump ↵	Kevin Enderby	2016-04-06	2	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to produce a real error message Produce the first specific error message for a malformed Mach-O file describing the problem instead of the generic message for object_error::parse_failed of "Invalid data was encountered while parsing the file”. Many more good error messages will follow after this first one. This is built on Lang Hames’ great work of adding the ’Error' class for structured error handling and threading Error through MachOObjectFile construction. And making createMachOObjectFile return Expected<...> . So to to get the error to the llvm-obdump tool, I changed the stack of these methods to also return Expected<...> : object::ObjectFile::createObjectFile() object::SymbolicFile::createSymbolicFile() object::createBinary() Then finally in ParseInputMachO() in MachODump.cpp the error can be reported and the specific error message can be printed in llvm-objdump and can be seen in the existing test case for the existing malformed binary but with the updated error message. Converting these interfaces to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now use of errorToErrorCode() and errorOrToExpected() are used where the callers are yet to be converted. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: “// TODO: Actually report errors helpfully” and a call something like consumeError(ObjOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. Note there is one fix also needed to lld/COFF/InputFiles.cpp that goes along with this that I will commit right after this. So expect lld not to built after this commit and before the next one. llvm-svn: 265606
*	[LoopUnroll] Fix the way we update DT after complete unrolling.	Michael Zolotukhin	2016-04-06	1	-0/+53
\| \| \| \| \| \| \| \|	Updating dominators for exit-blocks of the unrolled loops is not enough, as shown in PR27157. The proper way is to update dominators for all dominance-children of original loop blocks. llvm-svn: 265605
*	[PPC] Use VSX/FP Facility integer load when an integer load's only users are ↵	Ehsan Amiri	2016-04-06	1	-0/+83
\| \| \| \| \| \| \| \| \| \| \|	conversion to FP http://reviews.llvm.org/D18405 When the integer value loaded is never used directly as integer we should use VSX or Floating Point Facility integer loads and avoid extra direct move llvm-svn: 265593
*	regenerate checks	Sanjay Patel	2016-04-06	1	-556/+828
\| \| \| \|	llvm-svn: 265591
*	AMDGPU: Add a shader calling convention	Nicolai Haehnle	2016-04-06	81	-511/+431
\| \| \| \| \| \| \| \| \| \| \|	This makes it possible to distinguish between mesa shaders and other kernels even in the presence of compute shaders. Patch By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Differential Revision: http://reviews.llvm.org/D18559 llvm-svn: 265589
*	[IRVerifier] Don't crash on invalid DIFile inside DISubprogram.	Davide Italiano	2016-04-06	1	-0/+10
\| \| \| \| \| \| \|	r265515, this time with the correct fix. file inside DISubprogram is not mandatory. llvm-svn: 265586
*	[gold] Save bitcode for module partitions (save-temps + split codegen).	Evgeniy Stepanov	2016-04-06	1	-0/+6
\| \| \| \|	llvm-svn: 265583
*	Revert r265450 "[X86] Reuse EFLAGS and form LOCKed ops when only user is SETCC."	Hans Wennborg	2016-04-06	1	-5/+15
\| \| \| \| \| \|	It caused ASan 32-bit tests to hang (PR27245). llvm-svn: 265559
*	LoopUnroll: only allow non-modulo Partial unrolling when Runtime=true	Fiona Glaser	2016-04-06	1	-1/+1
\| \| \| \| \| \|	Patch by Evgeny Stupachenko <evstupac@gmail.com>. llvm-svn: 265558
*	Revert "[AMDGPU] llvm-objdump: Minimal HSA Code Object disassembler support."	Valery Pykhtin	2016-04-06	1	-75/+0
\| \| \| \| \| \|	This reverts commit r265550. There're problems with endianness on dumping instruction bytes. Need to find out how to use support::ulittle32_t type properly. llvm-svn: 265554