bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[IR] Delete unused Argument::removeAttr overload	Reid Kleckner	2017-04-28	1	-9/+0
\| \| \| \| \| \|	It doesn't make sense to remove an AttributeList from an argument. llvm-svn: 301663
*	Clean up DIExpression::prependDIExpr a little. (NFC)	Adrian Prantl	2017-04-28	4	-21/+15
\| \| \| \|	llvm-svn: 301662
*	Bitcode: Do not remove empty summary entries when reading a per-module summary.	Peter Collingbourne	2017-04-28	2	-22/+0
\| \| \| \| \| \| \| \|	This became no longer necessary after D19462 landed, and will be incompatible with an upcoming change to the summary data structures that changes how we represent references. llvm-svn: 301660
*	[APInt] Add clearSignBit method. Use it and setSignBit in a few places. NFCI	Craig Topper	2017-04-28	4	-6/+6
\| \| \| \|	llvm-svn: 301656
*	[LazyValueInfo] Fix typo in comment. NFC	Craig Topper	2017-04-28	1	-1/+1
\| \| \| \|	llvm-svn: 301655
*	[ValueTracking] Use APInt::isSubsetOf and APInt::intersects. NFC	Craig Topper	2017-04-28	1	-2/+2
\| \| \| \|	llvm-svn: 301654
*	[bpf] add bigendian support to disassembler	Alexei Starovoitov	2017-04-28	1	-7/+19
\| \| \| \| \| \| \| \| \|	. swap 4-bit register encoding, 16-bit offset and 32-bit imm to support big endian archs . add a test Reported-by: David S. Miller <davem@davemloft.net> Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 301653
*	[InlineCost] Improve the cost heuristic for Switch	Jun Bum Lim	2017-04-28	5	-115/+137
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The motivation example is like below which has 13 cases but only 2 distinct targets ``` lor.lhs.false2: ; preds = %if.then switch i32 %Status, label %if.then27 [ i32 -7012, label %if.end35 i32 -10008, label %if.end35 i32 -10016, label %if.end35 i32 15000, label %if.end35 i32 14013, label %if.end35 i32 10114, label %if.end35 i32 10107, label %if.end35 i32 10105, label %if.end35 i32 10013, label %if.end35 i32 10011, label %if.end35 i32 7008, label %if.end35 i32 7007, label %if.end35 i32 5002, label %if.end35 ] ``` which is compiled into a balanced binary tree like this on AArch64 (similar on X86) ``` .LBB853_9: // %lor.lhs.false2 mov w8, #10012 cmp w19, w8 b.gt .LBB853_14 // BB#10: // %lor.lhs.false2 mov w8, #5001 cmp w19, w8 b.gt .LBB853_18 // BB#11: // %lor.lhs.false2 mov w8, #-10016 cmp w19, w8 b.eq .LBB853_23 // BB#12: // %lor.lhs.false2 mov w8, #-10008 cmp w19, w8 b.eq .LBB853_23 // BB#13: // %lor.lhs.false2 mov w8, #-7012 cmp w19, w8 b.eq .LBB853_23 b .LBB853_3 .LBB853_14: // %lor.lhs.false2 mov w8, #14012 cmp w19, w8 b.gt .LBB853_21 // BB#15: // %lor.lhs.false2 mov w8, #-10105 add w8, w19, w8 cmp w8, #9 // =9 b.hi .LBB853_17 // BB#16: // %lor.lhs.false2 orr w9, wzr, #0x1 lsl w8, w9, w8 mov w9, #517 and w8, w8, w9 cbnz w8, .LBB853_23 .LBB853_17: // %lor.lhs.false2 mov w8, #10013 cmp w19, w8 b.eq .LBB853_23 b .LBB853_3 .LBB853_18: // %lor.lhs.false2 mov w8, #-7007 add w8, w19, w8 cmp w8, #2 // =2 b.lo .LBB853_23 // BB#19: // %lor.lhs.false2 mov w8, #5002 cmp w19, w8 b.eq .LBB853_23 // BB#20: // %lor.lhs.false2 mov w8, #10011 cmp w19, w8 b.eq .LBB853_23 b .LBB853_3 .LBB853_21: // %lor.lhs.false2 mov w8, #14013 cmp w19, w8 b.eq .LBB853_23 // BB#22: // %lor.lhs.false2 mov w8, #15000 cmp w19, w8 b.ne .LBB853_3 ``` However, the inline cost model estimates the cost to be linear with the number of distinct targets and the cost of the above switch is just 2 InstrCosts. The function containing this switch is then inlined about 900 times. This change use the general way of switch lowering for the inline heuristic. It etimate the number of case clusters with the suitability check for a jump table or bit test. Considering the binary search tree built for the clusters, this change modifies the model to be linear with the size of the balanced binary tree. The model is off by default for now : -inline-generic-switch-cost=false This change was originally proposed by Haicheng in D29870. Reviewers: hans, bmakam, chandlerc, eraman, haicheng, mcrosier Reviewed By: hans Subscribers: joerg, aemerson, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D31085 llvm-svn: 301649
*	Move variable local to where ita used. NFCI.	Simon Pilgrim	2017-04-28	1	-1/+1
\| \| \| \|	llvm-svn: 301646
*	Memory intrinsic value profile optimization: Avoid divide by 0	Teresa Johnson	2017-04-28	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Skip memops if the total value profiled count is 0, we can't correctly scale up the counts and there is no point anyway. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32624 llvm-svn: 301645
*	[DAGCombiner] Add ComputeNumSignBits vector demanded elements support to ↵	Simon Pilgrim	2017-04-28	1	-1/+39
\| \| \| \| \| \| \| \|	ASHR and INSERT_VECTOR_ELT (reapplied) Reapplied r299221 after fix for nondeterminism in ThinLTO builder (rL301599), with extra check for implicit truncation of inserted element. llvm-svn: 301644
*	[ARM] GlobalISel: fixup r301632	Diana Picus	2017-04-28	1	-42/+0
\| \| \| \| \| \| \|	Actually remove ARMInstructionSelector.h... Forgot to stage the removal in the previous commit. llvm-svn: 301633
*	[ARM] GlobalISel: Get rid of ARMInstructionSelector.h. NFC.	Diana Picus	2017-04-28	3	-3/+31
\| \| \| \| \| \| \| \| \|	Declare the ARMInstructionSelector in an anonymous namespace, to make it more in line with the other targets which were migrated to this in r299637 in order to avoid TableGen'erated headers being included in non-GlobalISel builds. llvm-svn: 301632
*	[DWARF] - Fix mistype in dump output of pub* tables. NFC.	George Rimar	2017-04-28	1	-1/+1
\| \| \| \| \| \| \|	There was a garbage character in output introduced by myself in r290040 "[DWARF] - Introduce DWARFDebugPubTable class for dumping pub* sections." llvm-svn: 301631
*	[DebugInfo][X86] Improve X86 Optimize LEAs handling of debug values.	Andrew Ng	2017-04-28	3	-54/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a follow up to the fix in r298360 to improve the handling of debug values when redundant LEAs are removed. The fix in r298360 effectively discarded the debug values. This patch now attempts to preserve the debug values by using the DWARF DW_OP_stack_value operation via prependDIExpr. Moved functions appendOffset and prependDIExpr from Local.cpp to DebugInfoMetadata.cpp and made them available as static member functions of DIExpression. Differential Revision: https://reviews.llvm.org/D31604 llvm-svn: 301630
*	[WebAssembly] Update calls to computeKnownBits after the changes from r301620.	Craig Topper	2017-04-28	2	-5/+6
\| \| \| \| \| \|	I didn't realize WebAssembly wasn't a default build target so I missed that changes were needed. llvm-svn: 301629
*	[X86][NFC] Refactor RepMovsRepeats in preparation for D32481.	Clement Courbet	2017-04-28	1	-49/+45
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D32583 llvm-svn: 301628
*	[ValueTracking] Convert computeKnownBitsFromRangeMetadata to use KnownBits ↵	Craig Topper	2017-04-28	2	-10/+9
\| \| \| \| \| \|	struct. llvm-svn: 301626
*	[EarlyCSE] Mark the condition of assume intrinsic as true	Max Kazantsev	2017-04-28	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	EarlyCSE should not just ignore assumes. It should use the fact that its condition is true for all dominated instructions. Reviewers: sanjoy, reames, apilipenko, anna, skatkov Reviewed By: reames, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32482 llvm-svn: 301625
*	[EarlyCSE] Remove guards with conditions known to be true	Max Kazantsev	2017-04-28	1	-3/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a condition is calculated only once, and there are multiple guards on this condition, we should be able to remove all guards dominated by the first of them. This patch allows EarlyCSE to try to find the condition of a guard among the known values, and if it is true, remove the guard. Otherwise we keep the guard and mark its condition as 'true' for future consideration. Reviewers: sanjoy, reames, apilipenko, skatkov, anna, dberlin Reviewed By: reames, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32476 llvm-svn: 301623
*	[SelectionDAG] Use KnownBits struct in DAG's computeKnownBits and ↵	Craig Topper	2017-04-28	28	-729/+660
\| \| \| \| \| \| \| \| \| \| \| \|	simplifyDemandedBits This patch replaces the separate APInts for KnownZero/KnownOne with a single KnownBits struct. This is similar to what was done to ValueTracking's version recently. This is largely a mechanical transformation from KnownZero to Known.Zero. Differential Revision: https://reviews.llvm.org/D32569 llvm-svn: 301620
*	[SelectionDAG] Use various APInt methods to reduce temporary APInt creation	Craig Topper	2017-04-28	6	-39/+30
\| \| \| \| \| \|	This patch uses various APInt methods to reduce the number of temporary APInts. These were all found while working through converting SelectionDAG's computeKnownBits to also use the KnownBits struct recently added to the ValueTracking version. llvm-svn: 301618
*	Remove unnecessary semicolon	Sanjoy Das	2017-04-28	1	-1/+1
\| \| \| \| \| \|	This shows up as a -Wpendatic error on GCC. llvm-svn: 301616
*	[StackMaps] Increase the size of the "location size" field	Sanjoy Das	2017-04-28	1	-6/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In some cases LLVM (especially the SLP vectorizer) will create vectors that are 256 bytes (or larger). Given that this is intentional[0] is likely to get more common, this patch updates the StackMap binary format to deal with the spill locations for said vectors. This change also bumps the stack map version from 2 to 3. [0]: https://reviews.llvm.org/D32533#738350 Reviewers: reames, kavon, skatkov, javed.absar Subscribers: mcrosier, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D32629 llvm-svn: 301615
*	[APInt] Use inplace shift methods where possible. NFCI	Craig Topper	2017-04-28	10	-29/+29
\| \| \| \|	llvm-svn: 301612
*	[llvm-pdbdump] Allow printing only a portion of a stream.	Zachary Turner	2017-04-28	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \|	When dumping raw data from a stream, you might know the offset of a certain record you're interested in, as well as how long that record is. Previously, you had to dump the entire stream and wade through the bytes to find the interesting record. This patch allows you to specify an offset and length on the command line, and it will only dump the requested range. llvm-svn: 301607
*	[SROA] Fix nondeterminism exposed by Simon's r299221.	Davide Italiano	2017-04-27	1	-14/+12
\| \| \| \| \| \| \|	Use a SmallSetSetVector instead of a SmallPtrSet as iterating over the latter is not stable ('<' relies on addresses). llvm-svn: 301599
*	[InstCombine] fix matcher to bind to specific operand (PR32830)	Sanjay Patel	2017-04-27	1	-1/+1
\| \| \| \| \| \| \|	Matching any random value would be very wrong: https://bugs.llvm.org/show_bug.cgi?id=32830 llvm-svn: 301594
*	[asan] Fix dead stripping of globals on Linux.	Evgeniy Stepanov	2017-04-27	3	-39/+138
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use a combination of !associated, comdat, @llvm.compiler.used and custom sections to allow dead stripping of globals and their asan metadata. Sometimes. Currently this works on LLD, which supports SHF_LINK_ORDER with sh_link pointing to the associated section. This also works on BFD, which seems to treat comdats as all-or-nothing with respect to linker GC. There is a weird quirk where the "first" global in each link is never GC-ed because of the section symbols. At this moment it does not work on Gold (as in the globals are never stripped). This is a second re-land of r298158. This time, this feature is limited to -fdata-sections builds. llvm-svn: 301587
*	[asan] Put ctor/dtor in comdat.	Evgeniy Stepanov	2017-04-27	1	-9/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When possible, put ASan ctor/dtor in comdat. The only reason not to is global registration, which can be TU-specific. This is not the case when there are no instrumented globals. This is also limited to ELF targets, because MachO does not have comdat, and COFF linkers may GC comdat constructors. The benefit of this is a lot less __asan_init() calls: one per DSO instead of one per TU. It's also necessary for the upcoming gc-sections-for-globals change on Linux, where multiple references to section start symbols trigger quadratic behaviour in gold linker. This is a second re-land of r298756. This time with a flag to disable the whole thing to avoid a bug in the gold linker: https://sourceware.org/bugzilla/show_bug.cgi?id=19002 llvm-svn: 301586
*	[PM/LoopUnswitch] Introduce a new, simpler loop unswitch pass.	Chandler Carruth	2017-04-27	6	-1/+641
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, this pass only focuses on trivial loop unswitching. At that reduced problem it remains significantly better than the current loop unswitch: - Old pass is worse than cubic complexity. New pass is (I think) linear. - New pass is much simpler in its design by focusing on full unswitching. (See below for details on this). - New pass doesn't carry state for thresholds between pass iterations. - New pass doesn't carry state for correctness (both miscompile and infloop) between pass iterations. - New pass produces substantially better code after unswitching. - New pass can handle more trivial unswitch cases. - New pass doesn't recompute the dominator tree for the entire function and instead incrementally updates it. I've ported all of the trivial unswitching test cases from the old pass to the new one to make sure that major functionality isn't lost in the process. For several of the test cases I've worked to improve the precision and rigor of the CHECKs, but for many I've just updated them to handle the new IR produced. My initial motivation was the fact that the old pass carried state in very unreliable ways between pass iterations, and these mechansims were incompatible with the new pass manager. However, I discovered many more improvements to make along the way. This pass makes two very significant assumptions that enable most of these improvements: 1) Focus on full unswitching -- that is, completely removing whatever control flow construct is being unswitched from the loop. In the case of trivial unswitching, this means removing the trivial (exiting) edge. In non-trivial unswitching, this means removing the branch or switch itself. This is in opposition to partial unswitching where some part of the unswitched control flow remains in the loop. Partial unswitching only really applies to switches and to folded branches. These are very similar to full unrolling and partial unrolling. The full form is an effective canonicalization, the partial form needs a complex cost model, cannot be iterated, isn't canonicalizing, and should be a separate pass that runs very late (much like unrolling). 2) Leverage LLVM's Loop machinery to the fullest. The original unswitch dates from a time when a great deal of LLVM's loop infrastructure was missing, ineffective, and/or unreliable. As a consequence, a lot of complexity was added which we no longer need. With these two overarching principles, I think we can build a fast and effective unswitcher that fits in well in the new PM and in the canonicalization pipeline. Some of the remaining functionality around partial unswitching may not be relevant today (not many test cases or benchmarks I can find) but if they are I'd like to add support for them as a separate layer that runs very late in the pipeline. Purely to make reviewing and introducing this code more manageable, I've split this into first a trivial-unswitch-only pass and in the next patch I'll add support for full non-trivial unswitching against a fixed threshold, exactly like full unrolling. I even plan to re-use the unrolling thresholds, as these are incredibly similar cost tradeoffs: we're cloning a loop body in order to end up with simplified control flow. We should only do that when the total growth is reasonably small. One of the biggest changes with this pass compared to the previous one is that previously, each individual trivial exiting edge from a switch was unswitched separately as a branch. Now, we unswitch the entire switch at once, with cases going to the various destinations. This lets us unswitch multiple exiting edges in a single operation and also avoids numerous extremely bad behaviors, where we would introduce 1000s of branches to test for thousands of possible values, all of which would take the exact same exit path bypassing the loop. Now we will use a switch with 1000s of cases that can be efficiently lowered into a jumptable. This avoids relying on somehow forming a switch out of the branches or getting horrible code if that fails for any reason. Another significant change is that this pass actively updates the CFG based on unswitching. For trivial unswitching, this is actually very easy because of the definition of loop simplified form. Doing this makes the code coming out of loop unswitch dramatically more friendly. We still should run loop-simplifycfg (at the least) after this to clean up, but it will have to do a lot less work. Finally, this pass makes much fewer attempts to simplify instructions based on the unswitch. Something like loop-instsimplify, instcombine, or GVN can be used to do increasingly powerful simplifications based on the now dominating predicate. The old simplifications are things that something like loop-instsimplify should get today or a very, very basic loop-instcombine could get. Keeping that logic separate is a big simplifying technique. Most of the code in this pass that isn't in the old one has to do with achieving specific goals: - Updating the dominator tree as we go - Unswitching all cases in a switch in a single step. I think it is still shorter than just the trivial unswitching code in the old pass despite having this functionality. Differential Revision: https://reviews.llvm.org/D32409 llvm-svn: 301576
*	[GlobalOpt] Correctly update metadata when localizing a global.	Eli Friedman	2017-04-27	1	-1/+3
\| \| \| \| \| \| \| \| \|	Just calling dropAllReferences leaves pointers to the ConstantExpr behind, so we would eventually crash with a null pointer dereference. Differential Revision: https://reviews.llvm.org/D32551 llvm-svn: 301575
*	Memory intrinsic value profile optimization: Improve debug output (NFC)	Teresa Johnson	2017-04-27	1	-9/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Misc improvements to debug output. Fix a couple typos and also dump the value profile before we make any profitability checks. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32607 llvm-svn: 301574
*	Use a pointer type for target frame indices during statepoint lowering	Sanjoy Das	2017-04-27	2	-7/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The type of the target frame index is intptr, not the type of the value we're going to store into it. Without this change we crash in the attached test case when trying to type-legalize a TargetFrameIndex. Patchpoint lowering types the target frame index as intptr as well. Reviewers: reames, bogner, arsenm Subscribers: arsenm, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D32256 llvm-svn: 301566
*	Refactor DynamicLibrary so searching for a symbol will have a defined order and	Frederich Munch	2017-04-27	5	-283/+362
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	libraries are properly unloaded when llvm_shutdown is called. Summary: This was mostly affecting usage of the JIT, where storing the library handles in a set made iteration unordered/undefined. This lead to disagreement between the JIT and native code as to what the address and implementation of particularly on Windows with stdlib functions: JIT: putenv_s("TEST", "VALUE") // called msvcrt.dll, putenv_s JIT: getenv("TEST") -> "VALUE" // called msvcrt.dll, getenv Native: getenv("TEST") -> NULL // called ucrt.dll, getenv Also fixed is the issue of DynamicLibrary::getPermanentLibrary(0,0) on Windows not giving priority to the process' symbols as it did on Unix. Reviewers: chapuni, v.g.vassilev, lhames Reviewed By: lhames Subscribers: danalbert, srhines, mgorny, vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D30107 llvm-svn: 301562
*	[PartialInlining]: Improve partial inlining to handle complex conditions	Xinliang David Li	2017-04-27	1	-41/+243
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D32249 llvm-svn: 301561
*	[CodeView] Isolate Debug Info Fragments into standalone classes.	Zachary Turner	2017-04-27	8	-120/+200
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously parsing of these were all grouped together into a single master class that could parse any type of debug info fragment. With writing forthcoming, the complexity of each individual fragment is enough to warrant them having their own classes so that reading and writing of each fragment type can be grouped together, but isolated from the code for reading and writing other fragment types. In doing so, I found a place where parsing code was duplicated for the FileChecksums fragment, across llvm-readobj and the CodeView library, and one of the implementations had a bug. Now that the codepaths are merged, the bug is resolved. Differential Revision: https://reviews.llvm.org/D32547 llvm-svn: 301557
*	[Support] Make BinaryStreamArray extractors stateless.	Zachary Turner	2017-04-27	1	-3/+2
\| \| \| \| \| \| \|	Instead, we now pass a context memeber through the extraction process. llvm-svn: 301556
*	Rename some PDB classes.	Zachary Turner	2017-04-27	16	-235/+260
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We have a lot of very similarly named classes related to dealing with module debug info. This patch has NFC, it just renames some classes to be more descriptive (albeit slightly more to type). The mapping from old to new class names is as follows: Old \| New ModInfo \| DbiModuleDescriptor ModuleSubstream \| ModuleDebugFragment ModStream \| ModuleDebugStream With the corresponding Builder classes renamed accordingly. Differential Revision: https://reviews.llvm.org/D32506 llvm-svn: 301555
*	[AMDGPU] DPP: add support for GFX9	Sam Kolton	2017-04-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Reviewers: artem.tamazov Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D32588 llvm-svn: 301551
*	Fix typo and place comment close to its target	Krzysztof Parzyszek	2017-04-27	1	-5/+6
\| \| \| \| \| \| \| \|	Patch by Wei-Ren Chen. Differential Revision: https://reviews.llvm.org/D32594 llvm-svn: 301546
*	[mips][microMIPS] Adding code size reduction pass for MicroMIPS	Zoran Jovanovic	2017-04-27	4	-0/+338
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Author: milena.vujosevic.janicic Reviewers: sdardis The code implements size reduction pass for MicroMIPS. Load and store instructions are examined and transformed, if possible. lw32 instruction is transformed into 16-bit instruction lwsp sw32 instruction is transformed into 16-bit instruction swsp Arithmetic instrcutions are examined and transformed, if possible. addu32 instruction is transformed into 16-bit instruction addu16 subu32 instruction is transformed into 16-bit instruction subu16 Differential Revision: https://reviews.llvm.org/D15144 llvm-svn: 301540
*	[SystemZ] Remove incorrect assert in SystemZTTIImpl	Jonas Paulsson	2017-04-27	1	-1/+0
\| \| \| \| \| \| \| \| \|	In getCmpSelInstrCost(), CondTy may actually be scalar while ValTy is a vector when LoopVectorizer is the caller. Therefore the assert that CondTy must be a vector type if ValTy is was wrong and is now removed. Review: Ulrich Weigand llvm-svn: 301533
*	[ARM] GlobalISel: Fix extended stack operands	Diana Picus	2017-04-27	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix a crash when trying to extend a value passed as a sign- or zero-extended stack parameter. The cause of the crash was that we were setting the size of the loaded value to 32 bits, and then tyring to extend again to 32 bits. This patch addresses the issue by also introducing a G_TRUNC after the load. This will leave the unused bits to their original values set by the caller, while being consistent about the types. For values that are not extended, we just use a smaller load. llvm-svn: 301531
*	[llvm-dwarfdump] - Change format for .gdb_index dump.	George Rimar	2017-04-27	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	It is useful to output size of ranges when address ranges section of .gdb_index is dumped. It helps to compare outputs produced by different linkers, for example. In that case address ranges can look very different, when they are the same at fact. Difference comes from different low address because of different address of .text. Differential revision: https://reviews.llvm.org/D32492 llvm-svn: 301527
*	[GlobalISel][X86] handle not symmetric G_COPY	Igor Breger	2017-04-27	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: handle not symmetric G_COPY Reviewers: zvi, guyblank Reviewed By: guyblank Subscribers: rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D32420 llvm-svn: 301523
*	[CodeGen][NFC] Rename 'Src' to 'Val'.	Clement Courbet	2017-04-27	1	-7/+7
\| \| \| \| \| \| \| \| \|	'Src' looks like it was borrowed from memcpy, 'Val' makes more sense for memset and is consistent with naming within the function. Differential Revision: https://reviews.llvm.org/D32580 llvm-svn: 301521
*	Use accessors for ValueHandleBase::V; NFC	Sanjoy Das	2017-04-27	1	-12/+12
\| \| \| \| \| \| \| \| \| \| \|	This changes code that touches ValueHandleBase::V to go through getValPtr and (newly added) setValPtr. This functionality will be used later, but also seemed like a generally good cleanup. I also renamed the field to Val, but that's just to make it obvious that I fixed all the uses. llvm-svn: 301518
*	[Metadata] Fix typos in comments. NFC	Craig Topper	2017-04-27	1	-2/+2
\| \| \| \|	llvm-svn: 301517
*	[InstCombine] Use APInt bit counting methods to avoid a temporary APInt. NFC	Craig Topper	2017-04-27	1	-6/+6
\| \| \| \|	llvm-svn: 301516