bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[SelectionDAG] Simplify the ISD::SIGN_EXTEND/ZERO_EXTEND handling to use ↵	Craig Topper	2017-10-12	1	-25/+11
\| \| \| \| \| \|	less temporary APInts by counting bits instead. NFCI llvm-svn: 315628
*	Implement custom lowering for ISD::CTTZ_ZERO_UNDEF and ISD::CTTZ.	Wei Ding	2017-10-12	1	-4/+15
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D37348 llvm-svn: 315610
*	[dump] Remove NDEBUG from test to enable dump methods [NFC]	Don Hinton	2017-10-12	3	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Add LLVM_FORCE_ENABLE_DUMP cmake option, and use it along with LLVM_ENABLE_ASSERTIONS to set LLVM_ENABLE_DUMP. Remove NDEBUG and only use LLVM_ENABLE_DUMP to enable dump methods. Move definition of LLVM_ENABLE_DUMP from config.h to llvm-config.h so it'll be picked up by public headers. Differential Revision: https://reviews.llvm.org/D38406 llvm-svn: 315590
*	Revert r307036 because of PR34919.	Wei Mi	2017-10-12	1	-92/+0
\| \| \| \|	llvm-svn: 315540
*	[DAGCombiner] convert insertelement of bitcasted vector into shuffle	Sanjay Patel	2017-10-11	1	-3/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Eg: insert v4i32 V, (v2i16 X), 2 --> shuffle v8i16 V', X', {0,1,2,3,8,9,6,7} This is a generalization of the IR fold in D38316 to handle insertion into a non-undef vector. We may want to abandon that one if we can't find value in squashing the more specific pattern sooner. We're using the existing legal shuffle target hook to avoid AVX512 horror with vXi1 shuffles. There may be room for improvement in the shuffle lowering here, but that would be follow-up work. Differential Revision: https://reviews.llvm.org/D38388 llvm-svn: 315460
*	[TargetLowering] Correctly track NumFixedArgs field of CallLoweringInfo	Alex Bradbury	2017-10-11	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The NumFixedArgs field of CallLoweringInfo is used by TargetLowering::LowerCallTo to determine whether a given argument is passed using the vararg calling convention or not (specifically, to set IsFixed for each ISD::OutputArg). Firstly, CallLoweringInfo::setLibCallee and CallLoweringInfo::setCallee both incorrectly set NumFixedArgs based on the _previous_ args list. Secondly, TargetLowering::LowerCallTo failed to increment NumFixedArgs when modifying the argument list so a pointer is passed for the return value. If your backend uses the IsFixed property or directly accesses NumFixedArgs, it is _possible_ this change could result in codegen changes (although the previous behaviour would have been incorrect). No such cases have been identified during code review for any in-tree architecture. Differential Revision: https://reviews.llvm.org/D37898 llvm-svn: 315457
*	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-10-10	2	-65/+109
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 315380
*	[DAGCombine] Fix for shuffle to vector extend for non power 2 vectors	David Stuttard	2017-10-10	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: See https://llvm.org/PR33743 for more details It seems that for non-power of 2 vector sizes, the algorithm can produce non-matching sizes for input and result causing an assert. This usually isn't a problem as the isAnyExtend check will weed these out, but in some cases (most often with lots of undefined values for the mask indices) it can pass this check for non power of 2 vectors. Adding in an extra check that ensures that bit size will match for the result and input (as required) Subscribers: nhaehnle Differential Revision: https://reviews.llvm.org/D35241 llvm-svn: 315307
*	Rename OptimizationDiagnosticInfo.* to OptimizationRemarkEmitter.*	Adam Nemet	2017-10-09	1	-1/+1
\| \| \| \| \| \| \|	Sync it up with the name of the class actually defined here. This has been bothering me for a while... llvm-svn: 315249
*	[DAG] combine assertsexts around a trunc	Sanjay Patel	2017-10-09	1	-10/+10
\| \| \| \| \| \| \|	This was a suggested follow-up to: D37017 / https://reviews.llvm.org/rL313577 llvm-svn: 315206
*	Remove unused variables. No functionality change.	Benjamin Kramer	2017-10-08	2	-2/+1
\| \| \| \|	llvm-svn: 315185
*	[SelectionDAG} Use KnownBits::isUnknown and hasConflict. NFC	Craig Topper	2017-10-07	1	-8/+8
\| \| \| \|	llvm-svn: 315154
*	Minor refactoring regarding Cast::isNoopCast(), NFC	Mikael Holmen	2017-10-05	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: FastISel::hasTrivialKill() was the only user of the "IntPtrTy" version of Cast::isNoopCast(). According to review comments in D37894 we could instead use the "DataLayout" version of the method, and thus get rid of the "IntPtrTy" versions of isNoopCast() completely. With the above done, the remaining isNoopCast() could then be simplified a bit more. Reviewers: arsenm Reviewed By: arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D38497 llvm-svn: 314969
*	Revert r314886 "[X86] Improvement in CodeGen instruction selection for LEAs ↵	Hans Wennborg	2017-10-04	1	-11/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(re-applying post required revision changes.)" It broke the Chromium / SQLite build; see PR34830. > Summary: > 1/ Operand folding during complex pattern matching for LEAs has been > extended, such that it promotes Scale to accommodate similar operand > appearing in the DAG. > e.g. > T1 = A + B > T2 = T1 + 10 > T3 = T2 + A > For above DAG rooted at T3, X86AddressMode will no look like > Base = B , Index = A , Scale = 2 , Disp = 10 > > 2/ During OptimizeLEAPass down the pipeline factorization is now performed over LEAs > so that if there is an opportunity then complex LEAs (having 3 operands) > could be factored out. > e.g. > leal 1(%rax,%rcx,1), %rdx > leal 1(%rax,%rcx,2), %rcx > will be factored as following > leal 1(%rax,%rcx,1), %rdx > leal (%rdx,%rcx) , %edx > > 3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops, > thus avoiding creation of any complex LEAs within a loop. > > Reviewers: lsaba, RKSimon, craig.topper, qcolombet, jmolloy > > Reviewed By: lsaba > > Subscribers: jmolloy, spatel, igorb, llvm-commits > > Differential Revision: https://reviews.llvm.org/D35014 llvm-svn: 314919
*	[X86] Improvement in CodeGen instruction selection for LEAs (re-applying ↵	Jatin Bhateja	2017-10-04	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	post required revision changes.) Summary: 1/ Operand folding during complex pattern matching for LEAs has been extended, such that it promotes Scale to accommodate similar operand appearing in the DAG. e.g. T1 = A + B T2 = T1 + 10 T3 = T2 + A For above DAG rooted at T3, X86AddressMode will no look like Base = B , Index = A , Scale = 2 , Disp = 10 2/ During OptimizeLEAPass down the pipeline factorization is now performed over LEAs so that if there is an opportunity then complex LEAs (having 3 operands) could be factored out. e.g. leal 1(%rax,%rcx,1), %rdx leal 1(%rax,%rcx,2), %rcx will be factored as following leal 1(%rax,%rcx,1), %rdx leal (%rdx,%rcx) , %edx 3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops, thus avoiding creation of any complex LEAs within a loop. Reviewers: lsaba, RKSimon, craig.topper, qcolombet, jmolloy Reviewed By: lsaba Subscribers: jmolloy, spatel, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D35014 llvm-svn: 314886
*	[DebugInfo] Handle endianness when moving debug info for split integer ↵	Bjorn Pettersson	2017-10-03	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	values (reapplied) Summary: Take the target's endianness into account when splitting the debug information in DAGTypeLegalizer::SetExpandedInteger. This patch fixes so that, for big-endian targets, the fragment expression corresponding to the high part of a split integer value is placed at offset 0, in order to correctly represent the memory address order. I have attached a PPC32 reproducer where the resulting DWARF pieces for a 64-bit integer were incorrectly reversed. Original patch was reverted due to using -stop-after=isel in the test case (but that is only working when AMDGPU target is included in the llc build). The test case has now been updated to use -stop-before=expand-isel-pseudos instead. Patch by: dstenb Reviewers: JDevlieghere, aprantl, dblaikie Reviewed By: JDevlieghere, aprantl, dblaikie Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D38172 llvm-svn: 314781
*	ISel type legalization: add debug messages. NFCI.	Sjoerd Meijer	2017-10-03	1	-168/+196
\| \| \| \| \| \| \| \| \| \|	This adds some more debug messages to the type legalizer and functions like PromoteNode, ExpandNode, ExpandLibCall in an attempt to make the debug messages a little bit more informative and useful. Differential Revision: https://reviews.llvm.org/D38450 llvm-svn: 314773
*	[PowerPC] Revert r314666.	Tim Shen	2017-10-02	1	-7/+2
\| \| \| \| \| \| \| \| \|	See https://reviews.llvm.org/D38172. I tried to XFAIL it, but sometimes XPASS triggers the bot. Simply revert it. llvm-svn: 314739
*	Remove trailing whitespace to trigger re-cmaking	Michael Liao	2017-10-02	1	-1/+1
\| \| \| \|	llvm-svn: 314728
*	Eliminate ftrunc if source is know to be rounded	Stanislav Mekhanoshin	2017-10-02	1	-0/+13
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D38421 llvm-svn: 314688
*	[Debug info] Handle endianness when moving debug info for split integer values	Bjorn Pettersson	2017-10-02	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Take the target's endianness into account when splitting the debug information in DAGTypeLegalizer::SetExpandedInteger. This patch fixes so that, for big-endian targets, the fragment expression corresponding to the high part of a split integer value is placed at offset 0, in order to correctly represent the memory address order. I have attached a PPC32 reproducer where the resulting DWARF pieces for a 64-bit integer were incorrectly reversed. Patch by: dstenb Reviewers: JDevlieghere, aprantl, dblaikie Reviewed By: JDevlieghere, aprantl, dblaikie Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D38172 llvm-svn: 314666
*	CodeGen: Fix pointer info in expandUnalignedLoad/Store	Yaxun Liu	2017-09-29	1	-12/+21
\| \| \| \| \| \| \| \| \| \| \| \|	Currently expandUnalignedLoad/Store uses place holder pointer info for temporary memory operand in stack, which does not have correct address space. This causes unaligned private double16 load/store to be lowered to flat_load instead of buffer_load for amdgcn target. This fixes failures of OpenCL conformance test basic/vload_private/vstore_private on target amdgcn---amdgizcl. Differential Revision: https://reviews.llvm.org/D35361 llvm-svn: 314566
*	[DAGCombiner] Fix an off-by-one error in vector logic	George Burgess IV	2017-09-28	1	-2/+2
\| \| \| \| \| \| \| \| \|	Without this, we could end up trying to get the Nth (0-indexed) element from a subvector of size N. Differential Revision: https://reviews.llvm.org/D37880 llvm-svn: 314380
*	[CodeGen] Fix some Clang-tidy modernize-use-default-member-init and Include ↵	Eugene Zelenko	2017-09-27	4	-180/+238
\| \| \| \| \| \|	What You Use warnings; other minor fixes (NFC). llvm-svn: 314363
*	[SelectionDAG] Make NewSDValueDbgMsg print target specific nodes correctly ↵	Craig Topper	2017-09-27	1	-12/+12
\| \| \| \| \| \|	by passing in the SelectionDAG. llvm-svn: 314271
*	[SelectionDAG] Teach simplifyDemandedBits to handle shifts by constant splat ↵	Craig Topper	2017-09-25	1	-62/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vectors This teach simplifyDemandedBits to handle constant splat vector shifts. This required changing some uses of getZExtValue to getLimitedValue since we can't rely on legalization using getShiftAmountTy for the shift amount. I believe there may have been a bug in the ((X << C1) >>u ShAmt) handling where we didn't check if the inner shift was too large. I've fixed that here. I had to add new patterns to ARM because the zext/sext the patterns were trying to look for got turned into an any_extend with this patch. Happy to split that out too, but not sure how to test without this change. Differential Revision: https://reviews.llvm.org/D37665 llvm-svn: 314139
*	[DebugInfo] Sort the SDDbgValue list before assuming it is in IR order	Reid Kleckner	2017-09-25	1	-9/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This code iterates the 'Orders' vector in parallel with the DbgValue list, emitting all DBG_VALUEs that occurred between the last IR order insertion point and the next insertion point. This assumes the SDDbgValue list is sorted in IR order, which it usually is. However, it is not sorted when a node with a debug value is replaced with another one. When this happens, TransferDbgValues is called, and the new value is added to the end of the list. The problem can be solved by stably sorting the list by IR order. Reviewers: aprantl, Ka-Ka Reviewed By: aprantl Subscribers: MatzeB, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D38197 llvm-svn: 314114
*	Use {} instead of make_pair and an iterator for the insertion point, NFC	Reid Kleckner	2017-09-25	1	-5/+6
\| \| \| \|	llvm-svn: 314113
*	[CodeGen] Fix some Clang-tidy modernize-use-bool-literals and Include What ↵	Eugene Zelenko	2017-09-21	3	-163/+200
\| \| \| \| \| \|	You Use warnings; other minor fixes (NFC). llvm-svn: 313941
*	[DAGCombiner] Slightly simplify some code by using APInt::isMask() and ↵	Craig Topper	2017-09-21	1	-3/+3
\| \| \| \| \| \| \| \|	countTrailingOnes instead of getting active bits and checking if all the bits below that make a mask. At least for the 64-bit and less case, we should be able to determine if we even have a mask without counting any bits. This also removes the need to explicitly check for 0 active bits, isMask will return false for 0. llvm-svn: 313908
*	Re-land r313825: "[IR] Add llvm.dbg.addr, a control-dependent version of ↵	Reid Kleckner	2017-09-21	1	-19/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	llvm.dbg.declare" The fix is to avoid invalidating our insertion point in replaceDbgDeclare: Builder.insertDeclare(NewAddress, DIVar, DIExpr, Loc, InsertBefore); + if (DII == InsertBefore) + InsertBefore = &std::next(InsertBefore->getIterator()); DII->eraseFromParent(); I had to write a unit tests for this instead of a lit test because the use list order matters in order to trigger the bug. The reduced C test case for this was: void useit(int); static inline void inlineme() { int x[2]; useit(x); } void f() { inlineme(); inlineme(); } llvm-svn: 313905
*	[SelectionDAG] Pick correct frame index in LowerArguments	Bjorn Pettersson	2017-09-21	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: SelectionDAGISel::LowerArguments is associating arguments with frame indices (FuncInfo->setArgumentFrameIndex). That information is later on used by EmitFuncArgumentDbgValue to create DBG_VALUE instructions that denotes that a variable can be found on the stack. I discovered that for our (big endian) out-of-tree target the association created by SelectionDAGISel::LowerArguments sometimes is wrong. I've seen this happen when a 64-bit value is passed on the stack. The argument will occupy two stack slots (frame index X, and frame index X+1). The fault is that a call to setArgumentFrameIndex is associating the 64-bit argument with frame index X+1. The effect is that the debug information (DBG_VALUE) will point at the least significant part of the arguement on the stack. When printing the argument in a debugger I will get the wrong value. I managed to create a test case for PowerPC that seems to show the same kind of problem. The bugfix will look at the datalayout, taking endianness into account when examining a BUILD_PAIR node, assuming that the least significant part is in the first operand of the BUILD_PAIR. For big endian targets we should use the frame index from the second operand, as the most significant part will be stored at the lower address (using the highest frame index). Reviewers: bogner, rnk, hfinkel, sdardis, aprantl Reviewed By: aprantl Subscribers: nemanjai, aprantl, llvm-commits, igorb Differential Revision: https://reviews.llvm.org/D37740 llvm-svn: 313901
*	[DAGCombiner] Remove duplicate code from visitZERO_EXTEND	Craig Topper	2017-09-21	1	-14/+0
\| \| \| \| \| \| \| \|	This exact block of code exists right below. Differential Revision: https://reviews.llvm.org/D38122 llvm-svn: 313891
*	Revert r313825: "[IR] Add llvm.dbg.addr, a control-dependent version of ↵	Daniel Jasper	2017-09-21	1	-29/+19
\| \| \| \| \| \| \| \| \| \| \|	llvm.dbg.declare" .. as well as the two subsequent changes r313826 and r313875. This leads to segfaults in combination with ASAN. Will forward repro instructions to the original author (rnk). llvm-svn: 313876
*	[SelectionDAG] Replace a flag that can never be true with an assert.	Craig Topper	2017-09-21	1	-3/+2
\| \| \| \|	llvm-svn: 313847
*	[SelectionDAG] Use APInt::getActivebits instead of Bitwidth - leading zeros.	Craig Topper	2017-09-20	1	-1/+1
\| \| \| \|	llvm-svn: 313839
*	[IR] Add llvm.dbg.addr, a control-dependent version of llvm.dbg.declare	Reid Kleckner	2017-09-20	1	-19/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This implements the design discussed on llvm-dev for better tracking of variables that live in memory through optimizations: http://lists.llvm.org/pipermail/llvm-dev/2017-September/117222.html This is tracked as PR34136 llvm.dbg.addr is intended to be produced and used in almost precisely the same way as llvm.dbg.declare is today, with the exception that it is control-dependent. That means that dbg.addr should always have a position in the instruction stream, and it will allow passes that optimize memory operations on local variables to insert llvm.dbg.value calls to reflect deleted stores. See SourceLevelDebugging.rst for more details. The main drawback to generating DBG_VALUE machine instrs is that they usually cause LLVM to emit a location list for DW_AT_location. The next step will be to teach DwarfDebug.cpp how to recognize more DBG_VALUE ranges as not needing a location list, and possibly start setting DW_AT_start_offset for variables whose lifetimes begin mid-scope. Reviewers: aprantl, dblaikie, probinson Subscribers: eraman, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D37768 llvm-svn: 313825
*	CodeGen: support SwiftError SwiftCC on Windows x64	Saleem Abdulrasool	2017-09-20	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Add support for passing SwiftError through a register on the Windows x64 calling convention. This allows the use of swifterror attributes on parameters which is used by the swift front end for the `Error` parameter. This partially enables building the swift standard library for Windows x86_64. llvm-svn: 313791
*	CodeGen: use range based for loops (NFC)	Saleem Abdulrasool	2017-09-19	1	-6/+1
\| \| \| \| \| \| \|	Simplify the RPOT traversal by using a range based for loop for the iterator dereference. llvm-svn: 313687
*	[DAGCombiner] fold assertzexts separated by trunc	Sanjay Patel	2017-09-18	1	-2/+25
\| \| \| \| \| \| \| \| \| \| \| \| \|	If we have an AssertZext of a truncated value that has already been AssertZext'ed, we can assert on the wider source op to improve the zext-y knowledge: assert (trunc (assert X, i8) to iN), i1 --> trunc (assert X, i1) to iN This moves a fold from being Mips-specific to general combining, and x86 shows improvements. Differential Revision: https://reviews.llvm.org/D37017 llvm-svn: 313577
*	[DAG, x86] allow store merging before and after legalization (PR34217)	Sanjay Patel	2017-09-18	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	rL310710 allowed store merging to occur after legalization to catch stores that are created late, but this exposes a logic hole seen in PR34217: https://bugs.llvm.org/show_bug.cgi?id=34217 We will miss merging stores if the target lowers vector extracts into target-specific operations. This patch allows store merging to occur both before and after legalization if the target chooses to get maximum merging. I don't think the potential regressions in the other tests are relevant. The tests are for correctness of weird IR constructs rather than perf tests, and I think those are still correct. Differential Revision: https://reviews.llvm.org/D37987 llvm-svn: 313564
*	[SelectionDAG] Add BITCAST handling to ComputeNumSignBits for splatted sign ↵	Simon Pilgrim	2017-09-18	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \| \|	bits. For cases where we are BITCASTing to vectors of smaller elements, then if the entire source was a splatted sign (src's NumSignBits == SrcBitWidth) we can say that the dst's NumSignBit == DstBitWidth, as we're just splitting those sign bits across multiple elements. We could generalize this but at the moment the only use case I have is to peek through bitcasts to vector comparison results. Differential Revision: https://reviews.llvm.org/D37849 llvm-svn: 313543
*	Revert r313343 "[X86] PR32755 : Improvement in CodeGen instruction selection ↵	Hans Wennborg	2017-09-15	1	-11/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	for LEAs." This caused PR34629: asserts firing when building Chromium. It also broke some buildbots building test-suite as reported on the commit thread. > Summary: > 1/ Operand folding during complex pattern matching for LEAs has been > extended, such that it promotes Scale to accommodate similar operand > appearing in the DAG. > e.g. > T1 = A + B > T2 = T1 + 10 > T3 = T2 + A > For above DAG rooted at T3, X86AddressMode will no look like > Base = B , Index = A , Scale = 2 , Disp = 10 > > 2/ During OptimizeLEAPass down the pipeline factorization is now performed over LEAs > so that if there is an opportunity then complex LEAs (having 3 operands) > could be factored out. > e.g. > leal 1(%rax,%rcx,1), %rdx > leal 1(%rax,%rcx,2), %rcx > will be factored as following > leal 1(%rax,%rcx,1), %rdx > leal (%rdx,%rcx) , %edx > > 3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops, > thus avoiding creation of any complex LEAs within a loop. > > Reviewers: lsaba, RKSimon, craig.topper, qcolombet > > Reviewed By: lsaba > > Subscribers: spatel, igorb, llvm-commits > > Differential Revision: https://reviews.llvm.org/D35014 llvm-svn: 313376
*	[X86] PR32755 : Improvement in CodeGen instruction selection for LEAs.	Jatin Bhateja	2017-09-15	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: 1/ Operand folding during complex pattern matching for LEAs has been extended, such that it promotes Scale to accommodate similar operand appearing in the DAG. e.g. T1 = A + B T2 = T1 + 10 T3 = T2 + A For above DAG rooted at T3, X86AddressMode will no look like Base = B , Index = A , Scale = 2 , Disp = 10 2/ During OptimizeLEAPass down the pipeline factorization is now performed over LEAs so that if there is an opportunity then complex LEAs (having 3 operands) could be factored out. e.g. leal 1(%rax,%rcx,1), %rdx leal 1(%rax,%rcx,2), %rcx will be factored as following leal 1(%rax,%rcx,1), %rdx leal (%rdx,%rcx) , %edx 3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops, thus avoiding creation of any complex LEAs within a loop. Reviewers: lsaba, RKSimon, craig.topper, qcolombet Reviewed By: lsaba Subscribers: spatel, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D35014 llvm-svn: 313343
*	Remove usages of deprecated std::unary_function and std::binary_function.	Benjamin Kramer	2017-09-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	These are removed in C++17. We still have some users of unary_function::argument_type, so just spell that typedef out. No functionality change intended. Note that many of the argument types are actually wrong :) llvm-svn: 313287
*	[DAGCombine] (shl (or x, c1), c2) -> (or (shl x, c2), c1 << c2)	Simon Pilgrim	2017-09-14	1	-2/+4
\| \| \| \| \| \| \| \| \| \|	We already have a combine for this pattern when the input to shl is add, so we just need to enable the transformation when the input is or. Original patch by @tstellar Differential Revision: https://reviews.llvm.org/D19325 llvm-svn: 313251
*	[SelectionDAG] ComputeNumSignBits - cleanup ROTL/ROTR wrapping to match ↵	Simon Pilgrim	2017-09-14	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	DAGCombine etc. Use RotAmt.urem(VTBits) instead of AND(RotAmt, VTBits - 1) TBH I don't expect non-power-of-2 types to be created, but it makes the logic clearer and matches what we do in other rotation combines. llvm-svn: 313245
*	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-09-13	2	-7/+8
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 313194
*	[SelectionDAG] Remove a check for type being a vector type after calling ↵	Craig Topper	2017-09-11	1	-2/+0
\| \| \| \| \| \| \| \|	getShiftAmountTy. NFCI getShiftAmountTy already returns the vector type when called for vectors. llvm-svn: 312924
*	Fixed a bug in splitting Scatter operation in the Type Legalizer.	Elena Demikhovsky	2017-09-11	1	-7/+6
\| \| \| \| \| \| \| \| \|	After the split of the Scatter operation, the order of the new instructions is well defined - Lo goes before Hi. Otherwise the semantic of Scatter (from LSB to MSB) is broken. I'm chaining 2 nodes to prevent reordering. Differential Revision https://reviews.llvm.org/D37670 llvm-svn: 312894