bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SelectionDAG] Update Loop info after splitting critical edges.	Davide Italiano	2017-06-17	1	-6/+9
\| \| \| \| \| \|	The analysis is expected to be preserved by SelectionDAG. llvm-svn: 305621
*	[SelectionDAG] Use APInt::isSubsetOf. NFC	Craig Topper	2017-06-16	2	-4/+4
\| \| \| \|	llvm-svn: 305606
*	[SelectionDAG] Use APInt::isNullValue/isOneValue. NFC	Craig Topper	2017-06-16	2	-5/+5
\| \| \| \|	llvm-svn: 305605
*	[TargetLowering] Use ConstantSDNode::isOne and getSExtValue instead of ↵	Craig Topper	2017-06-16	1	-6/+6
\| \| \| \| \| \|	getting the underlying APInt first. NFC llvm-svn: 305604
*	[Atomics] Rename and change prototype for atomic memcpy intrinsic	Daniel Neilson	2017-06-16	1	-14/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Background: http://lists.llvm.org/pipermail/llvm-dev/2017-May/112779.html This change is to alter the prototype for the atomic memcpy intrinsic. The prototype itself is being changed to more closely resemble the semantics and parameters of the llvm.memcpy intrinsic -- to ease later combination of the llvm.memcpy and atomic memcpy intrinsics. Furthermore, the name of the atomic memcpy intrinsic is being changed to make it clear that it is not a generic atomic memcpy, but specifically a memcpy is unordered atomic. Reviewers: reames, sanjoy, efriedma Reviewed By: reames Subscribers: mzolotukhin, anna, llvm-commits, skatkov Differential Revision: https://reviews.llvm.org/D33240 llvm-svn: 305558
*	Revert "[DAG] Allow truncated and extend memory operations in Store Merge. ↵	Ahmed Bougacha	2017-06-15	1	-21/+10
\| \| \| \| \| \| \| \|	NFCI." This reverts commit r305468, as it caused PR33475. llvm-svn: 305527
*	Fold variable into assert.	Benjamin Kramer	2017-06-15	1	-2/+1
\| \| \| \| \| \|	Silences an unused variable warning in Release builds. llvm-svn: 305488
*	ISel: Fix FastISel of swifterror values	Arnold Schwaighofer	2017-06-15	3	-14/+125
\| \| \| \| \| \| \| \| \| \| \| \|	The code assumed that we process instructions in basic block order. FastISel processes instructions in reverse basic block order. We need to pre-assign virtual registers before selecting otherwise we get def-use relationships wrong. This only affects code with swifterror registers. rdar://32659327 llvm-svn: 305484
*	[DAG] As StoreMerge now generates only legal nodes remove unecessary guard ↵	Nirav Dave	2017-06-15	1	-4/+2
\| \| \| \| \| \|	when run post-legalization NFCI. llvm-svn: 305477
*	[DAG] Defer Pre/Post IndexStore merge to after mergestore. NFCI.	Nirav Dave	2017-06-15	1	-4/+4
\| \| \| \| \| \| \| \|	In preparation for doing storemerge post-legalization, reorder visitSTORE passes to move pre/post-index combining after store merge. Reordered passes other than store merge are unaffected. llvm-svn: 305473
*	[DAG] Allow truncated and extend memory operations in Store Merge. NFCI.	Nirav Dave	2017-06-15	1	-10/+21
\| \| \| \| \| \| \| \|	As all store merges checks are based on the memory operation performed, allow use of truncated stores and extended loads as valid input candidates for merging. llvm-svn: 305468
*	[DAG] Make MergeStores generate legalized stores. NFCI.	Nirav Dave	2017-06-15	1	-4/+21
\| \| \| \| \| \| \|	Realized merged stores as truncstores if store will be realized as such by legalization. llvm-svn: 305467
*	[DAG] Use correct size for truncated store merge of load. NFCI.	Nirav Dave	2017-06-15	1	-2/+2
\| \| \| \| \| \| \|	Avoid non-legal memory ops by checking correct size when merging stores of loads into a extload-truncstore pair. llvm-svn: 305466
*	[mips] Fix multiprecision arithmetic.	Simon Dardis	2017-06-14	1	-4/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For multiprecision arithmetic on MIPS, rather than using ISD::ADDE / ISD::ADDC, get SelectionDAG to break down the operation into ISD::ADDs and ISD::SETCCs. For MIPS, only the DSP ASE has a carry flag, so in the general case it is not useful to directly support ISD::{ADDE, ADDC, SUBE, SUBC} nodes. Also improve the generation code in such cases for targets with TargetLoweringBase::ZeroOrOneBooleanContent by directly using the result of the comparison node rather than using it in selects. Similarly for ISD::SUBE / ISD::SUBC. Address optimization breakage by moving the generation of MIPS specific integer multiply-accumulate nodes to before legalization. This revolves PR32713 and PR33424. Thanks to Simonas Kazlauskas and Pirama Arumuga Nainar for reporting the issue! Reviewers: slthakur Differential Revision: https://reviews.llvm.org/D33494 llvm-svn: 305389
*	[SelectionDAG] Allow sin/cos -> sincos optimization on GNU triples w/ just ↵	Geoff Berry	2017-06-12	1	-14/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	-fno-math-errno Summary: This change enables the sin(x) cos(x) -> sincos(x) optimization on GNU target triples. This optimization was being inhibited when -ffast-math wasn't set because sincos in GLibC does not set errno, while sin and cos do. However, this optimization will only run if the attributes on the sin/cos calls include readnone, which is how clang represents the fact that it doesn't care about the errno values set by these functions (via the -fno-math-errno flag). Reviewers: hfinkel, bogner Subscribers: mcrosier, javed.absar, llvm-commits, paul.redmond Differential Revision: https://reviews.llvm.org/D32921 llvm-svn: 305204
*	[DAG] add helper to bind memop chains; NFCI	Sanjay Patel	2017-06-12	2	-15/+19
\| \| \| \| \| \| \| \| \| \|	This step is just intended to reduce code duplication rather than change any functionality. A follow-up would be to replace PPCTargetLowering::spliceIntoChain() usage with this new helper. Differential Revision: https://reviews.llvm.org/D33649 llvm-svn: 305192
*	[DAGCombine] Make sure we check the ResNo from UADDO before combining	Amaury Sechet	2017-06-11	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: UADDO has 2 result, and one must check the result no before doing any kind of combine. Without it, the transform is invalid. Reviewers: joerg Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34088 llvm-svn: 305162
*	SelectionDAG: Remove deleted nodes from legalized set to avoid clash with ↵	Zvi Rackover	2017-06-09	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	newly created nodes Summary: During DAG legalization loop in SelectionDAG::Legalize(), bookkeeping of the SDNodes that were already legalized is implemented with SmallPtrSet (LegalizedNodes). This kind of set stores only pointers to objects, not the objects themselves. Unfortunately, if SDNode is deleted during legalization for some reason, LegalizedNodes set is not informed about this fact. This wouldn’t be so bad, if SelectionDAG wouldn’t reuse space deallocated after deletion of unused nodes, for creation of new ones. Because of this, new nodes, created during legalization often can have pointers identical to ones that have been previously legalized, added to the LegalizedNodes set, and deleted afterwards. This in turn causes, that newly created nodes, sharing the same pointer as deleted old ones, are present in LegalizedNodes already at the moment of creation, so we never call Legalize on them. The fix facilitates the fact, that DAG notifies listeners about each modification. I have registered DAGNodeDeletedListener inside SelectionDAG::Legalize, with a callback function that removes any pointer of any deleted SDNode from the LegalizedNodes set. With this modification, LegalizeNodes set does not contain pointers to nodes that were deleted, so newly created nodes can always be inserted to it, even if they share pointers with old deleted nodes. Patch by pawel.szczerbuk@intel.com The issue this patch addresses causes failures in an out-of-tree target, and i was not able to create a reproducer for an in-tree target, hence there is no test-case. Reviewers: delena, spatel, RKSimon, hfinkel, davide, qcolombet Reviewed By: delena Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33891 llvm-svn: 305084
*	Reland "[SelectionDAG] Enable target specific vector scalarization of calls ↵	Simon Dardis	2017-06-09	3	-64/+185
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and returns" By target hookifying getRegisterType, getNumRegisters, getVectorBreakdown, backends can request that LLVM to scalarize vector types for calls and returns. The MIPS vector ABI requires that vector arguments and returns are passed in integer registers. With SelectionDAG's new hooks, the MIPS backend can now handle LLVM-IR with vector types in calls and returns. E.g. 'call @foo(<4 x i32> %4)'. Previously these cases would be scalarized for the MIPS O32/N32/N64 ABI for calls and returns if vector types were not legal. If vector types were legal, a single 128bit vector argument would be assigned to a single 32 bit / 64 bit integer register. By teaching the MIPS backend to inspect the original types, it can now implement the MIPS vector ABI which requires a particular method of scalarizing vectors. Previously, the MIPS backend relied on clang to scalarize types such as "call @foo(<4 x float> %a) into "call @foo(i32 inreg %1, i32 inreg %2, i32 inreg %3, i32 inreg %4)". This patch enables the MIPS backend to take either form for vector types. The previous version of this patch had a "conditional move or jump depends on uninitialized value". Reviewers: zoran.jovanovic, jaydeep, vkalintiris, slthakur Differential Revision: https://reviews.llvm.org/D27845 llvm-svn: 305083
*	Prevent RemoveDeadNodes from deleted already deleted node.	Nirav Dave	2017-06-09	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This prevents against assertion errors like PR32659 which occur from a replacement deleting a node after it's been added to the list argument of RemoveDeadNodes. The specific failure from PR32659 does not currently happen, but it is still potentially possible. The underlying cause is that the callers of the change dfunction builds up a list of nodes to delete after having moved their uses and it possible that a move of a later node will cause a previously deleted nodes to be deleted. Reviewers: bkramer, spatel, davide Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33731 llvm-svn: 305070
*	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-06-07	1	-6/+9
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 304954
*	[DAG] Improve Store Merge candidate pruning. NFC.	Nirav Dave	2017-06-07	1	-3/+15
\| \| \| \| \| \| \| \| \|	When considering merging stores values are the results of loads only consider stores whose values come from loads from the same base. This fixes much of the longer compile times in PR33330. llvm-svn: 304934
*	[DAG] Move SelectionDAG::isCommutativeBinOp to TargetLowering.	Simon Pilgrim	2017-06-07	3	-7/+7
\| \| \| \| \| \| \| \|	This will allow commutation of target-specific DAG nodes in future patches Differential Revision: https://reviews.llvm.org/D33882 llvm-svn: 304911
*	[DAG] remove duplicated code for isOnlyUsedInZeroEqualityComparison(); NFCI	Sanjay Patel	2017-06-06	1	-15/+1
\| \| \| \|	llvm-svn: 304822
*	Sort the remaining #include lines in include/... and lib/....	Chandler Carruth	2017-06-06	9	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I did this a long time ago with a janky python script, but now clang-format has built-in support for this. I fed clang-format every line with a #include and let it re-sort things according to the precise LLVM rules for include ordering baked into clang-format these days. I've reverted a number of files where the results of sorting includes isn't healthy. Either places where we have legacy code relying on particular include ordering (where possible, I'll fix these separately) or where we have particular formatting around #include lines that I didn't want to disturb in this patch. This patch is entirely mechanical. If you get merge conflicts or anything, just ignore the changes in this patch and run clang-format over your #include lines in the files. Sorry for any noise here, but it is important to keep these things stable. I was seeing an increasing number of patches with irrelevant re-ordering of #include lines because clang-format was used. This patch at least isolates that churn, makes it easy to skip when resolving conflicts, and gets us to a clean baseline (again). llvm-svn: 304787
*	[llvm] Remove double semicolons	Mandeep Singh Grang	2017-06-06	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: craig.topper, arsenm, mehdi_amini Reviewed By: mehdi_amini Subscribers: mehdi_amini, wdng, nhaehnle, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33924 llvm-svn: 304767
*	[SelectionDAG] Update the dominator after splitting critical edges.	Davide Italiano	2017-06-05	1	-5/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Running `llc -verify-dom-info` on the attached testcase results in a crash in the verifier, due to a stale dominator tree. i.e. DominatorTree is not up to date! Computed: =============================-------------------------------- Inorder Dominator Tree: [1] %safe_mod_func_uint8_t_u_u.exit.i.i.i {0,7} [2] %lor.lhs.false.i61.i.i.i {1,2} [2] %safe_mod_func_int8_t_s_s.exit.i.i.i {3,6} [3] %safe_div_func_int64_t_s_s.exit66.i.i.i {4,5} Actual: =============================-------------------------------- Inorder Dominator Tree: [1] %safe_mod_func_uint8_t_u_u.exit.i.i.i {0,9} [2] %lor.lhs.false.i61.i.i.i {1,2} [2] %safe_mod_func_int8_t_s_s.exit.i.i.i {3,8} [3] %safe_div_func_int64_t_s_s.exit66.i.i.i {4,5} [3] %safe_mod_func_int8_t_s_s.exit.i.i.i.lor.lhs.false.i61.i.i.i_crit_edge {6,7} This is because in `SelectionDAGIsel` we split critical edges without updating the corresponding dominator for the function (and we claim in `MachineFunctionPass::getAnalysisUsage()` that the domtree is preserved). We could either stop preserving the domtree in `getAnalysisUsage` or tell `splitCriticalEdge()` to update it. As the second option is easy to implement, that's the one I chose. Differential Revision: https://reviews.llvm.org/D33800 llvm-svn: 304742
*	[DAGCombine] Fix unchecked calls to DAGCombiner::*ExtPromoteOperand	Sanjay Patel	2017-06-05	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Other calls to DAGCombiner::*PromoteOperand check the result, but here it could cause an assertion in getNode. Falling back to any extend in this case instead of failing outright seems correct to me. No test case because: The failure was triggered by an out of tree backend. In order to trigger it, a backend would need to overload TargetLowering::IsDesirableToPromoteOp to return true for a type for which ISD::SIGN_EXTEND_INREG is marked illegal. In tree, only X86 overloads and sometimes returns true for MVT::i16 yet it marks setOperationAction(ISD::SIGN_EXTEND_INREG, MVT::i16 , Legal);. Patch by Jacob Young! Differential Revision: https://reviews.llvm.org/D33633 llvm-svn: 304723
*	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC.	Galina Kistanova	2017-06-03	1	-0/+1
\| \| \| \|	llvm-svn: 304635
*	[CodeGen] Fix Windows builds which treat warnings as errors, broken in r304621.	Eugene Zelenko	2017-06-03	1	-1/+1
\| \| \| \|	llvm-svn: 304627
*	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-06-03	1	-59/+62
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 304621
*	[Statepoint] Be consistent about using deopt naming [NFCI]	Philip Reames	2017-06-02	1	-1/+1
\| \| \| \| \| \|	We'd called this "vm state" in the early days, but have long since standardized on calling it "deopt" in line with the operand bundle tag. Fix a few cases we'd missed. llvm-svn: 304607
*	[TargetLowering] fix formatting; NFC	Sanjay Patel	2017-06-02	1	-2/+1
\| \| \| \|	llvm-svn: 304569
*	nits in TargetLowering.cpp . NFC	Amaury Sechet	2017-06-02	1	-13/+20
\| \| \| \|	llvm-svn: 304532
*	[SelectionDAG] Get rid of recursion in findNonImmUse	Max Kazantsev	2017-06-02	1	-20/+26
\| \| \| \| \| \| \| \| \| \| \| \|	The recursive implementation of findNonImmUse may overflow stack on extremely long use chains. This patch replaces it with an equivalent iterative implementation. Reviewed By: bogner Differential Revision: https://reviews.llvm.org/D33775 llvm-svn: 304522
*	[SDAG] Fix CombineTo ordering in visitZERO_EXTEND and visitSIGN_EXTEND	Nirav Dave	2017-06-01	1	-15/+8
\| \| \| \| \| \| \| \| \| \| \| \|	Reorder CombineTo Calls to prevent references to stale/deleted SDNodes which caused undue assertions. Reviewers: dbabokin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D31625 llvm-svn: 304460
*	DAG: Remove pointless type check	Matt Arsenault	2017-06-01	1	-1/+1
\| \| \| \| \| \|	These are only integer operations. llvm-svn: 304417
*	Only generate addcarry node when it is legal.	Amaury Sechet	2017-06-01	1	-7/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is a problem uncovered by stage2 testing. ADDCARRY end up being generated on target that do not support it. The patch that introduced the problem has other patches layed on top of it, so we want to fix the issue rather than revert it to avoid creating a lor of churn. A regression test will be added shortly, but this is committed as this in order to get the build back to green promptly. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33770 llvm-svn: 304409
*	Do not legalize large setcc with setcce, introduce setcccarry and do it with ↵	Amaury Sechet	2017-06-01	4	-24/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	usubo/setcccarry. Summary: This is a continuation of the work started in D29872 . Passing the carry down as a value rather than as a glue allows for further optimizations. Introducing setcccarry makes the use of addc/subc unecessary and we can start the removal process. This patch only introduce the optimization strictly required to get the same level of optimization as was available before nothing more. Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33374 llvm-svn: 304404
*	[DAGCombine] Refactor common addcarry pattern.	Amaury Sechet	2017-06-01	1	-0/+35
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This pattern is no very useful per se, but it exposes optimization for toehr patterns that wouldn't kick in otherwize. It's very common and worth optimizing for. Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32756 llvm-svn: 304402
*	[DAGCombine] (add/uaddo X, Carry) -> (addcarry X, 0, Carry)	Amaury Sechet	2017-06-01	1	-0/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This enables further transforms. Depends on D32916 Reviewers: jyknight, nemanjai, mkuper, spatel, RKSimon, zvi, bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32925 llvm-svn: 304401
*	[ScheduleDAG] Deal with already scheduled loads in ScheduleDAG.	Nirav Dave	2017-05-31	1	-128/+150
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If we attempt to unfold an SUnit in ScheduleDAG that results in finding an already scheduled load, we must should abort the unfold as it will not improve scheduling. This fixes PR32610. Reviewers: jmolloy, sunfish, bogner, spatel Subscribers: llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D32911 llvm-svn: 304321
*	[DAG] Avoid use of stale store.	Nirav Dave	2017-05-31	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Correct references to alignment of store which may be deleted in a previous iteration of merge. Instead use first store that would be merged. Corrects pr33172's use-after-poison caught by ASan. Reviewers: spatel, hfinkel, RKSimon Reviewed By: RKSimon Subscribers: thegameg, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33686 llvm-svn: 304299
*	[SelectionDAG] Remove special case for ISD::FPOWI from the strict FP ↵	Craig Topper	2017-05-30	1	-4/+0
\| \| \| \| \| \| \| \|	intrinsic handling. This code was compensating for FPOWI defaulting to Legal and many targets not changing it to Expand. This was fixed in r304215 to default to Expand so this special handling should no longer be necessary. llvm-svn: 304221
*	[SelectionDAG] Set ISD::FPOWI to Expand by default	Craig Topper	2017-05-30	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently FPOWI defaults to Legal and LegalizeDAG.cpp turns Legal into Expand for this opcode because Legal is a "lie". This patch changes the default for this opcode to Expand and removes the hack from LegalizeDAG.cpp. It also removes all the code in the targets that set this opcode to Expand themselves since they can just rely on the default. Reviewers: spatel, RKSimon, efriedma Reviewed By: RKSimon Subscribers: jfb, dschuff, sbc100, jgravelle-google, nemanjai, javed.absar, andrew.w.kaylor, llvm-commits Differential Revision: https://reviews.llvm.org/D33530 llvm-svn: 304215
*	[DAGCombiner] fix load narrowing transform to exclude loads with extension	Sanjay Patel	2017-05-29	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	The extending load possibility was missed in: https://reviews.llvm.org/rL304072 We might want to handle this cases as a follow-up, but bailing out for now to avoid miscompiling. llvm-svn: 304153
*	[DAGCombiner] use narrow load to avoid vector extract	Sanjay Patel	2017-05-27	1	-0/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we have (extract_subvector(load wide vector)) with no other users, that can just be (load narrow vector). This is intentionally conservative. Follow-ups may loosen the one-use constraint to account for the extract cost or just remove the one-use check. The memop chain updating is based on code that already exists multiple times in x86 lowering, so that should be pulled into a helper function as a follow-up. Background: this is a potential improvement noticed via regressions caused by making x86's peekThroughBitcasts() not loop on consecutive bitcasts (see comments in D33137). Differential Revision: https://reviews.llvm.org/D33578 llvm-svn: 304072
*	Make helper functions static. NFC.	Benjamin Kramer	2017-05-26	1	-5/+6
\| \| \| \|	llvm-svn: 304029
*	[DAGCombiner] use narrow vector ops to eliminate concat/extract (PR32790)	Sanjay Patel	2017-05-26	1	-0/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the best case: extract (binop (concat X1, X2), (concat Y1, Y2)), N --> binop XN, YN ...we kill all of the extract/concat and just have narrow binops remaining. If only one of the binop operands is amenable, this transform is still worthwhile because we kill some of the extract/concat. Optional bitcasting makes the code more complicated, but there doesn't seem to be a way to avoid that. The TODO about extending to more than bitwise logic is there because we really will regress several x86 tests including madd, psad, and even a plain integer-multiply-by-2 or shift-left-by-1. I don't think there's anything fundamentally wrong with this patch that would cause those regressions; those folds are just missing or brittle. If we extend to more binops, I found that this patch will fire on at least one non-x86 regression test. There's an ARM NEON test in test/CodeGen/ARM/coalesce-subregs.ll with a pattern like: t5: v2f32 = vector_shuffle<0,3> t2, t4 t6: v1i64 = bitcast t5 t8: v1i64 = BUILD_VECTOR Constant:i64<0> t9: v2i64 = concat_vectors t6, t8 t10: v4f32 = bitcast t9 t12: v4f32 = fmul t11, t10 t13: v2i64 = bitcast t12 t16: v1i64 = extract_subvector t13, Constant:i32<0> There was no functional change in the codegen from this transform from what I could see though. For the x86 test changes: 1. PR32790() is the closest call. We don't reduce the AVX1 instruction count in that case, but we improve throughput. Also, on a core like Jaguar that double-pumps 256-bit ops, there's an unseen win because two 128-bit ops have the same cost as the wider 256-bit op. SSE/AVX2/AXV512 are not affected which is expected because only AVX1 has the extract/concat ops to match the pattern. 2. do_not_use_256bit_op() is the best case. Everyone wins by avoiding the concat/extract. Related bug for IR filed as: https://bugs.llvm.org/show_bug.cgi?id=33026 3. The SSE diffs in vector-trunc-math.ll are just scheduling/RA, so nothing real AFAICT. 4. The AVX1 diffs in vector-tzcnt-256.ll are all the same pattern: we reduced the instruction count by one in each case by eliminating two insert/extract while adding one narrower logic op. https://bugs.llvm.org/show_bug.cgi?id=32790 Differential Revision: https://reviews.llvm.org/D33137 llvm-svn: 303997
*	[DAG] Move legal type checks in store merge to be checked only	Nirav Dave	2017-05-26	1	-2/+4
\| \| \| \| \| \|	on non-legal cases. NFC. llvm-svn: 303994