bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Prevent construction of cycle in DAG store merge	Nirav Dave	2016-03-25	1	-35/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When merging stores in DAGCombiner, add check to ensure that no dependenices exist that would cause the construction of a cycle in our DAG. This may happen if one store has a data dependence on another instruction (e.g. a load) which itself has a (chain) dependence on another store being merged. These stores cannot be merged safely and doing so results in a cycle that is discovered in LegalizeDAG. This test is only done in cases where Antialias analysis is used (UseAA) as non-AA store merge candidates will be merged logically after all loads which have been checked to not alias. Reviewers: ahatanak, spatel, niravd, arsenm, hfinkel, tstellarAMD, jyknight Subscribers: llvm-commits, tberghammer, danalbert, srhines Differential Revision: http://reviews.llvm.org/D18336 llvm-svn: 264461
*	[SelectionDAG] Ensure constant folded legalized vector element types are ↵	Simon Pilgrim	2016-03-22	1	-1/+1
\| \| \| \| \| \| \| \|	compatible with the BUILD_VECTOR type Found during fuzz testing - 32-bit x86 targets were legalizing a <2 x i1> compare result to <2 x i32> when <2 x i64> was expected. llvm-svn: 264085
*	[DAG] use !isUndef() ; NFCI	Sanjay Patel	2016-03-14	1	-6/+4
\| \| \| \|	llvm-svn: 263453
*	[DAG] use isUndef() ; NFCI	Sanjay Patel	2016-03-14	1	-33/+29
\| \| \| \|	llvm-svn: 263448
*	Re-apply "SelectionDAG: Store SDNode operands in an ArrayRecycler"	Justin Bogner	2016-03-08	1	-143/+118
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This re-applies r262886 with a fix for 32 bit platforms that have 8 byte pointer alignment, effectively reverting r262892. Original Message: Currently some SDNode operands are malloc'd, some are stored inline in subclasses of SDNode, and some are thrown into a BumpPtrAllocator. This scheme is complex, inconsistent, and makes refactoring SDNodes fairly difficult. Instead, we can allocate all of the operands using an ArrayRecycler that wraps a BumpPtrAllocator. This keeps the cache locality when iterating operands, improves locality when iterating SDNodes without looking at operands, and vastly simplifies the ownership semantics. It also means we stop overallocating SDNodes by 2-3x and will make it simpler to fix the rampant undefined behaviour we have in how we mutate SDNodes from one kind to another (See llvm.org/pr26808). This is NFC other than the changes in memory behaviour, and I ran some LNT tests to make sure this didn't hurt compile time. Not many tests changed: there were a couple of 1-2% regressions reported, but there were more improvements (of up to 4%) than regressions. llvm-svn: 262902
*	Revert "SelectionDAG: Store SDNode operands in an ArrayRecycler"	Justin Bogner	2016-03-08	1	-118/+143
\| \| \| \| \| \| \| \| \|	Looks like the largest SDNode is different between 32 and 64 bit now, so this is breaking 32 bit bots. Reverting while I figure out a fix. This reverts r262886. llvm-svn: 262892
*	SelectionDAG: Store SDNode operands in an ArrayRecycler	Justin Bogner	2016-03-08	1	-143/+118
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently some SDNode operands are malloc'd, some are stored inline in subclasses of SDNode, and some are thrown into a BumpPtrAllocator. This scheme is complex, inconsistent, and makes refactoring SDNodes fairly difficult. Instead, we can allocate all of the operands using an ArrayRecycler that wraps a BumpPtrAllocator. This keeps the cache locality when iterating operands, improves locality when iterating SDNodes without looking at operands, and vastly simplifies the ownership semantics. It also means we stop overallocating SDNodes by 2-3x and will make it simpler to fix the rampant undefined behaviour we have in how we mutate SDNodes from one kind to another (See llvm.org/pr26808). This is NFC other than the changes in memory behaviour, and I ran some LNT tests to make sure this didn't hurt compile time. Not many tests changed: there were a couple of 1-2% regressions reported, but there were more improvements (of up to 4%) than regressions. llvm-svn: 262886
*	SelectionDAG: Use correctly sized allocation functions for SDNodes	Justin Bogner	2016-03-02	1	-116/+86
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The placement new calls here were all calling the allocation function in RecyclingAllocator/Recycler for SDNode, instead of the function for the specific subclass we were constructing. Since this particular allocator always overallocates it more or less worked, but would hide what we're actually doing from any memory tools. Also, if you tried to change this allocator so something like a BumpPtrAllocator or MallocAllocator, the compiler would crash horribly all the time. Part of llvm.org/PR26808. llvm-svn: 262500
*	SelectionDAG: Use correct addrspace when lowering memcpy	Matt Arsenault	2016-02-22	1	-9/+16
\| \| \| \| \| \| \| \| \| \| \|	This was causing assertions later from using the wrong pointer size with LDS operations. getOptimalMemOpType should also have address space arguments later. This avoids assertions in existing tests exposed by a future commit. llvm-svn: 261580
*	ADT: Remove == and != comparisons between ilist iterators and pointers	Duncan P. N. Exon Smith	2016-02-21	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	I missed == and != when I removed implicit conversions between iterators and pointers in r252380 since they were defined outside ilist_iterator. Since they depend on getNodePtrUnchecked(), they indirectly rely on UB. This commit removes all uses of these operators. (I'll delete the operators themselves in a separate commit so that it can be easily reverted if necessary.) There should be NFC here. llvm-svn: 261498
*	[SelectionDAG] change getConstant() to use the input SDLoc when building ↵	Sanjay Patel	2016-02-11	1	-5/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	splat vectors The code change is simple enough: instead of attaching an anonymous SDLoc to splatted vector constants, use the scalar constant's existing SDLoc since that is what is passed into getConstant() as a param. But this changes instruction scheduling, so I'll explain why that happens. The motivation for this patch starts near: http://reviews.llvm.org/rL258833 ...x86's getZeroVector() could be similarly cleaned up and I thought it would be 'NFC'. But when I made that change locally, several x86 codegen tests wiggled. It turns out that the lack of SDLoc consistency in getConstant() changes the way ScheduleDAGRRList behaves. This is because the SDLoc contains 'IROrder' and some DAG scheduler algorithms use IROrder for tie-breaking. Differential Revision: http://reviews.llvm.org/D16972 llvm-svn: 260582
*	[SelectionDAG] make getMemBasePlusOffset() accessible; NFCI	Sanjay Patel	2016-02-09	1	-12/+9
\| \| \| \| \| \| \| \| \|	I reinvented this functionality in http://reviews.llvm.org/D16828 because it was hidden away as a static function. The changes in x86 are not based on a complete audit. I suspect there are other possible uses there, and there are almost certainly more potential users in other targets. llvm-svn: 260295
*	[SelectionDAG] Fix CombineToPreIndexedLoadStore O(n^2) behavior	Tim Shen	2016-02-03	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch consists of two parts: a performance fix in DAGCombiner.cpp and a correctness fix in SelectionDAG.cpp. The test case tests the bug that's uncovered by the performance fix, and fixed by the correctness fix. The performance fix keeps the containers required by the hasPredecessorHelper (which is a lazy DFS) and reuse them. Since hasPredecessorHelper is called in a loop, the overall efficiency reduced from O(n^2) to O(n), where n is the number of SDNodes. The correctness fix keeps iterating the neighbor list even if it's time to early return. It will return after finishing adding all neighbors to Worklist, so that no neighbors are discarded due to the original early return. llvm-svn: 259691
*	Rename TargetSelectionDAGInfo into SelectionDAGTargetInfo and move it to ↵	Benjamin Kramer	2016-01-27	1	-1/+1
\| \| \| \| \| \| \| \|	CodeGen/ It's a SelectionDAG thing, not a Target thing. llvm-svn: 258939
*	tidy up; NFC	Sanjay Patel	2016-01-26	1	-9/+9
\| \| \| \|	llvm-svn: 258838
*	fix formatting; NFC	Sanjay Patel	2016-01-26	1	-2/+1
\| \| \| \|	llvm-svn: 258825
*	[SelectionDAG] Use the correct return type for memcpy, memmove, and memset.	Dan Gohman	2016-01-25	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	When generating calls to memcpy, memmove, and memset, use void* as the return type rather than void, to match the standard signatures for these functions. This has no practical effect for most targets, since the return values of these calls aren't being used anyway, and most calling conventions tolerate this kind of mismatch. However, this change will help support future optimizations to utilize the return value to avoid holding the argument value live across a call. llvm-svn: 258691
*	[SelectionDAG] Generalised the CONCAT_VECTORS creation to support ↵	Simon Pilgrim	2016-01-23	1	-10/+12
\| \| \| \| \| \|	BUILD_VECTOR and UNDEF folding. llvm-svn: 258646
*	[SelectionDAG] Fold more offsets into GlobalAddresses	Dan Gohman	2016-01-22	1	-2/+46
\| \| \| \| \| \| \| \|	This reapplies r258296 and r258366, and also fixes an existing bug in SelectionDAG.cpp's isMemSrcFromString, neglecting to account for the offset in a GlobalAddressSDNode, which is uncovered by those patches. llvm-svn: 258482
*	Revert "[SelectionDAG] Fold more offsets into GlobalAddresses"	Reid Kleckner	2016-01-22	1	-43/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts r258296 and the follow up r258366. With this change, we miscompiled the following program on Windows: #include <string> #include <iostream> static const char kData[] = "asdf jkl;"; int main() { std::string s(kData + 3, sizeof(kData) - 3); std::cout << s << '\n'; } llvm-svn: 258465
*	[SelectionDAG] Fix constant offset folding to avoid commuting ↵	Dan Gohman	2016-01-20	1	-2/+3
\| \| \| \| \| \| \| \| \|	non-commutative operators. This fixes a miscompile in MultiSource/Benchmarks/MiBench/consumer-lame introduced in r258296. llvm-svn: 258366
*	[SelectionDAG] Fold more offsets into GlobalAddresses	Dan Gohman	2016-01-20	1	-0/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SelectionDAG previously missed opportunities to fold constants into GlobalAddresses in several areas. For example, given `(add (add GA, c1), y)`, it would often reassociate to `(add (add GA, y), c1)`, missing the opportunity to create `(add GA+c, y)`. This isn't often visible on targets such as X86 which effectively reassociate adds in their complex address-mode folding logic, however it is currently visible on WebAssembly since it currently has very simple address mode folding code that doesn't reassociate anything. This patch fixes this by making SelectionDAG fold offsets into GlobalAddresses at the same times that it folds constants together, so that it doesn't miss any opportunities to perform such folding. Differential Revision: http://reviews.llvm.org/D16090 llvm-svn: 258296
*	[SelectionDAG] CSE nodes with differing SDNodeFlags	Dan Gohman	2016-01-15	1	-22/+22
\| \| \| \| \| \| \| \| \| \| \| \|	In the optimizer (GVN etc.) when eliminating redundant nodes with different flags, the flags are ignored for the purposes of testing for congruence, and then intersected for the purposes of producing a result that supports the union of all the uses. This commit makes SelectionDAG's CSE do the same thing, allowing it to CSE nodes in more cases. This fixes PR26063. Differential Revision: http://reviews.llvm.org/D15957 llvm-svn: 257940
*	[SelectionDAG] Pulled out common code for CONCAT_VECTORS node creation	Simon Pilgrim	2016-01-03	1	-39/+55
\| \| \| \| \| \|	Pulled out the similar CONCAT_VECTORS creation code from the 2/3 operand getNode() calls (to handle all UNDEF and all BUILD_VECTOR cases). Added a similar handler to the general getNode() call as well. llvm-svn: 256709
*	Partially fix memcpy / memset / memmove lowering in SelectionDAG ↵	Manuel Jacob	2015-12-12	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	construction if address space != 0. Summary: Previously SelectionDAGBuilder asserted that the pointer operands of memcpy / memset / memmove intrinsics are in address space < 256. This assert implicitly assumed the X86 backend, where all address spaces < 256 are equivalent to address space 0 from the code generator's point of view. On some targets (R600 and NVPTX) several address spaces < 256 have a target-defined meaning, so this assert made little sense for these targets. This patch removes this wrong assertion and adds extra checks before lowering these intrinsics to library calls. If a pointer operand can't be casted to address space 0 without changing semantics, a fatal error is reported to the user. The new behavior should be valid for all targets that give address spaces != 0 a target-specified meaning (NVPTX, R600, X86). NVPTX lowers big or variable-sized memory intrinsics before SelectionDAG construction. All other memory intrinsics are inlined (the threshold is set very high for this target). R600 doesn't support memcpy / memset / memmove library calls (previously the illegal emission of a call to such library function triggered an error somewhere in the code generator). X86 now emits inline loads and stores for address spaces 256 and 257 up to the same threshold that is used for address space 0 and reports a fatal error otherwise. I call this a "partial fix" because there are still cases that can't be lowered. A fatal error is reported in these cases. Reviewers: arsenm, theraven, compnerd, hfinkel Subscribers: hfinkel, llvm-commits, alex Differential Revision: http://reviews.llvm.org/D7241 llvm-svn: 255441
*	[DAGCombiner] Fix PR25763 - vector comparison constant folding + sign-extension	Simon Pilgrim	2015-12-10	1	-5/+8
\| \| \| \| \| \|	PR25763 demonstrated an issue with D14683 - vector comparison constant folding only works for i1 results, so we need to split off the sign-extension of the result to the required type. Luckily this can be done with the existing type legalization code. llvm-svn: 255289
*	[X86] Part 1 to fix x86-64 fp128 calling convention.	Chih-Hung Hsieh	2015-12-03	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Almost all these changes are conditioned and only apply to the new x86-64 f128 type configuration, which will be enabled in a follow up patch. They are required together to make new f128 work. If there is any error, we should fix or revert them as a whole. These changes should have no impact to current configurations. * Relax type legalization checks to accept new f128 type configuration, whose TypeAction is TypeSoftenFloat, not TypeLegal, but also has TLI.isTypeLegal true. * Relax GetSoftenedFloat to return in some cases f128 type SDValue, which is TLI.isTypeLegal but not "softened" to i128 node. * Allow customized FABS, FNEG, FCOPYSIGN on new f128 type configuration, to generate optimized bitwise operators for libm functions. * Enhance related Lower* functions to handle f128 type. * Enhance DAGTypeLegalizer::run, SoftenFloatResult, and related functions to keep new f128 type in register, and convert f128 operators to library calls. * Fix Combiner, Emitter, Legalizer routines that did not handle f128 type. * Add ExpandConstant to handle i128 constants, ExpandNode to handle ISD::Constant node. * Add one more parameter to getCommonSubClass and firstCommonClass, to guarantee that returned common sub class will contain the specified simple value type. This extra parameter is used by EmitCopyFromReg in InstrEmitter.cpp. * Fix infinite loop in getTypeLegalizationCost when f128 is the value type. * Fix printOperand to handle null operand. * Enhance ISD::BITCAST node to handle f128 constant. * Expand new f128 type for BR_CC, SELECT_CC, SELECT, SETCC nodes. * Enhance X86AsmPrinter to emit f128 values in comments. Differential Revision: http://reviews.llvm.org/D15134 llvm-svn: 254653
*	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC)	Artyom Skrobov	2015-11-25	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085
*	[DAGCombiner] Vector constant folding for comparisons	Simon Pilgrim	2015-11-18	1	-6/+14
\| \| \| \| \| \| \| \| \| \|	This patch adds support for vector constant folding of integer/float comparisons. This requires FoldConstantVectorArithmetic to support scalar constant operands (in this case ISD::CONDCASE). In future we should be able to support other scalar constant types as necessary (and possibly start calling FoldConstantVectorArithmetic for all node creations) Differential Revision: http://reviews.llvm.org/D14683 llvm-svn: 253504
*	add a SelectionDAG method to check if no common bits are set in two nodes; NFCI	Sanjay Patel	2015-11-09	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was suggested in: http://reviews.llvm.org/D13956 and is a follow-on to: http://reviews.llvm.org/rL252515 http://reviews.llvm.org/rL252519 This lets us remove logically equivalent/duplicated code from DAGCombiner and X86ISelDAGToDAG. A corresponding function for IR instructions already exists in ValueTracking. llvm-svn: 252539
*	[SelectionDAG] Use existing constant nodes instead of recreating them. NFC.	Simon Pilgrim	2015-11-03	1	-9/+6
\| \| \| \|	llvm-svn: 251990
*	[ValueTracking] Use !range metadata more aggressively in KnownBits	Sanjoy Das	2015-10-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Teach `computeKnownBitsFromRangeMetadata` to use `!range` metadata more aggressively. Reviewers: majnemer, nlewycky, jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14100 llvm-svn: 251487
*	[SelectionDAG] Don't inspect !range metadata for extended loads	Sanjoy Das	2015-10-28	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Don't call `computeKnownBitsFromRangeMetadata` for extended loads -- this can cause a mismatch between the width of the !range metadata and the width of the APInt's accumulating `KnownZero` (and `KnownOne` in the future). This isn't a problem now, but will be after a future change. Note: this can be made more aggressive in the future. Reviewers: nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14107 llvm-svn: 251486
*	[DAGCombiner] Tidy up ConstantFP commutation. NFCI	Simon Pilgrim	2015-10-24	1	-37/+21
\| \| \| \| \| \|	Move ConstantFP canonicalization of commutative instructions to start of 2-op node creation (matches integer) - simplifies constant folding code. llvm-svn: 251203
*	Restore the original behavior of SelectionDAG::getTargetIndex().	Owen Anderson	2015-10-19	1	-1/+1
\| \| \| \| \| \|	It looks like an extra negation snuck in as apart of restoring it. llvm-svn: 250726
*	Put back SelectionDAG::getTargetIndex.	Benjamin Kramer	2015-10-19	1	-0/+18
\| \| \| \| \| \| \|	While technically this is untested dead code, it has out-of-tree users. This reverts a part of r250434. llvm-svn: 250717
*	Use SDValue bool check. NFCI.	Simon Pilgrim	2015-10-18	1	-2/+2
\| \| \| \|	llvm-svn: 250653
*	Move one-use variable inside test. NFC.	Simon Pilgrim	2015-10-18	1	-2/+1
\| \| \| \|	llvm-svn: 250651
*	[DAG] Ensure vector constant folding uses correct scalar undef types	Simon Pilgrim	2015-10-17	1	-2/+2
\| \| \| \| \| \|	Minor fix to D13665 found during post-commit review. llvm-svn: 250616
*	[SelectionDAG] Remove dead code. NFC.	Benjamin Kramer	2015-10-15	1	-40/+0
\| \| \| \| \| \|	Carefully selected parts without deleting graph stuff and dumping methods. llvm-svn: 250434
*	SelectionDAG: Remove implicit ilist iterator conversions, NFC	Duncan P. N. Exon Smith	2015-10-13	1	-5/+5
\| \| \| \|	llvm-svn: 250214
*	[SelectionDAG] Add common vector constant folding helper function	Simon Pilgrim	2015-10-12	1	-38/+90
\| \| \| \| \| \| \| \| \| \| \| \|	We have a number of functions that implement constant folding of vectors (unary and binary ops) in near identical manners (and the differences don't appear to be critical). This patch introduces a common implementation (SelectionDAG::FoldConstantVectorArithmetic) and calls this in both the unary and binary op cases. After this initial patch I intend to begin enabling vector constant folding for a wider number of opcodes in SelectionDAG::getNode(). Differential Revision: http://reviews.llvm.org/D13665 llvm-svn: 250118
*	[ARM] Promote helper function to SelectionDAG.	Chad Rosier	2015-10-07	1	-0/+19
\| \| \| \| \| \| \| \| \|	I'll be using the function in a similar combine for AArch64. The helper was also improved to handle undef values. Part of http://reviews.llvm.org/D13442 llvm-svn: 249572
*	[DAGCombiner] Merge SIGN_EXTEND_INREG vector constant folding methods. NCI.	Simon Pilgrim	2015-10-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	visitSIGN_EXTEND_INREG calls SelectionDAG::getNode to constant fold scalar constants but handles vector constants itself, despite getNode being capable of dealing with them. This required a minor change to the getNode implementation to actually deal with cases where the scalars of a BUILD_VECTOR were wider integers than the vector type - which was the only extra ability of the visitSIGN_EXTEND_INREG implementation. No codegen intended and all existing tests remain the same. llvm-svn: 249236
*	Remove roundingMode argument in APFloat::mod	Stephen Canon	2015-09-21	1	-1/+1
\| \| \| \| \| \|	Because mod is always exact, this function should have never taken a rounding mode argument. The actual implementation still has issues, which I'll look at resolving in a subsequent patch. llvm-svn: 248195
*	SelectionDAG: Use InsertNode for EntryNode	Matthias Braun	2015-09-21	1	-2/+2
\| \| \| \| \| \|	This fixes problems where two nodes have persistent debug id 0 assigned. llvm-svn: 248182
*	SelectionDAG: Introduce PersistentID to SDNode for assert builds.	Matthias Braun	2015-09-18	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This gives us more human readable numbers to identify nodes in debug dumps. Before: 0x7fcbd9700160: ch = EntryToken 0x7fcbd985c7c8: i64 = Register %RAX ... 0x7fcbd9700160: <multiple use> 0x7fcbd985c578: i64,ch = MOV64rm 0x7fcbd985c6a0, 0x7fcbd985cc68, 0x7fcbd985c200, 0x7fcbd985cd90, 0x7fcbd985ceb8, 0x7fcbd9700160<Mem:LD8[@foo]> [ORD=2] 0x7fcbd985c8f0: ch,glue = CopyToReg 0x7fcbd9700160, 0x7fcbd985c7c8, 0x7fcbd985c578 [ORD=3] 0x7fcbd985c7c8: <multiple use> 0x7fcbd985c8f0: <multiple use> 0x7fcbd985c8f0: <multiple use> 0x7fcbd985ca18: ch = RETQ 0x7fcbd985c7c8, 0x7fcbd985c8f0, 0x7fcbd985c8f0:1 [ORD=3] Now: t0: ch = EntryToken t5: i64 = Register %RAX ... t0: <multiple use> t3: i64,ch = MOV64rm t10, t12, t11, t13, t14, t0<Mem:LD8[@foo]> [ORD=2] t6: ch,glue = CopyToReg t0, t5, t3 [ORD=3] t5: <multiple use> t6: <multiple use> t6: <multiple use> t7: ch = RETQ t5, t6, t6:1 [ORD=3] Differential Revision: http://reviews.llvm.org/D12564 llvm-svn: 248010
*	propagate fast-math-flags on DAG nodes	Sanjay Patel	2015-09-16	1	-15/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	After D10403, we had FMF in the DAG but disabled by default. Nick reported no crashing errors after some stress testing, so I enabled them at r243687. However, Escha soon notified us of a bug not covered by any in-tree regression tests: if we don't propagate the flags, we may fail to CSE DAG nodes because differing FMF causes them to not match. There is one test case in this patch to prove that point. This patch hopes to fix or leave a 'TODO' for all of the in-tree places where we create nodes that are FMF-capable. I did this by putting an assert in SelectionDAG.getNode() to find any FMF-capable node that was being created without FMF ( D11807 ). I then ran all regression tests and test-suite and confirmed that everything passes. This patch exposes remaining work to get DAG FMF to be fully functional: (1) add the flags to non-binary nodes such as FCMP, FMA and FNEG; (2) add the flags to intrinsics; (3) use the flags as conditions for transforms rather than the current global settings. Differential Revision: http://reviews.llvm.org/D12095 llvm-svn: 247815
*	[SelectionDAG] Swap commutative binops before constant-based folding	Hal Finkel	2015-09-06	1	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In searching for a fix for the underlying code-quality bug highlighted by r246937 (that SDAG simplification can lead to us generating an ISD::OR node with a constant zero LHS), I ran across this: We generically canonicalize commutative binary-operation nodes in SDAG getNode so that, if only one operand is a constant, it will be on the RHS. However, we were doing this only after a bunch of constant-based simplification checks that all assume this canonical form (that any constant will be on the RHS). Moving the operand-swapping canonicalization prior to these checks seems like the right thing to do (and, as it turns out, causes SDAG to completely fold away the computation in test/CodeGen/ARM/2012-11-14-subs_carry.ll, just like InstCombine would do). llvm-svn: 246938
*	SelectionDAG: add missing ComputeSignBits case for SELECT_CC	Fiona Glaser	2015-08-29	1	-0/+5
\| \| \| \| \| \|	Identical to SELECT, just with different operand numbers. llvm-svn: 246366