bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	PowerPC: Remove implicit ilist iterator conversions, NFC	Duncan P. N. Exon Smith	2015-10-20	5	-27/+23
\| \| \| \|	llvm-svn: 250787
*	[PowerPC] Fix invalid lxvdsx optimization (PR25157)	Bill Schmidt	2015-10-14	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	PR25157 identifies a bug where a load plus a vector shuffle is incorrectly converted into an LXVDSX instruction. That optimization is only valid if the load is of a doubleword, and in the noted case, it was not. This corrects that problem. Joint patch with Eric Schweitz, who provided the bugpoint-reduced test case. llvm-svn: 250324
*	Vector element extraction without stack operations on Power 8	Nemanja Ivanovic	2015-10-09	3	-23/+369
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: http://reviews.llvm.org/D12032 This patch builds onto the patch that provided scalar to vector conversions without stack operations (D11471). Included in this patch: - Vector element extraction for all vector types with constant element number - Vector element extraction for v16i8 and v8i16 with variable element number - Removal of some unnecessary COPY_TO_REGCLASS operations that ended up unnecessarily moving things around between registers Not included in this patch (will be in upcoming patch): - Vector element extraction for v4i32, v4f32, v2i64 and v2f64 with variable element number - Vector element insertion for variable/constant element number Testing is provided for all extractions. The extractions that are not implemented yet are just placeholders. llvm-svn: 249822
*	PowerPC: Don't use getNextNode() for insertion point	Duncan P. N. Exon Smith	2015-10-08	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	Stop using `getNextNode()` to create an insertion point for machine instructions (at least, in this one place). Instead, use an iterator. As a drive-by, clean up dump statements to use iterator logic. The `getNextNode()` interface isn't actually supposed to work for insertion points; it's supposed to return `nullptr` if this is the last node. It's currently broken and will "happen" to work, but if we ever fix the function, we'll get some strange failures. llvm-svn: 249758
*	Fix pr24486.	Rafael Espindola	2015-10-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This extends the work done in r233995 so that now getFragment (in addition to getSection) also works for variable symbols. With that the existing logic to decide if a-b can be computed works even if a or b are variables. Given that, the expression evaluation can avoid expanding variables as aggressively and that in turn lets the relocation code see the original variable. In order for this to work with the asm streamer, there is now a dummy fragment per section. It is used to assign a section to a symbol when no other fragment exists. This patch is a joint work by Maxim Ostapenko andy myself. llvm-svn: 249303
*	[PowerPC] Disable shrink wrapping	Hal Finkel	2015-09-30	1	-2/+2
\| \| \| \| \| \| \|	Shrink wrapping is causing a self-hosting failure on PPC64/Linux. Disable for now until the problem can be fixed. llvm-svn: 248924
*	Addition of interfaces the BE to conform to Table A-2 of ELF V2 ABI V1.1	Nemanja Ivanovic	2015-09-29	2	-31/+43
\| \| \| \| \| \| \| \| \|	This patch corresponds to review: http://reviews.llvm.org/D13191 Back end portion of the fifth round of additions to altivec.h. llvm-svn: 248809
*	Improved the interface of methods commuting operands, improved X86-FMA3 ↵	Andrew Kaylor	2015-09-28	2	-12/+27
\| \| \| \| \| \| \| \| \| \|	mem-folding&coalescing. Patch by Slava Klochkov (vyacheslav.n.klochkov@intel.com) Differential Revision: http://reviews.llvm.org/D11370 llvm-svn: 248735
*	MachineBasicBlock: Factor out common code into isReturnBlock()	Matthias Braun	2015-09-25	2	-3/+3
\| \| \| \|	llvm-svn: 248617
*	[SCEV] Introduce ScalarEvolution::getOne and getZero.	Sanjoy Das	2015-09-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: It is fairly common to call SE->getConstant(Ty, 0) or SE->getConstant(Ty, 1); this change makes such uses a little bit briefer. I've refactored the call sites I could find easily to use getZero / getOne. Reviewers: hfinkel, majnemer, reames Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D12947 llvm-svn: 248362
*	Prune trailing whitespaces.	NAKAMURA Takumi	2015-09-22	3	-17/+17
\| \| \| \|	llvm-svn: 248265
*	Untabify.	NAKAMURA Takumi	2015-09-22	8	-40/+39
\| \| \| \|	llvm-svn: 248264
*	Reformat blank lines.	NAKAMURA Takumi	2015-09-22	5	-36/+24
\| \| \| \|	llvm-svn: 248263
*	Reformat comment lines.	NAKAMURA Takumi	2015-09-22	4	-22/+23
\| \| \| \|	llvm-svn: 248262
*	Reformat.	NAKAMURA Takumi	2015-09-22	6	-112/+103
\| \| \| \|	llvm-svn: 248261
*	Fix utf8 chars.	NAKAMURA Takumi	2015-09-22	1	-1/+1
\| \| \| \|	llvm-svn: 248259
*	[Machine Combiner] Refactor machine reassociation code to be target-independent.	Chad Rosier	2015-09-21	2	-233/+9
\| \| \| \| \| \| \| \| \| \|	No functional change intended. Patch by Haicheng Wu <haicheng@codeaurora.org>! http://reviews.llvm.org/D12887 PR24522 llvm-svn: 248164
*	constify the Function parameter to the TTI creation callback and	Eric Christopher	2015-09-16	2	-3/+4
\| \| \| \| \| \|	propagate to all callers/users/etc. llvm-svn: 247864
*	propagate fast-math-flags on DAG nodes	Sanjay Patel	2015-09-16	1	-6/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	After D10403, we had FMF in the DAG but disabled by default. Nick reported no crashing errors after some stress testing, so I enabled them at r243687. However, Escha soon notified us of a bug not covered by any in-tree regression tests: if we don't propagate the flags, we may fail to CSE DAG nodes because differing FMF causes them to not match. There is one test case in this patch to prove that point. This patch hopes to fix or leave a 'TODO' for all of the in-tree places where we create nodes that are FMF-capable. I did this by putting an assert in SelectionDAG.getNode() to find any FMF-capable node that was being created without FMF ( D11807 ). I then ran all regression tests and test-suite and confirmed that everything passes. This patch exposes remaining work to get DAG FMF to be fully functional: (1) add the flags to non-binary nodes such as FCMP, FMA and FNEG; (2) add the flags to intrinsics; (3) use the flags as conditions for transforms rather than the current global settings. Differential Revision: http://reviews.llvm.org/D12095 llvm-svn: 247815
*	Revert r247692: Replace Triple with a new TargetTuple in MCTargetDesc/* and ↵	Daniel Sanders	2015-09-15	8	-37/+35
\| \| \| \| \| \| \| \|	related. NFC. Eric has replied and has demanded the patch be reverted. llvm-svn: 247702
*	Re-commit r247683: Replace Triple with a new TargetTuple in MCTargetDesc/* ↵	Daniel Sanders	2015-09-15	8	-35/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and related. NFC. Summary: This is the first patch in the series to migrate Triple's (which are ambiguous) to TargetTuple's (which aren't). For the moment, TargetTuple simply passes all requests to the Triple object it holds. Once it has replaced Triple, it will start to implement the interface in a more suitable way. This change makes some changes to the public C++ API. In particular, InitMCSubtargetInfo(), createMCRelocationInfo(), and createMCSymbolizer() now take TargetTuples instead of Triples. The other public C++ API's have been left as-is for the moment to reduce patch size. This commit also contains a trivial patch to clang to account for the C++ API change. Thanks go to Pavel Labath for fixing LLDB for me. Reviewers: rengolin Subscribers: jyknight, dschuff, arsenm, rampitec, danalbert, srhines, javed.absar, dsanders, echristo, emaste, jholewinski, tberghammer, ted, jfb, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10969 llvm-svn: 247692
*	Revert r247684 - Replace Triple with a new TargetTuple ...	Daniel Sanders	2015-09-15	8	-37/+35
\| \| \| \| \| \|	LLDB needs to be updated in the same commit. llvm-svn: 247686
*	Replace Triple with a new TargetTuple in MCTargetDesc/* and related. NFC.	Daniel Sanders	2015-09-15	8	-35/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is the first patch in the series to migrate Triple's (which are ambiguous) to TargetTuple's (which aren't). For the moment, TargetTuple simply passes all requests to the Triple object it holds. Once it has replaced Triple, it will start to implement the interface in a more suitable way. This change makes some changes to the public C++ API. In particular, InitMCSubtargetInfo(), createMCRelocationInfo(), and createMCSymbolizer() now take TargetTuples instead of Triples. The other public C++ API's have been left as-is for the moment to reduce patch size. This commit also contains a trivial patch to clang to account for the C++ API change. Reviewers: rengolin Subscribers: jyknight, dschuff, arsenm, rampitec, danalbert, srhines, javed.absar, dsanders, echristo, emaste, jholewinski, tberghammer, ted, jfb, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10969 llvm-svn: 247683
*	Fix namespace indentation and missing blank lines before 'public:' in ↵	Daniel Sanders	2015-09-15	1	-12/+13
\| \| \| \| \| \| \| \| \| \|	*MCAsmInfo.h. NFC. This is to reduce noise in a following commit. Also fixes a couple missing spaces before the reference operator. llvm-svn: 247679
*	PPCFrameLowering::emitEpilogue(): Avoid manipulating MBBI on iterator end.	NAKAMURA Takumi	2015-09-11	1	-1/+1
\| \| \| \| \| \|	It caused crash in MachineInstr::hasPropertyInBundle() since r247237. llvm-svn: 247395
*	Pass BranchProbability/BlockMass by value instead of const& as they are ↵	Cong Hou	2015-09-10	2	-7/+5
\| \| \| \| \| \|	small. NFC. llvm-svn: 247357
*	Enable the shrink wrapping optimization for PPC64.	Kit Barton	2015-09-10	3	-77/+89
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The changes in this patch are as follows: 1. Modify the emitPrologue and emitEpilogue methods to work properly when the prologue and epilogue blocks are not the first/last blocks in the function 2. Fix a bug in PPCEarlyReturn optimization caused by an empty entry block in the function 3. Override the runShrinkWrap PredicateFtor (defined in TargetMachine) to check whether shrink wrapping should run: Shrink wrapping will run on PPC64 (Little Endian and Big Endian) unless -enable-shrink-wrap=false is specified on command line A new test case, ppc-shrink-wrapping.ll was created based on the existing shrink wrapping tests for x86, arm, and arm64. Phabricator review: http://reviews.llvm.org/D11817 llvm-svn: 247237
*	[PM/AA] Rebuild LLVM's alias analysis infrastructure in a way compatible	Chandler Carruth	2015-09-09	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	with the new pass manager, and no longer relying on analysis groups. This builds essentially a ground-up new AA infrastructure stack for LLVM. The core ideas are the same that are used throughout the new pass manager: type erased polymorphism and direct composition. The design is as follows: - FunctionAAResults is a type-erasing alias analysis results aggregation interface to walk a single query across a range of results from different alias analyses. Currently this is function-specific as we always assume that aliasing queries are within a function. - AAResultBase is a CRTP utility providing stub implementations of various parts of the alias analysis result concept, notably in several cases in terms of other more general parts of the interface. This can be used to implement only a narrow part of the interface rather than the entire interface. This isn't really ideal, this logic should be hoisted into FunctionAAResults as currently it will cause a significant amount of redundant work, but it faithfully models the behavior of the prior infrastructure. - All the alias analysis passes are ported to be wrapper passes for the legacy PM and new-style analysis passes for the new PM with a shared result object. In some cases (most notably CFL), this is an extremely naive approach that we should revisit when we can specialize for the new pass manager. - BasicAA has been restructured to reflect that it is much more fundamentally a function analysis because it uses dominator trees and loop info that need to be constructed for each function. All of the references to getting alias analysis results have been updated to use the new aggregation interface. All the preservation and other pass management code has been updated accordingly. The way the FunctionAAResultsWrapperPass works is to detect the available alias analyses when run, and add them to the results object. This means that we should be able to continue to respect when various passes are added to the pipeline, for example adding CFL or adding TBAA passes should just cause their results to be available and to get folded into this. The exception to this rule is BasicAA which really needs to be a function pass due to using dominator trees and loop info. As a consequence, the FunctionAAResultsWrapperPass directly depends on BasicAA and always includes it in the aggregation. This has significant implications for preserving analyses. Generally, most passes shouldn't bother preserving FunctionAAResultsWrapperPass because rebuilding the results just updates the set of known AA passes. The exception to this rule are LoopPass instances which need to preserve all the function analyses that the loop pass manager will end up needing. This means preserving both BasicAAWrapperPass and the aggregating FunctionAAResultsWrapperPass. Now, when preserving an alias analysis, you do so by directly preserving that analysis. This is only necessary for non-immutable-pass-provided alias analyses though, and there are only three of interest: BasicAA, GlobalsAA (formerly GlobalsModRef), and SCEVAA. Usually BasicAA is preserved when needed because it (like DominatorTree and LoopInfo) is marked as a CFG-only pass. I've expanded GlobalsAA into the preserved set everywhere we previously were preserving all of AliasAnalysis, and I've added SCEVAA in the intersection of that with where we preserve SCEV itself. One significant challenge to all of this is that the CGSCC passes were actually using the alias analysis implementations by taking advantage of a pretty amazing set of loop holes in the old pass manager's analysis management code which allowed analysis groups to slide through in many cases. Moving away from analysis groups makes this problem much more obvious. To fix it, I've leveraged the flexibility the design of the new PM components provides to just directly construct the relevant alias analyses for the relevant functions in the IPO passes that need them. This is a bit hacky, but should go away with the new pass manager, and is already in many ways cleaner than the prior state. Another significant challenge is that various facilities of the old alias analysis infrastructure just don't fit any more. The most significant of these is the alias analysis 'counter' pass. That pass relied on the ability to snoop on AA queries at different points in the analysis group chain. Instead, I'm planning to build printing functionality directly into the aggregation layer. I've not included that in this patch merely to keep it smaller. Note that all of this needs a nearly complete rewrite of the AA documentation. I'm planning to do that, but I'd like to make sure the new design settles, and to flesh out a bit more of what it looks like in the new pass manager first. Differential Revision: http://reviews.llvm.org/D12080 llvm-svn: 247167
*	Fix the PPC CTR Loop pass to look for calls to the intrinsics that	Eric Christopher	2015-09-08	1	-0/+6
\| \| \| \| \| \|	read CTR and count them as reading the CTR. llvm-svn: 247083
*	[PowerPC] Don't commute trivial rlwimi instructions	Hal Finkel	2015-09-06	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	To commute a trivial rlwimi instructions (meaning one with a full mask and zero shift), we'd need to ability to form an all-zero mask (instead of an all-one mask) using rlwimi. We can't represent this, however, and we'll miscompile code if we try. The code quality problem that this highlights (that SDAG simplification can lead to us generating an ISD::OR node with a constant zero LHS) will be fixed as a follow-up. Fixes PR24719. llvm-svn: 246937
*	[PowerPC] Fix and(or(x, c1), c2) -> rlwimi generation	Hal Finkel	2015-09-05	1	-3/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	PPCISelDAGToDAG has a transformation that generates a rlwimi instruction from an input pattern that looks like this: and(or(x, c1), c2) but the associated logic does not work if there are bits that are 1 in c1 but 0 in c2 (these are normally canonicalized away, but that can't happen if the 'or' has other users. Make sure we abort the transformation if such bits are discovered. Fixes PR24704. llvm-svn: 246900
*	[PowerPC] Enable interleaved-access vectorization	Hal Finkel	2015-09-04	2	-1/+43
\| \| \| \| \| \| \| \| \| \| \|	This adds a basic cost model for interleaved-access vectorization (and a better default for shuffles), and enables interleaved-access vectorization by default. The relevant difference from the default cost model for interleaved-access vectorization, is that on PPC, the shuffles that end up being used are much cheaper than modeling the process with insert/extract pairs (which are quite expensive, especially on older cores). llvm-svn: 246824
*	[PowerPC] Always use aggressive interleaving on the A2	Hal Finkel	2015-09-03	1	-0/+7
\| \| \| \| \| \| \| \|	On the A2, with an eye toward QPX unaligned-load merging, we should always use aggressive interleaving. It is generally superior to only using concatenation unrolling. llvm-svn: 246819
*	[PowerPC] Try harder to find a base+offset when looking for consecutive accesses	Hal Finkel	2015-09-03	1	-7/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When forming permutation-based unaligned vector loads, we need to know whether it is valid to read ahead of the requested address by a full vector length. Doing so is more efficient (and allows for more CSE with later loads), but could trigger a page fault if invalid. To determine validity, we look for other loads in the same block that access the relevant address range. The relevant point here is that we need to do this as part of the process of forming permutation-based vector loads, and this happens quite early in the SDAG pipeline - specifically before many of the address calculations are fully canonicalized. As a result, we need to try harder to recognize base+offset address computations, because they still might appear as chain of adds (base+offset+offset, for example). To account for this, we'll look through chains of adds, accumulating the constant offsets. llvm-svn: 246813
*	[PowerPC] Include the permutation cost for unaligned vector loads	Hal Finkel	2015-09-03	1	-8/+12
\| \| \| \| \| \| \| \| \| \|	Pre-P8, when we generate code for unaligned vector loads (for Altivec and QPX types), even when accounting for the combining that takes place for multiple consecutive such loads, there is at least one load instructions and one permutation for each load. Make sure the cost reported reflects the cost of the permutes as well. llvm-svn: 246807
*	[PowerPC] Compute the MMO offset for an unaligned load with signed arithmetic	Hal Finkel	2015-09-03	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	If you compute the MMO offset using unsigned arithmetic, you end up with a large positive offset instead of a small negative one. In theory, this could cause bad instruction-scheduling decisions later. I noticed this by inspection from the debug output, and using that for the regression test is the best I can do right now. llvm-svn: 246805
*	[PowerPC] Cleanup cost model for unaligned vector loads/stores	Hal Finkel	2015-09-02	1	-22/+37
\| \| \| \| \| \| \| \| \| \|	I'm adding a regression test to better cover code generation for unaligned vector loads and stores, but there's no functional change to the code generation here. There is an improvement to the cost model for unaligned vector loads and stores, mostly for QPX (for which we were not previously accounting for the permutation-based loads), and the cost model implementation is cleaner. llvm-svn: 246712
*	[PowerPC] Don't always consider P8Altivec-only masks in LowerVECTOR_SHUFFLE	Hal Finkel	2015-09-02	1	-6/+8
\| \| \| \| \| \| \| \| \| \| \|	LowerVECTOR_SHUFFLE needs to decide whether to pass a vector shuffle off to the TableGen-generated matching code, and it does this by testing the same predicates used by the TableGen files. Unfortunately, when we added new P8Altivec-only predicates, we started universally testing them in LowerVECTOR_SHUFFLE, and if then matched when targeting a system prior to a P8, we'd end up with a selection failure. llvm-svn: 246675
*	[EH] Handle non-Function personalities like unknown personalities	Reid Kleckner	2015-08-31	1	-7/+7
\| \| \| \| \| \| \| \| \|	Also delete and simplify a lot of MachineModuleInfo code that used to be needed to handle personalities on landingpads. Now that the personality is on the LLVM Function, we no longer need to track it this way on MMI. Certainly it should not live on LandingPadInfo. llvm-svn: 246478
*	[PowerPC] Fixup SELECT_CC (and SETCC) patterns with i1 comparison operands	Hal Finkel	2015-08-30	4	-5/+168
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There were really two problems here. The first was that we had the truth tables for signed i1 comparisons backward. I imagine these are not very common, but if you have: setcc i1 x, y, LT this has the '0 1' and the '1 0' results flipped compared to: setcc i1 x, y, ULT because, in the signed case, '1 0' is really '-1 0', and the answer is not the same as in the unsigned case. The second problem was that we did not have patterns (at all) for the unsigned comparisons select_cc nodes for i1 comparison operands. This was the specific cause of PR24552. These had to be added (and a missing Altivec promotion added as well) to make sure these function for all types. I've added a bunch more test cases for these patterns, and there are a few FIXMEs in the test case regarding code-quality. Fixes PR24552. llvm-svn: 246400
*	[MIR Serialization] static -> static const in ↵	Hal Finkel	2015-08-30	1	-2/+2
\| \| \| \| \| \| \| \| \|	getSerializable*MachineOperandTargetFlags Make the arrays 'static const' instead of just 'static'. Post-commit review comment from Roman Divacky on IRC. NFC. llvm-svn: 246376
*	[PowerPC/MIR Serialization] Target flags serialization support	Hal Finkel	2015-08-30	2	-0/+41
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for MIR serialization of PowerPC-specific operand target flags (based on the generic infrastructure added in r244185 and r245383). I won't even pretend that this is good test coverage, but this includes the regression test associated with r246372. Adding an MIR test for that fix is far superior to adding an IR-level test because particular instruction-scheduling decisions are necessary in order to expose the bug, and using an MIR test we can start the pipeline post-scheduling. llvm-svn: 246373
*	[PowerPC] Don't assume ADDISdtprelHA's source is r3	Hal Finkel	2015-08-30	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \|	Even through ADDISdtprelHA generally has r3 as its source register, it is possible for the instruction scheduler to move things around such that some other register is the source. We need to print the actual source register, not always r3. Fixes PR24394. The test case will come in a follow-up commit because it depends on MIR target-flags parsing. llvm-svn: 246372
*	[PowerPC] Remove unnecessary braces in PPCVSXFMAMutate	Hal Finkel	2015-08-26	1	-3/+3
\| \| \| \| \| \|	Address Eric's post-commit review of r245741. NFC. llvm-svn: 246121
*	FastISel: Use finishCondBranch() for ARM,Mips,PowerPC FastISel	Matthias Braun	2015-08-26	1	-2/+1
\| \| \| \| \| \|	Note that after this change branch probabilities are preserved now. llvm-svn: 245998
*	[PowerPC] PPCVSXFMAMutate should ignore trivial-copy addends	Hal Finkel	2015-08-24	1	-5/+8
\| \| \| \| \| \| \| \|	We might end up with a trivial copy as the addend, and if so, we should ignore the corresponding FMA instruction. The trivial copy can be coalesced away later, so there's nothing to do here. We should not, however, assert. Fixes PR24544. llvm-svn: 245907
*	[PPC64LE] Fix PR24546 - Swap optimization and debug values	Bill Schmidt	2015-08-24	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	This patch fixes PR24546, which demonstrates a segfault during the VSX swap removal pass. The problem is that debug value instructions were not excluded from the list of instructions to be analyzed for webs of related computation. I've added the test case from the PR as a crash test in test/CodeGen/PowerPC. llvm-svn: 245862
*	[PowerPC] PPCVSXFMAMutate should not segfault on undef input registers	Hal Finkel	2015-08-21	1	-0/+5
\| \| \| \| \| \| \| \| \| \|	When PPCVSXFMAMutate would look at the input addend register, it would get its input value number. This would fail, however, if the register was undef, causing a segfault. Don't segfault (just skip such FMA instructions). Fixes the test case from PR24542 (although that may have been over-reduced). llvm-svn: 245741
*	[PowerPC] Fix value type on XVCMPEQDP for v2f64 comparisons	Hal Finkel	2015-08-20	1	-3/+4
\| \| \| \| \| \| \| \| \|	XVCMPEQDP is used for VSX v2f64 equality comparisons, but the value type needs to be v2i64 (as that's the corresponding SETCC type). Fixes PR24225. llvm-svn: 245535
*	[PowerPC] Fix the int2fp(fp2int(x)) DAGCombine to ignore ppc_fp128	Hal Finkel	2015-08-20	1	-0/+3
\| \| \| \| \| \| \| \|	This DAGCombine was creating custom SDAG nodes with an illegal ppc_fp128 operand type because it was triggering on f64/f32 int2fp(fp2int(ppc_fp128 x)), but shouldn't (it should only apply to f32/f64 types). The result was a crash. llvm-svn: 245530