diff options
| author | Chandler Carruth <chandlerc@gmail.com> | 2014-08-05 18:45:49 +0000 | 
|---|---|---|
| committer | Chandler Carruth <chandlerc@gmail.com> | 2014-08-05 18:45:49 +0000 | 
| commit | a746239be3092c7b27d7e9b3cc81bd612bf641d5 (patch) | |
| tree | 1014b3600dac09aec51489ccb0bd8e3bdcebf8eb /llvm/lib/Target/X86/X86ISelLowering.cpp | |
| parent | 70db9d4d725647d682bbd4fba7fd0caf4d29b715 (diff) | |
| download | bcm5719-llvm-a746239be3092c7b27d7e9b3cc81bd612bf641d5.tar.gz bcm5719-llvm-a746239be3092c7b27d7e9b3cc81bd612bf641d5.zip | |
[x86] Fix a crasher due to shuffles which cancel each other out and add
a test case.
We also miscompile this test case which is showing a serious flaw in the
single-input v8i16 shuffle code. I've left the specific instruction
checks FIXME-ed out until I can address the bug in the single-input
code, but I wanted to separate out a significant functionality change to
produce correct code from a very simple and targeted crasher fix.
The miscompile problem stems from keeping track of inputs by value
rather than by index. As a consequence of doing this, we can't reliably
update those inputs because they might swap and we can't detect this
without copying the mask.
The blend code now uses indices for the input lists and this seems
strictly better. It also should make it easier to sort things and do
other cleanups. I think the time has come to simplify The Great Lambda
here.
llvm-svn: 214914
Diffstat (limited to 'llvm/lib/Target/X86/X86ISelLowering.cpp')
| -rw-r--r-- | llvm/lib/Target/X86/X86ISelLowering.cpp | 17 | 
1 files changed, 11 insertions, 6 deletions
| diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp index a3cd02f44b2..f88d6f8b2fd 100644 --- a/llvm/lib/Target/X86/X86ISelLowering.cpp +++ b/llvm/lib/Target/X86/X86ISelLowering.cpp @@ -19419,7 +19419,11 @@ static bool combineRedundantHalfShuffle(SDValue N, MutableArrayRef<int> Mask,      // We fell out of the loop without finding a viable combining instruction.      return false; -  // Record the old value to use in RAUW-ing. +  // Combine away the bottom node as its shuffle will be accumulated into +  // a preceding shuffle. +  DCI.CombineTo(N.getNode(), N.getOperand(0), /*AddTo*/ true); + +  // Record the old value.    SDValue Old = V;    // Merge this node's mask and our incoming mask (adjusted to account for all @@ -19430,12 +19434,13 @@ static bool combineRedundantHalfShuffle(SDValue N, MutableArrayRef<int> Mask,    V = DAG.getNode(V.getOpcode(), DL, MVT::v8i16, V.getOperand(0),                    getV4X86ShuffleImm8ForMask(Mask, DAG)); -  // Replace N with its operand as we're going to combine that shuffle away. -  DAG.ReplaceAllUsesWith(N, N.getOperand(0)); +  // Check that the shuffles didn't cancel each other out. If not, we need to +  // combine to the new one. +  if (Old != V) +    // Replace the combinable shuffle with the combined one, updating all users +    // so that we re-evaluate the chain here. +    DCI.CombineTo(Old.getNode(), V, /*AddTo*/ true); -  // Replace the combinable shuffle with the combined one, updating all users -  // so that we re-evaluate the chain here. -  DCI.CombineTo(Old.getNode(), V, /*AddTo*/ true);    return true;  } | 

