bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[LegacyPassManager] Remove TargetMachine constructors	Francis Visoiu Mistrih	2017-05-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This provides a new way to access the TargetMachine through TargetPassConfig, as a dependency. The patterns replaced here are: * Passes handling a null TargetMachine call `getAnalysisIfAvailable<TargetPassConfig>`. * Passes not handling a null TargetMachine `addRequired<TargetPassConfig>` and call `getAnalysis<TargetPassConfig>`. * MachineFunctionPasses now use MF.getTarget(). * Remove all the TargetMachine constructors. * Remove INITIALIZE_TM_PASS. This fixes a crash when running `llc -start-before prologepilog`. PEI needs StackProtector, which gets constructed without a TargetMachine by the pass manager. The StackProtector pass doesn't handle the case where there is no TargetMachine, so it segfaults. Related to PR30324. Differential Revision: https://reviews.llvm.org/D33222 llvm-svn: 303360
*	CodeGen: Power: Add lowering for shifts of v1i128.	Kyle Butt	2017-05-17	2	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \|	When legalizing vector operations on vNi128, they will be split to v1i128 because that is a legal type on ppc64, but then the compiler will crash in selection dag because it fails to select for these operations. This patch fixes shift operations. Logical shift right and left shift can be performed in the vector unit, but algebraic shift right requires being split. Differential Revision: https://reviews.llvm.org/D32774 llvm-svn: 303307
*	[PPC] Properly update register save area offsets	Krzysztof Parzyszek	2017-05-17	1	-9/+14
\| \| \| \| \| \| \| \| \| \| \| \|	The variables MinGPR/MinG8R were not updated properly when resetting the offsets, which in the included testcase lead to saving the CR register in the same location as R30. This fixes another issue reported in PR26519. Differential Revision: https://reviews.llvm.org/D33017 llvm-svn: 303257
*	[PPC] Lower load acquire/seq_cst trailing fence to cmp + bne + isync.	Tim Shen	2017-05-16	5	-8/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes pr32392. The lowering pipeline is: llvm.ppc.cfence in IR -> PPC::CFENCE8 in isel -> Actual instructions in expandPostRAPseudo. The reason why expandPostRAPseudo is chosen is because previous passes are likely eliminating instructions like cmpw 3, 3 (early CSE) and bne- 7, .+4 (some branch pass(s)). Differential Revision: https://reviews.llvm.org/D32763 llvm-svn: 303205
*	[PPC] Move the combine "a << (b % (sizeof(a) * 8)) -> (PPCshl a, b)" to the ↵	Tim Shen	2017-05-12	3	-17/+108
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	backend. NFC. Summary: Eli pointed out that it's unsafe to combine the shifts to ISD::SHL etc., because those are not defined for b > sizeof(a) * 8, even after some of the combiners run. However, PPCISD::SHL defines that behavior (as the instructions themselves). Move the combination to the backend. The tests in shift_mask.ll still pass. Reviewers: echristo, hfinkel, efriedma, iteratee Subscribers: nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D33076 llvm-svn: 302937
*	[PPC] Change the register constraint of the first source operand of ↵	Guozhi Wei	2017-05-11	2	-1/+18
\| \| \| \| \| \| \| \| \| \| \| \|	instruction mtvsrdd to g8rc_nox0 According to Power ISA V3.0 document, the first source operand of mtvsrdd is constant 0 if r0 is specified. So the corresponding register constraint should be g8rc_nox0. This bug caused wrong output generated by 401.bzip2 when -mcpu=power9 and fdo are specified. Differential Revision: https://reviews.llvm.org/D32880 llvm-svn: 302834
*	[PowerPC] Eliminate integer compare instructions - vol. 1	Nemanja Ivanovic	2017-05-11	5	-5/+284
\| \| \| \| \| \| \| \| \| \| \| \| \|	This patch is the first in a series of patches to provide code gen for doing compares in GPRs when the compare result is required in a GPR. It adds the infrastructure to select GPR sequences for i1->i32 and i1->i64 extensions. This first patch handles equality comparison on i32 operands with the result sign or zero extended. Differential Revision: https://reviews.llvm.org/D31847 llvm-svn: 302810
*	[Atomic] Remove IsStore/IsLoad in the interface, and pass the instruction ↵	Tim Shen	2017-05-09	2	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instead. NFC. Now both emitLeadingFence and emitTrailingFence take the instruction itself, instead of taking IsLoad/IsStore pairs. Instruction::mayReadFromMemory and Instrucion::mayWriteToMemory are used for determining those two booleans. The instruction argument is also useful for later D32763, in emitTrailingFence. For emitLeadingFence, it seems to have cleaner interface with the proposed change. Differential Revision: https://reviews.llvm.org/D32762 llvm-svn: 302539
*	Add extra operand to CALLSEQ_START to keep frame part set up previously	Serge Pavlov	2017-05-09	4	-18/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Using arguments with attribute inalloca creates problems for verification of machine representation. This attribute instructs the backend that the argument is prepared in stack prior to CALLSEQ_START..CALLSEQ_END sequence (see http://llvm.org/docs/InAlloca.htm for details). Frame size stored in CALLSEQ_START in this case does not count the size of this argument. However CALLSEQ_END still keeps total frame size, as caller can be responsible for cleanup of entire frame. So CALLSEQ_START and CALLSEQ_END keep different frame size and the difference is treated by MachineVerifier as stack error. Currently there is no way to distinguish this case from actual errors. This patch adds additional argument to CALLSEQ_START and its target-specific counterparts to keep size of stack that is set up prior to the call frame sequence. This argument allows MachineVerifier to calculate actual frame size associated with frame setup instruction and correctly process the case of inalloca arguments. The changes made by the patch are: - Frame setup instructions get the second mandatory argument. It affects all targets that use frame pseudo instructions and touched many files although the changes are uniform. - Access to frame properties are implemented using special instructions rather than calls getOperand(N).getImm(). For X86 and ARM such replacement was made previously. - Changes that reflect appearance of additional argument of frame setup instruction. These involve proper instruction initialization and methods that access instruction arguments. - MachineVerifier retrieves frame size using method, which reports sum of frame parts initialized inside frame instruction pair and outside it. The patch implements approach proposed by Quentin Colombet in https://bugs.llvm.org/show_bug.cgi?id=27481#c1. It fixes 9 tests failed with machine verifier enabled and listed in PR27481. Differential Revision: https://reviews.llvm.org/D32394 llvm-svn: 302527
*	[KnownBits] Add wrapper methods for setting and clear all bits in the ↵	Craig Topper	2017-05-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	underlying APInts in KnownBits. This adds routines for reseting KnownBits to unknown, making the value all zeros or all ones. It also adds methods for querying if the value is zero, all ones or unknown. Differential Revision: https://reviews.llvm.org/D32637 llvm-svn: 302262
*	[PPC] When restoring R30 (PIC base pointer), mark it as <def>	Krzysztof Parzyszek	2017-05-04	1	-2/+1
\| \| \| \| \| \| \| \| \|	This happened on the PPC32/SVR4 path and was discovered when building FreeBSD on PPC32. It was a typo-class error in the frame lowering code. This fixes PR26519. llvm-svn: 302183
*	[PowerPC, DAGCombiner] Fold a << (b % (sizeof(a) * 8)) back to a single ↵	Tim Shen	2017-05-03	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instruction Summary: This is the corresponding llvm change to D28037 to ensure no performance regression. Reviewers: bogner, kbarton, hfinkel, iteratee, echristo Subscribers: nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D28329 llvm-svn: 301990
*	[PowerPC] Emit VMX loads/stores for aligned ops to avoid adding swaps on LE	Nemanja Ivanovic	2017-05-02	2	-6/+23
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes PR30730. This is a re-commit of a pulled commit. The commit was pulled because some software projects contained uses of Altivec vectors that violated alignment requirements. Known issues have now been fixed. Committing on behalf of Lei Huang. Differential Revision: https://reviews.llvm.org/D26861 llvm-svn: 301892
*	Generalize the specialized flag-carrying SDNodes by moving flags into SDNode.	Amara Emerson	2017-05-01	1	-5/+5
\| \| \| \| \| \| \| \|	This removes BinaryWithFlagsSDNode, and flags are now all passed by value. Differential Revision: https://reviews.llvm.org/D32527 llvm-svn: 301803
*	[SelectionDAG] Use KnownBits struct in DAG's computeKnownBits and ↵	Craig Topper	2017-04-28	3	-37/+33
\| \| \| \| \| \| \| \| \| \| \| \|	simplifyDemandedBits This patch replaces the separate APInts for KnownZero/KnownOne with a single KnownBits struct. This is similar to what was done to ValueTracking's version recently. This is largely a mechanical transformation from KnownZero to Known.Zero. Differential Revision: https://reviews.llvm.org/D32569 llvm-svn: 301620
*	Move value type list from TargetRegisterClass to TargetRegisterInfo	Krzysztof Parzyszek	2017-04-24	1	-2/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31937 llvm-svn: 301234
*	Revert r301231: Accidentally committed stale files	Krzysztof Parzyszek	2017-04-24	1	-2/+2
\| \| \| \| \| \|	I forgot to commit local changes before commit. llvm-svn: 301232
*	Move value type list from TargetRegisterClass to TargetRegisterInfo	Krzysztof Parzyszek	2017-04-24	1	-2/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D31937 llvm-svn: 301231
*	Move size and alignment information of regclass to TargetRegisterInfo	Krzysztof Parzyszek	2017-04-24	1	-9/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1. RegisterClass::getSize() is split into two functions: - TargetRegisterInfo::getRegSizeInBits(const TargetRegisterClass &RC) const; - TargetRegisterInfo::getSpillSize(const TargetRegisterClass &RC) const; 2. RegisterClass::getAlignment() is replaced by: - TargetRegisterInfo::getSpillAlignment(const TargetRegisterClass &RC) const; This will allow making those values depend on subtarget features in the future. Differential Revision: https://reviews.llvm.org/D31783 llvm-svn: 301221
*	Re-commit r301040 "X86: Don't emit zero-byte functions on Windows"	Hans Wennborg	2017-04-21	2	-3/+3
\| \| \| \| \| \| \| \| \|	In addition to the original commit, tighten the condition for when to pad empty functions to COFF Windows. This avoids running into problems when targeting e.g. Win32 AMDGPU, which caused test failures when this was committed initially. llvm-svn: 301047
*	Revert r301040 "X86: Don't emit zero-byte functions on Windows"	Hans Wennborg	2017-04-21	2	-3/+3
\| \| \| \| \| \|	This broke almost all bots. Reverting while fixing. llvm-svn: 301041
*	X86: Don't emit zero-byte functions on Windows	Hans Wennborg	2017-04-21	2	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Empty functions can lead to duplicate entries in the Guard CF Function Table of a binary due to multiple functions sharing the same RVA, causing the kernel to refuse to load that binary. We had a terrific bug due to this in Chromium. It turns out we were already doing this for Mach-O in certain situations. This patch expands the code for that in AsmPrinter::EmitFunctionBody() and renames TargetInstrInfo::getNoopForMachoTarget() to simply getNoop() since it seems it was used for not just Mach-O anyway. Differential Revision: https://reviews.llvm.org/D32330 llvm-svn: 301040
*	Fix use-after-frees on memory allocated in a Recycler.	Benjamin Kramer	2017-04-20	1	-2/+2
\| \| \| \| \| \| \| \|	This will become asan errors once the patch lands that poisons the memory after free. The x86 change is a hack, but I don't see how to solve this properly at the moment. llvm-svn: 300867
*	Distinguish between code pointer size and DataLayout::getPointerSize() in ↵	Konstantin Zhuravlyov	2017-04-17	1	-2/+2
\| \| \| \| \| \|	DWARF info generation llvm-svn: 300463
*	[SystemZ] TargetTransformInfo cost functions implemented.	Jonas Paulsson	2017-04-12	2	-7/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	getArithmeticInstrCost(), getShuffleCost(), getCastInstrCost(), getCmpSelInstrCost(), getVectorInstrCost(), getMemoryOpCost(), getInterleavedMemoryOpCost() implemented. Interleaved access vectorization enabled. BasicTTIImpl::getCastInstrCost() improved to check for legal extending loads, in which case the cost of the z/sext instruction becomes 0. Review: Ulrich Weigand, Renato Golin. https://reviews.llvm.org/D29631 llvm-svn: 300052
*	[PowerPC] multiply-with-overflow might use the CTR register	Hal Finkel	2017-04-11	1	-9/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Check the legality of ISD::[US]MULO to see whether Intrinsic::[us]mul_with_overflow will legalize into a function call (and, thus, will use the CTR register). Fixes PR32485. Patch by Tim Neumann! Differential Revision: https://reviews.llvm.org/D31790 llvm-svn: 299910
*	Get the TOC save offset off of PPCFrameLowering rather than a separate copy ↵	Eric Christopher	2017-04-10	1	-1/+1
\| \| \| \| \| \|	of the same data. llvm-svn: 299887
*	Remove the default subtarget from the Power port. It's unnecessary and ↵	Eric Christopher	2017-04-06	2	-4/+1
\| \| \| \| \| \|	harmful if used. llvm-svn: 299726
*	[DAGCombiner] add and use TLI hook to convert and-of-seteq / or-of-setne to ↵	Sanjay Patel	2017-04-05	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	bitwise logic+setcc (PR32401) This is a generic combine enabled via target hook to reduce icmp logic as discussed in: https://bugs.llvm.org/show_bug.cgi?id=32401 It's likely that other targets will want to enable this hook for scalar transforms, and there are probably other patterns that can use bitwise logic to reduce comparisons. Note that we are missing an IR canonicalization for these patterns, and we will probably prefer the pair-of-compares form in IR (shorter, more likely to fold). Differential Revision: https://reviews.llvm.org/D31483 llvm-svn: 299542
*	Add MCContext argument to MCAsmBackend::applyFixup for error reporting	Alex Bradbury	2017-04-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A number of backends (AArch64, MIPS, ARM) have been using MCContext::reportError to report issues such as out-of-range fixup values in their TgtAsmBackend. This is great, but because MCContext couldn't easily be threaded through to the adjustFixupValue helper function from its usual callsite (applyFixup), these backends ended up adding an MCContext* argument and adding another call to applyFixup to processFixupValue. Adding an MCContext parameter to applyFixup makes this unnecessary, and even better - applyFixup can take a reference to MCContext rather than a potentially null pointer. Differential Revision: https://reviews.llvm.org/D30264 llvm-svn: 299529
*	[DAGCombiner] Add vector demanded elements support to ↵	Simon Pilgrim	2017-03-31	2	-0/+2
\| \| \| \| \| \| \| \| \| \|	computeKnownBitsForTargetNode Follow up to D25691, this sets up the plumbing necessary to support vector demanded elements support in known bits calculations in target nodes. Differential Revision: https://reviews.llvm.org/D31249 llvm-svn: 299201
*	Temporarily revert "[PPC] In PPCBoolRetToInt change the bool value to i64 if ↵	Eric Christopher	2017-03-31	3	-37/+19
\| \| \| \| \| \| \| \|	the target is ppc64" as it's causing test failures, I've given Carrot a testcase offline. This reverts commit r298955. llvm-svn: 299153
*	Spelling mistakes in comments. NFCI.	Simon Pilgrim	2017-03-30	1	-1/+1
\| \| \| \| \| \|	Based on corrections mentioned in patch for clang for PR27635 llvm-svn: 299072
*	[PPC] In PPCBoolRetToInt change the bool value to i64 if the target is ppc64	Guozhi Wei	2017-03-28	3	-19/+37
\| \| \| \| \| \| \| \| \| \|	In PPCBoolRetToInt bool value is changed to i32 type. On ppc64 it may introduce an extra zero extension for the return value. This patch changes the integer type to i64 to avoid the zero extension on ppc64. This patch fixed PR32442. Differential Revision: https://reviews.llvm.org/D31407 llvm-svn: 298955
*	Remove an oddly unnecessary temporary.	Eric Christopher	2017-03-27	1	-2/+1
\| \| \| \|	llvm-svn: 298888
*	Kill some trailing whitespace to make some new changes a bit easier.	Eric Christopher	2017-03-23	1	-12/+12
\| \| \| \|	llvm-svn: 298637
*	Make library calls sensitive to regparm module flag (Fixes PR3997).	Nirav Dave	2017-03-18	1	-4/+3
\| \| \| \| \| \| \| \| \| \|	Reviewers: mkuper, rnk Subscribers: mehdi_amini, jyknight, aemerson, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D27050 llvm-svn: 298179
*	Remove getArgumentList() in favor of arg_begin(), args(), etc	Reid Kleckner	2017-03-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Users often call getArgumentList().size(), which is a linear way to get the number of function arguments. arg_size(), on the other hand, is constant time. In general, the fact that arguments are stored in an iplist is an implementation detail, so I've removed it from the Function interface and moved all other users to the argument container APIs (arg_begin(), arg_end(), args(), arg_size()). Reviewed By: chandlerc Differential Revision: https://reviews.llvm.org/D31052 llvm-svn: 298010
*	Test commit.	Hiroshi Inoue	2017-03-16	1	-1/+1
\| \| \| \|	llvm-svn: 297959
*	[PowerPC][Altivec] Add mfvrd and mffprd extended mnemonic	Nemanja Ivanovic	2017-03-15	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \|	mfvrd and mffprd are both alias to mfvrsd. This patch enables correct parsing of the aliases, but we still emit a mfvrsd. Committing on behalf of brunoalr (Bruno Rosa). Differential Revision: https://reviews.llvm.org/D29177 llvm-svn: 297849
*	Revert "Revert "[PowerPC][ELFv2ABI] Allocate parameter area on-demand to ↵	Tim Shen	2017-03-08	1	-5/+43
\| \| \| \| \| \| \| \| \| \| \| \| \|	reduce stack frame size"" After inspection, it's an UB in our code base. Someone cast a var-arg function pointer to a non-var-arg one. :/ Re-commit r296771 to continue testing on the patch. Sorry for the trouble! llvm-svn: 297256
*	Revert "[PowerPC][ELFv2ABI] Allocate parameter area on-demand to reduce ↵	Tim Shen	2017-03-07	1	-43/+5
\| \| \| \| \| \| \| \| \| \| \|	stack frame size" This reverts commit r296771. We found some wide spread test failures internally. I'm working on a testcase. Politely revert the patch in the mean time. :) llvm-svn: 297124
*	[PowerPC] Fix failure with STBRX when store is narrower than the bswap	Nemanja Ivanovic	2017-03-06	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \|	Fixes a crash caused by r296811 by truncating the input of the STBRX node when the bswap is wider than i32. Fixes https://bugs.llvm.org/show_bug.cgi?id=32140 Differential Revision: https://reviews.llvm.org/D30615 llvm-svn: 297001
*	[DAGCombiner] allow transforming (select Cond, C +/- 1, C) to (add(ext Cond), C)	Sanjay Patel	2017-03-04	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	select Cond, C +/- 1, C --> add(ext Cond), C -- with a target hook. This is part of the ongoing process to obsolete D24480. The motivation is to canonicalize to select IR in InstCombine whenever possible, so we need to have a way to undo that easily in codegen. PowerPC is an obvious winner for this kind of transform because it has fast and complete bit-twiddling abilities but generally lousy conditional execution perf (although this might have changed in recent implementations). x86 also sees some wins, but the effect is limited because these transforms already mostly exist in its target-specific combineSelectOfTwoConstants(). The fact that we see any x86 changes just shows that that code is a mess of special-case holes. We may be able to remove some of that logic now. My guess is that other targets will want to enable this hook for most cases. The likely follow-ups would be to add value type and/or the constants themselves as parameters for the hook. As the tests in select_const.ll show, we can transform any select-of-constants to math/logic, but the general transform for any 2 constants needs one more instruction (multiply or 'and'). ARM is one target that I think may not want this for most cases. I see infinite loops there because it wants to use selects to enable conditionally executed instructions. Differential Revision: https://reviews.llvm.org/D30537 llvm-svn: 296977
*	Make TargetInstrInfo::isPredicable take a const reference, NFC	Krzysztof Parzyszek	2017-03-03	2	-2/+2
\| \| \| \|	llvm-svn: 296901
*	[PPC] Fix code generation for bswap(int32) followed by store16	Guozhi Wei	2017-03-02	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes pr32063. Current code in PPCTargetLowering::PerformDAGCombine can transform bswap store into a single PPCISD::STBRX instruction. but it doesn't consider the case that the operand size of bswap may be larger than store size. When it occurs, we need 2 modifications, 1 For the last operand of PPCISD::STBRX, we should not use DAG.getValueType(N->getOperand(1).getValueType()), instead we should use cast<StoreSDNode>(N)->getMemoryVT(). 2 Before PPCISD::STBRX, we need to shift the original operand of bswap to the right side. Differential Revision: https://reviews.llvm.org/D30362 llvm-svn: 296811
*	[PowerPC][ELFv2ABI] Allocate parameter area on-demand to reduce stack frame size	Nemanja Ivanovic	2017-03-02	1	-5/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch reduces the stack frame size by not allocating the parameter area if it is not required. In the current implementation LowerFormalArguments_64SVR4 already handles the parameter area, but LowerCall_64SVR4 does not (when calculating the stack frame size). What this patch does is make LowerCall_64SVR4 consistent with LowerFormalArguments_64SVR4. Committing on behalf of Hiroshi Inoue. Differential Revision: https://reviews.llvm.org/D29881 llvm-svn: 296771
*	vec perm can go down either pipeline on P8.	Eric Christopher	2017-02-26	1	-1/+1
\| \| \| \| \| \|	No observable changes, spotted while looking at the scheduling description. llvm-svn: 296277
*	[PowerPC] Use subfic instruction for subtract from immediate	Nemanja Ivanovic	2017-02-24	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \|	Provide a 64-bit pattern to use SUBFIC for subtracting from a 16-bit immediate. The corresponding pattern already exists for 32-bit integers. Committing on behalf of Hiroshi Inoue. Differential Revision: https://reviews.llvm.org/D29387 llvm-svn: 296144
*	[PowerPC] Use rldicr instruction for AND with an immediate if possible	Nemanja Ivanovic	2017-02-24	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \|	Emit clrrdi (extended mnemonic for rldicr) for AND-ing with masks that clear bits from the right hand size. Committing on behalf of Hiroshi Inoue. Differential Revision: https://reviews.llvm.org/D29388 llvm-svn: 296143