bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[APInt] Remove unnecessary min with BitWidth from countTrailingOnesSlowCase.	Craig Topper	2017-04-22	1	-1/+2
\| \| \| \| \| \|	The unused upper bits are guaranteed to be 0 so we don't need to worry about accidentally counting them. llvm-svn: 301091
*	[APInt] Add WORD_MAX constant and use it instead of UINT64_MAX. NFC	Craig Topper	2017-04-22	1	-15/+15
\| \| \| \|	llvm-svn: 301069
*	[APInt] Add compare/compareSigned methods that return -1, 0, 1. Reimplement ↵	Craig Topper	2017-04-21	1	-35/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	slt/ult and friends using them Currently sle and ule have to call slt/ult and eq to get the proper answer. This results in extra code for both calls and additional scans of multiword APInts. This patch replaces slt/ult with a compareSigned/compare that can return -1, 0, or 1 so we can cover all the comparison functions with a single call. While I was there I removed the activeBits calls and other checks at the start of the slow part of ult. Both of the activeBits calls potentially scan through each of the APInts separately. I can't imagine that's any better than just scanning them in parallel and doing the compares. Now we just share the code with tcCompare. These changes seem to be good for about a 7-8k reduction on the size of the opt binary on my local x86-64 build. Differential Revision: https://reviews.llvm.org/D32339 llvm-svn: 300995
*	[APInt] Add isSubsetOf method that can check if one APInt is a subset of ↵	Craig Topper	2017-04-20	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	another without creating temporary APInts This question comes up in many places in SimplifyDemandedBits. This makes it easy to ask without allocating additional temporary APInts. The BitVector class provides a similar functionality through its (IMHO badly named) test(const BitVector&) method. Though its output polarity is reversed. I've provided one example use case in this patch. I plan to do more as a follow up. Differential Revision: https://reviews.llvm.org/D32258 llvm-svn: 300851
*	[APInt] Implement APInt::intersects without creating a temporary APInt in ↵	Craig Topper	2017-04-20	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the multiword case Summary: This is a simple question we should be able to answer without creating a temporary to hold the AND result. We can also get an early out as soon as we find a word that intersects. Reviewers: RKSimon, hans, spatel, davide Reviewed By: hans, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32253 llvm-svn: 300812
*	[APInt] Implement operator==(uint64_t) similar to ugt/ult(uint64_t) to ↵	Craig Topper	2017-04-19	1	-8/+0
\| \| \| \| \| \|	remove one of the out of line EqualsSlowCase methods. llvm-svn: 300799
*	[APInt] Move the 'return *this' from the slow cases of assignment operators ↵	Craig Topper	2017-04-19	1	-10/+7
\| \| \| \| \| \| \| \|	inline. We should let the compiler see that the fast/slow cases both return *this. I don't think we chain assignments together very often so this shouldn't matter much. llvm-svn: 300715
*	[APInt] Inline the single word case of lshrInPlace similar to what we do for ↵	Craig Topper	2017-04-18	1	-9/+1
\| \| \| \| \| \|	<<=. llvm-svn: 300577
*	[APInt] Use lshrInPlace to replace lshr where possible	Craig Topper	2017-04-18	1	-4/+4
\| \| \| \| \| \| \| \| \| \|	This patch uses lshrInPlace to replace code where the object that lshr is called on is being overwritten with the result. This adds an lshrInPlace(const APInt &) version as well. Differential Revision: https://reviews.llvm.org/D32155 llvm-svn: 300566
*	[APInt] Cleanup the reverseBits slow case a little.	Craig Topper	2017-04-18	1	-6/+4
\| \| \| \| \| \|	Use lshrInPlace. Use single bit extract and operator\|=(uint64_t) to avoid a few temporary APInts. llvm-svn: 300527
*	[APInt] Make operator<<= shift in place. Improve the implementation of ↵	Craig Topper	2017-04-18	1	-78/+24
\| \| \| \| \| \|	tcShiftLeft and use it to implement operator<<=. llvm-svn: 300526
*	[APInt] Merge the multiword code from lshrInPlace and tcShiftRight into a ↵	Craig Topper	2017-04-17	1	-68/+25
\| \| \| \| \| \| \| \| \| \| \| \|	single implementation This merges the two different multiword shift right implementations into a single version located in tcShiftRight. lshrInPlace now calls tcShiftRight for the multiword case. I retained the memmove fast path from lshrInPlace and used a memset for the zeroing. The for loop is basically tcShiftRight's implementation with the zeroing and the intra-shift of 0 removed. Differential Revision: https://reviews.llvm.org/D32114 llvm-svn: 300503
*	[APInt] Fix a bug in lshr by a value more than 64 bits above the bit width.	Craig Topper	2017-04-16	1	-1/+1
\| \| \| \| \| \|	This was throwing an assert because we determined the intra-word shift amount by subtracting the size of the full word shift from the total shift amount. But we failed to account for the fact that we clipped the full word shifts by total words first. To fix this just calculate the intra-word shift as the remainder of dividing by bits per word. llvm-svn: 300405
*	Remove all allocation and divisions from GreatestCommonDivisor	Richard Smith	2017-04-13	1	-70/+82
\| \| \| \| \| \| \| \| \| \| \|	Switch from Euclid's algorithm to Stein's algorithm for computing GCD. This avoids the (expensive) APInt division operation in favour of bit operations. Remove all memory allocation from within the GCD loop by tweaking our `lshr` implementation so it can operate in-place. Differential Revision: https://reviews.llvm.org/D31968 llvm-svn: 300252
*	[APInt] Reorder fields to avoid a hole in the middle of the class	Craig Topper	2017-04-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: APInt is currently implemented with an unsigned BitWidth field first and then a uint_64/pointer union. Due to the 64-bit size of the union there is a hole after the bitwidth. Putting the union first allows the class to be packed. Making it 12 bytes instead of 16 bytes. An APSInt goes from 20 bytes to 16 bytes. This shows a 4k reduction on the size of the opt binary on my local x86-64 build. So this enables some other improvement to the code as well. Reviewers: dblaikie, RKSimon, hans, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D32001 llvm-svn: 300171
*	[APInt] Generalize the implementation of tcIncrement to support adding a ↵	Craig Topper	2017-04-13	1	-64/+40
\| \| \| \| \| \|	full 'word' by introducing tcAddPart. Use this to support tcIncrement, operator++ and operator+=(uint64_t). Do the same for subtract. NFCI. llvm-svn: 300169
*	[APInt] Make use of whichWord and maskBit to simplify some code. NFC	Craig Topper	2017-04-02	1	-5/+3
\| \| \| \|	llvm-svn: 299342
*	[APInt] Add a public typedef for the internal type of APInt use it instead ↵	Craig Topper	2017-04-02	1	-81/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	of integerPart. Make APINT_BITS_PER_WORD and APINT_WORD_SIZE public. This patch is one step to attempt to unify the main APInt interface and the tc functions used by APFloat. This patch adds a WordType to APInt and uses that in all the tc functions. I've added temporary typedefs to APFloat to alias it to integerPart to keep the patch size down. I'll work on removing that in a future patch. In future patches I hope to reuse the tc functions to implement some of the main APInt functionality. I may remove APINT_ from BITS_PER_WORD and WORD_SIZE constants so that we don't have the repetitive APInt::APINT_ externally. Differential Revision: https://reviews.llvm.org/D31523 llvm-svn: 299341
*	[X86] Use tcAdd/tcSubtract to implement the slow case of operator+=/operator-=.	Craig Topper	2017-04-02	1	-33/+3
\| \| \| \|	llvm-svn: 299326
*	[APInt] Combine declaration and initialization. NFC	Craig Topper	2017-04-02	1	-6/+2
\| \| \| \|	llvm-svn: 299325
*	[APInt] Simplify some code by using operator+=(uint64_t) instead of doing a ↵	Craig Topper	2017-04-02	1	-7/+2
\| \| \| \| \| \|	more complex assignment into a temporary APInt just to use the APInt operator+=. llvm-svn: 299324
*	[APInt] Fix typo in comment. NFC	Craig Topper	2017-04-02	1	-1/+1
\| \| \| \|	llvm-svn: 299323
*	[APInt] Use conditional operator to simplify some code. NFC	Craig Topper	2017-04-01	1	-4/+1
\| \| \| \|	llvm-svn: 299320
*	[APInt] Implement flipAllBitsSlowCase with tcComplement. NFCI	Craig Topper	2017-04-01	1	-2/+1
\| \| \| \|	llvm-svn: 299319
*	[APInt] Fix indentation. NFC	Craig Topper	2017-04-01	1	-8/+8
\| \| \| \|	llvm-svn: 299318
*	[APInt] Implement AndAssignSlowCase using tcAnd. Do the same for Or and Xor. ↵	Craig Topper	2017-04-01	1	-9/+3
\| \| \| \| \| \|	NFCI llvm-svn: 299317
*	[APInt] Allow GreatestCommonDivisor to take rvalue inputs efficiently. Use ↵	Craig Topper	2017-04-01	1	-6/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	moves instead of copies in the loop. Summary: GreatestComonDivisor currently makes a copy of both its inputs. Then in the loop we do one move and two copies, plus any allocation the urem call does. This patch changes it to take its inputs by value so that we can do a move of any rvalue inputs instead of copying. Then in the loop we do 3 move assignments and no copies. This way the only possible allocations we have in the loop is from the urem call. Reviewers: dblaikie, RKSimon, hans Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31572 llvm-svn: 299314
*	[APInt] Remove the mul/urem/srem/udiv/sdiv functions from the APIntOps ↵	Craig Topper	2017-04-01	1	-1/+1
\| \| \| \| \| \|	namespace. Replace the few usages with calls to the class methods. NFC llvm-svn: 299292
*	[APInt] Rewrite getLoBits in a way that will do one less memory allocation ↵	Craig Topper	2017-03-31	1	-3/+4
\| \| \| \| \| \|	in the multiword case. Rewrite getHiBits to use the class method version of lshr instead of the one in APIntOps. NFCI llvm-svn: 299243
*	[APInt] Reformat tc functions to put opening curly braces on the end of the ↵	Craig Topper	2017-03-28	1	-104/+45
\| \| \| \| \| \|	previous line. NFC llvm-svn: 298900
*	[APInt] Remove an anonymous namespace around static functions. NFC	Craig Topper	2017-03-28	1	-35/+33
\| \| \| \|	llvm-svn: 298899
*	[APInt] Combine variable declaration and initialization where possible in ↵	Craig Topper	2017-03-28	1	-76/+36
\| \| \| \| \| \|	the tc functions. NFCI llvm-svn: 298898
*	[APInt] Use 'unsigned' instead of 'unsigned int' in the interface to the ↵	Craig Topper	2017-03-28	1	-36/+36
\| \| \| \| \| \|	APInt tc functions. This is more consistent with the rest of the codebase. NFC llvm-svn: 298897
*	[APInt] Move the single word cases of the bitwise operators inline.	Craig Topper	2017-03-28	1	-18/+3
\| \| \| \|	llvm-svn: 298894
*	[APInt] Move operator=(uint64_t) inline as its pretty simple and is often ↵	Craig Topper	2017-03-27	1	-10/+0
\| \| \| \| \| \| \| \|	used with small constants that the compiler can optimize. While there recognize that we only need to clearUnusedBits on the single word case. llvm-svn: 298881
*	[APInt] Move operator&=(uint64_t) inline and use memset to clear the upper ↵	Craig Topper	2017-03-27	1	-12/+0
\| \| \| \| \| \| \| \|	words. This method is pretty new and probably isn't use much in the code base so this should have a negligible size impact. The OR and XOR operators are already inline. llvm-svn: 298870
*	[APInt] Move the >64 bit case for flipAllBits out of line.	Craig Topper	2017-03-27	1	-0/+5
\| \| \| \| \| \|	This is more consistent with what we do for other operations. This shrinks the opt binary on my build by ~72k. llvm-svn: 298858
*	[APInt] Don't initialize VAL to 0 in APInt constructors. Push it down to the ↵	Craig Topper	2017-03-20	1	-2/+5
\| \| \| \| \| \| \| \| \| \|	initSlowCase and other init methods. I'm not sure if zeroing VAL before writing pVal is really necessary, but at least one other place did it in code. But by taking the store out of line, this reduces the opt binary by about 20k on my local x86-64 build. llvm-svn: 298233
*	[APInt] Add APInt::insertBits() method to insert an APInt into a larger APInt	Simon Pilgrim	2017-03-10	1	-0/+59
\| \| \| \| \| \| \| \| \| \| \| \|	We currently have to insert bits via a temporary variable of the same size as the target with various shift/mask stages, resulting in further temporary variables, all of which require the allocation of memory for large APInts (MaskSizeInBits > 64). This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::insertBits() helper method which avoids the temporary memory allocation and masks/inserts the raw bits directly into the target. Differential Revision: https://reviews.llvm.org/D30780 llvm-svn: 297458
*	Fixed typos in comments. NFCI.	Simon Pilgrim	2017-03-09	1	-6/+6
\| \| \| \|	llvm-svn: 297379
*	[APInt] Add rvalue reference support to and, or, xor operations to allow ↵	Craig Topper	2017-03-07	1	-25/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	their memory allocation to be reused when possible This extends an earlier change that did similar for add and sub operations. With this first patch we lose the fastpath for the single word case as operator&= and friends don't support it. This can be added there if we think that's important. I had to change some functions in the APInt class since the operator overloads were moved out of the class and can't be used inside the class now. The getBitsSet change collides with another outstanding patch to implement it with setBits. But I didn't want to make this patch dependent on that series. I've also removed the Or, And, Xor functions which were rarely or never used. I already commited two changes to remove the only uses of Or that existed. Differential Revision: https://reviews.llvm.org/D30612 llvm-svn: 297121
*	[APInt] Add setLowBits/setHighBits methods to APInt.	Craig Topper	2017-03-07	1	-29/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There are quite a few places in the code base that do something like the following to set the high or low bits in an APInt. KnownZero \|= APInt::getHighBitsSet(BitWidth, BitWidth - 1); For BitWidths larger than 64 this creates a short lived APInt with malloced storage. I think it might even call malloc twice. Its better to just provide methods that can set the necessary bits without the temporary APInt. I'll update usages that benefit in a separate patch. Reviewers: majnemer, MatzeB, davide, RKSimon, hans Reviewed By: hans Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30525 llvm-svn: 297111
*	[APInt] Optimize APInt creation from uint64_t	Craig Topper	2017-03-01	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This patch moves the clearUnusedBits calls into the two different initialization paths for APInt from a uint64_t. This allows the compiler to better optimize the clearing of the unused bits for the single word case. And it puts the clearing for the multi word case into the initSlowCase function to save code. In the common case of initializing with 0 this allows the clearing to be completely optimized out for the single word case. On my local x86 build this is showing a ~45kb reduction in the size of the opt binary. Reviewers: RKSimon, hans, majnemer, davide, MatzeB Reviewed By: hans Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30486 llvm-svn: 296677
*	[APInt] Add APInt::extractBits() method to extract APInt subrange (reapplied)	Simon Pilgrim	2017-02-25	1	-0/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current pattern for extract bits in range is typically: Mask.lshr(BitOffset).trunc(SubSizeInBits); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation of memory for the temporary variable. This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::extractBits() helper method which avoids the temporary memory allocation. Differential Revision: https://reviews.llvm.org/D30336 llvm-svn: 296272
*	Revert: r296141 [APInt] Add APInt::extractBits() method to extract APInt ↵	Simon Pilgrim	2017-02-24	1	-32/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	subrange The current pattern for extract bits in range is typically: Mask.lshr(BitOffset).trunc(SubSizeInBits); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation of memory for the temporary variable. This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::extractBits() helper method which avoids the temporary memory allocation. Differential Revision: https://reviews.llvm.org/D30336 llvm-svn: 296147
*	[APInt] Add APInt::extractBits() method to extract APInt subrange	Simon Pilgrim	2017-02-24	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current pattern for extract bits in range is typically: Mask.lshr(BitOffset).trunc(SubSizeInBits); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation of memory for the temporary variable. This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::extractBits() helper method which avoids the temporary memory allocation. Differential Revision: https://reviews.llvm.org/D30336 llvm-svn: 296141
*	[APInt] Add APInt::setBits() method to set all bits in range	Simon Pilgrim	2017-02-24	1	-0/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current pattern for setting bits in range is typically: Mask \|= APInt::getBitsSet(MaskSizeInBits, LoPos, HiPos); Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation memory for the temporary variable. This is one of the key compile time issues identified in PR32037. This patch adds the APInt::setBits() helper method which avoids the temporary memory allocation completely, this first implementation uses setBit() internally instead but already significantly reduces the regression in PR32037 (~10% drop). Additional optimization may be possible. I investigated whether there is need for APInt::clearBits() and APInt::flipBits() equivalents but haven't seen these patterns to be particularly common, but reusing the code would be trivial. Differential Revision: https://reviews.llvm.org/D30265 llvm-svn: 296102
*	Strip trailing whitespace.	Simon Pilgrim	2017-02-23	1	-8/+8
\| \| \| \|	llvm-svn: 295989
*	[APInt] Fix rotl/rotr when the shift amount is greater than the total bit width.	Joey Gouly	2017-02-07	1	-2/+15
\| \| \| \| \|	Review: https://reviews.llvm.org/D27749 llvm-svn: 294295
*	[APInt] Add integer API bor bitwise operations.	Amaury Sechet	2017-02-03	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: As per title. I ran into that limitation of the API doing some other work, so I though that'd be a nice addition. Reviewers: jroelofs, compnerd, majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29503 llvm-svn: 294063