bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] Add BLSI to isUseDefConvertible.	Craig Topper	2019-06-20	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: BLSI sets the C flag is the input is not zero. So if its followed by a TEST of the input where only the Z flag is consumed, we can replace it with the opposite check of the C flag. We should be able to do the same for BLSMSK and BLSR, but the naive test case for those is being optimized to a subo by CodeGenPrepare. Reviewers: spatel, RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63589 llvm-svn: 363957
*	[X86] Add test cases showing missed opportunities to use the C flag from the ↵	Craig Topper	2019-06-20	1	-0/+65
\| \| \| \| \| \|	BLSI instruction to avoid a TEST instruction llvm-svn: 363909
*	[DAGCombiner][X86] Fold (not (neg X)) -> (add X, -1)	Craig Topper	2019-06-04	1	-17/+10
\| \| \| \| \| \| \| \| \| \|	This is a special case of a more general transform (not (sub Y, X)) -> (add X, ~Y). InstCombine knows the general form. I've restricted to the special case to fix the motivating case PR42118. I tried handling any case where Y was constant, but got some changes on some Mips tests that I couldn't quickly prove where beneficial. Fixes PR42118 Differential Revision: https://reviews.llvm.org/D62828 llvm-svn: 362533
*	[X86] Add test cases for 32 and 64 bit versions of PR42118. NFC	Craig Topper	2019-06-03	1	-0/+81
\| \| \| \|	llvm-svn: 362457
*	Revert r362451 "foo" and r362452 "[X86] Add test cases for 32 and 64 bit ↵	Craig Topper	2019-06-03	1	-81/+0
\| \| \| \| \| \| \| \|	versions of PR42118. NFC" I failed to squash these properly llvm-svn: 362453
*	[X86] Add test cases for 32 and 64 bit versions of PR42118. NFC	Craig Topper	2019-06-03	1	-10/+17
\| \| \| \|	llvm-svn: 362452
*	foo	Craig Topper	2019-06-03	1	-0/+74
\| \| \| \|	llvm-svn: 362451
*	[X86] Add some missing blsr patterns	Gabor Buella	2019-01-27	1	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The add+and sequence followed by a branch can happen e.g. when looping over the set bits of an integer: ``` while (x != 0) { func(x & ~x); x &= x - 1; } ``` Reviewed By: ctopper Differential Revision: https://reviews.llvm.org/D57296 llvm-svn: 352306
*	[NFC][X86] Add a few more blsr test cases	Gabor Buella	2019-01-27	1	-0/+100
\| \| \| \|	llvm-svn: 352305
*	[X86] Return false from hasAndNotCompare if the comparision value is a constant.	Craig Topper	2018-12-23	1	-2/+2
\| \| \| \| \| \|	We won't end up using an ANDN instruction in this case so we should generate the same code we do for pre-BMI targets. llvm-svn: 350018
*	[X86] Add isel patterns to match BMI/TBMI instructions when lowering has ↵	Craig Topper	2018-12-21	1	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \|	turned the root nodes into one of the flag producing binops. This fixes the patterns that have or/and as a root. 'and' is handled differently since thy usually have a CMP wrapped around them. I had to look for uses of the CF flag because all these nodes have non-standard CF flag behavior. A real or/xor would always clear CF. In practice we shouldn't be using the CF flag from these nodes as far as I know. Differential Revision: https://reviews.llvm.org/D55813 llvm-svn: 349962
*	[X86] Don't allow optimizeCompareInstr to replace a CMP with BEXTR if the ↵	Craig Topper	2018-12-21	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	sign flag is used. The BEXTR instruction documents the SF bit as undefined. The TBM BEXTR instruction has the same issue, but I'm not sure how to test it. With the control being an immediate we can determine the sign bit is 0 or the BEXTR would have been removed. Fixes PR40060 Differential Revision: https://reviews.llvm.org/D55807 llvm-svn: 349956
*	[X86] Don't match TESTrr from (cmp (and X, Y), 0) during isel. Defer to post ↵	Craig Topper	2018-12-19	1	-18/+6
\| \| \| \| \| \| \| \| \| \| \| \|	processing The (cmp (and X, Y) 0) pattern is greedy and ends up forming a TESTrr and consuming the and when it might be better to use one of the BMI/TBM like BLSR or BLSI. This patch moves removes the pattern from isel and adds a post processing check to combine TESTrr+ANDrr into just a TESTrr. With this patch we are able to select the BMI/TBM instructions, but we'll also emit a TESTrr when the result is compared to 0. In many cases the peephole pass will be able to use optimizeCompareInstr to remove the TEST, but its probably not perfect. Differential Revision: https://reviews.llvm.org/D55870 llvm-svn: 349661
*	[X86] Add test cases to show isel failing to match BMI blsmsk/blsi/blsr when ↵	Craig Topper	2018-12-18	1	-7/+363
\| \| \| \| \| \| \| \|	the flag result is used. A similar things happen to TBM instructions which we already have tests for. llvm-svn: 349450
*	[X86] Add test case for PR40060. NFC	Craig Topper	2018-12-18	1	-0/+32
\| \| \| \|	llvm-svn: 349441
*	[X86] Disable BMI BEXTR in X86DAGToDAGISel::matchBEXTRFromAnd unless we're ↵	Craig Topper	2018-09-30	1	-27/+58
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	on compiling for a CPU with single uop BEXTR Summary: This function turns (X >> C1) & C2 into a BMI BEXTR or TBM BEXTRI instruction. For BMI BEXTR we have to materialize an immediate into a register to feed to the BEXTR instruction. The BMI BEXTR instruction is 2 uops on Intel CPUs. It looks like on SKL its one port 0/6 uop and one port 1/5 uop. Despite what Agner's tables say. I know one of the uops is a regular shift uop so it would have to go through the port 0/6 shifter unit. So that's the same or worse execution wise than the shift+and which is one 0/6 uop and one 0/1/5/6 uop. The move immediate into register is an additional 0/1/5/6 uop. For now I've limited this transform to AMD CPUs which have a single uop BEXTR. If may also might make sense if we can fold a load or if the and immediate is larger than 32-bits and can't be encoded as a sign extended 32-bit value or if LICM or CSE can hoist the move immediate and share it. But we'd need to look more carefully at that. In the regression I looked at it doesn't look load folding or large immediates were occurring so the regression isn't caused by the loss of those. So we could try to be smarter here if we find a compelling case. Reviewers: RKSimon, spatel, lebedev.ri, andreadb Reviewed By: RKSimon Subscribers: llvm-commits, andreadb, RKSimon Differential Revision: https://reviews.llvm.org/D52570 llvm-svn: 343399
*	[X86] Handle COPYs of physregs better (regalloc hints)	Simon Pilgrim	2018-09-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Enable enableMultipleCopyHints() on X86. Original Patch by @jonpa: While enabling the mischeduler for SystemZ, it was discovered that for some reason a test needed one extra seemingly needless COPY (test/CodeGen/SystemZ/call-03.ll). The handling for that is resulted in this patch, which improves the register coalescing by providing not just one copy hint, but a sorted list of copy hints. On SystemZ, this gives ~12500 less register moves on SPEC, as well as marginally less spilling. Instead of improving just the SystemZ backend, the improvement has been implemented in common-code (calculateSpillWeightAndHint(). This gives a lot of test failures, but since this should be a general improvement I hope that the involved targets will help and review the test updates. Differential Revision: https://reviews.llvm.org/D38128 llvm-svn: 342578
*	[NFC][X86][AArch64] Reorganize/cleanup BZHI test patterns	Roman Lebedev	2018-06-06	1	-653/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In D47428, i propose to choose the `~(-(1 << nbits))` as the canonical form of low-bit-mask formation. As it is seen from these tests, there is a reason for that. AArch64 currently better handles `~(-(1 << nbits))`, but not the more traditional `(1 << nbits) - 1` (sic!). The other way around for X86. It would be much better to canonicalize. It would seem that there is too much tests, but this is most of all the auto-generated possible variants of C code that one would expect for BZHI to be formed, and then manually cleaned up a bit. So this should be pretty representable, which somewhat good coverage... Related links: https://bugs.llvm.org/show_bug.cgi?id=36419 https://bugs.llvm.org/show_bug.cgi?id=37603 https://bugs.llvm.org/show_bug.cgi?id=37610 https://rise4fun.com/Alive/idM Reviewers: javed.absar, craig.topper, RKSimon, spatel Reviewed By: RKSimon Subscribers: kristof.beyls, llvm-commits, RKSimon, craig.topper, spatel Differential Revision: https://reviews.llvm.org/D47452 llvm-svn: 334124
*	[X86][BMI][TBM] Only demand bottom 16-bits of the BEXTR control op (PR34042)	Simon Pilgrim	2018-06-06	1	-2/+1
\| \| \| \| \| \| \| \|	Only the bottom 16-bits of BEXTR's control op are required (0:8 INDEX, 15:8 LENGTH). Differential Revision: https://reviews.llvm.org/D47690 llvm-svn: 334083
*	[X86][BMI1] Test i32 intrinsics on 32/64 bits + branch off i64 tests	Simon Pilgrim	2018-06-03	1	-412/+973
\| \| \| \| \| \| \| \|	Further refactoring will wait until D47452 has landed. Part of ongoing work to ensure we test all intrinsic style tests on 32 and 64 bit targets where possible. llvm-svn: 333841
*	[X86][BMI] Remove CTTZ tests - this is fully covered in clz.ll	Simon Pilgrim	2018-06-03	1	-92/+0
\| \| \| \|	llvm-svn: 333840
*	[X86] Add combine to shrink 64-bit ands when one input is an any_extend and ↵	Craig Topper	2018-02-13	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the other input guarantees upper 32 bits are 0. Summary: This gets the shift case from PR35792. Reviewers: spatel, RKSimon Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43222 llvm-svn: 325018
*	[X86] Add a blsr test case with a shift from PR35792. NFC	Craig Topper	2018-02-13	1	-0/+13
\| \| \| \| \| \|	The blsr pattern here is missed because the add is shrunk, but the and is not. This leaves an any_extend between them. llvm-svn: 324986
*	[TargetLowering] try to create -1 constant operand for math ops via demanded ↵	Sanjay Patel	2018-02-11	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	bits This reverses instcombine's demanded bits' transform which always tries to clear bits in constants. As noted in PR35792 and shown in the test diffs: https://bugs.llvm.org/show_bug.cgi?id=35792 ...we can do better in codegen by trying to form -1. The x86 sub test shows a missed opportunity. I did investigate changing instcombine's behavior, but it would be more work to change canonicalization in IR. Clearing bits / shrinking constants can allow killing instructions, so we'd have to figure out how to not regress those cases. Differential Revision: https://reviews.llvm.org/D42986 llvm-svn: 324839
*	[x86] add test to show missed BMI isel; NFC	Sanjay Patel	2018-02-06	1	-0/+15
\| \| \| \|	llvm-svn: 324403
*	[X86] Artificially lower the complexity of the scalar ANDN patterns so that ↵	Craig Topper	2018-02-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	AND with immediate will match first. This allows the immediate to folded into the and instead of being forced to move into a register. This can sometimes result in shorter encodings since the and can sign extend an immediate. This also allows us to match an and to a movzx after a not. This can cause an extra move if the input to the separate NOT has an additional user which requires a copy before the NOT. llvm-svn: 324260
*	Followup on Proposal to move MIR physical register namespace to '$' sigil.	Puyan Lotfi	2018-01-31	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Discussed here: http://lists.llvm.org/pipermail/llvm-dev/2018-January/120320.html In preparation for adding support for named vregs we are changing the sigil for physical registers in MIR to '$' from '%'. This will prevent name clashes of named physical register with named vregs. llvm-svn: 323922
*	[X86] Remove 'NOREX' comment from the printing of _NOREX instructions.	Craig Topper	2018-01-23	1	-2/+2
\| \| \| \| \| \|	Some of the NOREX instructions are used in 32-bit mode making this printing confusing. It also doesn't provide a lot of value since you can see the h-register being used by the instruction. llvm-svn: 323174
*	[CodeGen] Use MachineOperand::print in the MIRPrinter for MO_Register.	Francis Visoiu Mistrih	2017-12-07	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Work towards the unification of MIR and debug output by refactoring the interfaces. For MachineOperand::print, keep a simple version that can be easily called from `dump()`, and a more complex one which will be called from both the MIRPrinter and MachineInstr::print. Add extra checks inside MachineOperand for detached operands (operands with getParent() == nullptr). https://reviews.llvm.org/D40836 * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/kill: ([^ ]+) ([^ ]+)<def> ([^ ]+)/kill: \1 def \2 \3/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/kill: ([^ ]+) ([^ ]+) ([^ ]+)<def>/kill: \1 \2 def \3/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/kill: def ([^ ]+) ([^ ]+) ([^ ]+)<def>/kill: def \1 \2 def \3/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/<def>//g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<kill>/killed \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-use,kill>/implicit killed \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<dead>/dead \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<def[ ],[ ]dead>/dead \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-def[ ],[ ]dead>/implicit-def dead \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-def>/implicit-def \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<imp-use>/implicit \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name ".s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<internal>/internal \1/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" -o -name "*.s" $ -type f -print0 \| xargs -0 sed -i '' -E 's/([^ ]+)<undef>/undef \1/g' llvm-svn: 320022
*	[CodeGen] Unify MBB reference format in both MIR and debug output	Francis Visoiu Mistrih	2017-12-04	1	-72/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As part of the unification of the debug format and the MIR format, print MBB references as '%bb.5'. The MIR printer prints the IR name of a MBB only for block definitions. * find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)->getNumber/" << printMBBReference(\1)/g' find . $ -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E 's/BB#" << ([a-zA-Z0-9_]+)\.getNumber/" << printMBBReference(\1)/g' * find . $ -name ".txt" -o -name ".s" -o -name ".mir" -o -name ".cpp" -o -name ".h" -o -name ".ll" $ -type f -print0 \| xargs -0 sed -i '' -E 's/BB#([0-9]+)/%bb.\1/g' * grep -nr 'BB#' and fix Differential Revision: https://reviews.llvm.org/D40422 llvm-svn: 319665
*	[CodeGen] Print register names in lowercase in both MIR and debug output	Francis Visoiu Mistrih	2017-11-28	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \|	As part of the unification of the debug format and the MIR format, always print registers as lowercase. * Only debug printing is affected. It now follows MIR. Differential Revision: https://reviews.llvm.org/D40417 llvm-svn: 319187
*	Revert r314249 "Recommit r314151 "[X86] Make all the NOREX CodeGenOnly ↵	Craig Topper	2017-09-27	1	-2/+2
\| \| \| \| \| \| \| \|	instructions into postRA pseudos like the NOREX version of TEST.""" This caused PR34751 llvm-svn: 314339
*	Recommit r314151 "[X86] Make all the NOREX CodeGenOnly instructions into ↵	Craig Topper	2017-09-26	1	-2/+2
\| \| \| \| \| \| \| \|	postRA pseudos like the NOREX version of TEST."" The late MOV8rr_NOREX that caused the crash has been removed. llvm-svn: 314249
*	Revert "[X86] Make all the NOREX CodeGenOnly instructions into postRA ↵	Benjamin Kramer	2017-09-26	1	-2/+2
\| \| \| \| \| \| \| \|	pseudos like the NOREX version of TEST." Makes llc crash. This reverts commit r314151. llvm-svn: 314199
*	[X86] Make all the NOREX CodeGenOnly instructions into postRA pseudos like ↵	Craig Topper	2017-09-25	1	-2/+2
\| \| \| \| \| \|	the NOREX version of TEST. llvm-svn: 314151
*	[X86] Make sure we emit a SUBREG_TO_REG after the MOV32ri when creating a ↵	Craig Topper	2017-09-13	1	-0/+12
\| \| \| \| \| \| \| \|	BEXTR64rr instruction from a shift/and pair. Fixes PR34589. llvm-svn: 313126
*	[X86] Move matching of (and (srl/sra, C), (1<<C) - 1) to BEXTR/BEXTRI ↵	Craig Topper	2017-09-12	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instruction to custom isel Recognizing this pattern during DAG combine hides information about the 'and' and the shift from other combines. I think it should be recognized at isel so its as late as possible. But it can't be done with table based isel because you need to be able to look at both immediates. This patch moves it to custom isel in X86ISelDAGToDAG.cpp. This does break a couple tests in tbm_patterns because we are now emitting an and_flag node or (cmp and, 0) that we dont' recognize yet. We already had this problem for several other TBM patterns so I think this fine and we can address of them together. I've also fixed a bug where the combine to BEXTR was preventing us from using a trick of zero extending AH to handle extracts of bits 15:8. We might still want to use BEXTR if it enables load folding. But honestly I hope we narrowed the load instead before got to isel. I think we should probably also support matching BEXTR from (srl/srl (and mask << C), C). But that should be a different patch. Differential Revision: https://reviews.llvm.org/D37592 llvm-svn: 313054
*	[X86][BMI] Add BEXTR demanded bits test cases (PR34042)	Simon Pilgrim	2017-08-13	1	-0/+24
\| \| \| \|	llvm-svn: 310802
*	[X86] Use BEXTR/BEXTRI for 64-bit 'and' with a large mask	Craig Topper	2017-08-01	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The 64-bit 'and' with immediate instruction only supports a 32-bit immediate. So for larger constants we have to load the constant into a register first. If the immediate happens to be a mask we can use the BEXTRI instruction to perform the masking. We already do something similar using the BZHI instruction from the BMI2 instruction set. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36129 llvm-svn: 309706
*	[X86] Split bmi.ll into a bmi test and a bmi2 test.	Craig Topper	2017-08-01	1	-150/+163
\| \| \| \| \| \| \| \|	This moves all the bmi2 specific intrinsics to a separate test file and adds a bmi1 only command line to the existing bmi test. This will allow us to see the missed opportunity to use bextr to handle 64-bit 'and' with a large mask. This will be improved in an upcoming patch. llvm-svn: 309700
*	[X86] Add pattern to use bzhi for 64-bit 'and' with a mask when there is a ↵	Craig Topper	2017-07-31	1	-0/+12
\| \| \| \| \| \| \| \|	load involved. We already had a pattern without load, but with a load we were falling back to a regular 'and' due to pattern complexity priority. llvm-svn: 309535
*	[X86] Add more patterns for BZHI isel	Craig Topper	2017-05-09	1	-0/+76
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds more patterns that a reasonable person might write that can be compiled to BZHI. This adds support for (~0U >> (32 - b)) & a; and a << (32 - b) >> (32 - b); This was inspired by the code in APInt::clearUnusedBits. This can pass an index of 32 to the bzhi instruction which a quick test of Haswell hardware shows will not mask any bits. Though the description text in the Intel manual says the "index is saturated to OperandSize-1". The pseudocode in the same manual indicates no bits will be zeroed for this case. I think this is still missing cases where the subtract portion is an 8-bit operation. Differential Revision: https://reviews.llvm.org/D32616 llvm-svn: 302549
*	VirtRegMap: Replace some identity copies with KILL instructions.	Matthias Braun	2016-07-09	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	An identity COPY like this: %AL = COPY %AL, %EAX<imp-def> has no semantic effect, but encodes liveness information: Further users of %EAX only depend on this instruction even though it does not define the full register. Replace the COPY with a KILL instruction in those cases to maintain this liveness information. (This reverts a small part of r238588 but this time adds a comment explaining why a KILL instruction is useful). llvm-svn: 274952
*	Recommit r274692 - [X86] Transform setcc + movzbl into xorl + setcc	Michael Kuperstein	2016-07-07	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \|	xorl + setcc is generally the preferred sequence due to the partial register stall setcc + movzbl suffers from. As a bonus, it also encodes one byte smaller. This fixes PR28146. The original commit tried inserting an 8bit-subreg into a GR32 (not GR32_ABCD) which was not appreciated by fast regalloc on 32-bit. llvm-svn: 274802
*	Revert r274692 to check whether this is what breaks windows selfhost.	Michael Kuperstein	2016-07-07	1	-1/+3
\| \| \| \|	llvm-svn: 274771
*	[X86] Transform setcc + movzbl into xorl + setcc	Michael Kuperstein	2016-07-06	1	-3/+1
\| \| \| \| \| \| \| \| \| \| \|	xorl + setcc is generally the preferred sequence due to the partial register stall setcc + movzbl suffers from. As a bonus, it also encodes one byte smaller. This fixes PR28146. Differential Revision: http://reviews.llvm.org/D21774 llvm-svn: 274692
*	[x86, BMI] add TLI hook for 'andn' and use it to simplify comparisons	Sanjay Patel	2016-05-07	1	-11/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For the sake of minimalism, this patch is x86 only, but I think that at least PPC, ARM, AArch64, and Sparc probably want to do this too. We might want to generalize the hook and pattern recognition for a target like PPC that has a full assortment of negated logic ops (orc, nand). Note that http://reviews.llvm.org/D18842 will cause this transform to trigger more often. For reference, this relates to: https://llvm.org/bugs/show_bug.cgi?id=27105 https://llvm.org/bugs/show_bug.cgi?id=27202 https://llvm.org/bugs/show_bug.cgi?id=27203 https://llvm.org/bugs/show_bug.cgi?id=27328 Differential Revision: http://reviews.llvm.org/D19087 llvm-svn: 268858
*	[CodeGen] When promoting CTTZ operations to larger type, don't insert a ↵	Craig Topper	2016-04-23	1	-58/+3
\| \| \| \| \| \|	select to detect if the input is zero to return the original size instead of the extended size. Instead just set the first bit in the zero extended part. llvm-svn: 267280
*	DAGCombiner: Reduce 64-bit BFE pattern to pattern on 32-bit component	Matt Arsenault	2016-04-21	1	-2/+2
\| \| \| \| \| \| \|	If the extracted bits are restricted to the upper half or lower half, this can be truncated. llvm-svn: 267024
*	[x86] add tests to show potential BMI optimization	Sanjay Patel	2016-04-13	1	-0/+68
\| \| \| \|	llvm-svn: 266243