bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Don't emit apple accelerator tables on non-darwin targets	Pavel Labath	2018-01-17	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Currently -glldb turns on emission of apple tables on all targets, but lldb is only really capable of consuming them on darwin. Furthermore, making lldb consume these tables is not straight-forward because of the differences in how the debug info is distributed on darwin vs. elf targets. The darwin debug model assumes that the debug info (along with accelerator tables) will either remain in the .o files or it will be linked into a dsym bundle by a linker that knows how to merge these tables. In the elf world, all present linkers will simply concatenate these accelerator tables into the shared object. Since the tables are not self-terminating, this renders the tables unusable, as the debugger cannot pry the individual tables apart anymore. It might theoretically be possible to make the tables work with split dwarf, as that is somewhat similar to the apple .o model, but unfortunately right now the combination of -glldb and -gsplit-dwarf produces broken object files. Until these issues are resolved there is no point in emitting the apple tables for these targets. At best, it wastes space; at worst, it breaks compilation and prevents the user from getting other benefits of -glldb. Reviewers: probinson, aprantl, dblaikie Subscribers: emaste, dim, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D41986 llvm-svn: 322633
*	[MC] Fix -stack-size-section on ARM	Sean Eveson	2018-01-17	1	-2/+1
\| \| \| \| \| \| \| \|	Change symbol values in the stack_size section from being 8 bytes, to being a target dependent size. Differential Revision: https://reviews.llvm.org/D42108 llvm-svn: 322619
*	[CodeGen] Skip some instructions that shouldn't affect shrink-wrapping	Francis Visoiu Mistrih	2018-01-16	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \|	r320606 checked for MI.isMetaInstruction which skips all DBG_VALUEs. This also skips IMPLICIT_DEFs and other instructions that may def / read a register. Differential Revision: https://reviews.llvm.org/D42119 llvm-svn: 322584
*	[LiveDebugValues] recognize spilled reg killed in instruction after spill	Petar Jovanovic	2018-01-16	1	-7/+30
\| \| \| \| \| \| \| \| \| \| \|	Current condition for spill instruction recognition in LiveDebugValues does not recognize case when register is spilled and killed in next instruction. Patch by Nikola Prica. Differential Revision: https://reviews.llvm.org/D41226 llvm-svn: 322554
*	[CodeGen] Remove special case of printing subRegIdx from MachineInstr::print	Francis Visoiu Mistrih	2018-01-16	1	-3/+0
\| \| \| \| \| \| \|	Support in MachineOperand has been added in r320209. No need to special case this anymore. llvm-svn: 322542
*	[CodeGen][NFC] Correct case for printSubRegIdx	Francis Visoiu Mistrih	2018-01-16	3	-3/+3
\| \| \| \|	llvm-svn: 322541
*	Revert "[DAG] Elide overlapping stores"	Benjamin Kramer	2018-01-15	1	-20/+21
\| \| \| \| \| \| \|	This reverts commit r322085. Internal PPC testing is still showing the same symptoms as when this patch landed the last time. llvm-svn: 322474
*	[MachineOutliner] Move hasAddressTaken check to MachineOutliner.cpp	Jessica Paquette	2018-01-13	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Mostly NFC. Still updating the test though just for completeness. This moves the hasAddressTaken check to MachineOutliner.cpp and replaces it with a per-basic block test rather than a per-function test. The old test was too conservative and was preventing functions in C programs from being outlined even though they were safe to outline. This was mostly a problem in C sources. llvm-svn: 322425
*	[NFC] Change MemIntrinsicInst::setAlignment() to take an unsigned instead of ↵	Daniel Neilson	2018-01-12	2	-7/+10
\| \| \| \| \| \| \| \| \| \| \|	a Constant Summary: In preparation for https://reviews.llvm.org/D41675 this NFC changes this prototype of MemIntrinsicInst::setAlignment() to accept an unsigned instead of a Constant. llvm-svn: 322403
*	[DWARFv5] CodeGen support for MD5 file checksums	Paul Robinson	2018-01-12	5	-45/+48
\| \| \| \| \| \| \| \| \| \|	Pass MD5 checksums through from IR to assembly/object files. After this, getting Clang to compute the MD5 should be the last step to supporting MD5 in the DWARF v5 line table header. Differential Revision: https://reviews.llvm.org/D41926 llvm-svn: 322391
*	[ARM GlobalISel] Legalize G_FMA	Diana Picus	2018-01-12	1	-2/+9
\| \| \| \| \| \| \| \| \| \| \|	For hard float with VFP4, it is legal. Otherwise, we use libcalls. This needs a bit of support in the LegalizerHelper for soft float because we didn't handle G_FMA libcalls yet. The support is trivial, as the only difference between G_FMA and other libcalls that we already handle is that it has 3 input operands rather than just 2. llvm-svn: 322366
*	[CGP] Re-enable Select in complex addressing mode	Serguei Katkov	2018-01-12	1	-1/+1
\| \| \| \| \| \| \| \|	Re-enable Select after a couple of fixes. Differential Revision: https://reviews.llvm.org/D40634 llvm-svn: 322358
*	PeepholeOpt cleanup/refactor; NFC	Matthias Braun	2018-01-11	1	-440/+370
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Less unnecessary use of `auto` - Add early `using RegSubRegPair(AndIdx) =` to avoid countless `TargetInstrInfo::` qualifications. - Use references instead of pointers where possible. - Remove unused parameters. - Rewrite the CopyRewriter class hierarchy: - Pull out uncoalescable copy rewriting functionality into PeepholeOptimizer class. - Use an abstract base class to make it clear that rewriters are independent. - Remove unnecessary \brief in doxygen comments. - Remove unused constructor and method from ValueTracker. - Replace UseAdvancedTracking of ValueTracker with DisableAdvCopyOpt use. llvm-svn: 322325
*	PeepholeOptimizer: Fix for vregs without defs	Matthias Braun	2018-01-11	2	-3/+22
\| \| \| \| \| \| \| \| \| \|	The PeepholeOptimizer would fail for vregs without a definition. If this was caused by an undef operand abort to keep the code simple (so we don't need to add logic everywhere to replicate the undef flag). Differential Revision: https://reviews.llvm.org/D40763 llvm-svn: 322319
*	PeepholeOptimizer: Do not form PHI with subreg arguments	Matthias Braun	2018-01-11	1	-22/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When replacing a PHI the PeepholeOptimizer currently takes the register class of the register at the first operand. This however is not correct if this argument has a subregister index. As there is currently no API to query the register class resulting from applying a subregister index to all registers in a class, we can only abort in these cases and not perform the transformation. This changes findNextSource() to require the end of all copy chains to not use a subregister if there is any PHI in the chain. I had to rewrite the overly complicated inner loop there to have a good place to insert the new check. This fixes https://llvm.org/PR33071 (aka rdar://32262041) Differential Revision: https://reviews.llvm.org/D40758 llvm-svn: 322313
*	dag-combine: Transfer debug information when folding (zext (truncate x))	Adrian Prantl	2018-01-11	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \|	-> (zext (truncate x)) This patch adds debug info support to the dagcombine rule (zext (truncate x)) -> (zext (truncate x)). Differential Revision: https://reviews.llvm.org/D41924 llvm-svn: 322304
*	DAGCombine: Let truncates negate extension through extract-subvector	Zvi Rackover	2018-01-11	1	-0/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Fold cases such as: (v8i8 truncate (v8i32 extract_subvector (v16i32 sext (v16i8 V), Idx))) -> (v8i8 extract_subvector (v16i8 V), Idx) This can be generalized to cases where the truncate and extend do not fully cancel each other out, but it may require querying the target about profitability. Reviewers: RKSimon, craig.topper, spatel, efriedma Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41927 llvm-svn: 322300
*	[VectorLegalizer] Remove broken code in ExpandStore.	Jonas Paulsson	2018-01-11	1	-28/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code that is supposed to "Round odd types to the next pow of two" seems broken and as well completely unused (untested). It also seems that ExpandStore really shouldn't ever change the memory VT, which this in fact does. As a first step in fixing the broken handling of vector stores (of irregular types, e.g. an i1 vector), this code is removed. For discussion, see https://bugs.llvm.org/show_bug.cgi?id=35520. Review: Eli Friedman llvm-svn: 322275
*	[CodeView] Fix the type for a variadic argument	Aaron Smith	2018-01-11	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: - MSVC uses the none type for a variadic argument in CodeView - Add a unit test Reviewers: zturner, llvm-commits Reviewed By: zturner Differential Revision: https://reviews.llvm.org/D41931 llvm-svn: 322257
*	[DWARF][NFC] Overload AsmPrinter::emitDwarfStringOffsets() to take a ↵	Wolfgang Pieb	2018-01-11	1	-3/+4
\| \| \| \| \| \| \| \| \| \|	DwarfStringPoolEntry record. Differential Revision: https://reviews.llvm.org/D41920 llvm-svn: 322250
*	Revert "AArch64: Fix emergency spillslot being out of reach for large ↵	Matthias Braun	2018-01-10	2	-5/+0
\| \| \| \| \| \| \| \| \| \| \| \|	callframes" Revert for now as the testcase is hitting a pre-existing verifier error that manifest as a failure when expensive checks are enabled (or -verify-machineinstrs) is used. This reverts commit r322200. llvm-svn: 322231
*	LiveRangeEdit: Inline markDeadRemat() into only user; NFC	Matthias Braun	2018-01-10	1	-1/+1
\| \| \| \| \| \| \|	This function was only called from a single place in which we didn't even need the `if (DeadRemats)` check. llvm-svn: 322230
*	LiveRangeEdit: Simplify code; NFC	Matthias Braun	2018-01-10	1	-12/+14
\| \| \| \| \| \| \| \|	Simplify the code slightly: Instead of creating empty subranges in one case and immediately removing them, do not create them in the first place. llvm-svn: 322226
*	TargetLoweringBase: The ios simulator has no bzero function.	Matthias Braun	2018-01-10	1	-3/+12
\| \| \| \| \| \| \| \|	Make sure I really get back to the beahvior before my rewrite in r321035 which turned out not to be completely NFC as I changed the behavior for the ios simulator environment. llvm-svn: 322223
*	[SelectionDAG][X86] Explicitly store the scale in the gather/scatter ISD nodes	Craig Topper	2018-01-10	5	-24/+46
\| \| \| \| \| \| \| \| \| \|	Currently we infer the scale at isel time by analyzing whether the base is a constant 0 or not. If it is we assume scale is 1, else we take it from the element size of the pass thru or stored value. This seems a little weird and I think it makes more sense to make it explicit in the DAG rather than doing tricky things in the backend. Most of this patch is just making sure we copy the scale around everywhere. Differential Revision: https://reviews.llvm.org/D40055 llvm-svn: 322210
*	AArch64: Fix emergency spillslot being out of reach for large callframes	Matthias Braun	2018-01-10	2	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Large callframes (calls with several hundreds or thousands or parameters) could lead to situations in which the emergency spillslot is out of range to be addressed relative to the stack pointer. This commit forces the use of a frame pointer in the presence of large callframes. This commit does several things: - Compute max callframe size at the end of instruction selection. - Add mirFileLoaded target callback. Use it to compute the max callframe size after loading a .mir file when the size wasn't specified in the file. - Let TargetFrameLowering::hasFP() return true if there exists a callframe > 255 bytes. - Always place the emergency spillslot close to FP if we have a frame pointer. - Note that `useFPForScavengingIndex()` would previously return false when a base pointer was available leading to the emergency spillslot getting allocated late (that's the whole effect of this callback). Which made no sense to me so I took this case out: Even though the emergency spillslot is technically not referenced by FP in this case we still want it allocated early. Differential Revision: https://reviews.llvm.org/D40876 llvm-svn: 322200
*	[SelectionDAGBuilder] Chain prefetches less aggressively.	Jonas Paulsson	2018-01-10	1	-7/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	Prefetches used to always be chained between any previous and following memory accesses. The problem with this was that later optimizations, such as folding of a load into the user instruction, got disrupted. This patch relaxes the chaining of prefetches in order to remedy this. Reveiw: Hal Finkel https://reviews.llvm.org/D38886 llvm-svn: 322163
*	[MIR] Repurposing '$' sigil used by external symbols. Replacing with '&'.	Puyan Lotfi	2018-01-10	3	-3/+3
\| \| \| \| \| \| \| \| \| \|	Planning to add support for named vregs. This puts is in a conundrum since physregs are named as well. To rectify this we need to use a sigil other than '%' for physregs in MIR. We've settled on using '$' for physregs but first we must repurpose it from external symbols using it, which is what this commit is all about. We think '&' will have familiar semantics for C/C++ users. llvm-svn: 322146
*	Reland "Emit Function IDs table for Control Flow Guard"	Adrian McCarthy	2018-01-09	4	-0/+110
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adds option /guard:cf to clang-cl and -cfguard to cc1 to emit function IDs of functions that have their address taken into a section named .gfids$y for compatibility with Microsoft's Control Flow Guard feature. The original patch didn't have the lit.local.cfg file that restricts the new test to x86, thus the new test was failing on the non-x86 bots. Differential Revision: https://reviews.llvm.org/D40531 The reverts r322008, which was a revert of r322005. This reverts commit a05b89f9aca70597dc79fe97bc49b50b51f525ba. llvm-svn: 322136
*	[WebAssembly] Add COMDAT support	Sam Clegg	2018-01-09	1	-9/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds COMDAT support to the Wasm object-file format. Spec: https://github.com/WebAssembly/tool-conventions/pull/31 Corresponding LLD change: https://bugs.llvm.org/show_bug.cgi?id=35533, and D40845 Patch by Nicholas Wilson Differential Revision: https://reviews.llvm.org/D40844 llvm-svn: 322135
*	[DWARFv5] MC support for MD5 file checksums	Paul Robinson	2018-01-09	5	-15/+24
\| \| \| \| \| \| \|	Extend .file directive syntax to allow specifying an MD5 checksum for the source file. Emit the checksums in DWARF v5 line tables. llvm-svn: 322134
*	Tidy some grammar in some comments	Eric Christopher	2018-01-09	2	-4/+4
\| \| \| \|	llvm-svn: 322133
*	[SelectionDAG] Fixed f16-from-vector promotion problem	Tim Renouf	2018-01-09	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In the case of an fp_extend of v1f16 to v1f32 where the v1f16 is the result of a bitcast from i16, avoid creating an illegal fp16_to_fp where the input is not a vector and the result is a v1f32. V2: The fix is now to avoid vector scalarization creating a v1->scalar bitcast. Reviewers: srhines, t.p.northover Subscribers: nhaehnle, llvm-commits, dstuttard, t-tye, yaxunl, wdng, kzhuravl, arsenm Differential Revision: https://reviews.llvm.org/D41126 llvm-svn: 322120
*	[CodeGen] Don't print "pred:" and "opt:" in -debug output	Francis Visoiu Mistrih	2018-01-09	3	-15/+9
\| \| \| \| \| \| \| \| \| \|	In -debug output we print "pred:" whenever a MachineOperand is a predicate operand in the instruction descriptor, and "opt:" whenever a MachineOperand is an optional def in the instruction descriptor. Differential Revision: https://reviews.llvm.org/D41870 llvm-svn: 322096
*	[CodeGen] Print frame-setup/destroy flags in -debug output like we do in MIR	Francis Visoiu Mistrih	2018-01-09	1	-15/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently the MachineInstr::print function prints the frame-setup/frame-destroy differently than it does in MIR. Instead of: %x21 = LDR %sp, -16; flags: FrameDestroy print: %x21 = frame-destroy LDR %sp, -16 llvm-svn: 322088
*	[SelectionDAG] lower math intrinsics to finite version of libcalls when ↵	Sanjay Patel	2018-01-09	3	-20/+65
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	possible (PR35672) Ingredients in this patch: 1. Add HANDLE_LIBCALL defs for finite mathlib functions that correspond to LLVM intrinsics. 2. Plumbing to send TargetLibraryInfo down to SelectionDAGLegalize. 3. Relaxed math and library checking in SelectionDAGLegalize::ConvertNodeToLibcall() to choose finite libcalls. There was a bug about determining the availability of the finite calls that should be fixed with: rL322010 Not in this patch: This doesn't resolve the question/bug of clang creating the intrinsic IR in the first place. There's likely follow-up work needed to support the long double variants better. There's room for improvement to reduce the code duplication. Create finite calls that don't originate from a corresponding intrinsic or DAG node? Differential Revision: https://reviews.llvm.org/D41338 llvm-svn: 322087
*	[CodeGen] Don't print register classes in -debug output	Francis Visoiu Mistrih	2018-01-09	1	-37/+0
\| \| \| \| \| \| \| \| \| \|	Since register classes and banks are already printed with the register definition, don't print it at the end of every instruction anymore. This follows MIR in this regard and is another step to the unification of the two formats. llvm-svn: 322086
*	[DAG] Elide overlapping stores	Nirav Dave	2018-01-09	1	-21/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Relanding after fixing handling of pre-indexed memory operations in BaseIndexOffset analysis (r322003). Extend overlapping store elision to handle overwrites of stores by larger stores. Reviewers: craig.topper, rnk, t.p.northover Subscribers: javed.absar, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D40969 llvm-svn: 322085
*	[MIR] Add support for the frame-destroy MachineInstr flag	Francis Visoiu Mistrih	2018-01-09	4	-0/+8
\| \| \| \| \| \| \| \| \|	We are printing / parsing the `frame-setup` MachineInstr flag but not the `frame-destroy` one. Differential Revision: https://reviews.llvm.org/D41509 llvm-svn: 322071
*	[CGP] Fix Complex addressing mode for offset	Serguei Katkov	2018-01-09	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the offset is differ in two addressing mode we can continue only if ScaleReg is not set due to we will use it as merge of different offsets. It should fix PR35799 and PR35805. Reviewers: john.brawn, reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41227 llvm-svn: 322056
*	[MachineOutliner] AArch64: Handle instrs that use SP and will never need fixups	Jessica Paquette	2018-01-09	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit does two things. Firstly, it adds a collection of flags which can be passed along to the target to encode information about the MBB that an instruction lives in to the outliner. Second, it adds some of those flags to the AArch64 outliner in order to add more stack instructions to the list of legal instructions that are handled by the outliner. The two flags added check if - There are calls in the MachineBasicBlock containing the instruction - The link register is available in the entire block If the link register is available and there are no calls, then a stack instruction can always be outlined without fixups, regardless of what it is, since in this case, the outliner will never modify the stack to create a call or outlined frame. The motivation for doing this was checking which instructions are most often missed by the outliner. Instructions like, say %sp<def> = ADDXri %sp, 32, 0; flags: FrameDestroy are very common, but cannot be outlined in the case that the outliner might modify the stack. This commit allows us to outline instructions like this. llvm-svn: 322048
*	[LiveDebugValues] Change condition for block termination recognition	Petar Jovanovic	2018-01-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The last iterator of MBB should be recognized as MBB.end() not as MBB.instr_end() which could return bundled instruction that is not iterable with basic iterator. Patch by Nikola Prica. Differential Revision: https://reviews.llvm.org/D41626 llvm-svn: 322015
*	Revert "Emit Function IDs table for Control Flow Guard"	Adrian McCarthy	2018-01-08	4	-110/+0
\| \| \| \| \| \| \| \| \| \|	The new test fails on the Hexagon bot. Reverting while I investigate. This reverts https://reviews.llvm.org/rL322005 This reverts commit b7e0026b4385180c378edc658ec91a39566f2942. llvm-svn: 322008
*	Emit Function IDs table for Control Flow Guard	Adrian McCarthy	2018-01-08	4	-0/+110
\| \| \| \| \| \| \| \| \| \|	Adds option /guard:cf to clang-cl and -cfguard to cc1 to emit function IDs of functions that have their address taken into a section named .gfids$y for compatibility with Microsoft's Control Flow Guard feature. Differential Revision: https://reviews.llvm.org/D40531 llvm-svn: 322005
*	[DAG] Teach BaseIndexOffset to correctly handle with indexed operations	Nirav Dave	2018-01-08	3	-51/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	BaseIndexOffset address analysis incorrectly ignores offsets folded into indexed memory operations causing potential errors in alias analysis of pre-indexed operations. Reviewers: efriedma, RKSimon, hfinkel, jyknight Subscribers: hiraditya, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D41701 llvm-svn: 322003
*	[DAGCombine] Fix for PR35761	Sam Parker	2018-01-08	1	-4/+10
\| \| \| \| \| \| \| \| \| \| \|	I had falsely assumed that constant operands would be operand(1) of the bin ops that may need their constant operand to be masked. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=35761 Differential Revision: https://reviews.llvm.org/D41667 llvm-svn: 321991
*	[DAG] Fix for Bug PR34620 - Allow SimplifyDemandedBits to look through bitcasts	Simon Pilgrim	2018-01-07	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	Allow SimplifyDemandedBits to use TargetLoweringOpt::computeKnownBits to look through bitcasts. This can help simplifying in some cases where bitcasts of constants generated during or after legalization can't be folded away, and thus didn't get picked up by SimplifyDemandedBits. This fixes PR34620, where a redundant pand created during legalization from lowering and lshr <16xi8> wasn't being simplified due to the presence of a bitcasted build_vector as an operand. Committed on the behalf of @sameconrad (Sam Conrad) Differential Revision: https://reviews.llvm.org/D41643 llvm-svn: 321969
*	[X86] Make v2i1 and v4i1 legal types without VLX	Craig Topper	2018-01-07	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There are few oddities that occur due to v1i1, v8i1, v16i1 being legal without v2i1 and v4i1 being legal when we don't have VLX. Particularly during legalization of v2i32/v4i32/v2i64/v4i64 masked gather/scatter/load/store. We end up promoting the mask argument to these during type legalization and then have to widen the promoted type to v8iX/v16iX and truncate it to get the element size back down to v8i1/v16i1 to use a 512-bit operation. Since need to fill the upper bits of the mask we have to fill with 0s at the promoted type. It would be better if we could just have the v2i1/v4i1 types as legal so they don't undergo any promotion. Then we can just widen with 0s directly in a k register. There are no real v4i1/v2i1 instructions anyway. Everything is done on a larger register anyway. This also fixes an issue that we couldn't implement a masked vextractf32x4 from zmm to xmm properly. We now have to support widening more compares to 512-bit to get a mask result out so new tablegen patterns got added. I had to hack the legalizer for widening the operand of a setcc a bit so it didn't try create a setcc returning v4i32, extract from it, then try to promote it using a sign extend to v2i1. Now we create the setcc with v4i1 if the original setcc's result type is v2i1. Then extract that and don't sign extend it at all. There's definitely room for improvement with some follow up patches. Reviewers: RKSimon, zvi, guyblank Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41560 llvm-svn: 321967
*	[PowerPC] Add an ISD::TRUNCATE to the legalization for ↵	Craig Topper	2018-01-07	1	-13/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ppc_is_decremented_ctr_nonzero Summary: I believe legalization is really expecting that ReplaceNodeResults will return something with the same type as the thing that's being legalized. Ultimately, it uses the output to replace the uses in the DAG so the type should match to make that work. There are two relevant cases here. When crbits are enabled, then i1 is a legal type and getSetCCResultType should return i1. In this case, the truncate will be between i1 and i1 and should be removed (SelectionDAG::getNode does this). Otherwise, getSetCCResultType will be i32 and the legalizer will promote the truncate to be i32 -> i32 which will be similarly removed. With this fixed we can remove some code from PromoteIntRes_SETCC that seemed to only exist to deal with the intrinsic being replaced with a larger type without changing the other operand. With the truncate being used for connectivity this doesn't happen anymore. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: nemanjai, llvm-commits, kbarton Differential Revision: https://reviews.llvm.org/D41654 llvm-svn: 321959
*	[x86, MemCmpExpansion] allow 2 pairs of loads per block (PR33325)	Sanjay Patel	2018-01-06	1	-6/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the last step needed to fix PR33325: https://bugs.llvm.org/show_bug.cgi?id=33325 We're trading branch and compares for loads and logic ops. This makes the code smaller and hopefully faster in most cases. The 24-byte test shows an interesting construct: we load the trailing scalar elements into vector registers and generate the same pcmpeq+movmsk code that we expected for a pair of full vector elements (see the 32- and 64-byte tests). Differential Revision: https://reviews.llvm.org/D41714 llvm-svn: 321934