bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[WebAssembly] Switch to MC for instruction printing.	Dan Gohman	2015-11-12	9	-173/+121
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This encompasses several changes which are all interconnected: - Use the MC framework for printing almost all instructions. - AsmStrings are now live. - This introduces an indirection between LLVM vregs and WebAssembly registers, and a new pass, WebAssemblyRegNumbering, for computing a basic the mapping. This addresses some basic issues with argument registers and unused registers. - The way ARGUMENT instructions are handled no longer generates redundant get_local+set_local for every argument. This also changes the assembly syntax somewhat; most notably, MC's printing use sigils on label names, so those are no longer present, and push/pop now have a sigil to keep them unambiguous. The usage of set_local/get_local/$push/$pop will continue to evolve significantly. This patch is just one step of a larger change. llvm-svn: 252858
*	[TLS on Darwin] use a different mask for tls calls on x86-64.	Manman Ren	2015-11-12	4	-0/+16
\| \| \| \| \| \| \| \| \|	Calls involved in thread-local variable lookup save more registers than normal calls. rdar://problem/23073171 llvm-svn: 252837
*	[ARM] Enable shrink-wrapping by default.	Quentin Colombet	2015-11-11	1	-0/+5
\| \| \| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D14357 rdar://problem/21942589 llvm-svn: 252825
*	[WinEH] Only generate UnwindHelp slot for MSVCXX	Joseph Tremoulet	2015-11-11	1	-12/+12
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Other personalities don't use this special frame slot. Reviewers: majnemer, andrew.w.kaylor, rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14580 llvm-svn: 252778
*	[MIPS] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz()	Sanjay Patel	2015-11-11	2	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MIPS32 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any MIPS32 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: jr $ra clz $2, $4 cttz: addiu $1, $4, -1 not $2, $4 and $1, $2, $1 clz $1, $1 addiu $2, $zero, 32 jr $ra subu $2, $2, $1 Instead of: ctlz: beqz $4, $BB0_2 addiu $2, $zero, 32 clz $2, $4 $BB0_2: jr $ra nop cttz: beqz $4, $BB1_2 addiu $2, $zero, 32 addiu $1, $4, -1 not $2, $4 and $1, $2, $1 clz $1, $1 addiu $2, $zero, 32 subu $2, $2, $1 $BB1_2: jr $ra nop See D14469 for the larger motivation. Differential Revision: http://reviews.llvm.org/D14500 llvm-svn: 252755
*	Properly fix unused variable in disable-assert builds.	Diego Novillo	2015-11-11	1	-1/+3
\| \| \| \| \| \| \| \|	I missed the side-effects of ParseBFI in my previous attempt (r252748). Thanks dblaikie for the suggestion of adding a void use of the unused variable instead. llvm-svn: 252751
*	Remove unused variable in disable-assert builds. NFC.	Diego Novillo	2015-11-11	1	-2/+1
\| \| \| \|	llvm-svn: 252748
*	Visibly fail if attempting to encode register AH,BH,CH,DH in a REX-prefixed ↵	Douglas Katzman	2015-11-11	1	-0/+7
\| \| \| \| \| \| \| \| \|	instruction. Differential Revision: http://reviews.llvm.org/D13316 Fixes PR25003 llvm-svn: 252743
*	[ARM] Combine BFIs together	James Molloy	2015-11-11	1	-2/+109
\| \| \| \| \| \|	If we have a chain of BFIs, we may be able to combine several together into one merged BFI. We can do this if the "from" bits from one BFI OR'd with the "from" bits from the other BFI form a contiguous range, and the same with the "to" bits. llvm-svn: 252740
*	Silencing nine warnings for "enumeral and non-enumeral type in conditional ↵	Aaron Ballman	2015-11-11	1	-10/+18
\| \| \| \| \| \|	expression"; NFC. llvm-svn: 252728
*	[X86] Replace LEAs with INC/DEC when profitable	Michael Kuperstein	2015-11-11	1	-8/+79
\| \| \| \| \| \| \| \| \|	If possible and profitable, replace lea %reg, 1(%reg) and lea %reg, -1(%reg) with inc %reg and dec %reg respectively. Patch by: anton.nadolsky@intel.com Differential Revision: http://reviews.llvm.org/D14059 llvm-svn: 252722
*	[X86] Fix feature flags on some MMX register instructions that really were ↵	Craig Topper	2015-11-11	1	-2/+13
\| \| \| \| \| \|	introduced with SSE or SSE2. llvm-svn: 252709
*	[X86] Remove redundant MMX isel patterns.	Craig Topper	2015-11-11	1	-4/+0
\| \| \| \|	llvm-svn: 252708
*	[WebAssembly] Support non-legal argument and return types.	Dan Gohman	2015-11-11	3	-79/+127
\| \| \| \|	llvm-svn: 252687
*	[MC] Use LShr for constant evaluation of ">>" on non-arm64 darwin.	Ahmed Bougacha	2015-11-11	1	-4/+0
\| \| \| \| \| \| \|	Follow-up to r235963: this matches other assemblers and is less unexpected (e.g. PR23227). llvm-svn: 252681
*	AMDGPU: Print more fields in comments	Matt Arsenault	2015-11-11	1	-3/+14
\| \| \| \|	llvm-svn: 252677
*	AMDGPU: Remove dead code	Matt Arsenault	2015-11-11	1	-33/+2
\| \| \| \|	llvm-svn: 252675
*	AMDGPU: Set isAllocatable = 0 on VS_32/VS_64	Matt Arsenault	2015-11-11	3	-16/+6
\| \| \| \|	llvm-svn: 252674
*	[WinEH] Insert the MBB for EH_RESTORE after the catchret	Reid Kleckner	2015-11-10	1	-1/+1
\| \| \| \| \| \| \|	Inserting it before the target block could be bad, we might already have a fallthrough edge to it. llvm-svn: 252670
*	[WebAssembly] Remove special cases for things that are no longer special. NFC.	Dan Gohman	2015-11-10	1	-16/+0
\| \| \| \|	llvm-svn: 252656
*	Add PPCMIPeephole.cpp to CMakeLists.txt	Bill Schmidt	2015-11-10	1	-0/+1
\| \| \| \|	llvm-svn: 252654
*	[WebAssembly] Support for floating point min and max.	Dan Gohman	2015-11-10	2	-7/+6
\| \| \| \|	llvm-svn: 252653
*	[PowerPC] Add an MI SSA peephole pass.	Bill Schmidt	2015-11-10	3	-0/+241
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a pass for doing PowerPC peephole optimizations at the MI level while the code is still in SSA form. This allows for easy modifications to the instructions while depending on a subsequent pass of DCE. Both passes are very fast due to the characteristics of SSA. At this time, the only peepholes added are for cleaning up various redundancies involving the XXPERMDI instruction. However, I would expect this will be a useful place to add more peepholes for inefficiencies generated during instruction selection. The pass is placed after VSX swap optimization, as it is best to let that pass remove unnecessary swaps before performing any remaining clean-ups. The utility of these clean-ups are demonstrated by changes to four existing test cases, all of which now have tighter expected code generation. I've also added Eric Schweiz's bugpoint-reduced test from PR25157, for which we now generate tight code. One other test started failing for me, and I've fixed it (test/Transforms/PlaceSafepoints/finite-loops.ll) as well; this is not related to my changes, and I'm not sure why it works before and not after. The problem is that the CHECK-NOT: of "statepoint" from test1 fails because of the "statepoint" in test2, and so forth. Adding a CHECK-LABEL in between keeps the different occurrences of that string properly scoped. llvm-svn: 252651
*	[ARM] add overrides for isCheapToSpeculateCttz() and isCheapToSpeculateCtlz()	Sanjay Patel	2015-11-10	2	-0/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ARM V6T2 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any ARM V6T2 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: clz r0, r0 bx lr cttz: rbit r0, r0 clz r0, r0 bx lr Instead of: ctlz: cmp r0, #0 moveq r0, #32 clzne r0, r0 bx lr cttz: cmp r0, #0 moveq r0, #32 rbitne r0, r0 clzne r0, r0 bx lr This will help solve a general speculation/despeculation problem noted in PR24818: https://llvm.org/bugs/show_bug.cgi?id=24818 Differential Revision: http://reviews.llvm.org/D14469 llvm-svn: 252639
*	[AArch64] add overrides for isCheapToSpeculateCttz() and ↵	Sanjay Patel	2015-11-10	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	isCheapToSpeculateCtlz() AArch64 has instructions for efficient count-leading/trailing-zeros, so this should be considered a cheap operation (and therefore fair game for speculation) for any AArch64 implementation. The net result of allowing this speculation for the regression tests in this patch is that we get this code: ctlz: clz w0, w0 ret cttz: rbit w8, w0 clz w0, w8 ret Instead of: ctlz: cbz w0, .LBB0_2 clz w0, w0 ret .LBB0_2: orr w0, wzr, #0x20 ret cttz: cbz w0, .LBB1_2 rbit w8, w0 clz w0, w8 ret .LBB1_2: orr w0, wzr, #0x20 ret See D14469 for the larger motivation. Differential Revision: http://reviews.llvm.org/D14505 llvm-svn: 252625
*	[X86] Do not try to custom-lower sitofp/fptosi in soft-float mode	Michael Kuperstein	2015-11-10	1	-11/+17
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D14495 llvm-svn: 252621
*	Reapply "[ARM] Combine CMOV into BFI where possible"	James Molloy	2015-11-10	2	-0/+113
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added fixes for stage2 failures: CMOV is not commutable; commuting the operands results in the condition being flipped! d'oh! Original commit message: If we have a CMOV, OR and AND combination such as: if (x & CN) y \|= CM; And: * CN is a single bit; * All bits covered by CM are known zero in y; Then we can convert this to a sequence of BFI instructions. This will always be a win if CM is a single bit, will always be no worse than the TST & OR sequence if CM is two bits, and for thumb will be no worse if CM is three bits (due to the extra IT instruction). llvm-svn: 252606
*	[PowerPC] Remove redundant code.	Tilmann Scheller	2015-11-10	1	-3/+2
\| \| \| \| \| \| \| \|	The local variable Hi is never being read. Issue identified by the Clang static analyzer. llvm-svn: 252600
*	[AArch64] Fix halfword load merging for big-endian targets	Oliver Stannard	2015-11-10	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \|	For big-endian targets, when we merge two halfword loads into a word load, the order of the halfwords in the loaded value is reversed compared to little-endian, so the load-store optimiser needs to swap the destination registers. This does not affect merging of two word loads, as we use ldp, which treats the memory as two separate 32-bit words. llvm-svn: 252597
*	AVX512 : Implemented encoding and DAG lowering for VMOVHPS/PD and VMOVLPS/PD ↵	Igor Breger	2015-11-10	2	-5/+119
\| \| \| \| \| \| \| \|	instructions. Differential Revision: http://reviews.llvm.org/D14492 llvm-svn: 252592
*	Remove another variable unused in -Asserts build	David Blaikie	2015-11-10	1	-2/+2
\| \| \| \|	llvm-svn: 252582
*	Remove some unused variables to clean up the -Werror build	David Blaikie	2015-11-10	2	-4/+4
\| \| \| \|	llvm-svn: 252580
*	[Hexagon] Adding instruction aliases and tests.	Colin LeMahieu	2015-11-10	2	-0/+464
\| \| \| \|	llvm-svn: 252579
*	Support for emitting inline stack probes	Andy Ayers	2015-11-10	3	-30/+320
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For CoreCLR on Windows, stack probes must be emitted as inline sequences that probe successive stack pages between the current stack limit and the desired new stack pointer location. This implements support for the inline expansion on x64. For in-body alloca probes, expansion is done during instruction lowering. For prolog probes, a stub call is initially emitted during prolog creation, and expanded after epilog generation, to avoid complications that arise when introducing new machine basic blocks during prolog and epilog creation. Added a new test case, modified an existing one to exclude non-x64 coreclr (for now). Add test case Fix tests llvm-svn: 252578
*	[Hexagon] Fixing compound register printing and reenabling more tests.	Colin LeMahieu	2015-11-10	2	-10/+33
\| \| \| \|	llvm-svn: 252574
*	AArch64: add experimental support for address tagging.	Tim Northover	2015-11-10	3	-5/+64
\| \| \| \| \| \| \| \| \| \| \| \| \|	AArch64 has the ability to use the top 8-bits of an "address" for extra information, with the memory subsystem automatically masking them off for loads and stores. When that's happening, we can sometimes skip masks on memory operations in the compiler. However, this requires the host OS and support stack to preserve those bits so it can't be enabled everywhere. In principle iOS 8.0 and above do take the required precautions and but we'll put it under a flag for now. llvm-svn: 252573
*	[WebAssembly] Support 'unreachable' expression	Derek Schuff	2015-11-10	3	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Lower LLVM's 'unreachable' terminator to ISD::TRAP, and lower ISD::TRAP to wasm's 'unreachable' expression. WebAssembly type-checks expressions, but a noreturn function with a return type that doesn't match the context will cause a check failure. So we lower LLVM 'unreachable' to ISD::TRAP and then lower that to WebAssembly's 'unreachable' expression, which typechecks in any context and causes a trap if executed. Differential Revision: http://reviews.llvm.org/D14515 llvm-svn: 252566
*	[Hexagon] Fixing store instructions and reenabling a few more tests.	Colin LeMahieu	2015-11-10	2	-25/+20
\| \| \| \|	llvm-svn: 252561
*	[ARM] Handle t2ADDri in ARMAsmPrinter::EmitUnwindingInstruction.	Akira Hatanaka	2015-11-10	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a bug in ARMAsmPrinter::EmitUnwindingInstruction where llvm_unreachable was reached because t2ADDri wasn't handled. Test case provided by Tim Northover. rdar://problem/23270609 http://reviews.llvm.org/D14518 llvm-svn: 252557
*	[Hexagon] Fixing load instruction parsing and reenabling tests.	Colin LeMahieu	2015-11-10	4	-14/+16
\| \| \| \|	llvm-svn: 252555
*	[WinEH] Remove isBarrier from instructions that do not return	Reid Kleckner	2015-11-09	1	-2/+2
\| \| \| \| \| \|	Fixes machine verification failures with David's latest EH change. llvm-svn: 252541
*	add a SelectionDAG method to check if no common bits are set in two nodes; NFCI	Sanjay Patel	2015-11-09	1	-17/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was suggested in: http://reviews.llvm.org/D13956 and is a follow-on to: http://reviews.llvm.org/rL252515 http://reviews.llvm.org/rL252519 This lets us remove logically equivalent/duplicated code from DAGCombiner and X86ISelDAGToDAG. A corresponding function for IR instructions already exists in ValueTracking. llvm-svn: 252539
*	[WinEH] Don't emit CATCHRET from visitCatchPad	David Majnemer	2015-11-09	3	-10/+27
\| \| \| \| \| \| \|	Instead, emit a CATCHPAD node which will get selected to a target specific sequence. llvm-svn: 252528
*	[x86] try harder to match bitwise 'or' into an LEA	Sanjay Patel	2015-11-09	1	-11/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The motivation for this patch starts with the epic fail example in PR18007: https://llvm.org/bugs/show_bug.cgi?id=18007 ...unfortunately, this patch makes no difference for that case, but it solves some simpler cases. We'll get there some day. :) The current 'or' matching code was using computeKnownBits() via isBaseWithConstantOffset() -> MaskedValueIsZero(), but that's an unnecessarily limited use. We can do more by copying the logic in ValueTracking's haveNoCommonBitsSet(), so we can treat the 'or' as if it was an 'add'. There's a TODO comment here because we should lift the bit-checking logic into a helper function, so it's not duplicated in DAGCombiner. An example of the better LEA matching: leal (%rdi,%rdi), %eax andl $1, %esi orl %esi, %eax Becomes: andl $1, %esi leal (%rsi,%rdi,2), %eax Differential Revision: http://reviews.llvm.org/D13956 llvm-svn: 252515
*	[Hexagon] Separating statement to match what clang-format would do.	Colin LeMahieu	2015-11-09	1	-2/+4
\| \| \| \|	llvm-svn: 252513
*	[WinEH] Tweak funclet prologue/epilogue insertion to pass verifier	Reid Kleckner	2015-11-09	1	-5/+8
\| \| \| \| \| \| \| \| \| \|	For some reason we'd never run MachineVerifier on WinEH code, and you explicitly have to ask for it with llc. I added it to a few test cases to get some coverage. Fixes PR25461. llvm-svn: 252512
*	[Hexagon] Fix -Wmicrosoft-enum-value warning with explicit enum type	Reid Kleckner	2015-11-09	1	-1/+1
\| \| \| \|	llvm-svn: 252505
*	don't repeat function names in comments; NFC	Sanjay Patel	2015-11-09	1	-19/+17
\| \| \| \|	llvm-svn: 252502
*	[AArch64] Add UABDL patterns for log2 shuffle.	Charlie Turner	2015-11-09	1	-2/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This matches the sum-of-absdiff patterns emitted by the vectoriser using log2 shuffles. Relies on D14207 to be able to match the `extract_subvector(..., 0)` Reviewers: t.p.northover, jmolloy Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14208 llvm-svn: 252465
*	[AArch64] Handle extract_subvector(..., 0) in ISel.	Charlie Turner	2015-11-09	2	-18/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Lowering this pattern early to an `EXTRACT_SUBREG` was making it impossible to match larger patterns in tblgen that use `extract_subvector(..., 0)` as part of the their input pattern. It seems like there will exist somewhere a better way of specifying this pattern over all relevant register value types, but I didn't manage to find it. Reviewers: t.p.northover, jmolloy Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14207 llvm-svn: 252464