bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86][SSE] Improve recognition of uitofp conversions that can be performed ↵	Simon Pilgrim	2016-09-18	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	as sitofp With D24253 we can now use SelectionDAG::SignBitIsZero with vector operations. This patch uses SelectionDAG::SignBitIsZero to recognise that a zero sign bit means that we can use a sitofp instead of a uitofp (which is not directly support on pre-AVX512 hardware). While AVX512 does provide support for uitofp, the conversion to sitofp should not cause any regressions. Differential Revision: https://reviews.llvm.org/D24343 llvm-svn: 281852
*	[X86][SSE] Improve target shuffle mask extraction	Simon Pilgrim	2016-09-17	1	-10/+14
\| \| \| \| \| \|	Add ability to extract vXi64 'vzext_movl' masks on 32-bit targets llvm-svn: 281834
*	Defer asm errors to post-statement failure	Nirav Dave	2016-09-16	1	-50/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Recommitting after fixing AsmParser initialization and X86 inline asm error cleanup. Allow errors to be deferred and emitted as part of clean up to simplify and shorten Assembly parser code. This will allow error messages to be emitted in helper functions and be modified by the caller which has better context. As part of this many minor cleanups to the Parser: * Unify parser cleanup on error * Add Workaround for incorrect return values in ParseDirective instances * Tighten checks on error-signifying return values for parser functions and fix in-tree TargetParsers to be more consistent with the changes. * Fix AArch64 test cases checking for spurious error messages that are now fixed. These changes should be backwards compatible with current Target Parsers so long as the error status are correctly returned in appropriate functions. Reviewers: rnk, majnemer Subscribers: aemerson, jyknight, llvm-commits Differential Revision: https://reviews.llvm.org/D24047 llvm-svn: 281762
*	Move the Mangler from the AsmPrinter down to TLOF and clean up the	Eric Christopher	2016-09-16	3	-20/+19
\| \| \| \| \| \|	TLOF API accordingly. llvm-svn: 281708
*	Remove unused function getMang().	Eric Christopher	2016-09-16	1	-3/+0
\| \| \| \|	llvm-svn: 281707
*	Finish renaming remaining analyzeBranch functions	Matt Arsenault	2016-09-14	2	-4/+4
\| \| \| \|	llvm-svn: 281535
*	Make analyzeBranch family of instruction names consistent	Matt Arsenault	2016-09-14	2	-3/+3
\| \| \| \| \| \| \|	analyzeBranch was renamed to use lowercase first, rename the related set to match. llvm-svn: 281506
*	AArch64: Use TTI branch functions in branch relaxation	Matt Arsenault	2016-09-14	2	-4/+11
\| \| \| \| \| \| \| \| \|	The main change is to return the code size from InsertBranch/RemoveBranch. Patch mostly by Tim Northover llvm-svn: 281505
*	[x86] fix formatting; NFC	Sanjay Patel	2016-09-14	1	-28/+20
\| \| \| \|	llvm-svn: 281504
*	[X86][SSE] Improve recognition of i64 sitofp conversions that can be ↵	Simon Pilgrim	2016-09-14	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \|	performed as i32 (PR29078) Until AVX512DQ we only support i64/vXi64 sitofp conversion as scalars. This patch sees if the sign bit extends far enough that we can truncate to a i32 type and then perform sitofp without loss of precision. Differential Revision: https://reviews.llvm.org/D24345 llvm-svn: 281502
*	[X86][SSE] Don't use PSHUFD directly - lower with generic shuffle	Simon Pilgrim	2016-09-14	1	-17/+1
\| \| \| \| \| \|	Remove the last user of the old getTargetShuffleNode helpers llvm-svn: 281499
*	getVectorElementType().getSizeInBits() -> getScalarSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	2	-15/+15
\| \| \| \|	llvm-svn: 281495
*	getValueType().getSizeInBits() -> getValueSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	1	-10/+8
\| \| \| \|	llvm-svn: 281493
*	getValueType().getScalarSizeInBits() -> getScalarValueSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	2	-5/+5
\| \| \| \|	llvm-svn: 281490
*	getScalarType().getSizeInBits() -> getScalarSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	1	-1/+1
\| \| \| \|	llvm-svn: 281489
*	[X86][SSE] Removed unused getTargetShuffleNode function	Simon Pilgrim	2016-09-14	1	-17/+0
\| \| \| \|	llvm-svn: 281481
*	[X86][SSE] Don't blend vector shifts with MOVSS/MOVSD directly, lower from ↵	Simon Pilgrim	2016-09-14	1	-10/+10
\| \| \| \| \| \| \| \|	generic shuffle Shuffle lowering will correctly lower to MOVSS/MOVSD/PBLEND, improving commutation opportunities llvm-svn: 281471
*	AVX-512: Fixed a bug in kortest.z intrinsic	Elena Demikhovsky	2016-09-14	1	-1/+1
\| \| \| \| \| \|	Lowering was wrong - X86ISD::SETCC node should return i8 type. llvm-svn: 281446
*	[AVX512BW] Change truncStore action (v16i16->v16i18). It can be legal only ↵	Igor Breger	2016-09-14	1	-2/+3
\| \| \| \| \| \| \| \|	with AVX512VL. Differential Revision: http://reviews.llvm.org/D24547 llvm-svn: 281445
*	[X86] Remove the VCVTSI2SD32 with rounding intrinsic. It's not used by clang ↵	Craig Topper	2016-09-14	1	-1/+0
\| \| \| \| \| \|	and not needed since 32-bit integer to double is always exact. llvm-svn: 281442
*	Revert r281336 (and r281337), it caused PR30372.	Nico Weber	2016-09-13	1	-28/+50
\| \| \| \|	llvm-svn: 281361
*	Defer asm errors to post-statement failure	Nirav Dave	2016-09-13	1	-50/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Recommitting after fixing AsmParser Initialization. Allow errors to be deferred and emitted as part of clean up to simplify and shorten Assembly parser code. This will allow error messages to be emitted in helper functions and be modified by the caller which has better context. As part of this many minor cleanups to the Parser: * Unify parser cleanup on error * Add Workaround for incorrect return values in ParseDirective instances * Tighten checks on error-signifying return values for parser functions and fix in-tree TargetParsers to be more consistent with the changes. * Fix AArch64 test cases checking for spurious error messages that are now fixed. These changes should be backwards compatible with current Target Parsers so long as the error status are correctly returned in appropriate functions. Reviewers: rnk, majnemer Subscribers: aemerson, jyknight, llvm-commits Differential Revision: https://reviews.llvm.org/D24047 llvm-svn: 281336
*	AVX-512: Fix for PR28175 - Scalar code optimization.	Elena Demikhovsky	2016-09-13	2	-7/+17
\| \| \| \| \| \| \| \| \|	Optimized (truncate (assertzext x) to i1) and anyext i1 to i8/16/32. Optimization of this patterns is a one more step towards i1 optimization on AVX-512. Differential Revision: https://reviews.llvm.org/D24456 llvm-svn: 281302
*	[X86] Remove masked shufpd/shufps intrinsics and autoupgrade to native ↵	Craig Topper	2016-09-13	1	-12/+0
\| \| \| \| \| \|	vector shuffles. They were removed from clang previously but accidentally left in the backend. llvm-svn: 281300
*	X86: Conditional tail calls should not have isBarrier = 1	Hans Wennborg	2016-09-13	1	-18/+31
\| \| \| \| \| \| \| \| \| \|	That confuses e.g. machine basic block placement, which then doesn't realize that control can fall through a block that ends with a conditional tail call. Instead, isBranch=1 should be set. Also, mark EFLAGS as used by these instructions. llvm-svn: 281281
*	Temporarily Revert "[MC] Defer asm errors to post-statement failure" as it's ↵	Eric Christopher	2016-09-13	1	-28/+50
\| \| \| \| \| \| \| \|	causing errors on the sanitizer bots. This reverts commit r281249. llvm-svn: 281280
*	[MC] Defer asm errors to post-statement failure	Nirav Dave	2016-09-12	1	-50/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Allow errors to be deferred and emitted as part of clean up to simplify and shorten Assembly parser code. This will allow error messages to be emitted in helper functions and be modified by the caller which has better context. As part of this many minor cleanups to the Parser: * Unify parser cleanup on error * Add Workaround for incorrect return values in ParseDirective instances * Tighten checks on error-signifying return values for parser functions and fix in-tree TargetParsers to be more consistent with the changes. * Fix AArch64 test cases checking for spurious error messages that are now fixed. These changes should be backwards compatible with current Target Parsers so long as the error status are correctly returned in appropriate functions. Reviewers: rnk, majnemer Subscribers: aemerson, jyknight, llvm-commits Differential Revision: https://reviews.llvm.org/D24047 llvm-svn: 281249
*	[X86] Copy imp-uses when folding tailcall into conditional branch.	Ahmed Bougacha	2016-09-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	r280832 added 32-bit support for emitting conditional tail-calls, but dropped imp-used parameter registers. This went unnoticed until r281113, which added 64-bit support, as this is only exposed with parameter passing via registers. Don't drop the imp-used parameters. llvm-svn: 281223
*	CodeGen: Give MachineBasicBlock::reverse_iterator a handle to the current MI	Duncan P. N. Exon Smith	2016-09-11	1	-5/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that MachineBasicBlock::reverse_instr_iterator knows when it's at the end (since r281168 and r281170), implement MachineBasicBlock::reverse_iterator directly on top of an ilist::reverse_iterator by adding an IsReverse template parameter to MachineInstrBundleIterator. This replaces another hard-to-reason-about use of std::reverse_iterator on list iterators, matching the changes for ilist::reverse_iterator from r280032 (see the "out of scope" section at the end of that commit message). MachineBasicBlock::reverse_iterator now has a handle to the current node and has obvious invalidation semantics. r280032 has a more detailed explanation of how list-style reverse iterators (invalidated when the pointed-at node is deleted) are different from vector-style reverse iterators like std::reverse_iterator (invalidated on every operation). A great motivating example is this commit's changes to lib/CodeGen/DeadMachineInstructionElim.cpp. Note: If your out-of-tree backend deletes instructions while iterating on a MachineBasicBlock::reverse_iterator or converts between MachineBasicBlock::iterator and MachineBasicBlock::reverse_iterator, you'll need to update your code in similar ways to r280032. The following table might help: [Old] ==> [New] delete &RI, RE = end() delete &RI++ RI->erase(), RE = end() RI++->erase() reverse_iterator(I) std::prev(I).getReverse() reverse_iterator(I) ++I.getReverse() --reverse_iterator(I) I.getReverse() reverse_iterator(std::next(I)) I.getReverse() RI.base() std::prev(RI).getReverse() RI.base() ++RI.getReverse() --RI.base() RI.getReverse() std::next(RI).base() RI.getReverse() (For more details, have a look at r280032.) llvm-svn: 281172
*	[AVX512] Fix pattern for vgetmantsd and all other instructions that use same ↵	Igor Breger	2016-09-11	1	-8/+1
\| \| \| \| \| \| \| \|	class. Fix memory operand size, remove unnecessary pattern. Differential Revision: http://reviews.llvm.org/D24443 llvm-svn: 281164
*	[AVX-512] Add VPTERNLOG to load folding tables.	Craig Topper	2016-09-11	1	-0/+18
\| \| \| \|	llvm-svn: 281156
*	[X86] Make a helper method into a static function local to the cpp file.	Craig Topper	2016-09-11	2	-11/+10
\| \| \| \|	llvm-svn: 281154
*	[CodeGen] Split out the notions of MI invariance and MI dereferenceability.	Justin Lebar	2016-09-11	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: An IR load can be invariant, dereferenceable, neither, or both. But currently, MI's notion of invariance is IR-invariant && IR-dereferenceable. This patch splits up the notions of invariance and dereferenceability at the MI level. It's NFC, so adds some probably-unnecessary "is-dereferenceable" checks, which we can remove later if desired. Reviewers: chandlerc, tstellarAMD Subscribers: jholewinski, arsenm, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D23371 llvm-svn: 281151
*	We also need to pass swifterror in R12 under swiftcc not only under ccc	Arnold Schwaighofer	2016-09-10	1	-0/+3
\| \| \| \| \| \|	rdar://28190687 llvm-svn: 281138
*	[CodeGen] Rename MachineInstr::isInvariantLoad to ↵	Justin Lebar	2016-09-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	isDereferenceableInvariantLoad. NFC Summary: I want to separate out the notions of invariance and dereferenceability at the MI level, so that they correspond to the equivalent concepts at the IR level. (Currently an MI load is MI-invariant iff it's IR-invariant and IR-dereferenceable.) First step is renaming this function. Reviewers: chandlerc Subscribers: MatzeB, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D23370 llvm-svn: 281125
*	X86: Fold tail calls into conditional branches also for 64-bit (PR26302)	Hans Wennborg	2016-09-09	4	-12/+40
\| \| \| \| \| \| \| \| \|	This extends the optimization in r280832 to also work for 64-bit. The only quirk is that we can't do this for 64-bit Windows (yet). Differential Revision: https://reviews.llvm.org/D24423 llvm-svn: 281113
*	[X86][XOP] Fix VPERMIL2PD mask creation on 32-bit targets	Simon Pilgrim	2016-09-09	1	-5/+5
\| \| \| \| \| \|	Use getConstVector helper to correctly create v2i64/v4i64 constants on 32-bit targets llvm-svn: 281105
*	[AVX-512] Add VPCMP instructions to the load folding tables and make them ↵	Craig Topper	2016-09-09	2	-1/+57
\| \| \| \| \| \|	commutable. llvm-svn: 281013
*	[X86] Tighten up a comment which confused x64 ABI terminology.	David Majnemer	2016-09-09	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The x64 ABI has two major function types: - frame functions - leaf functions A frame function is one which requires a stack frame. A leaf function is one which does not. A frame function may or may not have a frame pointer. A leaf function does not require a stack frame and may never modify SP except via a return (RET, tail call via JMP). A frame function which has a frame pointer is permitted to use the LEA instruction in the epilogue, a frame function without which doesn't establish a frame pointer must use ADD to adjust the stack pointer epilogue. Fun fact: Leaf functions don't require a function table entry (associated PDATA/XDATA). llvm-svn: 281006
*	Win64: Don't use REX prefix for direct tail calls	Hans Wennborg	2016-09-08	5	-8/+4
\| \| \| \| \| \| \| \| \| \|	The REX prefix should be used on indirect jmps, but not direct ones. For direct jumps, the unwinder looks at the offset to determine if it's inside the current function. Differential Revision: https://reviews.llvm.org/D24359 llvm-svn: 281003
*	Revert "[XRay] ARM 32-bit no-Thumb support in LLVM"	Renato Golin	2016-09-08	3	-2/+33
\| \| \| \| \| \| \| \| \| \|	And associated commits, as they broke the Thumb bots. This reverts commit r280935. This reverts commit r280891. This reverts commit r280888. llvm-svn: 280967
*	[XRay] ARM 32-bit no-Thumb support in LLVM	Dean Michael Berris	2016-09-08	3	-33/+2
\| \| \| \| \| \| \| \| \| \| \| \|	This is a port of XRay to ARM 32-bit, without Thumb support yet. The XRay instrumentation support is moving up to AsmPrinter. This is one of 3 commits to different repositories of XRay ARM port. The other 2 are: 1. https://reviews.llvm.org/D23932 (Clang test) 2. https://reviews.llvm.org/D23933 (compiler-rt) Differential Revision: https://reviews.llvm.org/D23931 llvm-svn: 280888
*	Don't reduce the width of vector mul if the target doesn't support SSE2.	Wei Mi	2016-09-07	1	-1/+2
\| \| \| \| \| \| \| \| \|	The patch is to fix PR30298, which is caused by rL272694. The solution is to bail out if the target has no SSE2. Differential Revision: https://reviews.llvm.org/D24288 llvm-svn: 280837
*	X86: Fold tail calls into conditional branches where possible (PR26302)	Hans Wennborg	2016-09-07	5	-17/+121
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When branching to a block that immediately tail calls, it is possible to fold the call directly into the branch if the call is direct and there is no stack adjustment, saving one byte. Example: define void @f(i32 %x, i32 %y) { entry: %p = icmp eq i32 %x, %y br i1 %p, label %bb1, label %bb2 bb1: tail call void @foo() ret void bb2: tail call void @bar() ret void } before: f: movl 4(%esp), %eax cmpl 8(%esp), %eax jne .LBB0_2 jmp foo .LBB0_2: jmp bar after: f: movl 4(%esp), %eax cmpl 8(%esp), %eax jne bar .LBB0_1: jmp foo I don't expect any significant size savings from this (on a Clang bootstrap I saw 288 bytes), but it does make the code a little tighter. This patch only does 32-bit, but 64-bit would work similarly. Differential Revision: https://reviews.llvm.org/D24108 llvm-svn: 280832
*	[x86] move combines of 'select of 2 constants' to its own function; NFC	Sanjay Patel	2016-09-07	1	-92/+103
\| \| \| \| \| \|	There are missing folds here and possibly folds that could be made generic. llvm-svn: 280817
*	AVX512F: FMA intrinsic + FNEG - sequence optimization	Elena Demikhovsky	2016-09-07	1	-90/+102
\| \| \| \| \| \| \| \| \| \| \|	The previous commit (r280368 - https://reviews.llvm.org/D23313) does not cover AVX-512F, KNL set. FNEG(x) operation is lowered to (bitcast (vpxor (bitcast x), (bitcast constfp(0x80000000))). It happens because FP XOR is not supported for 512-bit data types on KNL and we use integer XOR instead. I added pattern match for integer XOR. Differential Revision: https://reviews.llvm.org/D24221 llvm-svn: 280785
*	[X86] Add hasSideEffects=0 to some instructions.	Craig Topper	2016-09-07	2	-3/+5
\| \| \| \|	llvm-svn: 280782
*	[AVX-512] Add support for commuting masked instructions in ↵	Craig Topper	2016-09-07	1	-1/+23
\| \| \| \| \| \|	findCommutedOpIndices. The default implementation doesn't skip the mask input or the preserved input. llvm-svn: 280781
*	[AVX-512] Fix masked VPERMI2PS isel when the index comes from a bitcast.	Craig Topper	2016-09-06	3	-58/+39
\| \| \| \| \| \|	We need to bitcast the index operand to a floating point type so that it matches the result type. If not then the passthru part of the DAG will be a bitcast from the index's original type to the destination type. This makes it very difficult to match. The other option would be to add 5 sets of patterns for every other possible type. llvm-svn: 280696
*	[X86] Remove unused encoding from IntrinsicType enum.	Craig Topper	2016-09-06	2	-4/+1
\| \| \| \|	llvm-svn: 280694