bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86] Remove redundant bitcast patterns for 128/256-bit vectors. These only ↵	Craig Topper	2016-06-03	1	-64/+0
\| \| \| \| \| \|	differ from the SSE/AVX versions by the register class, but register class has no bearing on isel. llvm-svn: 271623
*	Revert "[WebAssembly] Emit type signatures for declared functions"	Derek Schuff	2016-06-02	3	-50/+10
\| \| \| \| \| \| \| \|	This reverts r271599, it broke the integration tests. More places than I expected had nontrival return types in imports, or else the check was wrong. llvm-svn: 271606
*	[WebAssembly] Emit type signatures for declared functions	Derek Schuff	2016-06-02	3	-10/+50
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Under emscripten, C code can take the address of a function implemented in Javascript (which is exposed via an import in wasm). Because imports do not have linear memory address in wasm, we need to generate a thunk to be the target of the indirect call; it call the import directly. To make this possible, LLVM needs to emit the type signatures for these functions, because they may not be called directly or referred to other than where the address is taken. This uses s new .s directive (.functype) which specifies the signature. Differential Revision: http://reviews.llvm.org/D20891 llvm-svn: 271599
*	AMDGPU: Handle flat in getMemOpBaseRegImmOfs	Matt Arsenault	2016-06-02	1	-0/+7
\| \| \| \| \| \| \|	It can still report the base register, and the uses give up when it fails. llvm-svn: 271575
*	transform obscured FP sign bit ops into a fabs/fneg using TLI hook	Sanjay Patel	2016-06-02	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is effectively a revert of: http://reviews.llvm.org/rL249702 - [InstCombine] transform masking off of an FP sign bit into a fabs() intrinsic call (PR24886) and: http://reviews.llvm.org/rL249701 - [ValueTracking] teach computeKnownBits that a fabs() clears sign bits and a reimplementation as a DAG combine for targets that have IEEE754-compliant fabs/fneg instructions. This is intended to resolve the objections raised on the dev list: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098154.html and: https://llvm.org/bugs/show_bug.cgi?id=24886#c4 In the interest of patch minimalism, I've only partly enabled AArch64. PowerPC, MIPS, x86 and others can enable later. Differential Revision: http://reviews.llvm.org/D19391 llvm-svn: 271573
*	AMDGPU: Cleanup load tests	Matt Arsenault	2016-06-02	1	-0/+14
\| \| \| \| \| \| \| \| \|	There are a lot of different kinds of loads to test for, and these were scattered around inconsistently with some redundancy. Try to comprehensively test all loads in a consistent way. llvm-svn: 271571
*	AMDGPU: Temporary fix for broken store combine	Matt Arsenault	2016-06-02	1	-0/+2
\| \| \| \|	llvm-svn: 271567
*	AMDGPU: Fix crashes on unknown processor name	Matt Arsenault	2016-06-02	4	-7/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the processor name failed to parse for amdgcn, the resulting output would have R600 ISA in it. If the processor name was missing or invalid for R600, the wavefront size would not be set and there would be crashes from missing itinerary data. Fixes crashes in future commit caused by dividing by the unset/0 wavefront size. llvm-svn: 271561
*	[X86] Define segment MI operands as regs instead of i8imm.	Ahmed Bougacha	2016-06-02	2	-11/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We've been pretending that segments are i8imm since the initial support (r68645), predating the addition of the SEGMENT_REG class (r81895). That happens to works, but is wrong, and inconsistent with how we print (e.g., X86ATTInstPrinter::printMemReference) and parse them (e.g., X86Operand::addMemOperands). This change shouldn't affect any tool users, but is visible to library users or out-of-tree tablegen backends: this causes MCOperandInfo for the segment op to have an RC instead of "unknown", and TII::getRegClass to actually return something. As the registers are reserved and no vregs of the class ever created, that shouldn't change anything. No test change; no suspicious getRegClass() in X86 and CodeGen. llvm-svn: 271559
*	AArch64: Do not test for CPUs, use SubtargetFeatures	Matthias Braun	2016-06-02	10	-115/+224
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Testing for specific CPUs has a number of problems, better use subtarget features: - When some tweak is added for a specific CPU it is often desirable for the next version of that CPU as well, yet we often forget to add it. - It is hard to keep track of checks scattered around the target code; Declaring all target specifics together with the CPU in the tablegen file is a clear representation. - Subtarget features can be tweaked from the command line. To discourage people from using CPU checks in the future I removed the isCortexXX(), isCyclone(), ... functions. I added an getProcFamily() function for exceptional circumstances but made it clear in the comment that usage is discouraged. Reformat feature list in AArch64.td to have 1 feature per line in alphabetical order to simplify merging and sorting for out of tree tweaks. No functional change intended. Differential Revision: http://reviews.llvm.org/D20762 llvm-svn: 271555
*	Only attempt to detect AVG if SSE2 is available	Dimitry Andric	2016-06-02	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: In PR29973 Sanjay Patel reported an assertion failure when a certain loop was optimized, for a target without SSE2 support. It turned out this was because of the AVG pattern detection introduced in rL253952. Prevent the assertion failure by bailing out early in `detectAVGPattern()`, if the target does not support SSE2. Also add a minimized test case. Reviewers: congh, eli.friedman, spatel Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D20905 llvm-svn: 271548
*	[PEI, AArch64] Use empty spaces in stack area for local stack slot allocation.	Geoff Berry	2016-06-02	3	-3/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: If the target requests it, use emptry spaces in the fixed and callee-save stack area to allocate local stack objects. AArch64: Change last callee-save reg stack object alignment instead of size to leave a gap to take advantage of above change. Reviewers: t.p.northover, qcolombet, MatzeB Subscribers: rengolin, mcrosier, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D20220 llvm-svn: 271527
*	[Hexagon] Expand COPY pseudo-instruction	Krzysztof Parzyszek	2016-06-02	1	-6/+11
\| \| \| \| \| \| \| \|	Handle it locally instead of having the target-independent pass deal with it. The generic pass does not preserve implicit uses, which may be necessary. llvm-svn: 271520
*	[RDF] Ignore implicit defs when resetting <kill> flags	Krzysztof Parzyszek	2016-06-02	1	-1/+5
\| \| \| \|	llvm-svn: 271519
*	[X86][SSE] Replace (V)CVTTPS2DQ and VCVTTPD2DQ truncating (round to zero) ↵	Simon Pilgrim	2016-06-02	1	-23/+8
\| \| \| \| \| \| \| \| \| \| \| \|	f32/f64 to i32 with generic IR (llvm) This patch removes the llvm intrinsics (V)CVTTPS2DQ and VCVTTPD2DQ truncation (round to zero) conversions and auto-upgrades to FP_TO_SINT calls instead. Note: I looked at updating CVTTPD2DQ as well but this still requires a lot more work to correctly lower. Differential Revision: http://reviews.llvm.org/D20860 llvm-svn: 271510
*	This adds support for Cortex-A73 as an available target.	Sjoerd Meijer	2016-06-02	3	-2/+12
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20865 llvm-svn: 271508
*	[AVX512] Add 512-bit load/stores to fast isel.	Craig Topper	2016-06-02	1	-0/+46
\| \| \| \|	llvm-svn: 271486
*	[X86] No need to use 256-bit VMOVNTPS for integer types when only AVX1 is ↵	Craig Topper	2016-06-02	1	-15/+1
\| \| \| \| \| \| \| \|	supported. VMOVNTDQ is available with AVX1. We were getting this right for v4i64 but not the other integer types. llvm-svn: 271482
*	[X86] Add AVX 256-bit load and stores to fast isel.	Craig Topper	2016-06-02	1	-9/+52
\| \| \| \| \| \| \| \|	I'm not sure why this was missing for so long. This also exposed that we were picking floating point 256-bit VMOVNTPS for some integer types in normal isel for AVX1 even though VMOVNTDQ is available. In practice it doesn't matter due to the execution dependency fix pass, but it required extra isel patterns. Fixing that in a follow up commit. llvm-svn: 271481
*	[X86] Use uint16_t for a couple arrays of instruction opcodes. NFC	Craig Topper	2016-06-02	1	-2/+2
\| \| \| \|	llvm-svn: 271480
*	[AVX512] Remove LOADA/LOADU/STOREA/STOREU intrinsic types now that they are ↵	Craig Topper	2016-06-02	2	-52/+3
\| \| \| \| \| \|	unused. llvm-svn: 271479
*	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load ↵	Craig Topper	2016-06-02	1	-30/+0
\| \| \| \| \| \| \| \|	intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271478
*	AMDGPU: Fix incorrectly setting kill flag when copying register tuples	Matt Arsenault	2016-06-02	1	-1/+1
\| \| \| \| \| \| \|	This fixes some verifier errors when trackLivenessAfterRegAlloc is enabled. llvm-svn: 271446
*	AMDGPU: SIDebuggerInsertNops preserves CFG	Matt Arsenault	2016-06-02	2	-0/+6
\| \| \| \| \| \| \|	This saves an additional run of the DominatorTree and MachineLoopInfo llvm-svn: 271444
*	Avoid a load for local functions.	Rafael Espindola	2016-06-01	1	-2/+7
\| \| \| \|	llvm-svn: 271437
*	[PPC64] Fix SUBFC8 Defs list	Keno Fischer	2016-06-01	2	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Fix PR27943 "Bad machine code: Using an undefined physical register". SUBFC8 implicitly defines the CR0 register, but this was omitted in the instruction definition. Patch by Jameson Nash <jameson@juliacomputing.com> Reviewers: hfinkel Differential Revision: http://reviews.llvm.org/D20802 llvm-svn: 271425
*	Adding back-end support to two bit scanning intrinsics	Michael Zuckerman	2016-06-01	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adding LLVM back-end support to two intrinsics dealing with bit scan: _bit_scan_forward and _bit_scan_reverse. Their functionality is as described in Intel intrinsics guide: https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_forward&expand=371,370 https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_reverse&expand=371,370 Commit on behalf of Omer Paparo Bivas Differential Revision: http://reviews.llvm.org/D19915 llvm-svn: 271386
*	[ARM] Add additional matching for UBFX instructions	Oliver Stannard	2016-06-01	1	-0/+21
\| \| \| \| \| \| \| \| \| \| \|	This adds an additional matcher to select UBFX(..) from SRL(AND(..)) in ARMISelDAGToDAG to help with code size. Patch by David Green. Differential Revision: http://reviews.llvm.org/D20667 llvm-svn: 271384
*	[Sparc] Allow passing of empty structs.	Chris Dewhurst	2016-06-01	1	-11/+21
\| \| \| \| \| \| \| \|	Passing an empty struct as a function call argument is now supported. unit tests for various scenarios added. llvm-svn: 271374
*	Revert r271362 "[AVX512] Remove masked load intrinsics. Clang now emits ↵	Craig Topper	2016-06-01	1	-0/+30
\| \| \| \| \| \| \| \|	generic masked load intrinsics instead." Looks like something isn't quite right still. Also forgot to move the test cases to an autoupgrade test. llvm-svn: 271363
*	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load ↵	Craig Topper	2016-06-01	1	-30/+0
\| \| \| \| \| \| \| \|	intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271362
*	[X86]: Add a pattern that uses GR16_ABCD rather than GR32_ABCD to avoid ↵	Kevin B. Smith	2016-05-31	1	-0/+4
\| \| \| \| \| \| \| \|	falsely marking whole 32 bit register as live. Differential Revision: http://reviews.llvm.org/D20649 llvm-svn: 271341
*	ARM: Do not attempt to modify register class of physregs.	Matthias Braun	2016-05-31	1	-4/+9
\| \| \| \| \| \| \|	Physregs have no associated register class, do not attempt to modify it in Thumb2InstrInfo::storeRegToStackSlot()/loadFromStackSlot(). llvm-svn: 271339
*	Delete AArch64II::MO_CONSTPOOL.	Rafael Espindola	2016-05-31	5	-38/+4
\| \| \| \| \| \| \| \|	A constant pool holding the address of a variable in equivalent to a got entry. It produces exactly the same instruction sequence as a got use and unlike a got use this is not uniqued by the linker. llvm-svn: 271311
*	[mips] Enforce compact branch register restrictions	Simon Dardis	2016-05-31	2	-14/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Enforce compact branch register restrictions such as the use of the zero register, both operands being the same register. Emit clear error in such cases as the issue is subtle. For bovc and bnvc, silently fixup such cases when emitting objects directly, like LLVM started doing in rL269899. Reviewers: vkalintiris, dsanders Differential Review: http://reviews.llvm.org/D20475 llvm-svn: 271301
*	AMDGPU: Remove unused address space	Matt Arsenault	2016-05-31	2	-12/+10
\| \| \| \| \| \|	Also return a single StringRef instead of building a string. llvm-svn: 271296
*	Add a use of shouldAssumeDSOLocal to ARM.	Rafael Espindola	2016-05-31	1	-2/+6
\| \| \| \| \| \|	Now this code path knows about position independent executables. llvm-svn: 271290
*	[Hexagon] Disable expanding MUX instructions that define a subregister	Krzysztof Parzyszek	2016-05-31	1	-0/+5
\| \| \| \| \| \| \|	The code in HexagonExpandCondsets.cpp does not handle those cases at the moment. llvm-svn: 271281
*	Do not modify a std::vector while looping it.	Yaron Keren	2016-05-31	1	-2/+6
\| \| \| \| \| \| \| \| \| \|	Introduced in r271244, this is probably undefined behaviour and asserts when compiled with Visual C++ debug mode. On further note, the loop is quadratic with regard to the number of successors since removeSuccessor is linear and could probably be modified to linear time. llvm-svn: 271278
*	[ARM] Add backend support for load/store intrinsics.	Ranjeet Singh	2016-05-31	2	-37/+39
\| \| \| \| \| \| \| \| \| \|	Added support to map intrinsics __builtin_arm_{ldc,ldcl,ldc2,ldc2l,stc,stcl,stc2,stc2l} to their ARM instructions. Differential Revision: http://reviews.llvm.org/D20564 llvm-svn: 271271
*	[X86][SSE] Add load-folding patterns for (V)CVTDQ2PD (PR27291)	Simon Pilgrim	2016-05-31	1	-0/+4
\| \| \| \| \| \|	Added patterns for (V)CVTDQ2PD -> 2f64 loading from a 64-bit source. llvm-svn: 271269
*	[mips] bnec/beqc register constraint fix	Simon Dardis	2016-05-31	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \|	beqc and bnec cannot have $rs == $rt. Inhibit compact branch creation if that would occur. Reviewers: vkalintiris, dsanders Differential Revision: http://reviews.llvm.org/D20624 llvm-svn: 271260
*	[AVX512] Fix intrinsic vcvtps2ph lowering.	Igor Breger	2016-05-31	2	-8/+11
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20788 llvm-svn: 271255
*	Fix intrinsic vbroadcast{i32\|f32}x2 lowering.	Igor Breger	2016-05-31	3	-37/+46
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20780 llvm-svn: 271254
*	[AVX512] Remove masked store intrinsics. Clang now emits generic masked ↵	Craig Topper	2016-05-31	1	-30/+0
\| \| \| \| \| \| \| \|	store intrinsics instead. The intrinsics will be autoupgraded to the same generic masked stores. llvm-svn: 271245
*	X86: permit using SjLj EH on x86 targets as an option	Saleem Abdulrasool	2016-05-31	3	-1/+277
\| \| \| \| \| \| \| \| \| \| \|	This adds support to the backed to actually support SjLj EH as an exception model. This is NOT the default model, and requires explicitly opting into it from the frontend. GCC supports this model and for MinGW can still be enabled via the `--using-sjlj-exceptions` options. Addresses PR27749! llvm-svn: 271244
*	[X86] Remove SSE/AVX unaligned store intrinsics as clang no longer uses ↵	Craig Topper	2016-05-30	1	-29/+0
\| \| \| \| \| \|	them. Auto upgrade to native unaligned store instructions. llvm-svn: 271236
*	Fix a crash when producing COFF.	Rafael Espindola	2016-05-30	1	-0/+2
\| \| \| \|	llvm-svn: 271229
*	[BPF] Remove exit-on-error from tests (PR27768, PR27769)	Diana Picus	2016-05-30	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \|	The exit-on-error flag is necessary to avoid some assertions/unreachables. We can get past them by creating a few dummy nodes. Fixes PR27768, PR27769. Differential Revision: http://reviews.llvm.org/D20726 llvm-svn: 271200
*	Move RelaxELFRel out to llvm-mc.	Rafael Espindola	2016-05-29	1	-6/+0
\| \| \| \|	llvm-svn: 271160