bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[X86] Remove masking from 512-bit VPERMIL intrinsics in preparation for ↵	Craig Topper	2016-12-11	1	-4/+5
\| \| \| \| \| \|	being able to constant fold them in InstCombineCalls like we do for 128/256-bit. llvm-svn: 289350
*	[X86] Remove masking from 512-bit PSHUFB intrinsics in preparation for being ↵	Craig Topper	2016-12-10	1	-2/+3
\| \| \| \| \| \|	able to constant fold it in InstCombineCalls like we do for 128/256-bit. llvm-svn: 289344
*	[AVX-512] Remove 128/256 masked vpermil instrinsics and autoupgrade to a ↵	Craig Topper	2016-12-10	1	-0/+22
\| \| \| \| \| \|	select around the unmasked avx1 intrinsics. llvm-svn: 289340
*	[X86][IR] Move the autoupgrading of store intrinsics out of the main nested ↵	Craig Topper	2016-12-10	1	-90/+102
\| \| \| \| \| \| \| \|	if/else chain. This should buy a little more time against the MSVC limit mentioned in PR31034. The handlers for stores all return at the end of their block so they can be picked off early. llvm-svn: 289339
*	[AVX-512] Remove intrinsics for valignd/q and autoupgrade them to native ↵	Craig Topper	2016-11-23	1	-11/+30
\| \| \| \| \| \|	shuffles. llvm-svn: 287744
*	[AVX-512] Replace masked 16-bit element variable shift intrinsics with new ↵	Craig Topper	2016-11-18	1	-16/+27
\| \| \| \| \| \| \| \| \| \|	unmasked versions and selects. The same thing was done to 32-bit and 64-bit element sizes previously. This will allow us to support these shuffls in InstCombineCalls along with the other variable shift intrinsics. llvm-svn: 287312
*	[X86][AVX512] Autoupgrade lossless i32/u32 to f64 conversion intrinsics with ↵	Simon Pilgrim	2016-11-16	1	-3/+14
\| \| \| \| \| \| \| \| \| \| \| \|	generic IR Both the (V)CVTDQ2PD (i32 to f64) and (V)CVTUDQ2PD (u32 to f64) conversion instructions are lossless and can be safely represented as generic SINT_TO_FP/UINT_TO_FP calls instead of x86 intrinsics without affecting final codegen. LLVM counterpart to D26686 Differential Revision: https://reviews.llvm.org/D26736 llvm-svn: 287108
*	[X86][AVX512] Removing llvm x86 intrinsics for _mm_mask_move_{ss\|sd} intrinsics.	Ayman Musa	2016-11-16	1	-0/+16
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26128 llvm-svn: 287087
*	[X86] Remove the scalar intrinsics for fadd/fsub/fdiv/fmul	Craig Topper	2016-11-16	1	-0/+44
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: These intrinsics have been unused for clang for a while. This patch removes them. We auto upgrade them to extractelements, a scalar operation and then an insertelement. This matches the sequence used by clangs intrinsic file. Reviewers: zvi, delena, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26660 llvm-svn: 287083
*	[X86] Add LLVM version number for each intrinsic handled by auto upgrade for ↵	Craig Topper	2016-11-15	1	-152/+158
\| \| \| \| \| \| \| \| \| \|	age tracking. One day we'd like to remove some of this autoupgrade support and it will be easier if we know how long some of it has been around. Differential Revision: https://reviews.llvm.org/D26321 llvm-svn: 286933
*	[AVX-512] Remove and autoupgrade masked dword/qword variable shift ↵	Craig Topper	2016-11-14	1	-24/+35
\| \| \| \| \| \|	intrinsics to the new unmasked versions and selects. llvm-svn: 286786
*	[X86][IR] Reduce the number of full string comparisons in the code that ↵	Craig Topper	2016-11-13	1	-156/+173
\| \| \| \| \| \|	autoupgrades masked shift intrinsics. llvm-svn: 286768
*	revert commit r286761, some builds failed on Win platforms	Igor Breger	2016-11-13	1	-17/+0
\| \| \| \|	llvm-svn: 286765
*	[X86][AVX512] Removing llvm x86 intrinsics for _mm_mask_move_{ss\|sd} intrinsics.	Ayman Musa	2016-11-13	1	-0/+17
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D26128 llvm-svn: 286761
*	[AVX-512] Remove the remaining masked shift by immediate or by single value. ↵	Craig Topper	2016-11-12	1	-55/+84
\| \| \| \| \| \| \| \|	Autoupgrade them to recently introduced unmasked versions and a select. After this I'll add the unmasked intrinsics to InstCombineCalls to finish making our handling of these types of shuffles consistent between AVX-512 and the legacy intrinsics. llvm-svn: 286725
*	Add a missing break statement. NFC.	George Burgess IV	2016-11-08	1	-0/+1
\| \| \| \|	llvm-svn: 286203
*	[AVX-512] Remove masked pmovzx/pmovsx builtins and autoupgrade them to ↵	Craig Topper	2016-11-07	1	-1/+9
\| \| \| \| \| \| \| \|	selects and native zext/sext. This mostly reuses earlier autoupgrade support for the sse and avx equivalents. Just needed to add the code to add the select. llvm-svn: 286092
*	[X86] Use StringRef::startswith to reduce a few compares in the intrinsic ↵	Craig Topper	2016-11-07	1	-12/+3
\| \| \| \| \| \|	autoupgrade code. llvm-svn: 286090
*	[AVX-512] Remove 128/256 masked pshufb intrinsics. Autoupgrade them to ↵	Craig Topper	2016-11-07	1	-0/+16
\| \| \| \| \| \|	legacy intrinsics and a select. llvm-svn: 286089
*	[AVX-512] Remove intrinsics for 128/256-bit masked variable shift. Instead ↵	Craig Topper	2016-11-06	1	-0/+30
\| \| \| \| \| \|	upgrade them to a select and the older AVX2 intrinsic. llvm-svn: 286073
*	[AVX-512] Remove intrinsics for 128/256-bit masked shift by immediate. ↵	Craig Topper	2016-11-06	1	-0/+48
\| \| \| \| \| \|	Instead upgrade them to a select and the older SSE/AVX2 intrinsic. llvm-svn: 286072
*	[AVX-512] Remove intrinsics for 128/256-bit masked shift by single element ↵	Craig Topper	2016-11-06	1	-0/+59
\| \| \| \| \| \|	in xmm. Instead upgrade them to a select and the older SSE/AVX2 intrinsic. llvm-svn: 286070
*	[AVX-512] Use an equality compare instead of StringRef::startswith in a few ↵	Craig Topper	2016-11-05	1	-32/+29
\| \| \| \| \| \|	places in auto upgrade that were looking for the complete intrinsic name anyway. llvm-svn: 286033
*	[X86] Remove broken support for autoupgrading llvm.x86.fma4.* intrinsics to ↵	Craig Topper	2016-11-05	1	-6/+0
\| \| \| \| \| \| \| \|	llvm.x86.fma.*. It currently fires an assert if you even try. Looking back, I don't think it ever worked because it only changed the name of the function object, but not the intrinsic ID stored in it. Given that, I think it can be removed since no one has noticed or complained in the past 4 years. llvm-svn: 286031
*	[AVX-512] Remove masked pmin/pmax intrinsics and autoupgrade to native IR.	Craig Topper	2016-10-24	1	-5/+16
\| \| \| \| \| \|	Clang patch to replace 512-bit vector and 64-bit element versions with native IR will follow. llvm-svn: 284955
*	Don't drop the llvm. prefix when renaming.	Rafael Espindola	2016-10-03	1	-14/+16
\| \| \| \| \| \| \| \| \| \|	If the llvm. prefix is dropped other parts of llvm don't see this as an intrinsic. This means that the number of regular symbols depends on the context the module is loaded into, which causes LTO to abort. Fixes PR30509. llvm-svn: 283117
*	Fix autoupgrade logic for Objective-C class properties module flag	Mehdi Amini	2016-09-16	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previous we were issuing an error when linking a module containing the new Objective-C metadata structure for class properties with an "old" one. Now instead we downgrade the module flag so that the Objective-C runtime does not expect the new metadata structure. This is consistent with what ld64 is doing on binary files. Differential Revision: https://reviews.llvm.org/D24620 llvm-svn: 281685
*	Fix auto-upgrade of TBAA tags in Bitcode Reader	Mehdi Amini	2016-09-14	1	-17/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If TBAA is on an intrinsic and it gets upgraded, it'll delete the call instruction that we collected in a vector. Even if we were to use WeakVH, it'll drop the TBAA and we'll hit the assert on the upgrade path. r263673 gave a shot to make sure the TBAA upgrade happens before intrinsics upgrade, but failed to account for all cases. Instead of collecting instructions in a vector, this patch makes it just upgrade the TBAA on the fly, because metadata are always already loaded at this point. Differential Revision: https://reviews.llvm.org/D24533 llvm-svn: 281549
*	[X86] Remove masked shufpd/shufps intrinsics and autoupgrade to native ↵	Craig Topper	2016-09-13	1	-0/+26
\| \| \| \| \| \|	vector shuffles. They were removed from clang previously but accidentally left in the backend. llvm-svn: 281300
*	[AVX-512] Remove 128-bit and 256-bit masked floating point add/sub/mul/div ↵	Craig Topper	2016-09-04	1	-0/+44
\| \| \| \| \| \|	intrinsics and upgrade to native IR. llvm-svn: 280633
*	[AVX-512] Remove masked integer add/sub/mull intrinsics and upgrade to ↵	Craig Topper	2016-09-04	1	-0/+15
\| \| \| \| \| \|	native IR. llvm-svn: 280611
*	[X86] Combine some of the strings in autoupgrade code.	Craig Topper	2016-09-03	1	-35/+7
\| \| \| \|	llvm-svn: 280603
*	[AVX-512] Remove floating point logical operation instrinsics and replace ↵	Craig Topper	2016-09-02	1	-0/+37
\| \| \| \| \| \|	them with native IR. llvm-svn: 280466
*	Revert "Revert "Invariant start/end intrinsics overloaded for address space""	Mehdi Amini	2016-08-13	1	-1/+27
\| \| \| \| \| \|	This reverts commit 32fc6488e48eafc0ca1bac1bd9cbf0008224d530. llvm-svn: 278609
*	Revert "Invariant start/end intrinsics overloaded for address space"	Mehdi Amini	2016-08-13	1	-27/+1
\| \| \| \| \| \|	This reverts commit r276447. llvm-svn: 278608
*	Use range algorithms instead of unpacking begin/end	David Majnemer	2016-08-11	1	-1/+1
\| \| \| \| \| \|	No functionality change is intended. llvm-svn: 278417
*	[x86] Fix a bug in the auto-upgrade from r276416 where we failed to give	Chandler Carruth	2016-08-10	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a sufficiently low alignment for the IR load created. There is no test case because we don't have any test cases for the IR produced by the autoupgrade, only the x86 assembly, and it happens that the x86 assembly for this intrinsic as it is tested in the autoupgrade path just happens to not produce a separate load instruction where we might have observed the alignment. I'm going to follow up on the original commit to suggest getting IR-level testing in addition to the asm level testing here so that we can see and test these kinds of issues. We might never get an x86 instruction out with an alignment constraint, but we could stil miscompile code by folding against the alignment marked on (or inferred for in this case) the load. llvm-svn: 278203
*	Invariant start/end intrinsics overloaded for address space	Anna Thomas	2016-07-22	1	-1/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: apilipenko, reames Subscribers: llvm-commits llvm-svn: 276447
*	[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128 ↵	Simon Pilgrim	2016-07-22	1	-7/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(reapplied) As reported on PR26235, we don't currently make use of the VBROADCASTF128/VBROADCASTI128 instructions (or the AVX512 equivalents) to load+splat a 128-bit vector to both lanes of a 256-bit vector. This patch enables lowering from subvector insertion/concatenation patterns and auto-upgrades the llvm.x86.avx.vbroadcastf128.pd.256 / llvm.x86.avx.vbroadcastf128.ps.256 intrinsics to match. We could possibly investigate using VBROADCASTF128/VBROADCASTI128 to load repeated constants as well (similar to how we already do for scalar broadcasts). Reapplied with fix for PR28657 - removed intrinsic definitions (clang companion patch to be be submitted shortly). Differential Revision: https://reviews.llvm.org/D22460 llvm-svn: 276416
*	Revert "[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128"	Benjamin Kramer	2016-07-22	1	-14/+7
\| \| \| \| \| \| \| \|	It caused PR28657. This reverts commit r276281. llvm-svn: 276405
*	Revert "Invariant start/end intrinsics overloaded for address space"	Anna Thomas	2016-07-21	1	-27/+1
\| \| \| \| \| \|	This reverts commit r276316. llvm-svn: 276320
*	Invariant start/end intrinsics overloaded for address space	Anna Thomas	2016-07-21	1	-1/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The llvm.invariant.start and llvm.invariant.end intrinsics currently support specifying invariant memory objects only in the default address space. With this change, these intrinsics are overloaded for any adddress space for memory objects and we can use these llvm invariant intrinsics in non-default address spaces. Example: llvm.invariant.start.p1i8(i64 4, i8 addrspace(1)* %ptr) This overloaded intrinsic is needed for representing final or invariant memory in managed languages. Reviewers: tstellarAMD, reames, apilipenko Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D22519 llvm-svn: 276316
*	[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128	Simon Pilgrim	2016-07-21	1	-7/+14
\| \| \| \| \| \| \| \| \| \| \| \|	As reported on PR26235, we don't currently make use of the VBROADCASTF128/VBROADCASTI128 instructions (or the AVX512 equivalents) to load+splat a 128-bit vector to both lanes of a 256-bit vector. This patch enables lowering from subvector insertion/concatenation patterns and auto-upgrades the llvm.x86.avx.vbroadcastf128.pd.256 / llvm.x86.avx.vbroadcastf128.ps.256 intrinsics to match. We could possibly investigate using VBROADCASTF128/VBROADCASTI128 to load repeated constants as well (similar to how we already do for scalar broadcasts). Differential Revision: https://reviews.llvm.org/D22460 llvm-svn: 276281
*	[X86][SSE] Reimplement SSE fp2si conversion intrinsics instead of using ↵	Simon Pilgrim	2016-07-19	1	-8/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	generic IR D20859 and D20860 attempted to replace the SSE (V)CVTTPS2DQ and VCVTTPD2DQ truncating conversions with generic IR instead. It turns out that the behaviour of these intrinsics is different enough from generic IR that this will cause problems, INF/NAN/out of range values are guaranteed to result in a 0x80000000 value - which plays havoc with constant folding which converts them to either zero or UNDEF. This is also an issue with the scalar implementations (which were already generic IR and what I was trying to match). This patch changes both scalar and packed versions back to using x86-specific builtins. It also deals with the other scalar conversion cases that are runtime rounding mode dependent and can have similar issues with constant folding. A companion clang patch is at D22105 Differential Revision: https://reviews.llvm.org/D22106 llvm-svn: 275981
*	[AVX512] Remove masked logic op intrinsics and autoupgrade them to native IR.	Craig Topper	2016-07-12	1	-0/+21
\| \| \| \|	llvm-svn: 275155
*	[X86,IR] Remove unnecessary or unused LLVMContext parameter from some of the ↵	Craig Topper	2016-07-12	1	-17/+16
\| \| \| \| \| \|	X86 intrinsic upgrade functions. llvm-svn: 275138
*	[X86] Remove and autoupgrade 512-bit non-temporal store intrinsics.	Craig Topper	2016-07-09	1	-2/+6
\| \| \| \|	llvm-svn: 274966
*	Move setName after accessing Name	Eric Liu	2016-07-08	1	-5/+2
\| \| \| \|	llvm-svn: 274862
*	Make a std::string copy of StringRef Name so that it remains valid when the ↵	Eric Liu	2016-07-08	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	original Name is overridden. Summary: lib/IR/AutoUpgrade.cpp:348 and lib/IR/AutoUpgrade.cpp:350 upset sanitizer. Reviewers: bkramer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D22140 llvm-svn: 274861
*	[AVX512] Remove and autoupgrade a duplicate set of 512-bit masked shift ↵	Craig Topper	2016-07-08	1	-1/+24
\| \| \| \| \| \| \| \|	intrinsics. I'm not sure if clang ever used these builtin names or not. llvm-svn: 274827