bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[x86-64] allow mfence even with -mno-sse (PR23203)	Sanjay Patel	2016-02-13	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	As shown in: https://llvm.org/bugs/show_bug.cgi?id=23203 ...we currently die because lowering believes that mfence is allowed without SSE2 on x86-64, but the instruction def doesn't know that. I don't know if allowing mfence without SSE is right, but if not, at least now it's consistently wrong. :) Differential Revision: http://reviews.llvm.org/D17219 llvm-svn: 260828
*	[TableGen] Fix sort order of asm operand classes	Oliver Stannard	2016-01-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a fix for https://llvm.org/bugs/show_bug.cgi?id=22796. The previous implementation of ClassInfo::operator< allowed cycles of classes such that x < y < z < x, meaning that a list of them cannot be correctly sorted, and the sort order could differ with different standard libraries. The original implementation sorted classes by ValueName if they were otherwise equal. This isn't strictly necessary, but some backends seem to accidentally rely on it. If I reverse this comparison I get 8 test failures spread across the AArch64, Mips and X86 backends, so I have left it in until those backends can be fixed. There was one case in the X86 backend where the observable behaviour of the assembler is changed by this patch. This was because some of the memory asm operands were not marked as children of X86MemAsmOperand. Differential Revision: http://reviews.llvm.org/D16141 llvm-svn: 258677
*	Added Skylake client to X86 targets and features	Elena Demikhovsky	2016-01-24	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Changes in X86.td: I set features of Intel processors in incremental form: IVB = SNB + X HSW = IVB + X .. I added Skylake client processor and defined it's features FeatureADX was missing on KNL Added some new features to appropriate processors SMAP, IFMA, PREFETCHWT1, VMFUNC and others Differential Revision: http://reviews.llvm.org/D16357 llvm-svn: 258659
*	[AVX512] Adding VPERMB instruction	Michael Zuckerman	2016-01-19	1	-1/+2
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D16294 llvm-svn: 258144
*	[X86] Adding support for missing variations of X86 string related instructions	Marina Yatsina	2016-01-19	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The following are legal according to X86 spec: ins mem, DX outs DX, mem lods mem stos mem scas mem cmps mem, mem movs mem, mem Differential Revision: http://reviews.llvm.org/D14827 llvm-svn: 258132
*	[AVX512] adding AVXVBMI feature flag	Michael Zuckerman	2016-01-17	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	The feature flag is for VPERMB,VPERMI2B,VPERMT2B and VPMULTISHIFTQB instructions. More about the instruction can be found in: hattps://software.intel.com/sites/default/files/managed/07/b7/319433-023.pdf Differential Revision: http://reviews.llvm.org/D16190 llvm-svn: 258012
*	[X86] Use \t instead of space after mnemonics in a bunch InstAliases for ↵	Craig Topper	2016-01-08	1	-79/+79
\| \| \| \| \| \|	consistency. llvm-svn: 257148
*	[X86] Add hasSideEffects=0 and mayLoad=1 to MOVZX64* instructions. While ↵	Craig Topper	2016-01-07	1	-2/+2
\| \| \| \| \| \|	there remove a superfluous _Q from the instruction names. llvm-svn: 257032
*	[X86] STOSQ without a rep prefix doesn't read or write RCX.	Craig Topper	2016-01-07	1	-1/+1
\| \| \| \|	llvm-svn: 257030
*	[X86] Use PS instead of TB for instructions that have PD/XS/XD variations. ↵	Craig Topper	2016-01-06	1	-1/+2
\| \| \| \| \| \|	Use OpSize32 on an instruction that has an OpSize16 variant. llvm-svn: 256918
*	Revert "[X86] Use push-pop for materializing small constants under 'minsize'"	David Majnemer	2016-01-05	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \|	The red zone consists of 128 bytes beyond the stack pointer so that the allocation of objects in leaf functions doesn't require decrementing rsp. In r255656, we introduced an optimization that would cheaply materialize certain constants via push/pop. Push decrements the stack pointer and stores it's result at what is now the top of the stack. However, this means that using push/pop would encroach on the red zone. PR26023 gives an example where this corrupts an object in the red zone. llvm-svn: 256808
*	[X86] Add intrinsics for reading and writing to the flags register	David Majnemer	2016-01-01	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	LLVM's targets need to know if stack pointer adjustments occur after the prologue. This is needed to correctly determine if the red-zone is appropriate to use or if a frame pointer is required. Normally, LLVM can figure this out very precisely by reasoning about the contents of the MachineFunction. There is an interesting corner case: inline assembly. The vast majority of inline assembly which will perform a push or pop is done so to pair up with pushf or popf as appropriate. Unfortunately, this inline assembly doesn't mark the stack pointer as clobbered because, well, it isn't. The stack pointer is decremented and then immediately incremented. Because of this, LLVM was changed in r256456 to conservatively assume that inline assembly contain a sequence of stack operations. This is unfortunate because the vast majority of inline assembly will not end up manipulating the stack pointer in any way at all. Instead, let's provide a more principled solution: an intrinsic. FWIW, other compilers (MSVC and GCC among them) also provide this functionality as an intrinsic. llvm-svn: 256685
*	[X86] Add proper Uses/Defs/mayLoad flags for AAA/AAD/AAM/AAS/DAA/DAS/XLAT ↵	Craig Topper	2015-12-28	1	-6/+7
\| \| \| \| \| \|	instructions. llvm-svn: 256481
*	Implemented Support of IA interrupt and exception handlers:	Amjad Aboud	2015-12-21	1	-0/+2
\| \| \| \| \| \| \| \|	http://lists.llvm.org/pipermail/cfe-dev/2015-September/045171.html Differential Revision: http://reviews.llvm.org/D15567 llvm-svn: 256155
*	[X86] Use push-pop for materializing small constants under 'minsize'	Hans Wennborg	2015-12-17	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	Use the 3-byte (4 with REX prefix) push-pop sequence for materializing small constants. This is smaller than using a mov (5, 6 or 7 bytes depending on size and REX prefix), but it's likely to be slower, so only used for 'minsize'. This is a follow-up to r255656. Differential Revision: http://reviews.llvm.org/D15549 llvm-svn: 255936
*	[x86] adding PKU feature flag	Asaf Badouh	2015-12-15	1	-0/+1
\| \| \| \| \| \| \| \| \|	the feature flag is essential for RDPKRU and WRPKRU instruction more about the instruction can be found in the SDM rev 56, vol 2 from http://www.intel.com/sdm Differential Revision: http://reviews.llvm.org/D15491 llvm-svn: 255644
*	[X86] Part 2 to fix x86-64 fp128 calling convention.	Chih-Hung Hsieh	2015-12-14	1	-5/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Part 1 was submitted in http://reviews.llvm.org/D15134. Changes in this part: * X86RegisterInfo.td, X86RecognizableInstr.cpp: Add FR128 register class. * X86CallingConv.td: Pass f128 values in XMM registers or on stack. * X86InstrCompiler.td, X86InstrInfo.td, X86InstrSSE.td: Add instruction selection patterns for f128. * X86ISelLowering.cpp: When target has MMX registers, configure MVT::f128 in FR128RegClass, with TypeSoftenFloat action, and custom actions for some opcodes. Add missed cases of MVT::f128 in places that handle f32, f64, or vector types. Add TODO comment to support f128 type in inline assembly code. * SelectionDAGBuilder.cpp: Fix infinite loop when f128 type can have VT == TLI.getTypeToTransformTo(Ctx, VT). * Add unit tests for x86-64 fp128 type. Differential Revision: http://reviews.llvm.org/D11438 llvm-svn: 255558
*	VX-512: Fixed a bug in FP logic operation lowering	Elena Demikhovsky	2015-12-07	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	FP logic instructions are supported in DQ extension on AVX-512 target. I use integer operations instead. Added tests. I also enabled FABS in this patch in order to check ANDPS. The operations are FOR, FXOR, FAND, FANDN. The instructions, that supported for 512-bit vector under DQ are: VORPS/PD, VXORPS/PD, VANDPS/PD, FANDNPS/PD. Differential Revision: http://reviews.llvm.org/D15110 llvm-svn: 254913
*	[X86] Add support for loopz, loopnz for Intel syntax	Marina Yatsina	2015-12-06	1	-2/+2
\| \| \| \| \| \| \| \|	According to x86 spec, loopz and loopnz should be supported for Intel syntax, where loopz is equivalent to loope and loopnz is equivalent to loopne. Differential Revision: http://reviews.llvm.org/D15148 llvm-svn: 254877
*	X86: Don't emit SAHF/LAHF for 64-bit targets unless explicitly supported	Hans Wennborg	2015-12-04	1	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These instructions are not supported by all CPUs in 64-bit mode. Emitting them causes Chromium to crash on start-up for users with such chips. (GCC puts these instructions behind -msahf on 64-bit for the same reason.) This patch adds FeatureLAHFSAHF, enables it by default for 32-bit targets and modern CPUs, and changes X86InstrInfo::copyPhysReg back to the lowering from before r244503 when the instructions are not available. Differential Revision: http://reviews.llvm.org/D15240 llvm-svn: 254793
*	[X86] Add support for fcomip, fucomip for Intel syntax	Marina Yatsina	2015-12-03	1	-2/+2
\| \| \| \| \| \| \| \|	According to x86 spec, fcomip and fucomip should be supported for Intel syntax. Differential Revision: http://reviews.llvm.org/D15104 llvm-svn: 254595
*	[x86] translating "fp" (floating point) instructions from ↵	Michael Zuckerman	2015-11-12	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	{fadd,fdiv,fmul,fsub,fsubr,fdivr} to {faddp,fdivp,fmulp,fsubp,fsubrp,fdivrp} LLVM Missing the following instructions: fadd\fdiv\fmul\fsub\fsubr\fdivr. GAS and MS supporting this instruction and lowering them in to a faddp\fdivp\fmulp\fsubp\fsubrp\fdivrp instructions. Differential Revision: http://reviews.llvm.org/D14217 llvm-svn: 252908
*	[X86] Add AMD mwaitx, monitorx, and clzero instructions to the assembly ↵	Craig Topper	2015-10-21	1	-0/+26
\| \| \| \| \| \|	parser and disassembler. llvm-svn: 250911
*	[X86] Add fxsr feature flag for fxsave/fxrestore instructions.	Craig Topper	2015-10-16	1	-0/+1
\| \| \| \|	llvm-svn: 250497
*	function names should start with a lower case letter; NFC	Sanjay Patel	2015-10-13	1	-10/+10
\| \| \| \|	llvm-svn: 250174
*	[X86] Mark the AAD and AAM aliases as not valid in 64-bit mode.	Craig Topper	2015-10-13	1	-2/+2
\| \| \| \|	llvm-svn: 250148
*	[X86] Add XSAVE intrinsic family	Amjad Aboud	2015-10-12	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Add intrinsics for the XSAVE instructions (XSAVE/XSAVE64/XRSTOR/XRSTOR64) XSAVEOPT instructions (XSAVEOPT/XSAVEOPT64) XSAVEC instructions (XSAVEC/XSAVEC64) XSAVES instructions (XSAVES/XSAVES64/XRSTORS/XRSTORS64) Differential Revision: http://reviews.llvm.org/D13012 llvm-svn: 250029
*	[X86] Change the immediate for IN/OUT instructions to u8imm so the assembly ↵	Craig Topper	2015-10-12	1	-6/+6
\| \| \| \| \| \|	parser will check the size. llvm-svn: 250012
*	[X86] Add some instruction aliases to get the assembly parser table to favor ↵	Craig Topper	2015-10-12	1	-0/+31
\| \| \| \| \| \| \| \|	arithmetic instructions with 8-bit immediates over the forms that implicitly use the ax/eax/rax. This allows us to remove the explicit code for working around the existing priority llvm-svn: 250011
*	[X86] Simplify immediate range checking code.	Craig Topper	2015-10-11	1	-6/+6
\| \| \| \|	llvm-svn: 249979
*	[WinEH] Make funclet return instrs pseudo instrs	Reid Kleckner	2015-09-17	1	-4/+0
\| \| \| \| \| \| \| \| \|	This makes catchret look more like a branch, and less like a weird use of BlockAddress. It also lets us get away from llvm.x86.seh.restoreframe, which relies on the old parentfpoffset label arithmetic. llvm-svn: 247936
*	[WinEH] Add codegen support for cleanuppad and cleanupret	Reid Kleckner	2015-09-10	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	All of the complexity is in cleanupret, and it mostly follows the same codepaths as catchret, except it doesn't take a return value in RAX. This small example now compiles and executes successfully on win32: extern "C" int printf(const char *, ...) noexcept; struct Dtor { ~Dtor() { printf("~Dtor\n"); } }; void has_cleanup() { Dtor o; throw 42; } int main() { try { has_cleanup(); } catch (int) { printf("caught it\n"); } } Don't try to put the cleanup in the same function as the catch, or Bad Things will happen. llvm-svn: 247219
*	x32. Fixes a bug in i8mem_NOREX declaration.	Derek Schuff	2015-09-08	1	-5/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The old implementation assumed LP64 which is broken for x32. Specifically, the MOVE8rm_NOREX and MOVE8mr_NOREX, when selected, would cause a 'Cannot emit physreg copy instruction' error message to be reported. This patch also enable the h-register*ll tests for x32. Differential Revision: http://reviews.llvm.org/D12336 Patch by João Porto llvm-svn: 247058
*	[WinEH] Add some support for code generating catchpad	Reid Kleckner	2015-08-27	1	-0/+2
\| \| \| \| \| \| \|	We can now run 32-bit programs with empty catch bodies. The next step is to change PEI so that we get funclet prologues and epilogues. llvm-svn: 246235
*	[X86] Remove references to _ftol2	Michael Kuperstein	2015-08-25	1	-5/+0
\| \| \| \| \| \| \|	As of r245924, _ftol2 is no longer used for fptoui on MS platforms. Remove the dead code associated with it. llvm-svn: 245925
*	[X86] Allow merging of immediates within a basic block for code size savings	Michael Kuperstein	2015-08-11	1	-3/+37
\| \| \| \| \| \| \| \| \| \| \|	First step in preventing immediates that occur more than once within a single basic block from being pulled into their users, in order to prevent unnecessary large instruction encoding .Currently enabled only when optimizing for size. Patch by: zia.ansari@intel.com Differential Revision: http://reviews.llvm.org/D11363 llvm-svn: 244601
*	[X86] Add SAL mnemonics for Intel syntax	Marina Yatsina	2015-08-11	1	-0/+1
\| \| \| \| \| \| \| \|	SAL and SHL instructions perform the same operation Differential Revision: http://reviews.llvm.org/D11882 llvm-svn: 244588
*	[X86] Fix REPE, REPZ, REPNZ for intel syntax	Marina Yatsina	2015-08-11	1	-3/+3
\| \| \| \| \| \| \| \| \|	REPE, REPZ, REPNZ, REPNE should have mnemonics for Intel syntax as well. Currently using these instructions causes compilation errors for Intel syntax. Differential Revision: http://reviews.llvm.org/D11794 llvm-svn: 244584
*	[X86] Fix imul alias for intel syntax	Marina Yatsina	2015-08-11	1	-6/+6
\| \| \| \| \| \| \| \| \|	The "imul reg, imm" alias is not defined for intel syntax. In intel syntax there is no w/l/q suffix for the imul instruction. Differential Revision: http://reviews.llvm.org/D11887 llvm-svn: 244582
*	[X86] Allow load folding into PUSH instructions	Michael Kuperstein	2015-07-23	1	-5/+11
\| \| \| \| \| \| \| \| \| \|	Adds pushes to the folding tables. This also required a fix to the TD definition, since the memory forms of the push instructions did not have the right mayLoad/mayStore flags. Differential Revision: http://reviews.llvm.org/D11340 llvm-svn: 243010
*	AVX : Fix ISA disabling in case AVX512VL , some instructions should be ↵	Igor Breger	2015-07-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	disabled only if AVX512BW and AVX512VL present. Tests added. Differential Revision: http://reviews.llvm.org/D11414 llvm-svn: 242987
*	AVX : Fix ISA disabling in case AVX512VL , some instructions should be ↵	Igor Breger	2015-07-15	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	disabled only if AVX512BW present. Tests added. Differential Revision: http://reviews.llvm.org/D11122 llvm-svn: 242270
*	Rename llvm.frameescape and llvm.framerecover to localescape and localrecover	Reid Kleckner	2015-07-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Initially, these intrinsics seemed like part of a family of "frame" related intrinsics, but now I think that's more confusing than helpful. Initially, the LangRef specified that this would create a new kind of allocation that would be allocated at a fixed offset from the frame pointer (EBP/RBP). We ended up dropping that design, and leaving the stack frame layout alone. These intrinsics are really about sharing local stack allocations, not frame pointers. I intend to go further and add an `llvm.localaddress()` intrinsic that returns whatever register (EBP, ESI, ESP, RBX) is being used to address locals, which should not be confused with the frame pointer. Naming suggestions at this point are welcome, I'm happy to re-run sed. Reviewers: majnemer, nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11011 llvm-svn: 241633
*	[X86] Fix incorrect/inefficient pushw encodings for x86-64 targets	Michael Kuperstein	2015-07-05	1	-8/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Correctly support assembling "pushw $imm8" on x86-64 targets. Also some cleanup of the PUSH instructions (PUSH64i16 and PUSHi16 actually represent the same instruction) This fixes PR23996 Patch by: david.l.kreitzer@intel.com Differential Revision: http://reviews.llvm.org/D10878 llvm-svn: 241404
*	AVX-512: Added all SKX forms of GATHER instructions.	Elena Demikhovsky	2015-06-28	1	-1/+9
\| \| \| \| \| \| \|	Added intrinsics. Added encoding and tests. llvm-svn: 240905
*	X86-MPX: Implemented encoding for MPX instructions.	Elena Demikhovsky	2015-06-09	1	-0/+3
\| \| \| \| \| \|	Added encoding tests. llvm-svn: 239403
*	X86: Added MPX feature and bound registers.	Elena Demikhovsky	2015-06-03	1	-0/+1
\| \| \| \| \| \| \| \| \|	Intel® Memory Protection Extensions (Intel® MPX) is a new feature in Skylake. It is a part of KNL and SKX sets. It is also a part of Skylake client. I added definition of %bnd0 - %bnd3 registers, each register is a pair of 64-bit integers. llvm-svn: 238916
*	Masked gather and scatter - added DAGCombine visitors	Elena Demikhovsky	2015-04-30	1	-0/+2
\| \| \| \| \| \| \| \| \|	and AVX-512 instruction selection patterns. All other patches, including tests will follow. http://reviews.llvm.org/D7665 llvm-svn: 236211
*	Reapply r235977 "[DebugInfo] Add debug locations to constant SD nodes"	Sergey Dmitrouk	2015-04-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	[DebugInfo] Add debug locations to constant SD nodes This adds debug location to constant nodes of Selection DAG and updates all places that create constants to pass debug locations (see PR13269). Can't guarantee that all locations are correct, but in a lot of cases choice is obvious, so most of them should be. At least all tests pass. Tests for these changes do not cover everything, instead just check it for SDNodes, ARM and AArch64 where it's easy to get incorrect locations on constants. This is not complete fix as FastISel contains workaround for wrong debug locations, which drops locations from instructions on processing constants, but there isn't currently a way to use debug locations from constants there as llvm::Constant doesn't cache it (yet). Although this is a bit different issue, not directly related to these changes. Differential Revision: http://reviews.llvm.org/D9084 llvm-svn: 235989
*	Revert "[DebugInfo] Add debug locations to constant SD nodes"	Daniel Jasper	2015-04-28	1	-1/+1
\| \| \| \| \| \| \|	This breaks a test: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/23870 llvm-svn: 235987