bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Equally to NetBSD, Bitrig/ARM uses the Itanium-ABI.	Renato Golin	2015-02-27	1	-0/+1
\| \| \| \| \| \|	Patch by Patrick Wildt. llvm-svn: 230762
*	[mips][microMIPS] Change register class for GP register	Zoran Jovanovic	2015-02-27	1	-6/+11
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D7934 llvm-svn: 230760
*	R600/SI: Add missing mubuf instructions	Tom Stellard	2015-02-27	2	-9/+25
\| \| \| \|	llvm-svn: 230759
*	R600/SI: Consistently put soffset before the offset operand for mubuf ↵	Tom Stellard	2015-02-27	3	-20/+20
\| \| \| \| \| \| \| \|	instructions This matches the assembly syntax. llvm-svn: 230758
*	R600/SI: Add slc, glc, and tfe to non-atomic _ADDR64 instructions	Tom Stellard	2015-02-27	4	-17/+33
\| \| \| \|	llvm-svn: 230757
*	[x86] Run most of the rest of the shuffle combining over non-128-bit	Chandler Carruth	2015-02-27	1	-16/+11
\| \| \| \| \| \| \| \| \| \| \| \|	vectors. This lets us fix the rest of the v16 lowering problems when pshufb is clearly better. We might still be able to improve some of the lowerings by enabling the other combine-based rewriting to fire for non-128-bit vectors, but this at least should remove any regressions from using the fancy v16i16 lowering strategy. llvm-svn: 230753
*	[x86] Teach a bunch of the x86-specific shuffle combining to work with	Chandler Carruth	2015-02-27	1	-13/+28
\| \| \| \| \| \| \|	256-bit vectors as well as 128-bit vectors. Fixes some of the redundant shuffles for v16i16. llvm-svn: 230752
*	[x86] Make the v8i16 clever single-input shuffle lowering usable for	Chandler Carruth	2015-02-27	1	-15/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	repeated 128-bit lane shuffles of wider vector types and use it to lower 256-bit v16i16 vector shuffles where applicable. This should let us perfectly lowering the pattern of pshuflw and pshufhw even for AVX2 256-bit patterns. I've not added AVX-512 support, but it should be trivial for someone working on that to wire up. Note that currently this generates bad, long shuffle chains because we don't combine 256-bit target shuffles. The subsequent patches will fix that. llvm-svn: 230751
*	[mips] Remove redundant periods from -mattr=help descriptions for MIPS.	Toma Tabacu	2015-02-27	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Also fixes an infringement of the 80-column limit rule. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7910 llvm-svn: 230748
*	[x86] Make the single-input v8i16 lowering directly recurse rather than	Chandler Carruth	2015-02-27	1	-2/+2
\| \| \| \| \| \| \| \|	going back through the entire vector shuffle lowering. This is an important step to being able to re-use this logic. llvm-svn: 230743
*	[mips] Account for constant-zero operands in ADDE nodes.	Vasileios Kalintiris	2015-02-27	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We identify the cases where the operand to an ADDE node is a constant zero. In such cases, we can avoid generating an extra ADDu instruction disguised as an identity move alias (ie. addu $r, $r, 0 --> move $r, $r). Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7906 llvm-svn: 230742
*	Target/X86: Save Win64 non-volatile registers in a Win64 ABI function.	Charles Davis	2015-02-27	1	-1/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This change causes us to actually save non-volatile registers in a Win64 ABI function that calls a System V ABI function, and vice-versa. Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7919 llvm-svn: 230714
*	Rewrite MachineOperand::print and MachineInstr::print to avoid	Eric Christopher	2015-02-27	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \|	uses of TM->getSubtargetImpl and propagate to all calls. This could be a debugging regression in places where we had a TargetMachine and/or MachineFunction but don't have it as part of the MachineInstr. Fixing this would require passing a MachineFunction/Function down through the print operator, but none of the existing uses in tree seem to do this. llvm-svn: 230710
*	getRegForInlineAsmConstraint wants to use TargetRegisterInfo for	Eric Christopher	2015-02-26	22	-66/+86
\| \| \| \| \| \| \| \| \|	a lookup, pass that in rather than use a naked call to getSubtargetImpl. This involved passing down and around either a TargetMachine or TargetRegisterInfo. Update all callers/definitions around the targets and SelectionDAG. llvm-svn: 230699
*	[x86] Fix PR22706 where we would incorrectly try lower a v32i8 dynamic	Chandler Carruth	2015-02-26	1	-13/+30
\| \| \| \| \| \| \| \| \| \| \| \| \|	blend as legal. We made the same mistake in two different places. Whenever we are custom lowering a v32i8 blend we need to check whether we are custom lowering it only for constant conditions that can be shuffled, or whether we actually have AVX2 and full dynamic blending support on bytes. Both are fixed, with comments added to make it clear what is going on and a new test case. llvm-svn: 230695
*	[x86] Restructure the comments and the conditions for handling	Chandler Carruth	2015-02-26	1	-13/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	dynamic blends. This makes it much more clear what is going on. The case we're handling is that of dynamic conditions, and we're bailing when the nature of the vector types and subtarget preclude lowering the dynamic condition vselect as an actual blend. No functionality changed here, but this will make a subsequent bug-fix to this code much more clear. llvm-svn: 230690
*	[x86] Re-order the combines of select in the X86 backend. This doesn't	Chandler Carruth	2015-02-26	1	-19/+19
\| \| \| \| \| \| \|	change functionality, but makes it more clear that the dynamic case and the shuffle case don't overlap in any interesting way. llvm-svn: 230689
*	[x86] Add an assert to catch if we ever try to blend a v32i8 without	Chandler Carruth	2015-02-26	1	-0/+3
\| \| \| \| \| \|	AVX2. llvm-svn: 230688
*	Don't sibcall between SysV and Win64 convention functions	Reid Kleckner	2015-02-26	1	-0/+6
\| \| \| \| \| \| \| \|	The shadow stack space expectations won't match. Fixes PR22709. llvm-svn: 230667
*	Fix justify error for small structures in varargs for MIPS64BE	Petar Jovanovic	2015-02-26	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There was a problem when passing structures as variable arguments. The structures smaller than 64 bit were not left justified on MIPS64 big endian. This is now fixed by shifting the value to make it left- justified when appropriate. This fixes the bug http://llvm.org/bugs/show_bug.cgi?id=21608 Patch by Aleksandar Beserminji. Differential Revision: http://reviews.llvm.org/D7881 llvm-svn: 230657
*	Use ".arch_extension" ARM directive to support hwdiv on krait	Sumanth Gundapaneni	2015-02-26	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \|	In case of "krait" CPU, asm printer doesn't emit any ".cpu" so the features bits are not computed. This patch lets the asm printer emit ".cpu cortex-a9" directive for krait and the hwdiv feature is enabled through ".arch_extension". In short, krait is treated as "cortex-a9" with hwdiv. We can not emit ".krait" as CPU since it is not supported bu GNU GAS yet llvm-svn: 230651
*	Use ".arch_extension" ARM directive to specify the additional CPU features	Sumanth Gundapaneni	2015-02-26	4	-0/+75
\| \| \| \| \| \| \| \| \| \| \|	This patch is in response to r223147 where the avaiable features are computed based on ".cpu" directive. This will work clean for the standard variants like cortex-a9. For custom variants which rely on standard cpu names for assembly, the additional features of a CPU should be propagated. This can be done via ".arch_extension" as long as the assembler supports it. The implementation for krait along with unit test will be submitted in next patch. llvm-svn: 230650
*	R600/SI: Remove M0 from DS assembly strings	Tom Stellard	2015-02-26	1	-8/+8
\| \| \| \| \| \|	This matches the assembly syntax for the proprietary compiler. llvm-svn: 230645
*	[X86][Haswell][SchedModel] Fix WriteMULm latency.	Michael Kuperstein	2015-02-26	1	-1/+1
\| \| \| \| \| \| \|	The latency for the WriteMULm class was set to 4, which is actually lower than the latency for WriteMULr (5). A better estimate would be 4 added to WriteMULr, that is, 9. llvm-svn: 230634
*	[x86] Sink the single-input v8i16 lowering code that is actually	Chandler Carruth	2015-02-26	1	-24/+26
\| \| \| \| \| \| \| \| \| \|	formulaic into the top v8i16 lowering routine. This makes the generalized lowering a completely general and single path lowering which will allow generalizing it in turn for multiple 128-bit lanes. llvm-svn: 230623
*	[x86] Remove a SimpleTy usage. No need for it here, we already have the	Chandler Carruth	2015-02-26	1	-2/+2
\| \| \| \| \| \|	MVT. llvm-svn: 230622
*	[x86] Make the vector shuffle helpers order the SDLoc and MVT arguments.	Chandler Carruth	2015-02-26	1	-27/+27
\| \| \| \| \| \|	This ordering matches that of DAG.getNode. llvm-svn: 230617
*	Pass /nologo to ml64 for quieter builds	Reid Kleckner	2015-02-26	1	-1/+1
\| \| \| \| \| \| \|	It still prints "Assembling path/to/X86CompilationCallback_Win64.asm", but linking does the same thing. llvm-svn: 230596
*	Remove a FIXME.	Eric Christopher	2015-02-26	1	-1/+0
\| \| \| \| \| \| \| \| \| \|	Explanation: This function is in TargetLowering because it uses RegClassForVT which would need to be moved to TargetRegisterInfo and would necessitate moving isTypeLegal over as well - a massive change that would just require TargetLowering having a TargetRegisterInfo class member that it would use. llvm-svn: 230585
*	Remove an argument-less call to getSubtargetImpl from TargetLoweringBase.	Eric Christopher	2015-02-26	21	-31/+39
\| \| \| \| \| \| \| \| \|	This required plumbing a TargetRegisterInfo through computeRegisterProperties and into findRepresentativeClass which uses it for register class iteration. This required passing a subtarget into a few target specific initializations of TargetLowering. llvm-svn: 230583
*	[PowerPC] Make LDtocL and friends invariant loads	Hal Finkel	2015-02-25	4	-34/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	LDtocL, and other loads that roughly correspond to the TOC_ENTRY SDAG node, represent loads from the TOC, which is invariant. As a result, these loads can be hoisted out of loops, etc. In order to do this, we need to generate GOT-style MMOs for TOC_ENTRY, which requires treating it as a legitimate memory intrinsic node type. Once this is done, the MMO transfer is automatically handled for TableGen-driven instruction selection, and for nodes generated directly in PPCISelDAGToDAG, we need to transfer the MMOs manually. Also, we were not transferring MMOs associated with pre-increment loads, so do that too. Lastly, this fixes an exposed bug where R30 was not added as a defined operand of UpdateGBR. This problem was highlighted by an example (used to generate the test case) posted to llvmdev by Francois Pichet. llvm-svn: 230553
*	X86, Win64: Allow 'mov' to restore the stack pointer if we have a FP	David Majnemer	2015-02-25	1	-13/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The Win64 epilogue structure is very restrictive, it permits a very small number of opcodes and none of them are 'mov'. This means that given: mov %rbp, %rsp pop %rbp The mov isn't the epilogue, only the pop is. This is problematic unless a frame pointer is present in which case we are free to do whatever we'd like in the "body" of the function. If a frame pointer is present, unwinding will undo the prologue operations in reverse order regardless of the fact that we are at an instruction which is reseting the stack pointer. llvm-svn: 230543
*	[PowerPC] Cleanup unused target-specific SDAG nodes	Hal Finkel	2015-02-25	4	-35/+7
\| \| \| \| \| \| \| \|	We had somehow accumulated a few target-specific SDAG nodes dealing with PPC64 TOC access that were referenced only in TableGen patterns. The associated (pseudo-)instructions are used, but are being generated directly. NFC. llvm-svn: 230518
*	AArch64: Add debug message for large shift constants.	Matthias Braun	2015-02-25	1	-2/+8
\| \| \| \| \| \|	As requested in code review. llvm-svn: 230517
*	[MIPS]Multiple and add instructions for Mips are currently available in ↵	Vladimir Medic	2015-02-25	1	-14/+14
\| \| \| \| \| \|	mips32r2/mips64r2 and later but should also be available in mips4, mips5, and mips64. This patch fixes the requested features and updates the corresponding test files. llvm-svn: 230500
*	[X86][MMX] Reapply: Add MMX instructions to foldable tables	Bruno Cardoso Lopes	2015-02-25	1	-0/+84
\| \| \| \| \| \| \| \| \| \|	Reapply r230248. Teach the peephole optimizer to work with MMX instructions by adding entries into the foldable tables. This covers folding opportunities not handled during isel. llvm-svn: 230499
*	[X86][MMX] Prevent MMX_MOVD64rm folding	Bruno Cardoso Lopes	2015-02-25	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MMX_MOVD64rm zero-extends i32 load results into i64 registers. The peephole optimizer will try to fold it in other MMX foldable instructions, the wrong thing to do, since there's no MMX memory instruction that loads from i32 and does implict zero extension. Remove 'canFoldAsLoad' from MOVD64rm in order to prevent such folding. The current MMX tests already test this, but since there are no MMX instructions in the foldable tables yet, this did not trigger. This commit prepares the addition of those instructions. llvm-svn: 230498
*	Improve handling of stack accesses in Thumb-1	Renato Golin	2015-02-25	4	-12/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Thumb-1 only allows SP-based LDR and STR to be word-sized, and SP-base LDR, STR, and ADD only allow offsets that are a multiple of 4. Make some changes to better make use of these instructions: * Use word loads for anyext byte and halfword loads from the stack. * Enforce 4-byte alignment on objects accessed in this way, to ensure that the offset is valid. * Do the same for objects whose frame index is used, in order to avoid having to use more than one ADD to generate the frame index. * Correct how many bits of offset we think AddrModeT1_s has. Patch by John Brawn. llvm-svn: 230496
*	Silencing a "result of 32-bit shift implicitly converted to 64 bits (was ↵	Aaron Ballman	2015-02-25	1	-1/+1
\| \| \| \| \| \|	64-bit shift intended?)" warning in MSVC; NFC. llvm-svn: 230489
*	Silencing a -Wsign-compare warning triggered in MSVC; NFC.	Aaron Ballman	2015-02-25	1	-1/+1
\| \| \| \|	llvm-svn: 230488
*	AVX-512: Gather and Scatter patterns	Elena Demikhovsky	2015-02-25	3	-44/+108
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Gather and scatter instructions additionally write to one of the source operands - mask register. In this case Gather has 2 destination values - the loaded value and the mask. Till now we did not support code gen pattern for gather - the instruction was generated from intrinsic only and machine node was hardcoded. When we introduce the masked_gather node, we need to select instruction automatically, in the standard way. I added a flag "hasTwoExplicitDefs" that allows to handle 2 destination operands. (Some code in the X86InstrFragmentsSIMD.td is commented out, just to split one big patch in many small patches) llvm-svn: 230471
*	[PowerPC] Add support for the QPX vector instruction set	Hal Finkel	2015-02-25	19	-67/+2675
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds support for the QPX vector instruction set, which is used by the enhanced A2 cores on the IBM BG/Q supercomputers. QPX vectors are 256 bytes wide, holding 4 double-precision floating-point values. Boolean values, modeled here as <4 x i1> are actually also represented as floating-point values (essentially { -1, 1 } for { false, true }). QPX shares many features with Altivec and VSX, but is distinct from both of them. One major difference is that, instead of adding completely-separate vector registers, QPX vector registers are extensions of the scalar floating-point registers (lane 0 is the corresponding scalar floating-point value). The operations supported on QPX vectors mirrors that supported on the scalar floating-point values (with some additional ones for permutations and logical/comparison operations). I've been maintaining this support out-of-tree, as part of the bgclang project, for several years. This is not the entire bgclang patch set, but is most of the subset that can be cleanly integrated into LLVM proper at this time. Adding this to the LLVM backend is part of my efforts to rebase bgclang to the current LLVM trunk, but is independently useful (especially for codes that use LLVM as a JIT in library form). The assembler/disassembler test coverage is complete. The CodeGen test coverage is not, but I've included some tests, and more will be added as follow-up work. llvm-svn: 230413
*	Rename UpdateRegAllocHint to match style guidelines.	Eric Christopher	2015-02-24	2	-2/+2
\| \| \| \|	llvm-svn: 230357
*	AArch64: Relax assert about large shift sizes.	Matthias Braun	2015-02-24	1	-3/+9
\| \| \| \| \| \| \| \| \| \|	The reason why these large shift sizes happen is because OpaqueConstants currently inhibit alot of DAG combining, but that has to be addressed in another commit (like the proposal in D6946). Differential Revision: http://reviews.llvm.org/D6940 llvm-svn: 230355
*	R600/SI: Remove isel mubuf legalization	Tom Stellard	2015-02-24	2	-130/+0
\| \| \| \| \| \| \|	We legalize mubuf instructions post-instruction selection, so this code is no longer needed. llvm-svn: 230352
*	ARM: treat [N x i32] and [N x i64] as AAPCS composite types	Tim Northover	2015-02-24	3	-61/+100
\| \| \| \| \| \| \| \| \| \| \|	The logic is almost there already, with our special homogeneous aggregate handling. Tweaking it like this allows front-ends to emit AAPCS compliant code without ever having to count registers or add discarded padding arguments. Only arrays of i32 and i64 are needed to model AAPCS rules, but I decided to apply the logic to all integer arrays for more consistency. llvm-svn: 230348
*	simplify control flow; NFC	Sanjay Patel	2015-02-24	1	-8/+9
\| \| \| \|	llvm-svn: 230342
*	[x32] Mark RBX as reserved when EBX is the base pointer.	Michael Kuperstein	2015-02-24	1	-1/+3
\| \| \| \| \| \|	This should have gone into r230334. llvm-svn: 230339
*	fix typo in comment; NFC	Sanjay Patel	2015-02-24	1	-1/+1
\| \| \| \|	llvm-svn: 230338
*	[x32] x32 should use ebx as the base pointer.	Michael Kuperstein	2015-02-24	1	-8/+9
\| \| \| \| \| \|	This fixes the original issue in PR22655, but not the secondary one. llvm-svn: 230334