bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[X86] Add ISel patterns to select 'f32_to_f16' and 'f16_to_f32' dag nodes.	Andrea Di Biagio	2014-07-03	2	-0/+23
\| \| \| \| \| \| \| \| \| \|	This patch adds tablegen patterns to select F16C float-to-half-float conversion instructions from 'f32_to_f16' and 'f16_to_f32' dag nodes. If the target doesn't have F16C, then 'f32_to_f16' and 'f16_to_f32' are expanded into library calls. llvm-svn: 212293
*	[ARM] Implement ISB memory barrier intrinsic	Yi Kong	2014-07-03	2	-7/+8
\| \| \| \| \| \| \|	Adds support for __builtin_arm_isb. Also corrects DSB and ISB instructions modelling by adding has-side-effects property. llvm-svn: 212276
*	[x86] Fix crashes in lowering bitcast instructions with the widening	Chandler Carruth	2014-07-03	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	mode. This also runs the test in that mode which would reproduce the crash. What I love is that every single FIXME in the test is addressed by switching to widening. llvm-svn: 212254
*	[x86] Based on a long conversation between myself, Jim Grosbach, Hal	Chandler Carruth	2014-07-03	2	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Finkel, Eric Christopher, and a bunch of other people I'm probably forgetting (sorry), add an option to the x86 backend to widen vectors during type legalization rather than promote them. This still would promote vNi1 vectors to get the masks right, but would widen other vectors. A lot of experiments are piling up right now showing that widening should probably be the default legalization strategy outside of vNi1 cases, but it is very hard to test the rammifications of that and fix bugs in widening-based legalization without an option that enables it. I'll be checking in tests shortly that use this option to exercise cases where widening doesn't work well and hopefully we'll be able to switch fully to this soon. llvm-svn: 212249
*	Make these preprocessor directives match all of the others in the port.	Eric Christopher	2014-07-03	2	-4/+4
\| \| \| \|	llvm-svn: 212245
*	Remove dead code.	Eric Christopher	2014-07-03	1	-7/+0
\| \| \| \|	llvm-svn: 212244
*	[codegen,aarch64] Add a target hook to the code generator to control	Chandler Carruth	2014-07-03	6	-6/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vector type legalization strategies in a more fine grained manner, and change the legalization of several v1iN types and v1f32 to be widening rather than scalarization on AArch64. This fixes an assertion failure caused by scalarizing nodes like "v1i32 trunc v1i64". As v1i64 is legal it will fail to scalarize v1i32. This also provides a foundation for other targets to have more granular control over how vector types are legalized. Patch by Hao Liu, reviewed by Tim Northover. I'm committing it to allow some work to start taking place on top of this patch as it adds some really important hooks to the backend that I'd like to immediately start using. =] http://reviews.llvm.org/D4322 llvm-svn: 212242
*	Move subtarget dependent features into the subtarget from the target	Eric Christopher	2014-07-03	4	-96/+97
\| \| \| \| \| \| \|	machine. Includes a fix for a subtarget initialization for hard floating point on mips16. llvm-svn: 212240
*	So that we can include frame lowering in the subtarget, remove include	Eric Christopher	2014-07-02	5	-5/+10
\| \| \| \| \| \| \|	circular dependency with the subtarget by inlining accessor methods and outlining a routine. llvm-svn: 212236
*	So that we can include target lowering in the subtarget, remove include	Eric Christopher	2014-07-02	4	-64/+80
\| \| \| \| \| \| \|	circular dependency with the subtarget by inlining accessor methods and outlining a routine. llvm-svn: 212234
*	Fix typos.	Eric Christopher	2014-07-02	1	-1/+1
\| \| \| \|	llvm-svn: 212228
*	Move the data layout and selection dag info from the mips target machine	Eric Christopher	2014-07-02	4	-42/+46
\| \| \| \| \| \|	down to the subtarget. llvm-svn: 212224
*	[X86] AVX512: Allow writemask argument in vpermt* intrinsics	Adam Nemet	2014-07-02	1	-5/+15
\| \| \| \|	llvm-svn: 212223
*	[X86] AVX512: Generate Pat<>'s for the vpermt2* intrinsics via multiclass	Adam Nemet	2014-07-02	1	-19/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	This new multiclass, avx512_perm_table_3src derives from the current one and provides the Pat<>. The next patch will add another Pat<> that uses the writemask. Note that I dropped the type annotation from the intrinsic call, i.e.: (v16f32 VR512:$src1) -> R512:$src1. I think that this should be fine (at least many intrinsic calls don't provide them) and it greatly reduces the number of template arguments. llvm-svn: 212222
*	[X86] AVX512: Add writemask variants for vperm2	Adam Nemet	2014-07-02	1	-14/+68
\| \| \| \| \| \| \| \| \|	This includes assembler and codegen support (see the new tests in avx512-encodings.s and avx512-shuffle.ll). <rdar://problem/17492620> llvm-svn: 212221
*	R600: Add a comment that llvm.AMDGPU.trunc is a legacy intrinsic	Tom Stellard	2014-07-02	1	-1/+1
\| \| \| \|	llvm-svn: 212218
*	R600/SI: Use a ComplexPattern for ADDR64 addressing of MUBUF loads	Tom Stellard	2014-07-02	2	-37/+35
\| \| \| \|	llvm-svn: 212217
*	R600: Promote i64 loads to v2i32	Tom Stellard	2014-07-02	3	-7/+12
\| \| \| \|	llvm-svn: 212216
*	R600/SI: Adjsut SGPR live ranges before register allocation	Tom Stellard	2014-07-02	4	-0/+118
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SGPRs are written by instructions that sometimes will ignore control flow, which means if you have code like: if (VGPR0) { SGPR0 = S_MOV_B32 0 } else { SGPR0 = S_MOV_B32 1 } The value of SGPR0 will 1 no matter what the condition is. In order to deal with this situation correctly, we need to view the program as if it were a single basic block when we calculate the live ranges for the SGPRs. They way we actually update the live range is by iterating over all of the segments in each LiveRange object and setting the end of each segment equal to the start of the next segment. So a live range like: [3888r,9312r:0)[10032B,10384B:0) 0@3888r will become: [3888r,10032B:0)[10032B,10384B:0) 0@3888r This change will allow us to use SALU instructions within branches. llvm-svn: 212215
*	R600/SI: Add verifier check for immediates in register operands.	Tom Stellard	2014-07-02	4	-2/+33
\| \| \| \|	llvm-svn: 212214
*	[RegAllocGreedy] Provide a subtarget hook to disable the local reassignment	Quentin Colombet	2014-07-02	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \|	heuristic. By default, no functionality change. This is a follow-up of r212099. This hook provides a finer grain to control the optimization. <rdar://problem/17444599> llvm-svn: 212204
*	AArch64: Re-enable AArch64AddressTypePromotion	Duncan P. N. Exon Smith	2014-07-02	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \|	This reverts commits r212189 and r212190. While this pass was accidentally disabled (until r212073), r205437 slipped in a use of `auto` that should have been `auto&`. This fixes PR20188. llvm-svn: 212201
*	AArch64: Remove unnecessary parens	Duncan P. N. Exon Smith	2014-07-02	1	-1/+1
\| \| \| \|	llvm-svn: 212199
*	R600: Fix crashes when an illegal type load or store is not handled.	Matt Arsenault	2014-07-02	1	-2/+6
\| \| \| \| \| \| \|	I don't think anything hits this now, but will be exposed in future patches. llvm-svn: 212197
*	AArch64: Merge isa with dyn_cast	Duncan P. N. Exon Smith	2014-07-02	1	-2/+1
\| \| \| \|	llvm-svn: 212194
*	AArch64: Temporarily disable AArch64AddressTypePromotion	Duncan P. N. Exon Smith	2014-07-02	1	-2/+0
\| \| \| \| \| \| \|	Temporarily disable AArch64AddressTypePromotion, which was effectively re-enabled in r212073 and r212075, while I look into PR20188. llvm-svn: 212189
*	X86: When combining shuffles just remove shuffles that are completely redundant.	Benjamin Kramer	2014-07-02	1	-0/+7
\| \| \| \| \| \| \|	CombineTo doesn't allow replacing a node with itself so this would crash if the combined shuffle is the same as the input shuffle. llvm-svn: 212181
*	AVX-512: dec/inc instructions are slow on KNL	Elena Demikhovsky	2014-07-02	1	-1/+2
\| \| \| \| \| \| \|	After Alexey Volkov, I'm adding the same property for KNL, that prefers ADD/SUB instead of INC/DEC. Added a test. llvm-svn: 212178
*	aarch64: support target-specific .req assembler directive	Saleem Abdulrasool	2014-07-02	1	-3/+96
\| \| \| \| \| \| \| \| \| \|	Based on the support for .req on ARM. The aarch64 variant has to keep track if the alias register was a vector register (v0-31) or a general purpose or VFP/Advanced SIMD ([bhsdq]0-31) register. Patch by Janne Grunau! llvm-svn: 212161
*	Break out subtarget initialization that dependent variables need into	Eric Christopher	2014-07-02	2	-11/+17
\| \| \| \| \| \|	a separate function and clean up calling convention for helper function. llvm-svn: 212153
*	Unify these two lines.	Eric Christopher	2014-07-02	1	-2/+1
\| \| \| \|	llvm-svn: 212152
*	Move MipsJITInfo to the subtarget rather than the target machine.	Eric Christopher	2014-07-02	4	-5/+10
\| \| \| \|	llvm-svn: 212151
*	Remove unnecessary include.	Eric Christopher	2014-07-02	1	-1/+0
\| \| \| \|	llvm-svn: 212150
*	Remove the cached InstrItineraryData on the TargetMachine, it's unnecessary.	Eric Christopher	2014-07-02	2	-15/+13
\| \| \| \|	llvm-svn: 212149
*	Move the subtarget dependent features from XCoreTargetMachine	Eric Christopher	2014-07-02	6	-37/+42
\| \| \| \| \| \|	down to the subtarget. llvm-svn: 212147
*	Make XCoreSelectionDAGInfo take a DataLayout since it only needs	Eric Christopher	2014-07-02	3	-4/+4
\| \| \| \| \| \|	that information. llvm-svn: 212146
*	X86: remove atomic instructions after we've iterated through them.	Tim Northover	2014-07-01	1	-3/+6
\| \| \| \| \| \| \| \| \|	Otherwise they get freed and the implicit "isa<XYZ>" tests following turn out badly (at least under sanitizers). Also corrects the ordering of unordered atomic stores. llvm-svn: 212136
*	[DAG] Pass the argument list to the CallLoweringInfo via move semantics. NFCI.	Juergen Ributzka	2014-07-01	11	-16/+19
\| \| \| \| \| \| \| \|	The argument list vector is never used after it has been passed to the CallLoweringInfo and moving it to the CallLoweringInfo is cleaner and pretty much as cheap as keeping a pointer to it. llvm-svn: 212135
*	X86: delegate expanding atomic libcalls to generic code.	Tim Northover	2014-07-01	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On targets without cmpxchg16b or cmpxchg8b, the borderline atomic operations were slipping through the gaps. X86AtomicExpand.cpp was delegating to ISelLowering. Generic ISelLowering was delegating to X86ISelLowering and X86ISelLowering was asserting. The correct behaviour is to expand to a libcall, preferably in generic ISelLowering. This can be achieved by X86ISelLowering deciding it doesn't want the faff after all. llvm-svn: 212134
*	Move the subtarget dependent features from SystemZTargetMachine	Eric Christopher	2014-07-01	6	-38/+55
\| \| \| \| \| \|	down to the subtarget. Add an initialization routine to assist. llvm-svn: 212124
*	Remove the use and initialization of the target machine and subtarget	Eric Christopher	2014-07-01	3	-29/+19
\| \| \| \| \| \|	from SystemZFrameLowering. llvm-svn: 212123
*	AArch64: fix comment typo	Tim Northover	2014-07-01	1	-1/+1
\| \| \| \|	llvm-svn: 212120
*	X86: expand atomics in IR instead of as MachineInstrs.	Tim Northover	2014-07-01	9	-976/+294
\| \| \| \| \| \| \| \| \| \| \| \|	The logic for expanding atomics that aren't natively supported in terms of cmpxchg loops is much simpler to express at the IR level. It also allows the normal optimisations and CodeGen improvements to help out with atomics, instead of using a limited set of possible instructions.. rdar://problem/13496295 llvm-svn: 212119
*	[X86] AVX512: Allow writemasks with vpcmp	Adam Nemet	2014-07-01	1	-0/+10
\| \| \| \| \| \| \| \| \|	For now I only updated the _alt variants. The main variants are used by codegen and that will need a bit more work to trigger. <rdar://problem/17492620> llvm-svn: 212114
*	[X86] AVX512: Factor generating the AsmString into avx512_icmp_cc	Adam Nemet	2014-07-01	1	-25/+24
\| \| \| \| \| \| \| \| \|	Adding a writemask variant would require a third asm string to be passed to the template. Generate the AsmString in the template instead. No change in X86.td.expanded. llvm-svn: 212113
*	Fix .seh_stackalloc 0	Reid Kleckner	2014-07-01	1	-4/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	seh_stackalloc 0 is not representable in Win64 SEH info, so emitting it is a bug. Reviewers: rnk Differential Revision: http://reviews.llvm.org/D4334 Patch by Vadim Chugunov! llvm-svn: 212081
*	AArch64: Follow-up to r212073	Duncan P. N. Exon Smith	2014-07-01	1	-4/+4
\| \| \| \| \| \| \| \|	In r212073 I missed a call of `use_begin()` that assumed the wrong semantics. It's not clear to me at all what this code does without the fix, so I'm not sure how to write a testcase. llvm-svn: 212075
*	AArch64: Actually do address type promotion	Duncan P. N. Exon Smith	2014-06-30	1	-3/+3
\| \| \| \| \| \| \| \|	AArch64AddressTypePromotion was doing nothing because it was using the old semantics of `Use` and `uses()`, when it really wanted to get at the `users()`. llvm-svn: 212073
*	Fix 'platform-specific' hyphenations	Alp Toker	2014-06-30	2	-3/+3
\| \| \| \|	llvm-svn: 212056
*	R600: Move mul combine to separate function	Matt Arsenault	2014-06-30	2	-28/+35
\| \| \| \|	llvm-svn: 212052