bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	This patch changes the ownership of TLOF from TargetLoweringBase to ↵	Aditya Nandakumar	2014-11-13	1	-1/+1
\| \| \| \| \| \|	TargetMachine so that different subtargets could share the TLOF effectively llvm-svn: 221878
*	Add minnum / maxnum codegen	Matt Arsenault	2014-10-21	1	-0/+2
\| \| \| \|	llvm-svn: 220342
*	R600/SI: Add missing parameter to div_fmas intrinsic	Matt Arsenault	2014-10-21	1	-0/+2
\| \| \| \|	llvm-svn: 220338
*	R600: Fix nonsensical implementation of computeKnownBits for BFE	Matt Arsenault	2014-10-16	1	-5/+1
\| \| \| \| \| \|	This was resulting in invalid simplifications of sdiv llvm-svn: 219953
*	R600: Remove dead function	Matt Arsenault	2014-10-16	1	-12/+0
\| \| \| \|	llvm-svn: 219879
*	R600: Remove unnecessary part of computeKnownBitsForTargetNode	Matt Arsenault	2014-10-15	1	-5/+0
\| \| \| \| \| \| \|	Zero-width BFEs are combined away already, so there's no point in handling them. llvm-svn: 219868
*	Move variable down to use	Matt Arsenault	2014-10-15	1	-4/+4
\| \| \| \|	llvm-svn: 219867
*	R600: Fix miscompiles when BFE has multiple uses	Matt Arsenault	2014-10-15	1	-7/+10
\| \| \| \| \| \|	SimplifyDemandedBits would break the other uses of the operand. llvm-svn: 219819
*	R600: Use existing variable	Matt Arsenault	2014-10-15	1	-1/+1
\| \| \| \|	llvm-svn: 219778
*	R600: Remove outdated comment	Matt Arsenault	2014-10-15	1	-3/+0
\| \| \| \|	llvm-svn: 219777
*	R600/SI: Custom lower f64 -> i64 conversions	Matt Arsenault	2014-10-03	1	-0/+53
\| \| \| \|	llvm-svn: 219038
*	R600: Custom lower [s\|u]int_to_fp for i64 -> f64	Matt Arsenault	2014-10-03	1	-2/+43
\| \| \| \|	llvm-svn: 219037
*	R600/SI: Fix ftrunc f64 conformance failures.	Matt Arsenault	2014-10-03	1	-1/+1
\| \| \| \| \| \|	Re-add the tests since they were deleted at some point llvm-svn: 219036
*	R600/SI: Add a note about the order of the operands to div_scale	Matt Arsenault	2014-09-26	1	-0/+6
\| \| \| \|	llvm-svn: 218534
*	R600: Don't set BypassSlowDiv for 64-bit division	Tom Stellard	2014-09-22	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	BypassSlowDiv is used by codegen prepare to insert a run-time check to see if the operands to a 64-bit division are really 32-bit values and if they are it will do 32-bit division instead. This is not useful for R600, which has predicated control flow since both the 32-bit and 64-bit paths will be executed in most cases. It also increases code size which can lead to more instruction cache misses. llvm-svn: 218252
*	R600/SI: Use ISD::MUL instead of ISD::UMULO when lowering division	Tom Stellard	2014-09-22	1	-3/+3
\| \| \| \| \| \| \|	ISD::MUL and ISD:UMULO are the same except that UMULO sets an overflow bit. Since we aren't using the overflow bit, we should use ISD::MUL. llvm-svn: 218251
*	R600: Better fix for bug 20982	Matt Arsenault	2014-09-19	1	-6/+3
\| \| \| \| \| \|	Just do the left shift as unsigned to avoid the UB. llvm-svn: 218092
*	R600: Bug 20982 - Avoid undefined left shift of negative value	Matt Arsenault	2014-09-18	1	-3/+10
\| \| \| \| \| \| \|	I'm not sure what the hardware actually does, so don't bother trying to fold it for now. llvm-svn: 218057
*	R600: Custom lower frem	Matt Arsenault	2014-09-10	1	-0/+19
\| \| \| \|	llvm-svn: 217553
*	R600/SI: Use mad for fsub + fmul	Matt Arsenault	2014-08-29	1	-0/+1
\| \| \| \| \| \| \|	We can use a negate source modifier to match this for fsub. llvm-svn: 216735
*	name change: isPow2DivCheap -> isPow2SDivCheap	Sanjay Patel	2014-08-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	isPow2DivCheap That name doesn't specify signed or unsigned. Lazy as I am, I eventually read the function and variable comments. It turns out that this is strictly about signed div. But I discovered that the comments are wrong: srl/add/sra is not the general sequence for signed integer division by power-of-2. We need one more 'sra': sra/srl/add/sra That's the sequence produced in DAGCombiner. The first 'sra' may be removed when dividing by exactly '2', but that's a special case. This patch corrects the comments, changes the name of the flag bit, and changes the name of the accessor methods. No functional change intended. Differential Revision: http://reviews.llvm.org/D5010 llvm-svn: 216237
*	R600/SI: Use source modifiers for f64 fneg	Matt Arsenault	2014-08-15	1	-1/+1
\| \| \| \|	llvm-svn: 215748
*	R600/SI: Use source modifier for f64 fabs	Matt Arsenault	2014-08-15	1	-1/+1
\| \| \| \|	llvm-svn: 215747
*	R600/SI: Add intrinsic for ldexp	Matt Arsenault	2014-08-15	1	-0/+5
\| \| \| \|	llvm-svn: 215734
*	R600: Use optimized 24bit path in udivrem	Jan Vesely	2014-08-12	1	-17/+38
\| \| \| \| \| \| \| \| \|	v2: drop enum keyword use correct extension mode don't bother computing the sign in unsinged case Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215462
*	R600: Remove unused code.	Jan Vesely	2014-08-12	1	-168/+0
\| \| \| \| \|	Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215461
*	R600: Use i24 optimized path for SREM	Jan Vesely	2014-08-12	1	-7/+27
\| \| \| \| \| \| \| \| \|	v2: add tests rename LowerSDIV24 to LowerSDIVREM24 handle the rem part in this function Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215460
*	R600: Disable FP exceptions.	Matt Arsenault	2014-08-09	1	-0/+5
\| \| \| \|	llvm-svn: 215277
*	R600/SI: Avoid generating REGISTER_LOAD instructions.	Tom Stellard	2014-08-05	1	-1/+2
\| \| \| \| \| \| \|	SI doesn't use REGISTER_LOAD anymore, but it was still hitting this code path for 8-bit and 16-bit private loads. llvm-svn: 214865
*	Remove the TargetMachine forwards for TargetSubtargetInfo based	Eric Christopher	2014-08-04	1	-4/+4
\| \| \| \| \| \|	information and update all callers. No functional change. llvm-svn: 214781
*	Use the known address space constant rather than checking it	Matt Arsenault	2014-08-04	1	-1/+1
\| \| \| \|	llvm-svn: 214729
*	Revert "R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp"	Tom Stellard	2014-08-01	1	-1/+37
\| \| \| \| \| \| \| \|	This reverts commit r214566. I did not mean to commit this yet. llvm-svn: 214572
*	R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp	Tom Stellard	2014-08-01	1	-37/+1
\| \| \| \| \| \| \|	SI doesn't use REGISTER_LOAD anymore, but it was still hitting this code path for 8-bit and 16-bit private loads. llvm-svn: 214566
*	Make sure no loads resulting from load->switch DAGCombine are marked invariant	Louis Gerbarg	2014-07-31	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently when DAGCombine converts loads feeding a switch into a switch of addresses feeding a load the new load inherits the isInvariant flag of the left side. This is incorrect since invariant loads can be reordered in cases where it is illegal to reoarder normal loads. This patch adds an isInvariant parameter to getExtLoad() and updates all call sites to pass in the data if they have it or false if they don't. It also changes the DAGCombine to use that data to make the right decision when creating the new load. llvm-svn: 214449
*	[SDAG] Enable the new assert for out-of-range result numbers in	Chandler Carruth	2014-07-25	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SDValues, fixing the two bugs left in the regression suite. The key for both of these was the use a single value type rather than a VTList which caused an unintentionally single-result merge-value node. Fix this by getting the appropriate VTList in place. Doing this exposed that the comments in x86's code abouth how MUL_LOHI operands are handle is wrong. The bug with the use of out-of-range result numbers was hiding the bug about the order of operands here (as best i can tell). There are more places where the code appears to get this backwards still... llvm-svn: 213931
*	R600: Add new functions for splitting vector loads and stores.	Matt Arsenault	2014-07-24	1	-20/+129
\| \| \| \| \| \|	These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877
*	R600: Fix LowerSDIV24	Matt Arsenault	2014-07-24	1	-51/+50
\| \| \| \| \| \| \| \| \| \|	Use ComputeNumSignBits instead of checking for i8 / i16 which only worked when AMDIL was lying about having legal i8 / i16. If an integer is known to fit in 24-bits, we can do division faster with float ops. llvm-svn: 213843
*	R600/SI: Store constant initializer data in constant memory	Tom Stellard	2014-07-21	1	-16/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530
*	R600: support fpext/fptrunc operations to and from f16.	Tim Northover	2014-07-18	1	-0/+4
\| \| \| \|	llvm-svn: 213376
*	R600: support f16 -> f64 conversion intrinsic.	Tim Northover	2014-07-18	1	-0/+2
\| \| \| \| \| \| \| \|	Unfortunately, we don't seem to have a direct truncation, but the extension can be legally split into two operations so we should support that. llvm-svn: 213357
*	R600: Implement zero undef variants of ctlz/cttz	Jan Vesely	2014-07-15	1	-0/+6
\| \| \| \| \| \| \| \| \|	v2: use ffbh/l if available v3: Rebase on top of Matt's SI patches Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 213072
*	R600: Add dag combine for copy of an illegal type.	Matt Arsenault	2014-07-15	1	-1/+55
\| \| \| \| \| \| \| \| \|	This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. llvm-svn: 213031
*	R600: Implement float to long/ulong	Jan Vesely	2014-07-10	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773
*	R600: Fix mishandling of load / store chains.	Matt Arsenault	2014-07-07	1	-11/+49
\| \| \| \| \| \| \| \|	Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473
*	R600: Add a comment that llvm.AMDGPU.trunc is a legacy intrinsic	Tom Stellard	2014-07-02	1	-1/+1
\| \| \| \|	llvm-svn: 212218
*	R600: Promote i64 loads to v2i32	Tom Stellard	2014-07-02	1	-0/+3
\| \| \| \|	llvm-svn: 212216
*	R600: Fix crashes when an illegal type load or store is not handled.	Matt Arsenault	2014-07-02	1	-2/+6
\| \| \| \| \| \| \|	I don't think anything hits this now, but will be exposed in future patches. llvm-svn: 212197
*	R600: Move mul combine to separate function	Matt Arsenault	2014-06-30	1	-28/+33
\| \| \| \|	llvm-svn: 212052
*	R600: Move load/store ReplaceNodeResults to common code.	Matt Arsenault	2014-06-27	1	-0/+14
\| \| \| \| \| \|	Future patches will want to custom lower loads on SI. llvm-svn: 211848
*	Silencing a warning about isZExtFree hiding an inherited virtual function. ↵	Aaron Ballman	2014-06-26	1	-0/+4
\| \| \| \| \| \|	No functional change intended. llvm-svn: 211783