bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Split flat offsets that don't fit in DAG	Matt Arsenault	2019-10-20	1	-13/+14
\| \| \| \| \| \| \| \| \| \|	We handle it this way for some other address spaces. Since r349196, SILoadStoreOptimizer has been trying to do this. This is after SIFoldOperands runs, which can change the addressing patterns. It's simpler to just split this earlier. llvm-svn: 375366
*	[AMDGPU] Switch to the new addr space mapping by default	Yaxun Liu	2018-02-02	1	-54/+54
\| \| \| \| \| \| \| \|	This requires corresponding clang change. Differential Revision: https://reviews.llvm.org/D40955 llvm-svn: 324101
*	AMDGPU: Start selecting flat instruction offsets	Matt Arsenault	2017-06-12	1	-3/+53
\| \| \| \|	llvm-svn: 305201
*	AMDGPU: Mark all unspecified CC functions in tests as amdgpu_kernel	Matt Arsenault	2017-03-21	1	-16/+16
\| \| \| \| \| \| \| \| \| \| \| \|	Currently the default C calling convention functions are treated the same as compute kernels. Make this explicit so the default calling convention can be changed to a non-kernel. Converted with perl -pi -e 's/define void/define amdgpu_kernel void/' on the relevant test directories (and undoing in one place that actually wanted a non-kernel). llvm-svn: 298444
*	AMDGPU: Enable InferAddressSpaces	Matt Arsenault	2017-02-08	1	-12/+12
\| \| \| \|	llvm-svn: 294408
*	Enable FeatureFlatForGlobal on Volcanic Islands	Matt Arsenault	2017-01-24	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	This switches to the workaround that HSA defaults to for the mesa path. This should be applied to the 4.0 branch. Patch by Vedran Miletić <vedran@miletic.net> llvm-svn: 292982
*	AMDGPU/SI: Don't emit multi-dword flat memory ops when they might access scratch	Tom Stellard	2016-10-26	1	-2/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: A single flat memory operations that might access the scratch buffer can only access MaxPrivateElementSize bytes. Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D25788 llvm-svn: 285198
*	AMDGPU/SI: Remove unnecessary run lines from test	Tom Stellard	2016-10-26	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This test had run lines disabling/enabling the promote alloca pass, but enabling/disabling promote alloca had no impact on the output. Reviewers: arsenm Subscribers: mgrang, kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25787 llvm-svn: 285197
*	AMDGPU/SI: Don't allow unaligned scratch access	Tom Stellard	2016-10-14	1	-6/+30
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: The hardware doesn't support this. Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25523 llvm-svn: 284257
*	[AMDGPU] Assembler: Swap operands of flat_store instructions to match AMD ↵	Tom Stellard	2016-02-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	assembler Historically, AMD internal sp3 assembler has flat_store* addr, data format. To match existing code and to enable reuse, change LLVM definitions to match. Also update MC and CodeGen tests. Differential Revision: http://reviews.llvm.org/D16927 Patch by: Nikolay Haustov llvm-svn: 260694
*	AMDGPU: Remove some old intrinsic uses from tests	Matt Arsenault	2016-02-11	1	-3/+0
\| \| \| \|	llvm-svn: 260493
*	AMDGPU: Switch barrier intrinsics to using convergent	Matt Arsenault	2015-12-19	1	-1/+1
\| \| \| \| \| \| \| \|	noduplicate prevents unrolling of small loops that happen to have barriers in them. If a loop has a barrier in it, it is OK to duplicate it for the unroll. llvm-svn: 256075
*	AMDGPU: fix overlapping copies in copyPhysReg	Nicolai Haehnle	2015-12-19	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: When copying aggregate registers within the same register class, there may be an overlap between source and destination that forces us to do the copy backwards. Do the simplest possible thing that guarantees the correct order of moves when there are overlaps, and does whatever when there is no overlap. (The last part forces some trivial adjustments to test cases.) Together with r255906, this fixes a VM fault in Unreal Elemental Demo. While at it, change the generation of kill and def flags to something that looks more reasonable. This method is used very late during compilation, so it probably doesn't matter in practice, and to be honest, I don't know if this change is actually correct because the semantics in connection with aggregate registers vs. sub-registers are not clear to me. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93264 Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15622 llvm-svn: 256072
*	AMDGPU: Error on addrspacecasts that aren't actually implemented	Matt Arsenault	2015-12-01	1	-52/+0
\| \| \| \|	llvm-svn: 254469
*	Fix CHECK directives that weren't checking.	Hans Wennborg	2015-08-31	1	-7/+7
\| \| \| \|	llvm-svn: 246485
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+184
	llvm-svn: 239657