bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	AMDGPU: Partially fix implicit.buffer.ptr intrinsic handling	Matt Arsenault	2017-06-26	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This should not be treated as a different version of private_segment_buffer. These are distinct things with different uses and register classes, and requires the function argument info to have more context about the function's type and environment. Also add missing test coverage for the intrinsic, and emit an error for HSA. This also encovers that the intrinsic is broken unless there happen to be stack objects. llvm-svn: 306264
*	Sort the remaining #include lines in include/... and lib/....	Chandler Carruth	2017-06-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I did this a long time ago with a janky python script, but now clang-format has built-in support for this. I fed clang-format every line with a #include and let it re-sort things according to the precise LLVM rules for include ordering baked into clang-format these days. I've reverted a number of files where the results of sorting includes isn't healthy. Either places where we have legacy code relying on particular include ordering (where possible, I'll fix these separately) or where we have particular formatting around #include lines that I didn't want to disturb in this patch. This patch is entirely mechanical. If you get merge conflicts or anything, just ignore the changes in this patch and run clang-format over your #include lines in the files. Sorry for any noise here, but it is important to keep these things stable. I was seeing an increasing number of patches with irrelevant re-ordering of #include lines because clang-format was used. This patch at least isolates that churn, makes it easy to skip when resolving conflicts, and gets us to a clean baseline (again). llvm-svn: 304787
*	AMDGPU: Start defining a calling convention	Matt Arsenault	2017-05-17	1	-3/+2
\| \| \| \| \| \| \| \|	Partially implement callee-side for arguments and return values. byval doesn't work properly, and most likely sret or other on-stack return values most as well. llvm-svn: 303308
*	AMDGPU: Add StackPtr and FramePtr registers to MFI	Matt Arsenault	2017-04-24	1	-0/+24
\| \| \| \| \| \|	These will be necessary for setting up call sequences. llvm-svn: 301208
*	AMDGPU: Make MFI fields private	Matt Arsenault	2017-04-18	1	-3/+5
\| \| \| \|	llvm-svn: 300596
*	AMDGPU: Refactor argument lowering	Matt Arsenault	2017-04-11	1	-3/+10
\| \| \| \| \| \| \|	Split into smaller functions and prepare for handling non-entry functions. llvm-svn: 299998
*	AMDGPU: Fix crash when disassembling VOP3 mac	Matt Arsenault	2017-04-10	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The unused dummy src2_modifiers is missing, so it crashes when trying to print it. I tried to fully remove src2_modifiers, but there are some irritations in the places where it is converted to mad since it starts to require modifying use lists while iterating over them. llvm-svn: 299861
*	AMDGPU: Don't use stack space for SGPR->VGPR spills	Matt Arsenault	2017-02-21	1	-4/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before frame offsets are calculated, try to eliminate the frame indexes used by SGPR spills. Then we can delete them after. I think for now we can be sure that no other instruction will be re-using the same frame indexes. It should be easy to notice if this assumption ever breaks since everything asserts if it tries to use a dead frame index later. The unused emergency stack slot seems to still be left behind, so an additional 4 bytes is still wasted. llvm-svn: 295753
*	AMDGPU add support for spilling to a user sgpr pointed buffers	Tom Stellard	2017-01-25	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This lets you select which sort of spilling you want, either s[0:1] or 64-bit loads from s[0:1]. Patch By: Dave Airlie Reviewers: nhaehnle, arsenm, tstellarAMD Reviewed By: arsenm Subscribers: mareko, llvm-commits, kzhuravl, wdng, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D25428 llvm-svn: 293000
*	[AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; ↵	Eugene Zelenko	2017-01-21	1	-7/+13
\| \| \| \| \| \|	other minor fixes (NFC). llvm-svn: 292688
*	AMDGPU/SI: Make a function const	Tom Stellard	2016-12-20	1	-3/+3
\| \| \| \|	llvm-svn: 290185
*	AMDGPU/SI: Add a MachineMemOperand when lowering llvm.amdgcn.buffer.load.*	Tom Stellard	2016-12-20	1	-0/+28
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle, mareko Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D27834 llvm-svn: 290184
*	AMDGPU/SI: Add a MachineMemOperand to MIMG instructions	Tom Stellard	2016-12-20	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Without a MachineMemOperand, the scheduler was assuming MIMG instructions were ordered memory references, so no loads or stores could be reordered across them. Reviewers: arsenm Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27536 llvm-svn: 290179
*	[AMDGPU] Wave and register controls	Konstantin Zhuravlyov	2016-09-06	1	-8/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Implemented amdgpu-flat-work-group-size attribute - Implemented amdgpu-num-active-waves-per-eu attribute - Implemented amdgpu-num-sgpr attribute - Implemented amdgpu-num-vgpr attribute - Dynamic LDS constraints are in a separate patch Patch by Tom Stellard and Konstantin Zhuravlyov Differential Revision: https://reviews.llvm.org/D21562 llvm-svn: 280747
*	AMDGPU: fix mismatch tags, NFC	Saleem Abdulrasool	2016-08-29	1	-1/+1
\| \| \| \|	llvm-svn: 280006
*	AMDGPU: Remove unused tracking of flat instructions	Matt Arsenault	2016-08-11	1	-9/+0
\| \| \| \|	llvm-svn: 278361
*	AMDGPU: Make AMDGPUMachineFunction fields private	Matt Arsenault	2016-07-26	1	-1/+0
\| \| \| \| \| \| \| \| \|	ABIArgOffset is a problem because properly fsetting the KernArgSize requires that the reserved area before the real kernel arguments be correctly aligned, which requires fixing clover. llvm-svn: 276766
*	AMDGPU: Add HSA dispatch id intrinsic	Matt Arsenault	2016-07-22	1	-5/+6
\| \| \| \|	llvm-svn: 276437
*	AMDGPU/SI: Emit the number of SGPR and VGPR spills	Marek Olsak	2016-07-13	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: v2: don't count SGPRs spilled to scratch twice I think this is sufficient. It doesn't count private memory usage, which happens often and uses scratch but isn't technically a spill. The private memory usage can be computed by: [scratch_per_thread - vgpr_spills - a random multiple of SGPR spills]. The fact SGPR spills add very high numbers to the scratch size make that computation a guessing game, but I don't have a solution to that. Reviewers: tstellarAMD Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D22197 llvm-svn: 275288
*	SIMachineFunctionInfo.cpp: Appease msc18 to use std::array.	NAKAMURA Takumi	2016-06-27	1	-2/+3
\| \| \| \|	llvm-svn: 273860
*	Reformat blank lines.	NAKAMURA Takumi	2016-06-27	1	-1/+0
\| \| \| \|	llvm-svn: 273858
*	[AMDGPU] Emit debugger prologue and emit the rest of the debugger fields in ↵	Konstantin Zhuravlyov	2016-06-25	1	-0/+60
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the kernel code header Debugger prologue is emitted if -mattr=+amdgpu-debugger-emit-prologue. Debugger prologue writes work group IDs and work item IDs to scratch memory at fixed location in the following format: - offset 0: work group ID x - offset 4: work group ID y - offset 8: work group ID z - offset 16: work item ID x - offset 20: work item ID y - offset 24: work item ID z Set - amd_kernel_code_t::debug_wavefront_private_segment_offset_sgpr to scratch wave offset reg - amd_kernel_code_t::debug_private_segment_buffer_sgpr to scratch rsrc reg - amd_kernel_code_t::is_debug_supported to true if all debugger features are enabled Differential Revision: http://reviews.llvm.org/D20335 llvm-svn: 273769
*	[AMDGPU][NFC] Rename ReserveTrapVGPRs -> ReserveRegs	Konstantin Zhuravlyov	2016-05-24	1	-4/+5
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20081 llvm-svn: 270594
*	[AMDGPU] Move reserved vgpr count for trap handler usage to ↵	Konstantin Zhuravlyov	2016-04-26	1	-0/+7
\| \| \| \| \| \| \| \|	SIMachineFunctionInfo + minor commenting changes Differential Revision: http://reviews.llvm.org/D19537 llvm-svn: 267573
*	AMDGPU: Implement addrspacecast	Matt Arsenault	2016-04-25	1	-0/+4
\| \| \| \|	llvm-svn: 267452
*	AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit	Tom Stellard	2016-04-14	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For GL_ARB_compute_shader we need to support workgroup sizes of at least 1024. However, if we want to allow large workgroup sizes, we may need to use less registers, as we have to run more waves per SIMD. This patch adds an attribute to specify the maximum work group size the compiled program needs to support. It defaults, to 256, as that has no wave restrictions. Reducing the number of registers available is done similarly to how the registers were reserved for chips with the sgpr init bug. Reviewers: mareko, arsenm, tstellarAMD, nhaehnle Subscribers: FireBurn, kerberizer, llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D18340 Patch By: Bas Nieuwenhuizen llvm-svn: 266337
*	AMDGPU/SI: Use the correct scratch wave offset register for shaders.	Tom Stellard	2016-04-14	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The code previously always used s1 as it was using the user + system SGPR information for compute kernels. This is incorrect for Mesa shaders though, The register should be the next SGPR after all user and system SGPR's. We use that Mesa adds arguments for all input and system SGPR's and take the next available SGPR for the scratch wave offset register. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewers: mareko, arsenm, nhaehnle, tstellarAMD Subscribers: qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18941 Patch By: Bas Nieuwenhuizen llvm-svn: 266336
*	AMDGPU: R600 code splitting cleanup	Matt Arsenault	2016-03-11	1	-5/+3
\| \| \| \| \| \| \|	Move a few functions only used by R600 to R600 specific code, fix header macros to stop using R600, mark classes as final. llvm-svn: 263204
*	AMDGPU/SI: Add support for spiling SGPRs to scratch buffer	Tom Stellard	2016-03-04	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is necessary for when we run out of VGPRs and can no longer use v_{read,write}_lane for spilling SGPRs. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17592 llvm-svn: 262732
*	AMDGPU: Set flat_scratch from flat_scratch_init reg	Matt Arsenault	2016-02-12	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was hardcoded to the static private size, but this would be missing the offset and additional size for someday when we have dynamic sizing. Also stops always initializing flat_scratch even when unused. In the future we should stop emitting this unless flat instructions are used to access private memory. For example this will initialize it almost always on VI because flat is used for global access. llvm-svn: 260658
*	AMDGPU/SI: Add s_waitcnt at the end of non-void functions	Marek Olsak	2016-01-13	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: v2: Make ReturnsVoid private, so that I can another 8 lines of code and look more productive. Reviewers: tstellarAMD, arsenm Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D16034 llvm-svn: 257622
*	AMDGPU/SI: Add new target attribute InitialPSInputAddr	Marek Olsak	2016-01-13	1	-1/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This allows Mesa to pass initial SPI_PS_INPUT_ADDR to LLVM. The register assigns VGPR locations to PS inputs, while the ENA register determines whether or not they are loaded. Mesa needs to set some inputs as not-movable, so that a pixel shader prolog binary appended at the beginning can assume where some inputs are. v2: Make PSInputAddr private, because there is never enough silly getters and setters for people to read. Reviewers: tstellarAMD, arsenm Subscribers: arsenm Differential Revision: http://reviews.llvm.org/D16030 llvm-svn: 257591
*	AMDGPU: Rework how private buffer passed for HSA	Matt Arsenault	2015-11-30	1	-2/+111
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we know we have stack objects, we reserve the registers that the private buffer resource and wave offset are passed and use them directly. If not, reserve the last 5 SGPRs just in case we need to spill. After register allocation, try to pick the next available registers instead of the last SGPRs, and then insert copies from the inputs to the reserved registers in the progloue. This also only selectively enables all of the input registers which are really required instead of always enabling them. llvm-svn: 254331
*	AMDGPU: Check feature attributes in SIMachineFunctionInfo	Matt Arsenault	2015-11-25	1	-6/+98
\| \| \| \|	llvm-svn: 254091
*	AMDGPU: Also track whether SGPRs were spilled	Matt Arsenault	2015-11-05	1	-2/+17
\| \| \| \|	llvm-svn: 252145
*	R600 -> AMDGPU rename	Tom Stellard	2015-06-13	1	-0/+66
\| \| \| \|	llvm-svn: 239657
*	Revert "AMDGPU: Add core backend files for R600/SI codegen v6"	Tom Stellard	2012-07-16	1	-37/+0
\| \| \| \| \| \|	This reverts commit 4ea70107c5e51230e9e60f0bf58a0f74aa4885ea. llvm-svn: 160303
*	AMDGPU: Add core backend files for R600/SI codegen v6	Tom Stellard	2012-07-16	1	-0/+37
	llvm-svn: 160270