bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Reapply "AMDGPU: Add 32-bit constant address space"	Matt Arsenault	2018-02-09	1	-1/+7
\| \| \| \| \| \|	This reverts r324494 and reapplies r324487. llvm-svn: 324747
*	Revert "AMDGPU: Add 32-bit constant address space"	Rafael Espindola	2018-02-07	1	-7/+1
\| \| \| \| \| \| \| \|	This reverts commit r324487. It broke clang tests. llvm-svn: 324494
*	AMDGPU: Add 32-bit constant address space	Marek Olsak	2018-02-07	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Note: This is a candidate for LLVM 6.0, because it was planned to be in that release but was delayed due to a long review period. Merge conflict in release_60 - resolution: Add "-p6:32:32" into the second (non-amdgiz) string. Only scalar loads support 32-bit pointers. An address in a VGPR will fail to compile. That's OK because the results of loads will only be used in places where VGPRs are forbidden. Updated AMDGPUAliasAnalysis and used SReg_64_XEXEC. The tests cover all uses cases we need for Mesa. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D41651 llvm-svn: 324487
*	AMDGPU: Use unique PSVs for buffer resources	Matt Arsenault	2017-12-29	1	-1/+0
\| \| \| \| \| \| \|	Also fixes using the wrong memory type for some intrinsics when custom lowering them. llvm-svn: 321557
*	AMDGPU: Implement getTgtMemIntrinsic for images	Matt Arsenault	2017-12-29	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	Currently all images are lowered to have a single image PseudoSourceValue. Image stores happen to have overly strict mayLoad/mayStore/hasSideEffects flags set on them, so this happens to work. When these are fixed to be correct, the scheduler breaks this because the identical PSVs are assumed to be the same address. These need to be unique to the image resource value. llvm-svn: 321555
*	MachineFunction: Return reference from getFunction(); NFC	Matthias Braun	2017-12-15	1	-21/+21
\| \| \| \| \| \|	The Function can never be nullptr so we can return a reference. llvm-svn: 320884
*	[AMDGPU] AMDPAL scratch buffer support	Tim Renouf	2017-09-29	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Added support for scratch (including spilling) for OS type amdpal: generates code to set up the scratch descriptor if it is needed. With amdpal, the scratch resource descriptor is loaded from offset 0 of the global information table. The low 32 bits of the address of the global information table is passed in s0. Added amdgpu-git-ptr-high function attribute to hard-wire the high 32 bits of the address of the global information table. If the function attribute is not specified, or is 0xffffffff, then the backend generates code to use the high 32 bits of pc. The documentation for the AMDPAL ABI will be added in a later commit. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye Differential Revision: https://reviews.llvm.org/D37483 llvm-svn: 314501
*	Add AddresSpace to PseudoSourceValue.	Jan Sjodin	2017-09-14	1	-0/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D35089 llvm-svn: 313297
*	AMDGPU: trivial comment change	Tim Renouf	2017-09-11	1	-1/+1
\| \| \| \| \| \|	... to check commit access for new committer. llvm-svn: 312900
*	[AMDGPU] Fix some Clang-tidy modernize-use-using and Include What You Use ↵	Eugene Zelenko	2017-08-08	1	-25/+10
\| \| \| \| \| \|	warnings; other minor fixes (NFC). llvm-svn: 310328
*	AMDGPU: Fix implicitarg.ptr handling special inputs	Matt Arsenault	2017-08-03	1	-1/+2
\| \| \| \|	llvm-svn: 310002
*	AMDGPU: Pass special input registers to functions	Matt Arsenault	2017-08-03	1	-45/+34
\| \| \| \|	llvm-svn: 309998
*	AMDGPU: Fix clobbering CSR VGPRs when spilling SGPR to it	Matt Arsenault	2017-08-02	1	-2/+20
\| \| \| \|	llvm-svn: 309783
*	AMDGPU: Annotate implicitarg.ptr usage	Matt Arsenault	2017-07-28	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \|	We need to pass something to functions for this to work. It isn't derivable just from the kernarg segment pointer because the implicit arguments are placed after the kernel arguments. Also fixes missing test for the intrinsic. llvm-svn: 309398
*	AMDGPU: Annotate necessity of flat-scratch-init	Matt Arsenault	2017-07-18	1	-7/+8
\| \| \| \| \| \| \| \|	As an approximation of the existing handling to avoid regressions. Fixes using too many registers with calls on subtargets with the SGPR allocation bug. llvm-svn: 308326
*	AMDGPU: Figure out private memory regs after lowering	Matt Arsenault	2017-07-18	1	-5/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduce pseudo-registers for registers needed for stack access, which are replaced during finalizeLowering. Note these pseudo-registers are currently only used for the used register location, and not for determining their input argument register. This is better because it avoids the need to try to predict whether a call will be emitted from the IR, and also detects stack objects introduced by legalization. Test changes are from the HasStackObjects check being more accurate since stack objects introduced during legalization are now known. llvm-svn: 308325
*	AMDGPU: Annotate features from x work item/group IDs.	Matt Arsenault	2017-07-17	1	-13/+26
\| \| \| \| \| \| \| \|	This wasn't necessary before since they are always enabled for kernels, but this is necessary if they need to be forwarded to a callable function. llvm-svn: 308226
*	AMDGPU: Detect kernarg segment pointer	Matt Arsenault	2017-07-14	1	-1/+4
\| \| \| \| \| \| \| \|	This is necessary to pass the kernarg segment pointer to callee functions. Also don't unconditionally enable for kernels. llvm-svn: 307978
*	AMDGPU: Setup SP/FP in callee function prolog/epilog	Matt Arsenault	2017-06-26	1	-0/+1
\| \| \| \|	llvm-svn: 306312
*	AMDGPU: Partially fix implicit.buffer.ptr intrinsic handling	Matt Arsenault	2017-06-26	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This should not be treated as a different version of private_segment_buffer. These are distinct things with different uses and register classes, and requires the function argument info to have more context about the function's type and environment. Also add missing test coverage for the intrinsic, and emit an error for HSA. This also encovers that the intrinsic is broken unless there happen to be stack objects. llvm-svn: 306264
*	AMDGPU: Start defining a calling convention	Matt Arsenault	2017-05-17	1	-7/+12
\| \| \| \| \| \| \| \|	Partially implement callee-side for arguments and return values. byval doesn't work properly, and most likely sret or other on-stack return values most as well. llvm-svn: 303308
*	AMDGPU: GFX9 GS and HS shaders always have the scratch wave offset in SGPR5	Marek Olsak	2017-05-04	1	-1/+7
\| \| \| \| \| \| \| \| \| \|	Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D32645 llvm-svn: 302200
*	AMDGPU: Add StackPtr and FramePtr registers to MFI	Matt Arsenault	2017-04-24	1	-0/+2
\| \| \| \| \| \|	These will be necessary for setting up call sequences. llvm-svn: 301208
*	AMDGPU: Refactor SIMachineFunctionInfo slightly	Matt Arsenault	2017-04-11	1	-15/+25
\| \| \| \| \| \|	Prepare for handling non-entry functions. llvm-svn: 299999
*	AMDGPU: Refactor argument lowering	Matt Arsenault	2017-04-11	1	-1/+1
\| \| \| \| \| \| \|	Split into smaller functions and prepare for handling non-entry functions. llvm-svn: 299998
*	AMDGPU: Don't use stack space for SGPR->VGPR spills	Matt Arsenault	2017-02-21	1	-42/+51
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before frame offsets are calculated, try to eliminate the frame indexes used by SGPR spills. Then we can delete them after. I think for now we can be sure that no other instruction will be re-using the same frame indexes. It should be easy to notice if this assumption ever breaks since everything asserts if it tries to use a dead frame index later. The unused emergency stack slot seems to still be left behind, so an additional 4 bytes is still wasted. llvm-svn: 295753
*	AMDGPU add support for spilling to a user sgpr pointed buffers	Tom Stellard	2017-01-25	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This lets you select which sort of spilling you want, either s[0:1] or 64-bit loads from s[0:1]. Patch By: Dave Airlie Reviewers: nhaehnle, arsenm, tstellarAMD Reviewed By: arsenm Subscribers: mareko, llvm-commits, kzhuravl, wdng, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D25428 llvm-svn: 293000
*	AMDGPU/SI: Make a function const	Tom Stellard	2016-12-20	1	-1/+0
\| \| \| \|	llvm-svn: 290185
*	AMDGPU/SI: Add a MachineMemOperand to MIMG instructions	Tom Stellard	2016-12-20	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Without a MachineMemOperand, the scheduler was assuming MIMG instructions were ordered memory references, so no loads or stores could be reordered across them. Reviewers: arsenm Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27536 llvm-svn: 290179
*	AMDGPU/SI: Add support for triples with the mesa3d operating system	Tom Stellard	2016-09-16	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: mesa3d will use the same kernel calling convention as amdhsa, but it will handle everything else like the default 'unknown' OS type. Reviewers: arsenm Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D22783 llvm-svn: 281779
*	[AMDGPU] Wave and register controls	Konstantin Zhuravlyov	2016-09-06	1	-14/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Implemented amdgpu-flat-work-group-size attribute - Implemented amdgpu-num-active-waves-per-eu attribute - Implemented amdgpu-num-sgpr attribute - Implemented amdgpu-num-vgpr attribute - Dynamic LDS constraints are in a separate patch Patch by Tom Stellard and Konstantin Zhuravlyov Differential Revision: https://reviews.llvm.org/D21562 llvm-svn: 280747
*	AMDGPU: Remove unused tracking of flat instructions	Matt Arsenault	2016-08-11	1	-1/+0
\| \| \| \|	llvm-svn: 278361
*	MachineFunction: Return reference for getFrameInfo(); NFC	Matthias Braun	2016-07-28	1	-4/+4
\| \| \| \| \| \| \|	getFrameInfo() never returns nullptr so we should use a reference instead of a pointer. llvm-svn: 277017
*	AMDGPU/SI: Don't use reserved VGPRs for SGPR spilling	Tom Stellard	2016-07-28	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We were using reserved VGPRs for SGPR spilling and this was causing some programs with a workgroup size of 1024 to use more than 64 registers, which is illegal. Reviewers: arsenm, mareko, nhaehnle Subscribers: nhaehnle, arsenm, llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D22032 llvm-svn: 276980
*	AMDGPU: Make AMDGPUMachineFunction fields private	Matt Arsenault	2016-07-26	1	-3/+0
\| \| \| \| \| \| \| \| \|	ABIArgOffset is a problem because properly fsetting the KernArgSize requires that the reserved area before the real kernel arguments be correctly aligned, which requires fixing clover. llvm-svn: 276766
*	AMDGPU: Add HSA dispatch id intrinsic	Matt Arsenault	2016-07-22	1	-1/+11
\| \| \| \|	llvm-svn: 276437
*	AMDGPU/SI: Emit the number of SGPR and VGPR spills	Marek Olsak	2016-07-13	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: v2: don't count SGPRs spilled to scratch twice I think this is sufficient. It doesn't count private memory usage, which happens often and uses scratch but isn't technically a spill. The private memory usage can be computed by: [scratch_per_thread - vgpr_spills - a random multiple of SGPR spills]. The fact SGPR spills add very high numbers to the scratch size make that computation a guessing game, but I don't have a solution to that. Reviewers: tstellarAMD Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D22197 llvm-svn: 275288
*	SIMachineFunctionInfo.cpp: Appease msc18 to use std::array.	NAKAMURA Takumi	2016-06-27	1	-2/+2
\| \| \| \|	llvm-svn: 273860
*	Reformat.	NAKAMURA Takumi	2016-06-27	1	-1/+1
\| \| \| \|	llvm-svn: 273859
*	Reformat blank lines.	NAKAMURA Takumi	2016-06-27	1	-2/+0
\| \| \| \|	llvm-svn: 273858
*	[AMDGPU] Emit debugger prologue and emit the rest of the debugger fields in ↵	Konstantin Zhuravlyov	2016-06-25	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the kernel code header Debugger prologue is emitted if -mattr=+amdgpu-debugger-emit-prologue. Debugger prologue writes work group IDs and work item IDs to scratch memory at fixed location in the following format: - offset 0: work group ID x - offset 4: work group ID y - offset 8: work group ID z - offset 16: work item ID x - offset 20: work item ID y - offset 24: work item ID z Set - amd_kernel_code_t::debug_wavefront_private_segment_offset_sgpr to scratch wave offset reg - amd_kernel_code_t::debug_private_segment_buffer_sgpr to scratch rsrc reg - amd_kernel_code_t::is_debug_supported to true if all debugger features are enabled Differential Revision: http://reviews.llvm.org/D20335 llvm-svn: 273769
*	AMDGPU: Cleanup subtarget handling.	Matt Arsenault	2016-06-24	1	-5/+6
\| \| \| \| \| \| \| \| \|	Split AMDGPUSubtarget into amdgcn/r600 specific subclasses. This removes most of the static_casting of the basic codegen classes everywhere, and tries to restrict the features visible on the wrong target. llvm-svn: 273652
*	AMDGPU: Add option to disable spilling SGPRs to VGPRs.	Matt Arsenault	2016-06-23	1	-2/+9
\| \| \| \| \| \|	This can help debug spilling problems. llvm-svn: 273605
*	[AMDGPU][NFC] Rename ReserveTrapVGPRs -> ReserveRegs	Konstantin Zhuravlyov	2016-05-24	1	-3/+3
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D20081 llvm-svn: 270594
*	[AMDGPU] Move reserved vgpr count for trap handler usage to ↵	Konstantin Zhuravlyov	2016-04-26	1	-0/+4
\| \| \| \| \| \| \| \|	SIMachineFunctionInfo + minor commenting changes Differential Revision: http://reviews.llvm.org/D19537 llvm-svn: 267573
*	AMDGPU: Add queue ptr intrinsic	Matt Arsenault	2016-04-25	1	-0/+3
\| \| \| \|	llvm-svn: 267451
*	AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit	Tom Stellard	2016-04-14	1	-6/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: For GL_ARB_compute_shader we need to support workgroup sizes of at least 1024. However, if we want to allow large workgroup sizes, we may need to use less registers, as we have to run more waves per SIMD. This patch adds an attribute to specify the maximum work group size the compiled program needs to support. It defaults, to 256, as that has no wave restrictions. Reducing the number of registers available is done similarly to how the registers were reserved for chips with the sgpr init bug. Reviewers: mareko, arsenm, tstellarAMD, nhaehnle Subscribers: FireBurn, kerberizer, llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D18340 Patch By: Bas Nieuwenhuizen llvm-svn: 266337
*	AMDGPU/SI: Use the correct scratch wave offset register for shaders.	Tom Stellard	2016-04-14	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The code previously always used s1 as it was using the user + system SGPR information for compute kernels. This is incorrect for Mesa shaders though, The register should be the next SGPR after all user and system SGPR's. We use that Mesa adds arguments for all input and system SGPR's and take the next available SGPR for the scratch wave offset register. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewers: mareko, arsenm, nhaehnle, tstellarAMD Subscribers: qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D18941 Patch By: Bas Nieuwenhuizen llvm-svn: 266336
*	AMDGPU: Add a shader calling convention	Nicolai Haehnle	2016-04-06	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \|	This makes it possible to distinguish between mesa shaders and other kernels even in the presence of compute shaders. Patch By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Differential Revision: http://reviews.llvm.org/D18559 llvm-svn: 265589
*	AMDGPU/SI: Add support for spiling SGPRs to scratch buffer	Tom Stellard	2016-03-04	1	-10/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This is necessary for when we run out of VGPRs and can no longer use v_{read,write}_lane for spilling SGPRs. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17592 llvm-svn: 262732