bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[AMDGPU][MC] Added validation of d16 and r128 modifiers of MIMG opcodes	Dmitry Preobrazhensky	2018-02-05	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	See bugs 36094, 36095: https://bugs.llvm.org/show_bug.cgi?id=36094 https://bugs.llvm.org/show_bug.cgi?id=36095 Differential Revision: https://reviews.llvm.org/D42692 Reviewers: vpykhtin, artem.tamazov, arsenm llvm-svn: 324231
*	AMDGPU/SI: Add d16 support for buffer intrinsics.	Changpeng Fang	2018-01-12	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D38906 Reviewers: Matt and Brian. llvm-svn: 322402
*	MachineFunction: Return reference from getFunction(); NFC	Matthias Braun	2017-12-15	1	-2/+2
\| \| \| \| \| \|	The Function can never be nullptr so we can return a reference. llvm-svn: 320884
*	AMDGPU: Fix missing subtarget feature initializer	Matt Arsenault	2017-12-05	1	-0/+1
\| \| \| \|	llvm-svn: 319733
*	AMDGPU: Disable fp64 support on pre GCN asics	Jan Vesely	2017-12-04	1	-9/+14
\| \| \| \| \| \| \| \| \| \| \|	It's not implemented. Passing +fp64-fp16-denormal feature enables fp64 even on asics that don't support it v2: fix hasFP64 query Differential Revision: https://reviews.llvm.org/D39931 llvm-svn: 319709
*	AMDGPU: Don't use MUBUF vaddr if address may overflow	Matt Arsenault	2017-11-15	1	-0/+1
\| \| \| \| \| \| \|	Effectively revert r263964. Before we would not allow this if vaddr was not known to be positive. llvm-svn: 318240
*	Move TargetFrameLowering.h to CodeGen where it's implemented	David Blaikie	2017-11-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	This header already includes a CodeGen header and is implemented in lib/CodeGen, so move the header there to match. This fixes a link error with modular codegeneration builds - where a header and its implementation are circularly dependent and so need to be in the same library, not split between two like this. llvm-svn: 317379
*	[AMDGPU] Clean up symbols in the global namespace.	Benjamin Kramer	2017-10-31	1	-0/+2
\| \| \| \|	llvm-svn: 317051
*	AMDGPU: Add max-mix-insts subtarget feature	Matt Arsenault	2017-10-25	1	-0/+1
\| \| \| \|	llvm-svn: 316553
*	AMDGPU: Initialize WavefrontSize from TD files	Konstantin Zhuravlyov	2017-10-23	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D39205 llvm-svn: 316389
*	AMDGPU: Fix default range in non-kernel functions	Matt Arsenault	2017-10-23	1	-4/+21
\| \| \| \| \| \| \| \| \|	The range should be assumed to be the hardware maximum if a workitem intrinsic is used in a callable function which does not know the restricted limit of the calling kernel. llvm-svn: 316346
*	AMDGPU: Do not emit deprecated notes for code object v3	Konstantin Zhuravlyov	2017-10-14	1	-0/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D38749 llvm-svn: 315810
*	[AMDGPU] Prevent post-RA scheduler from breaking memory clauses	Stanislav Mekhanoshin	2017-09-19	1	-0/+54
\| \| \| \| \| \| \| \| \|	The pre-RA scheduler does load/store clustering, but post-RA scheduler undoes it. Add mutation to prevent it. Differential Revision: https://reviews.llvm.org/D38014 llvm-svn: 313670
*	[AMDGPU][MC][GFX9] Added integer clamping support for VOP3 opcodes	Dmitry Preobrazhensky	2017-08-16	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	See Bug 34152: https://bugs.llvm.org//show_bug.cgi?id=34152 Reviewers: SamWot, artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D36674 llvm-svn: 311006
*	Reapply "[GlobalISel] Remove the GISelAccessor API."	Quentin Colombet	2017-08-15	1	-31/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r310425, thus reapplying r310335 with a fix for link issue of the AArch64 unittests on Linux bots when BUILD_SHARED_LIBS is ON. Original commit message: [GlobalISel] Remove the GISelAccessor API. Its sole purpose was to avoid spreading around ifdefs related to building global-isel. Since r309990, GlobalISel is not optional anymore, thus, we can get rid of this mechanism all together. NFC. ---- The fix for the link issue consists in adding the GlobalISel library in the list of dependencies for the AArch64 unittests. This dependency comes from the use of AArch64Subtarget that needs to know how to destruct the GISel related APIs when being detroyed. Thanks to Bill Seurer and Ahmed Bougacha for helping me reproducing and understand the problem. llvm-svn: 310969
*	Revert "[GlobalISel] Remove the GISelAccessor API."	Quentin Colombet	2017-08-08	1	-6/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit r310115. It causes a linker failure for the one of the unittests of AArch64 on one of the linux bot: http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/3429 : && /home/fedora/gcc/install/gcc-7.1.0/bin/g++ -fPIC -fvisibility-inlines-hidden -Werror=date-time -std=c++11 -Wall -W -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wno-missing-field-initializers -pedantic -Wno-long-long -Wno-maybe-uninitialized -Wdelete-non-virtual-dtor -Wno-comment -ffunction-sections -fdata-sections -O2 -L/home/fedora/gcc/install/gcc-7.1.0/lib64 -Wl,-allow-shlib-undefined -Wl,-O3 -Wl,--gc-sections unittests/Target/AArch64/CMakeFiles/AArch64Tests.dir/InstSizes.cpp.o -o unittests/Target/AArch64/AArch64Tests lib/libLLVMAArch64CodeGen.so.6.0.0svn lib/libLLVMAArch64Desc.so.6.0.0svn lib/libLLVMAArch64Info.so.6.0.0svn lib/libLLVMCodeGen.so.6.0.0svn lib/libLLVMCore.so.6.0.0svn lib/libLLVMMC.so.6.0.0svn lib/libLLVMMIRParser.so.6.0.0svn lib/libLLVMSelectionDAG.so.6.0.0svn lib/libLLVMTarget.so.6.0.0svn lib/libLLVMSupport.so.6.0.0svn -lpthread lib/libgtest_main.so.6.0.0svn lib/libgtest.so.6.0.0svn -lpthread -Wl,-rpath,/home/buildbots/ppc64le-clang-multistage-test/clang-ppc64le-multistage/stage1/lib && : unittests/Target/AArch64/CMakeFiles/AArch64Tests.dir/InstSizes.cpp.o:(.toc+0x0): undefined reference to `vtable for llvm::LegalizerInfo' unittests/Target/AArch64/CMakeFiles/AArch64Tests.dir/InstSizes.cpp.o:(.toc+0x8): undefined reference to `vtable for llvm::RegisterBankInfo' The particularity of this bot is that it is built with BUILD_SHARED_LIBS=ON However, I was not able to reproduce the problem so far. Reverting to unblock the bot. llvm-svn: 310425
*	AMDGPU: Cleanup subtarget features	Matt Arsenault	2017-08-07	1	-2/+13
\| \| \| \| \| \| \| \| \| \| \| \|	Try to avoid mutually exclusive features. Don't use a real default GPU, and use a fake "generic". The goal is to make it easier to see which set of features are incompatible between feature strings. Most of the test changes are due to random scheduling changes from not having a default fullspeed model. llvm-svn: 310258
*	[GlobalISel] Remove the GISelAccessor API.	Quentin Colombet	2017-08-04	1	-31/+6
\| \| \| \| \| \| \| \| \| \|	Its sole purpose was to avoid spreading around ifdefs related to building global-isel. Since r309990, GlobalISel is not optional anymore, thus, we can get rid of this mechanism all together. NFC. llvm-svn: 310115
*	[GlobalISel] Make GlobalISel a non-optional library.	Quentin Colombet	2017-08-03	1	-8/+0
\| \| \| \| \| \| \| \|	With this change, the GlobalISel library gets always built. In particular, this is not possible to opt GlobalISel out of the build using the LLVM_BUILD_GLOBAL_ISEL variable any more. llvm-svn: 309990
*	AMDGPU: Add encoding for carryless add/sub instructions	Matt Arsenault	2017-07-20	1	-0/+1
\| \| \| \|	llvm-svn: 308639
*	AMDGPU: Fix amdgpu-flat-work-group-size/amdgpu-waves-per-eu check	Konstantin Zhuravlyov	2017-07-16	1	-1/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D35433 llvm-svn: 308147
*	[AMDGPU] Fix -Wimplicit-fallthrough warnings. NFCI.	Simon Pilgrim	2017-07-07	1	-0/+3
\| \| \| \|	llvm-svn: 307381
*	[AMDGPU] Move GISel accessor initialization from TargetMachine to Subtarget.	Quentin Colombet	2017-07-05	1	-5/+50
\| \| \| \| \| \|	NFC llvm-svn: 307186
*	[AMDGPU] SDWA: several fixes for V_CVT and VOPC instructions	Sam Kolton	2017-06-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: 1. Instruction V_CVT_U32_F32 allow omod operand (see SIInstrInfo.td:1435). In fact this operand shouldn't be allowed here. This fix checks if SDWA pseudo instruction has OMod operand and then copy it. 2. There were several problems with support of VOPC instructions in SDWA peephole pass. Reviewers: tstellar, arsenm, vpykhtin, airlied, kzhuravl Subscribers: wdng, nhaehnle, yaxunl, dstuttard, tpr, sarnex, t-tye Differential Revision: https://reviews.llvm.org/D34626 llvm-svn: 306413
*	[AMDGPU] SDWA: add support for GFX9 in peephole pass	Sam Kolton	2017-06-22	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Added support based on merged SDWA pseudo instructions. Now peephole allow one scalar operand, omod and clamp modifiers. Added several subtarget features for GFX9 SDWA. This diff also contains changes from D34026. Depends D34026 Reviewers: vpykhtin, rampitec, arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D34241 llvm-svn: 305986
*	AMDGPU: Make auto waitcnt before barrier a feature	Konstantin Zhuravlyov	2017-06-02	1	-0/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D33793 llvm-svn: 304571
*	AMDGPU: Add new subtarget features for gfx9 flat instructions	Matt Arsenault	2017-05-10	1	-0/+3
\| \| \| \| \| \| \|	Flat instructions gain an immediate offset, and 2 new sets of segment specific flat instructions are added. llvm-svn: 302729
*	[AMDGPU] Generate range metadata for workitem id	Stanislav Mekhanoshin	2017-04-12	1	-0/+60
\| \| \| \| \| \| \| \| \|	If workgroup size is known inform llvm about range returned by local id and local size queries. Differential Revision: https://reviews.llvm.org/D31804 llvm-svn: 300102
*	AMDGPU: Fix crash when disassembling VOP3 mac	Matt Arsenault	2017-04-10	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \|	The unused dummy src2_modifiers is missing, so it crashes when trying to print it. I tried to fully remove src2_modifiers, but there are some irritations in the places where it is converted to mad since it starts to require modifying use lists while iterating over them. llvm-svn: 299861
*	[AMDGPU] Get address space mapping by target triple environment	Yaxun Liu	2017-03-27	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As we introduced target triple environment amdgiz and amdgizcl, the address space values are no longer enums. We have to decide the value by target triple. The basic idea is to use struct AMDGPUAS to represent address space values. For address space values which are not depend on target triple, use static const members, so that they don't occupy extra memory space and is equivalent to a compile time constant. Since the struct is lightweight and cheap, it can be created on the fly at the point of usage. Or it can be added as member to a pass and created at the beginning of the run* function. Differential Revision: https://reviews.llvm.org/D31284 llvm-svn: 298846
*	AMDGPU: Add VOP3P instruction format	Matt Arsenault	2017-02-27	1	-0/+1
\| \| \| \| \| \| \| \|	Add a few non-VOP3P but instructions related to packed. Includes hack with dummy operands for the benefit of the assembler llvm-svn: 296368
*	AMDGPU: Redefine clamp node as clamp 0.0-1.0	Matt Arsenault	2017-02-21	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	Change implementation to use max instead of add. min/max/med3 do not flush denormals regardless of the mode, so it is OK to use it whether or not they are enabled. Also allow using clamp with f16, and use knowledge of dx10_clamp. llvm-svn: 295788
*	AMDGPU: Fix assembler subtarget predicate for gfx9	Matt Arsenault	2017-02-18	1	-0/+1
\| \| \| \| \| \|	This was accepting GFX9 instructions on VI. llvm-svn: 295557
*	AMDGPU: Merge initial gfx9 support	Matt Arsenault	2017-02-18	1	-0/+1
\| \| \| \|	llvm-svn: 295554
*	Revert "[AMDGPU] Fix for SIMachineScheduler crash. SI Scheduler should track"	Alexander Timofeev	2017-02-14	1	-1/+3
\| \| \| \| \| \|	This reverts commit ce06d9cb99298eb844b66e117f5108a06747c907. llvm-svn: 295054
*	AMDGPU : Add trap handler support.	Wei Ding	2017-02-10	1	-1/+2
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D26010 llvm-svn: 294692
*	[AMDGPU] Calculate number of min/max SGPRs/VGPRs for WavesPerEU instead of ↵	Konstantin Zhuravlyov	2017-02-09	1	-1/+1
\| \| \| \| \| \| \| \|	using switch statement Differential Revision: https://reviews.llvm.org/D29741 llvm-svn: 294627
*	[AMDGPU] Add target information that is required by tools to metadata	Konstantin Zhuravlyov	2017-02-08	1	-80/+1
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D28760#fb670e28 llvm-svn: 294449
*	[AMDGPU][NFC] De-tabify	Konstantin Zhuravlyov	2017-02-08	1	-1/+1
\| \| \| \|	llvm-svn: 294445
*	[AMDGPU] Move register related queries to subtarget class	Konstantin Zhuravlyov	2017-02-08	1	-5/+173
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D29318 llvm-svn: 294440
*	[AMDGPU] Fix for SIMachineScheduler crash. SI Scheduler should track	Alexander Timofeev	2017-02-07	1	-3/+1
\| \| \| \| \| \| \| \|	lane masks. Differential revision: https://reviews.llvm.org/D29442 llvm-svn: 294324
*	[AMDGPU] Account workgroup size in LDS occupancy limits	Stanislav Mekhanoshin	2017-02-01	1	-53/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Functions matching LDS use to occupancy return results for a workgroup of 64 workitems. The numbers has to be adjusted for bigger workgroups. For example a workgroup of size 256 already occupies 4 waves just by itself. Given that all numbers of LDS use in the compiler are per workgroup, occupancy shall be multiplied by 4 in this case. Each 64 workitems still limited by the same number, but 4 subrgoups 64 workitems each can afford 4 times more LDS to get the same occupancy. In addition change initializes LDS size in the subtarget to a real value for SI+ targets. This is required since LDS size is a variable in these calculations. Differential Revision: https://reviews.llvm.org/D29423 llvm-svn: 293837
*	AMDGPU: Enable FeatureFlatForGlobal on Volcanic Islands	Matt Arsenault	2017-01-27	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \|	Accomplishes what r292982 was supposed to, which ended up only really making the necessary test changes. This should be applied to the 4.0 branch. Patch by Vedran Miletić <vedran@miletic.net> llvm-svn: 293310
*	AMDGPU add support for spilling to a user sgpr pointed buffers	Tom Stellard	2017-01-25	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This lets you select which sort of spilling you want, either s[0:1] or 64-bit loads from s[0:1]. Patch By: Dave Airlie Reviewers: nhaehnle, arsenm, tstellarAMD Reviewed By: arsenm Subscribers: mareko, llvm-commits, kzhuravl, wdng, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D25428 llvm-svn: 293000
*	Enable FeatureFlatForGlobal on Volcanic Islands	Matt Arsenault	2017-01-24	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	This switches to the workaround that HSA defaults to for the mesa path. This should be applied to the 4.0 branch. Patch by Vedran Miletić <vedran@miletic.net> llvm-svn: 292982
*	AMDGPU: Combine fp16/fp64 subtarget features	Matt Arsenault	2017-01-23	1	-5/+4
\| \| \| \| \| \| \|	The same control register controls both, and are set to the same defaults. Keep the old names around as aliases. llvm-svn: 292837
*	Pacify -Wreorder.	Benjamin Kramer	2017-01-20	1	-1/+1
\| \| \| \|	llvm-svn: 292599
*	[AMDGPU] Add subtarget features for SDWA/DPP	Sam Kolton	2017-01-20	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Reviewers: vpykhtin, artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D28900 llvm-svn: 292596
*	[AMDGPU, PowerPC, TableGen] Fix some Clang-tidy modernize and Include What ↵	Eugene Zelenko	2016-12-12	1	-13/+5
\| \| \| \| \| \|	You Use warnings; other minor fixes (NFC). llvm-svn: 289475
*	[AMDGPU] Scalarization of global uniform loads.	Alexander Timofeev	2016-12-08	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: LC can currently select scalar load for uniform memory access basing on readonly memory address space only. This restriction originated from the fact that in HW prior to VI vector and scalar caches are not coherent. With MemoryDependenceAnalysis we can check that the memory location corresponding to the memory operand of the LOAD is not clobbered along the all paths from the function entry. Reviewers: rampitec, tstellarAMD, arsenm Subscribers: wdng, arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D26917 llvm-svn: 289076