bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[ARM][MVE] validForTailPredication insts	Sam Parker	2019-10-15	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reverse the logic for valid tail predication instructions and create a whitelist instead. Added other instruction groups that aren't obviously safe: - instructions that 'narrow' their result. - lane moves. - byte swapping instructions. - interleaving loads and stores. - cross-beat carries. - top/bottom instructions. - complex operations. Hopefully we should be able to add more of these instructions to the whitelist, once we have a more concrete idea of the transform. Differential Revision: https://reviews.llvm.org/D67904 llvm-svn: 374887
*	[ARM][MVE] Add invalidForTailPredication to TSFlags	Sam Parker	2019-09-17	1	-0/+4
\| \| \| \| \| \| \| \| \|	Set this bit for the MVE reduction instructions to prevent a loop from becoming tail predicated in their presence. Differential Revision: https://reviews.llvm.org/D67444 llvm-svn: 372076
*	[ARM] Add MVE vector load/store instructions.	Simon Tatham	2019-06-25	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds the rest of the vector memory access instructions. It includes contiguous loads/stores, with an ordinary addressing mode such as [r0,#offset] (plus writeback variants); gather loads and scatter stores with a scalar base address register and a vector of offsets from it (written [r0,q1] or similar); and gather/scatters with a vector of base addresses (written [q0,#offset], again with writeback). Additionally, some of the loads can widen each loaded value into a larger vector lane, and the corresponding stores narrow them again. To implement these, we also have to add the addressing modes they need. Also, in AsmParser, the `isMem` query function now has subqueries `isGPRMem` and `isMVEMem`, according to which kind of base register is used by a given memory access operand. I've also had to add an extra check in `checkTargetMatchPredicate` in the AsmParser, without which our last-minute check of `rGPR` register operands against SP and PC was failing an assertion because Tablegen had inserted an immediate 0 in place of one of a pair of tied register operands. (This matches the way the corresponding check for `MCK_rGPR` in `validateTargetOperandClass` is guarded.) Apparently the MVE load instructions were the first to have ever triggered this assertion, but I think only because they were the first to have a combination of the usual Arm pre/post writeback system and the `rGPR` class in particular. Reviewers: dmgreen, samparker, SjoerdMeijer, t.p.northover Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62680 llvm-svn: 364291
*	[ARM] Add the non-MVE instructions in Arm v8.1-M.	Simon Tatham	2019-06-11	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds support for the new family of conditional selection / increment / negation instructions; the low-overhead branch instructions (e.g. BF, WLS, DLS); the CLRM instruction to zero a whole list of registers at once; the new VMRS/VMSR and VLDR/VSTR instructions to get data in and out of 8.1-M system registers, particularly including the new VPR register used by MVE vector predication. To support this, we also add a register name 'zr' (used by the CSEL family to force one of the inputs to the constant 0), and operand types for lists of registers that are also allowed to include APSR or VPR (used by CLRM). The VLDR/VSTR instructions also need a new addressing mode. The low-overhead branch instructions exist in their own separate architecture extension, which we treat as enabled by default, but you can say -mattr=-lob or equivalent to turn it off. Reviewers: dmgreen, samparker, SjoerdMeijer, t.p.northover Reviewed By: samparker Subscribers: miyuki, javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62667 llvm-svn: 363039
*	[ARM] Add an MVE execution domain	Sjoerd Meijer	2019-05-30	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MVE architecturally specifies a 'beat' system in which a vector instruction executed now will complete its actual operation over the next four cycles, so it can overlap with the execution of the previous and next MVE instruction. This makes it generally an advantage to avoid moving values back and forth between MVE registers and anywhere else, if there's any sensible way to do the same processing in whatever register type the values already occupied. That's just what the 'execution domain' system is supposed to achieve. So here we add a new execution domain which will contain all the MVE vector instructions when they are added. Patch by: Simon Tatham Differential Revision: https://reviews.llvm.org/D60703 llvm-svn: 362068
*	Update the file headers across all of the LLVM projects in the monorepo	Chandler Carruth	2019-01-19	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636
*	ARM: fix Thumb2 CodeGen for ldrex with folded frame-index.	Tim Northover	2018-09-07	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \|	Because t2LDREX (& t2STREX) were marked as AddrModeNone, but did allow a FrameIndex operand, rewriteT2FrameIndex asserted. This gives them a proper addressing-mode and tells the rewriter about it so that encodable offsets are exploited and others are rejected. Should fix PR38828. llvm-svn: 341642
*	[MinGW] [ARM] Add stubs for potential automatic dllimported variables	Martin Storsjo	2018-08-31	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \|	The runtime pseudo relocations can't handle the ARM format embedded addresses in movw/movt pairs. By using stubs, the potentially dllimported addresses can be touched up by the runtime pseudo relocation framework. Differential Revision: https://reviews.llvm.org/D51450 llvm-svn: 341176
*	[AArch64][ARM] Armv8.4-A: Trace synchronization barrier instruction	Sjoerd Meijer	2018-07-06	1	-0/+14
\| \| \| \| \| \| \| \|	This adds the Armv8.4-A Trace synchronization barrier (TSB) instruction. Differential Revision: https://reviews.llvm.org/D48918 llvm-svn: 336418
*	[ARM] Armv8.2-A FP16 code generation (part 1/3)	Sjoerd Meijer	2018-01-26	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the groundwork for Armv8.2-A FP16 code generation . Clang passes and returns _Float16 values as floats, together with the required bitconverts and truncs etc. to implement correct AAPCS behaviour, see D42318. We will implement half-precision argument passing/returning lowering in the ARM backend soon, but for now this means that this: _Float16 sub(_Float16 a, _Float16 b) { return a + b; } gets lowered to this: define float @sub(float %a.coerce, float %b.coerce) { entry: %0 = bitcast float %a.coerce to i32 %tmp.0.extract.trunc = trunc i32 %0 to i16 %1 = bitcast i16 %tmp.0.extract.trunc to half <SNIP> %add = fadd half %1, %3 <SNIP> } When FullFP16 is not supported, we don't make f16 a legal type, and we get legalization for "free", i.e. nothing changes and everything works as before. And also f16 argument passing/returning is handled. When FullFP16 is supported, we do make f16 a legal type, and have 2 places that we need to patch up: f16 argument passing and returning, which involves minor tweaks to avoid unnecessary code generation for some bitcasts. As a "demonstrator" that this works for the different FP16, FullFP16, softfp modes, etc., I've added match rules to the VSUB instruction description showing that we can codegen this instruction from IR, but more importantly, also to some conversion instructions. These conversions were causing issue before in the FP16 and FullFP16 cases. I've also added match rules to the VLDRH and VSTRH desriptions, so that we can actually compile the entire half-precision sub code example above. This showed that these loads and stores had the wrong addressing mode specified: AddrMode5 instead of AddrMode5FP16, which turned out not be implemented at all, so that has also been added. This is the minimal patch that shows all the different moving parts. In patch 2/3 I will add some efficient lowering of bitcasts, and in 2/3 I will add the remaining Armv8.2-A FP16 instruction descriptions. Thanks to Sam Parker and Oliver Stannard for their help and reviews! Differential Revision: https://reviews.llvm.org/D38315 llvm-svn: 323512
*	[arm] Fix Unnecessary reloads from GOT.	Evgeniy Stepanov	2017-11-13	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: This fixes PR35221. Use pseudo-instructions to let MachineCSE hoist global address computation. Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D39871 llvm-svn: 318081
*	[ARM] v8.3-a complex number support	Sam Parker	2017-09-29	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	New instructions are added to AArch32 and AArch64 to aid floating-point multiplication and addition of complex numbers, where the complex numbers are packed in a vector register as a pair of elements. The Imaginary part of the number is placed in the more significant element, and the Real part of the number is placed in the less significant element. This patch adds assembler for the ARM target. Differential Revision: https://reviews.llvm.org/D36789 llvm-svn: 314511
*	[ARM] Tidy-up condition-code support functions	Javed Absar	2017-08-27	1	-64/+1
\| \| \| \| \| \| \| \| \|	Move condition code support functions to Utils and remove code duplication. Reviewed by: @fhahn, @asb Differential Revision: https://reviews.llvm.org/D37179 llvm-svn: 311860
*	[ARM] Make RWPI use movw/movt when available	Christof Douma	2017-02-07	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When constructing global address literals while targeting the RWPI relocation model. LLVM currently only uses literal pools. If MOVW/MOVT instructions are available we can use these instead. Beside being more efficient it allows -arm-execute-only to work with -relocation-model=RWPI as well. When we generate MOVW/MOVT for global addresses when targeting the RWPI relocation model, we need to use base relative relocations. This patch does the needed plumbing in MC to generate these for MOVW/MOVT. Differential Revision: https://reviews.llvm.org/D29487 Change-Id: I446786e43a6f5aa9b6a5bb2cd216d60d41c7755d llvm-svn: 294298
*	Don't print (PLT) on arm.	Rafael Espindola	2016-06-16	1	-4/+0
\| \| \| \| \| \| \| \| \|	The R_ARM_PLT32 relocation is deprecated and is not produced by MC. This means that the code being deleted is dead from the .o point of view and was making the .s more confusing. llvm-svn: 272909
*	ARM: correct TLS access on WoA	Saleem Abdulrasool	2016-06-07	1	-2/+8
\| \| \| \| \| \| \| \| \| \| \| \|	TLS access requires an offset from the TLS index. The index itself is the section-relative distance of the symbol. For ARM, the relevant relocation (IMAGE_REL_ARM_SECREL) is applied as a constant. This means that the value may not be an immediate and must be lowered into a constant pool. This offset will not be base relocated. We were previously emitting the actual address of the symbol which would be base relocated and would therefore be the vaue offset by the ImageBase + TLS Offset. llvm-svn: 271974
*	Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC)	Alexander Kornienko	2015-06-23	1	-2/+2
\| \| \| \| \| \|	Apparently, the style needs to be agreed upon first. llvm-svn: 240390
*	Fixed/added namespace ending comments using clang-tidy. NFC	Alexander Kornienko	2015-06-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	The patch is generated using this command: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ llvm/lib/ Thanks to Eugene Kosov for the original patch! llvm-svn: 240137
*	Revert "Move dllimport name mangling to IR mangler."	Reid Kleckner	2015-06-11	1	-1/+6
\| \| \| \| \| \| \| \| \|	This reverts commit r239437. This broke clang-cl self-hosts. We'd end up calling the __imp_ symbol directly instead of using it to do an indirect function call. llvm-svn: 239502
*	Move dllimport name mangling to IR mangler.	Peter Collingbourne	2015-06-09	1	-6/+1
\| \| \| \| \| \| \| \|	This ensures that LTO clients see the correct external symbol name. Differential Revision: http://reviews.llvm.org/D10318 llvm-svn: 239437
*	Canonicalize header guards into a common format.	Benjamin Kramer	2014-08-13	1	-2/+2
\| \| \| \| \| \| \| \| \| \|	Add header guards to files that were missing guards. Remove #endif comments as they don't seem common in LLVM (we can easily add them back if we decide they're useful) Changes made by clang-tidy with minor tweaks. llvm-svn: 215558
*	ARM: correctly mangle dllimport symbols	Saleem Abdulrasool	2014-07-07	1	-1/+6
\| \| \| \| \| \| \| \|	Add support for tracking DLLImport storage class information on a per symbol basis in the ARM instruction selection. Use that information to correctly mangle the symbol (dllimport symbols are referenced via *__imp_<name>). llvm-svn: 212430
*	Fix known typos	Alp Toker	2014-01-24	1	-1/+2
\| \| \| \| \| \| \|	Sweep the codebase for common typos. Includes some changes to visible function names that were misspelt. llvm-svn: 200018
*	ARM: remove special cases for Darwin dynamic-no-pic mode.	Tim Northover	2013-11-25	1	-27/+21
\| \| \| \| \| \| \| \| \|	These are handled almost identically to static mode (and ELF's global address materialisation), except that a symbol may have "$non_lazy_ptr" appended. This can be handled by passing appropriate flags along with the instruction instead of using entirely separate pseudo-instructions. llvm-svn: 195655
*	[ARMv8] Implement the new DMB/DSB operands.	Joey Gouly	2013-09-05	1	-9/+9
\| \| \| \| \| \| \|	This removes the custom ISD Node: MEMBARRIER and replaces it with an intrinsic. llvm-svn: 190055
*	ARM: ISB cannot be passed the same options as DMB	Amaury de la Vieuville	2013-06-10	1	-0/+43
\| \| \| \| \| \|	ISB should only accepts full system sync, other options are reserved llvm-svn: 183656
*	Remove getARMRegisterNumbering and replace with calls into	Eric Christopher	2012-08-09	1	-76/+0
\| \| \| \| \| \| \| \| \| \| \|	the register info for getEncodingValue. This builds on the small patch of yesterday to set HWEncoding in the register file. One (deprecated) use was turned into a hard number to avoid needing register info in the old JIT. llvm-svn: 161628
*	Fix #13138, a bug around ARM instruction DSB encoding and decoding issue.	Jiangning Liu	2012-08-02	1	-7/+23
\| \| \| \|	llvm-svn: 161161
*	ARM more NEON VLD/VST composite physical register refactoring.	Jim Grosbach	2012-03-06	1	-15/+31
\| \| \| \| \| \|	Register pair, all lanes subscripting. llvm-svn: 152157
*	ARM refactor away a bunch of VLD/VST pseudo instructions.	Jim Grosbach	2012-03-05	1	-0/+17
\| \| \| \| \| \| \| \| \|	With the new composite physical registers to represent arbitrary pairs of DPR registers, we don't need the pseudo-registers anymore. Get rid of a bunch of them that use DPR register pairs and just use the real instructions directly instead. llvm-svn: 152045
*	Move default case for covered enum outside of switch.	Richard Smith	2012-01-10	1	-1/+1
\| \| \| \|	llvm-svn: 147870
*	Fix a -Wreturn-type warning in g++.	Richard Smith	2012-01-10	1	-0/+1
\| \| \| \|	llvm-svn: 147867
*	Remove unnecessary default cases in switches that cover all enum values.	David Blaikie	2012-01-10	1	-2/+0
\| \| \| \|	llvm-svn: 147855
*	Thumb parsing diagnostics for low-reg requirements on ADD and MOV.	Jim Grosbach	2011-08-16	1	-0/+13
\| \| \| \|	llvm-svn: 137779
*	ARM thumb assembly parsing for arithmetic flag setting instructions.	Jim Grosbach	2011-08-16	1	-0/+6
\| \| \| \| \| \| \| \| \|	Thumb one requires that many arithmetic instruction forms have an 'S' suffix. For Thumb2, the whether the suffix is required or precluded depends on whether the instruction is in an IT block. Use target parser predicates to check for these sorts of context-sensitive constraints. llvm-svn: 137746
*	Code clean up.	Evan Cheng	2011-07-25	1	-3/+0
\| \| \| \|	llvm-svn: 135954
*	Sink ARM mc routines into MCTargetDesc.	Evan Cheng	2011-07-23	1	-0/+433
	llvm-svn: 135825