bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[SystemZ] Add translate/convert instructions	Ulrich Weigand	2017-05-10	1	-0/+324
\| \| \| \| \| \| \|	This adds the set of character-set translate and convert instructions for assembler / disassembler use. llvm-svn: 302644
*	[SystemZ] Add missing memory/string instructions	Ulrich Weigand	2017-05-10	1	-0/+186
\| \| \| \| \| \| \|	This adds a number of missing memory and string instructions for assembler / disassembler use. llvm-svn: 302643
*	[SystemZ] Reformat assembler/disassembler tests	Ulrich Weigand	2017-05-10	2	-4198/+4228
\| \| \| \| \| \| \| \|	The assembler and disassmebler test cases started out formatted and sorted in a particular way, but this got lost over time as patches were added. Reformat them again. NFC. llvm-svn: 302642
*	[AArch64] armv8-A doesn't have CRC.	Ahmed Bougacha	2017-05-03	1	-17/+0
\| \| \| \| \| \| \| \| \| \| \|	That's only a required extension as of v8.1a. Remove it from the "generic" CPU as well: it should only support the base ISA (and binutils agrees). Also unify the MC tests into crc.s and arm64-crc32.s llvm-svn: 302077
*	[X86][LWP] Add llvm support for LWP instructions (reapplied).	Simon Pilgrim	2017-05-03	2	-0/+42
\| \| \| \| \| \| \| \| \| \|	This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Reapplied - this time without changing line endings of existing files. Differential Revision: https://reviews.llvm.org/D32769 llvm-svn: 302041
*	Revert rL302028 due to accidental line ending changes.	Simon Pilgrim	2017-05-03	2	-42/+0
\| \| \| \|	llvm-svn: 302038
*	[X86][LWP] Add llvm support for LWP instructions.	Simon Pilgrim	2017-05-03	2	-0/+42
\| \| \| \| \| \| \| \|	This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Differential Revision: https://reviews.llvm.org/D32769 llvm-svn: 302028
*	[X86][SSE2] Fix asm string for movq (Move Quadword) instruction.	Ayman Musa	2017-04-26	1	-4/+4
\| \| \| \| \| \| \| \|	Replace "mov{d\|q}" with "movq". Differential Revision: https://reviews.llvm.org/D32220 llvm-svn: 301386
*	[AMDGPU][mc][tests][NFC] Bulk ISA tests: update for Gfx7/Gfx8, add for Gfx9.	Artem Tamazov	2017-04-24	1	-57799/+65599
\| \| \| \|	llvm-svn: 301247
*	[AMDGPU][MC] Corrected src0 size for s_cbranch_join	Dmitry Preobrazhensky	2017-04-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Fix for bug 28159: https://bugs.llvm.org//show_bug.cgi?id=28159 Reviewers: vpykhtin, arsenm Differential Revision: https://reviews.llvm.org/D31595 llvm-svn: 300055
*	AMDGPU: Fix crash when disassembling VOP3 mac	Matt Arsenault	2017-04-10	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \|	The unused dummy src2_modifiers is missing, so it crashes when trying to print it. I tried to fully remove src2_modifiers, but there are some irritations in the places where it is converted to mad since it starts to require modifying use lists while iterating over them. llvm-svn: 299861
*	[AMDGPU][MC] Fix for Bug 28211 + LIT tests	Dmitry Preobrazhensky	2017-04-07	1	-10/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- corrected DS_GWS_* opcodes (see VI_Shader_Programming#16.pdf for detailed description) - address operand is not used - several opcodes have data operand - all opcodes have offset modifier - DS_AND_SRC2_B32: corrected typo in mnemo - DS_WRAP_RTN_F32 replaced with DS_WRAP_RTN_B32 - added CI/VI opcodes: - DS_CONDXCHG32_RTN_B64 - DS_GWS_SEMA_RELEASE_ALL - added VI opcodes: - DS_CONSUME - DS_APPEND - DS_ORDERED_COUNT Differential Revision: https://reviews.llvm.org/D31707 llvm-svn: 299767
*	[AMDGPU][MC] Fix for Bugs 28200, 28202 + LIT tests	Dmitry Preobrazhensky	2017-03-20	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	Fixed several related issues with VOP3 fp modifiers. Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D30821 llvm-svn: 298255
*	[PowerPC][Altivec] Add mfvrd and mffprd extended mnemonic	Nemanja Ivanovic	2017-03-15	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	mfvrd and mffprd are both alias to mfvrsd. This patch enables correct parsing of the aliases, but we still emit a mfvrsd. Committing on behalf of brunoalr (Bruno Rosa). Differential Revision: https://reviews.llvm.org/D29177 llvm-svn: 297849
*	AMDGPU: Add definition for v_swap_b32	Matt Arsenault	2017-02-28	1	-0/+4
\| \| \| \| \| \| \| \|	This is somewhat tricky because there are two pairs of tied operands, and it isn't allowed to be VOP3 encoded. llvm-svn: 296519
*	AMDGPU: Fix disassembly of aperture registers	Matt Arsenault	2017-02-18	1	-0/+13
\| \| \| \|	llvm-svn: 295555
*	AMDGPU: Replace assert with report_fatal_error	Matt Arsenault	2017-02-15	1	-0/+4
\| \| \| \| \| \|	Also use a more refined condition. llvm-svn: 295239
*	[Hexagon] Replace instruction definitions with auto-generated ones	Krzysztof Parzyszek	2017-02-10	18	-931/+931
\| \| \| \|	llvm-svn: 294753
*	[X86] Clzero intrinsic and its addition under znver1	Craig Topper	2017-02-09	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch does the following. 1. Adds an Intrinsic int_x86_clzero which works with __builtin_ia32_clzero 2. Identifies clzero feature using cpuid info. (Function:8000_0008, Checks if EBX[0]=1) 3. Adds the clzero feature under znver1 architecture. 4. The custom inserter is added in Lowering. 5. A testcase is added to check the intrinsic. 6. The clzero instruction is added to assembler test. Patch by Ganesh Gopalasubramanian with a couple formatting tweaks, a disassembler test, and using update_llc_test.py from me. Differential revision: https://reviews.llvm.org/D29385 llvm-svn: 294558
*	Revert r294437 as it broke an asan buildbot.	Amara Emerson	2017-02-08	1	-24/+24
\| \| \| \|	llvm-svn: 294523
*	[AArch64][TableGen] Skip tied result operands for InstAlias	Amara Emerson	2017-02-08	1	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch checks the number of operands in the resulting instruction instead of just the alias, then skips over tied operands when generating the printing method. This allows us to generate the preferred assembly syntax for the AArch64 'ins' instruction, which should always be displayed as 'mov' according to the ARMARM. Several unit tests have changed as a result, but only to reflect the preferred disassembly. Some other InstAlias patterns (movk/bic/orr) needed a slight adjustment to stop them becoming the default and breaking other unit tests. Patch by Graham Hunter. Differential Revision: https://reviews.llvm.org/D29219 llvm-svn: 294437
*	[X86] Remove PCOMMIT instruction support since Intel has deprecated this ↵	Craig Topper	2017-02-08	1	-3/+0
\| \| \| \| \| \| \| \|	instruction with no plans to release products with it. Intel's documentation for the deprecation https://software.intel.com/en-us/blogs/2016/09/12/deprecate-pcommit-instruction llvm-svn: 294405
*	[Hexagon] Adding gp+ to the syntax of gp-relative instructions	Krzysztof Parzyszek	2017-02-06	3	-14/+14
\| \| \| \| \| \|	Patch by Colin LeMahieu. llvm-svn: 294258
*	[X86] Fix printing of sha256rnds2 to include the implicit %xmm0 argument.	Craig Topper	2017-02-05	1	-2/+2
\| \| \| \|	llvm-svn: 294132
*	Reapply [AMDGPU][mc][tests][NFC] Add coverage/smoke tests for Gfx7 and Gfx8.	Artem Tamazov	2017-01-30	1	-0/+87676
\| \| \| \|	llvm-svn: 293552
*	[X86][Disassembler] Added SALC instruction	Chris Ray	2017-01-29	1	-790/+793
\| \| \| \| \| \| \| \| \| \| \| \|	Reviewers: joe.abbey, craig.topper Reviewed By: craig.topper Subscribers: majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D29201 llvm-svn: 293447
*	Add some Book-E instructions to the asm parser and printer.	Justin Hibbits	2017-01-29	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Adds the following instructions: * mfpmr * mtpmr * icblc * icblq * icbtls Fix the scheduling for mtspr on e5500, which uses CFX0, instead of SFX0/SFX1 as on e500mc. Addresses PR 31538. Differential Revision: https://reviews.llvm.org/D29002 llvm-svn: 293417
*	[X86] Adding FFREEP instruction.	Chris Ray	2017-01-27	1	-1037/+1061
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Small change to get the FREEP instruction to decode properly. Reviewers: craig.topper Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29193 llvm-svn: 293314
*	Revert [AMDGPU][mc][tests][NFC] Add coverage/smoke tests for Gfx7 and Gfx8.	Ivan Krasin	2017-01-24	1	-87676/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Reason: broke ASAN bots with a global buffer overflow. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/2291 Each test contains 20-30K test cases but takes only several (from 4 to 10) seconds to complete on average machine. The tests cover the majority of AMDGPU Gfx7/Gfx8 instructions, including many dark corners, and intended to quickly find out if something is broken. llvm-svn: 292974
*	[AMDGPU][mc][tests][NFC] Add coverage/smoke tests for Gfx7 and Gfx8.	Artem Tamazov	2017-01-24	1	-0/+87676
\| \| \| \| \| \| \| \| \|	Each test contains 20-30K test cases but takes only several (from 4 to 10) seconds to complete on average machine. The tests cover the majority of AMDGPU Gfx7/Gfx8 instructions, including many dark corners, and intended to quickly find out if something is broken. llvm-svn: 292922
*	[mips] Correct c.cond.fmt instruction definition.	Simon Dardis	2017-01-16	1	-38/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Permit explicit $fcc<X> operand in c.cond.fmt instruction. Add c.cond.fmt to the MIPS to microMIPS instruction mapping table. Check that $fcc1 - $fcc7 are unusable for MIPS-I to MIPS-III for c.cond.fmt, bc1t, bc1f. Reviewers: seanbruno, zoran.jovanovic, vkalintiris Differential Revision: https://reviews.llvm.org/D24510 llvm-svn: 292117
*	[AVX-512] Teach the disassembler about all of the EVEX gather and scatter ↵	Craig Topper	2017-01-16	1	-0/+95
\| \| \| \| \| \|	instructions. llvm-svn: 292094
*	[PowerPC] Implement missing ISA 2.06 instructions.	Tony Jiang	2017-01-05	1	-0/+18
\| \| \| \| \| \| \|	Instructions: fctidu[.], fctiwu[.], ftdiv, ftsqrt are not implemented. Implement them and add corresponding test cases in this patch. llvm-svn: 291116
*	[AMDGPU] Disassembler: fix for disaasembling v_mac_f32/16_dpp/sdwa	Sam Kolton	2016-12-22	2	-3/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Real instruction should copy constraints from real instruction. This allows auto-generated disassembler to correctly process tied operands. Reviewers: nhaustov, vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27847 llvm-svn: 290336
*	AMDGPU: Fix name for v_ashrrev_i16	Matt Arsenault	2016-12-16	2	-2/+2
\| \| \| \|	llvm-svn: 289967
*	AMDGPU: Change vintrp printing	Matt Arsenault	2016-12-14	1	-16/+16
\| \| \| \|	llvm-svn: 289664
*	AMDGPU: Fix handling of 16-bit immediates	Matt Arsenault	2016-12-10	2	-2/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since 32-bit instructions with 32-bit input immediate behavior are used to materialize 16-bit constants in 32-bit registers for 16-bit instructions, determining the legality based on the size is incorrect. Change operands to have the size specified in the type. Also adds a workaround for a disassembler bug that produces an immediate MCOperand for an operand that is supposed to be OPERAND_REGISTER. The assembler appears to accept out of bounds immediates and truncates them, but this seems to be an issue for 32-bit already. llvm-svn: 289306
*	AMDGPU: Fix vintrp disassembly	Matt Arsenault	2016-12-10	2	-5/+49
\| \| \| \|	llvm-svn: 289292
*	AMDGPU: Change vintrp printing to better match sc	Matt Arsenault	2016-12-10	1	-1/+1
\| \| \| \| \| \| \|	Some of the immediates need to be printed differently eventually. llvm-svn: 289291
*	AMDGPU: Fix operand name for v_interp_*	Matt Arsenault	2016-12-06	1	-1/+1
\| \| \| \| \| \|	Other VOP instructions call the output vdst llvm-svn: 288856
*	[AMDGPU] Disassembler: fix s_buffer_store_dword instructions	Sam Kolton	2016-12-05	1	-0/+36
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: s_buffer_store_dword instructions sdata operand was called sdst in encoding. This caused disassembler to fail. Reviewers: tstellarAMD, vpykhtin, artem.tamazov Subscribers: arsenm, nhaehnle, rampitec Differential Revision: https://reviews.llvm.org/D27100 llvm-svn: 288657
*	[SystemZ] Support remaining atomic instructions	Ulrich Weigand	2016-12-02	1	-0/+252
\| \| \| \| \| \| \| \|	Add assembler support for all atomic instructions that weren't already supported. Some of those could be used to implement codegen for 128-bit atomic operations, but this isn't done here yet. llvm-svn: 288526
*	[SystemZ] Support floating-point control register instructions	Ulrich Weigand	2016-12-02	1	-0/+135
\| \| \| \| \| \| \| \| \| \|	Add assembler support for instructions manipulating the FPC. Also add codegen support via the GCC compatibility builtins: __builtin_s390_sfpc __builtin_s390_efpc llvm-svn: 288525
*	[SystemZ] Support execution hint instructions	Ulrich Weigand	2016-11-28	2	-0/+76
\| \| \| \| \| \| \| \| \|	This adds assembler support for the instructions provided by the execution-hint facility (NIAI and BP(R)P). This required adding support for the new relocation types for 12-bit and 24-bit PC- relative offsets used by the BP(R)P instructions. llvm-svn: 288031
*	[SystemZ] Support load-and-trap instructions	Ulrich Weigand	2016-11-28	1	-0/+150
\| \| \| \| \| \| \|	This adds support for the instructions provided with the load-and-trap facility. llvm-svn: 288030
*	[SystemZ] Add remaining branch instructions	Ulrich Weigand	2016-11-28	2	-1/+287
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds assembler support for the remaining branch instructions: the non-relative branch on count variants, and all variants of branch on index. The only one of those that can be readily exploited for code generation is BRCTH (branch on count using a high 32-bit register as count). Do use it, however, it is necessary to also introduce a hew CHIMux pseudo to allow comparisons of a 32-bit value agains a short immediate to go into a high register as well (implemented via CHI/CIH). This causes a bit of codegen changes overall, but those have proven to be neutral (or even beneficial) in performance measurements. llvm-svn: 288029
*	[SystemZ] Improve use of conditional instructions	Ulrich Weigand	2016-11-28	1	-0/+193
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch moves formation of LOC-type instructions from (late) IfConversion to the early if-conversion pass, and in some cases additionally creates them directly from select instructions during DAG instruction selection. To make early if-conversion work, the patch implements the canInsertSelect / insertSelect callbacks. It also implements the commuteInstructionImpl and FoldImmediate callbacks to enable generation of the full range of LOC instructions. Finally, the patch adds support for all instructions of the load-store-on-condition-2 facility, which allows using LOC instructions also for high registers. Due to the use of the GRX32 register class to enable high registers, we now also have to handle the cases where there are still no single hardware instructions (conditional move from a low register to a high register or vice versa). These are converted back to a branch sequence after register allocation. Since the expandRAPseudos callback is not allowed to create new basic blocks, this requires a simple new pass, modelled after the ARM/AArch64 ExpandPseudos pass. Overall, this patch causes significantly more LOC-type instructions to be used, and results in a measurable performance improvement. llvm-svn: 288028
*	[AVX-512] Fix a disassembler failure for AVX-512 vcmpss/vcmpsd with an ↵	Craig Topper	2016-11-13	1	-0/+31
\| \| \| \| \| \| \| \|	immediate larger than 32. Fix the same bug with VLX vcmpps/vcmppd. Fixes PR24941. llvm-svn: 286775
*	[AMDGPU] Add f16 support (VI+)	Konstantin Zhuravlyov	2016-11-13	1	-2/+2
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D25975 llvm-svn: 286753
*	[SystemZ] Support CL(G)T instructions	Ulrich Weigand	2016-11-11	1	-0/+84
\| \| \| \| \| \| \| \|	This adds support for the compare logical and trap (memory) instructions that were added as part of the miscellaneous instruction extensions feature with zEC12. llvm-svn: 286587