bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	[MCA] Show aggregate over Average Wait times for the whole snippet (PR43219)	Roman Lebedev	2019-10-10	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: As disscused in https://bugs.llvm.org/show_bug.cgi?id=43219, i believe it may be somewhat useful to show //some// aggregates over all the sea of statistics provided. Example: ``` Average Wait times (based on the timeline view): [0]: Executions [1]: Average time spent waiting in a scheduler's queue [2]: Average time spent waiting in a scheduler's queue while ready [3]: Average time elapsed from WB until retire stage [0] [1] [2] [3] 0. 3 1.0 1.0 4.7 vmulps %xmm0, %xmm1, %xmm2 1. 3 2.7 0.0 2.3 vhaddps %xmm2, %xmm2, %xmm3 2. 3 6.0 0.0 0.0 vhaddps %xmm3, %xmm3, %xmm4 3 3.2 0.3 2.3 <total> ``` I.e. we average the averages. Reviewers: andreadb, mattd, RKSimon Reviewed By: andreadb Subscribers: gbedwell, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68714 llvm-svn: 374361
*	[MCA][X86] Add tests for LOCK variants of standard X86 arithmetic ops	Simon Pilgrim	2019-08-20	1	-1/+382
\| \| \| \| \| \|	D66424 adds the base support for LOCK so we should be able to add special case support for all these cases in future patches llvm-svn: 369367
*	[X86] Move scheduling tests for CMPXCHG to the corresponding ↵	Andrea Di Biagio	2019-08-19	2	-41/+14
\| \| \| \| \| \| \| \| \| \|	resources-x86_64.s files. NFC In D66424 it has been requested to move all the new tests added by r369278 into resources-x86_64.s. That is because only the 8b/16 ops should be tested by resources-cmpxchg.s. This partially reverts r369278. llvm-svn: 369288
*	[X86] Added extensive scheduling model tests for all the CMPXCHG variants. NFC	Andrea Di Biagio	2019-08-19	1	-1/+46
\| \| \| \| \| \|	Addresses a review comment in D66424 llvm-svn: 369279
*	[X86] Add missing properties on llvm.x86.sse.{st,ld}mxcsr	Clement Courbet	2019-06-19	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: llvm.x86.sse.stmxcsr only writes to memory. llvm.x86.sse.ldmxcsr only reads from memory, and might generate an FPE. Reviewers: craig.topper, RKSimon Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62896 llvm-svn: 363773
*	[X86] Add zero idioms to the haswell, broadwell, and skylake schedule ↵	Craig Topper	2019-05-25	1	-509/+509
\| \| \| \| \| \| \| \| \| \|	models. Add 256-bit fp xor to sandybridge zero idioms This copies the Sandy Bridge zero idiom support to later CPUs. Adding the AVX2 and AVX512F/VL instructions as appropriate. Differential Revision: https://reviews.llvm.org/D62360 llvm-svn: 361690
*	[X86][llvm-mca] Add zero idiom tests for Intel CPUs. NFC	Craig Topper	2019-05-25	1	-0/+778
\| \| \| \| \| \|	This pre-commits tests for D62360 llvm-svn: 361689
*	[X86] Remove the suffix on vcvt[u]si2ss/sd register variants in assembly ↵	Craig Topper	2019-05-06	3	-16/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	printing. We require d/q suffixes on the memory form of these instructions to disambiguate the memory size. We don't require it on the register forms, but need to support parsing both with and without it. Previously we always printed the d/q suffix on the register forms, but it's redundant and inconsistent with gcc and objdump. After this patch we should support the d/q for parsing, but not print it when its unneeded. llvm-svn: 360085
*	[llvm-mca][x86] Fix MMX PMOVMSKB test	Simon Pilgrim	2019-04-29	1	-3/+3
\| \| \| \| \| \|	This is defined as part of SSE1, XMM PMOVMSKB doesn't appear until SSE2 llvm-svn: 359477
*	[MCA] Fix typo in AVX2 gather tests. NFC	Andrea Di Biagio	2019-04-28	1	-3/+3
\| \| \| \|	llvm-svn: 359397
*	[X86] Remove the _alt forms of (V)CMP instructions. Use a combination of ↵	Craig Topper	2019-03-18	3	-40/+40
\| \| \| \| \| \| \| \| \| \|	custom printing and custom parsing to achieve the same result and more Similar to previous change done for VPCOM and VPCMP Differential Revision: https://reviews.llvm.org/D59468 llvm-svn: 356384
*	[X86] Correct scheduler information for rotate by constant for Haswell, ↵	Craig Topper	2019-03-07	1	-17/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Broadwell, and Skylake. Rotate with explicit immediate is a single uop from Haswell on. An immediate of 1 has a dependency on the previous writer of flags, but the other immediate values do not. The implicit rotate by 1 instruction is 2 uops. But the flags are merged after the rotate uop so the data result does not see the flag dependency. But I don't think we have any way of modeling that. RORX is 1 uop without the load. 2 uops with the load. We currently model these with WriteShift/WriteShiftLd. Differential Revision: https://reviews.llvm.org/D59077 llvm-svn: 355636
*	[llvm-mca][X86] Add ADC/SBB with zero test cases	Simon Pilgrim	2019-03-06	1	-1/+73
\| \| \| \| \| \|	Some targets have fast-path handling for these patterns that we should model. llvm-svn: 355498
*	[X86] Correct some ADC/SBB with immediate scheduler data for Broadwell and ↵	Craig Topper	2019-02-24	1	-17/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Skylake. Summary: The AX/EAX/RAX with immediate forms are 2 uops just like the AL with immediate. The modrm form with r8 and immediate is a single uop just like r16/r32/r64 with immediate. Reviewers: RKSimon, andreadb Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58581 llvm-svn: 354754
*	[X86] Print all register forms of x87 fadd/fsub/fdiv/fmul as having two ↵	Craig Topper	2019-02-04	1	-44/+44
\| \| \| \| \| \| \| \| \| \|	arguments where on is %st. All of these instructions consume one encoded register and the other register is %st. They either write the result to %st or the encoded register. Previously we printed both arguments when the encoded register was written. And we printed one argument when the result was written to %st. For the stack popping forms the encoded register is always the destination and we didn't print both operands. This was inconsistent with gcc and objdump and just makes the output assembly code harder to read. This patch changes things to always print both operands making us consistent with gcc and objdump. The parser should still be able to handle the single register forms just as it did before. This also matches the GNU assembler behavior. llvm-svn: 353061
*	[X86] Print %st(0) as %st when its implicit to the instruction. Continue ↵	Craig Topper	2019-02-04	1	-42/+42
\| \| \| \| \| \| \| \|	printing it as %st(0) when its encoded in the instruction. This is a step back from the change I made in r352985. This appears to be more consistent with gcc and objdump behavior. llvm-svn: 353015
*	Revert r352985 "[X86] Print %st(0) as %st to match what gcc inline asm uses ↵	Craig Topper	2019-02-04	1	-54/+54
\| \| \| \| \| \| \| \| \| \|	as the clobber name to make MS inline asm work correctly" Looking into gcc and objdump behavior more this was overly aggressive. If the register is encoded in the instruction we should print %st(0), if its implicit we should print %st. I'll be making a more directed change in a future patch. llvm-svn: 353013
*	[X86] Print %st(0) as %st to match what gcc inline asm uses as the clobber ↵	Craig Topper	2019-02-03	1	-54/+54
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	name to make MS inline asm work correctly Summary: When calculating clobbers for MS style inline assembly we fail if the asm clobbers stack top because we print st(0) and try to pass it through the gcc register name check. This was found with when I attempted to make a emms/femms clobber all ST registers. If you use emms/femms in MS inline asm we would try to use st(0) as the clobber name but clang would think that wasn't a valid clobber name. This also matches what objdump disassembly prints. It's also what is printed by gcc -S. Reviewers: RKSimon, rnk, efriedma, spatel, andreadb, lebedev.ri Reviewed By: rnk Subscribers: eraman, gbedwell, lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D57621 llvm-svn: 352985
*	[llvm-mca][X86] Add some missing DQI tests	Simon Pilgrim	2019-01-26	4	-2/+1979
\| \| \| \| \| \|	Match more of the coverage of test\CodeGen\X86\avx512-schedule.ll as discussed on D57244 llvm-svn: 352273
*	[llvm-mca][X86] Add missing shuffle tests	Simon Pilgrim	2019-01-25	4	-4/+1962
\| \| \| \| \| \|	Match the coverage of test\CodeGen\X86\avx512-shuffle-schedule.ll so we can get rid of -print-schedule (and fix PR37160) without losing schedule tests llvm-svn: 352179
*	[llvm-mca][X86] Tidyup avx512 placeholder tests	Simon Pilgrim	2019-01-22	4	-78/+676
\| \| \| \| \| \|	Ensure we keep avx512f/bw/dq + vl versions separate, add example broadcast tests - this should allow us to better the test coverage of test\CodeGen\X86\avx512-schedule.ll llvm-svn: 351848
*	[llvm-mca][X86] Add missing CLWB/CLZERO/FSGSBASE/LWP/MWAITX/RDPID/SHA tests	Simon Pilgrim	2019-01-22	2	-0/+94
\| \| \| \| \| \|	We're getting pretty close to matching/exceeding test coverage of the test\CodeGen\X86\*-schedule.ll files, which should allow us to get rid of -print-schedule and fix PR37160 llvm-svn: 351836
*	[llvm-mca][X86] Add missing enter/leave, invlpg/invlpga, rdmsr/wrmsr, rdpmc ↵	Simon Pilgrim	2019-01-22	1	-1/+33
\| \| \| \| \| \|	and rdtsc/rdtscp tests llvm-svn: 351835
*	[llvm-mca][X86] Add missing mfence/pinsrw tests	Simon Pilgrim	2019-01-22	1	-1/+12
\| \| \| \|	llvm-svn: 351831
*	[llvm-mca][X86] Add missing monitor/mwait tests	Simon Pilgrim	2019-01-22	1	-1/+9
\| \| \| \| \| \|	These technically should be under a MONITOR cpuid bit, but we tag them as SSE3 so I've done that here as well. llvm-svn: 351829
*	[llvm-mca][X86] Add missing vperm2i128 tests	Simon Pilgrim	2019-01-22	1	-1/+8
\| \| \| \|	llvm-svn: 351828
*	[llvm-mca][X86] Add missing tzcntw tests	Simon Pilgrim	2019-01-22	1	-1/+8
\| \| \| \|	llvm-svn: 351827
*	[llvm-mca][x86] Add missing AES instruction resource tests	Simon Pilgrim	2018-12-07	1	-0/+73
\| \| \| \| \| \|	Add missing non-VEX instructions llvm-svn: 348623
*	[llvm-mca][x86] Add RDRAND/RDSEED instruction resource tests	Simon Pilgrim	2018-12-07	2	-0/+82
\| \| \| \|	llvm-svn: 348622
*	[X86] Fix VZEROUPPER scheduling info on SNB,HSW,BDW,SXL,SKX.	Clement Courbet	2018-11-09	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Starting from SNB, VZEROUPPER is handled by the renamer and uses no proc resources. After HSW, it also has zero latency. This fixes PR35606. To reproduce: Uops: llvm-exegesis -mode=uops -opcode-name=VZEROUPPER Latency: echo -e '#LLVM-EXEGESIS-DEFREG XMM0 1\n#LLVM-EXEGESIS-DEFREG XMM1 1\nvzeroupper' \| /tmp/llvm-exegesis -mode=latency -snippets-file=- echo -e '#LLVM-EXEGESIS-DEFREG XMM0 1\n#LLVM-EXEGESIS-DEFREG XMM1 1\nvzeroupper\naddps %xmm0, %xmm1' \| /tmp/llvm-exegesis -mode=latency -snippets-file=- Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D54107 llvm-svn: 346482
*	[X86][Sched] Update scheduling information for VZEROALL on HWS, BDW, SKX, SNB.	Clement Courbet	2018-10-01	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: While looking at PR35606, I found out that the scheduling info is incorrect. One can check that it's really a P5+P6 and not a 2*P56 with: echo -e 'vzeroall\nvandps %xmm1, %xmm2, %xmm3' \| ./bin/llvm-exegesis -mode=uops -snippets-file=- (vandps executes on P5 only) Reviewers: craig.topper, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52541 llvm-svn: 343447
*	[LLVM-MCA][X86] Add missing VCMPESTR/VCMPESTR tests	Simon Pilgrim	2018-09-30	1	-1/+29
\| \| \| \|	llvm-svn: 343421
*	[LLVM-MCA][X86] Add some AVX512 tests	Simon Pilgrim	2018-09-30	2	-0/+708
\| \| \| \| \| \|	These are going to be necessary to check I don't mess up when I start cleaning up all the remaining vector integer overrides llvm-svn: 343414
*	[X86] Split WriteIMul into 8/16/32/64 implementations (PR36931)	Simon Pilgrim	2018-09-24	1	-4/+4
\| \| \| \| \| \| \| \|	Split WriteIMul by size and also by IMUL multiply-by-imm and multiply-by-reg cases. This removes all the scheduler overrides for gpr multiplies and stops WriteMULH being ignored for BMI2 MULX instructions. llvm-svn: 342892
*	[X86] RORmCL instruction models should match ROLmCL etc.	Simon Pilgrim	2018-09-23	1	-9/+9
\| \| \| \| \| \| \| \|	Confirmed with Craig Topper - fix a typo that was missing a Port4 uop for ROR*mCL instructions on some Intel models. Yet another step on the scheduler model cleanup marathon...... llvm-svn: 342846
*	[X86] MCA tests for XCHG, XADD and CMPXCHG* instructions	Andrew V. Tischenko	2018-08-07	1	-1/+94
\| \| \| \| \| \|	Differential Revision: https://reviews.llvm.org/D49912 llvm-svn: 339145
*	[llvm-mca][x86] Add CMPXCHG instruction resource tests	Simon Pilgrim	2018-08-01	1	-0/+38
\| \| \| \| \| \|	I've put CMPXCHG8B/CMPXCHG16B in the same file, even though technically they are under separate CPUID bits all targets seem to support both (or neither). llvm-svn: 338595
*	[llvm-mca][x86] Add PREFETCHW instruction resource tests	Simon Pilgrim	2018-08-01	1	-0/+38
\| \| \| \| \| \|	These aren't just available via 3DNow! so test for them separately as well. llvm-svn: 338584
*	[llvm-mca][x86] Add PCLMUL instruction resource tests	Simon Pilgrim	2018-08-01	1	-0/+38
\| \| \| \| \| \|	Renamed the btver2 file that already contained them - the other targets were only testing the AVX versions llvm-svn: 338583
*	[llvm-mca][x86] Add SET/TEST instruction resource tests	Simon Pilgrim	2018-08-01	1	-1/+180
\| \| \| \|	llvm-svn: 338576
*	[llvm-mca][x86] Add LEA instruction resource tests	Simon Pilgrim	2018-08-01	1	-0/+439
\| \| \| \| \| \|	We already added these to btver2, now add them to other targets, even though none of their models treat them specially (yet). llvm-svn: 338565
*	[llvm-mca][x86] Add more x86-64 system instruction resource tests	Simon Pilgrim	2018-08-01	1	-1/+92
\| \| \| \| \| \|	CPUID, IN/OUT, INS/OUTS, INT, PAUSE, SCAS, UD2, XLAT llvm-svn: 338563
*	[llvm-mca][x86] Add CLFLUSHOPT instruction resource tests	Simon Pilgrim	2018-08-01	1	-0/+35
\| \| \| \|	llvm-svn: 338550
*	[llvm-mca][x86] Add CMPS/LODS/MOVS/STOS string instruction resource tests	Simon Pilgrim	2018-08-01	1	-1/+53
\| \| \| \|	llvm-svn: 338532
*	[llvm-mca][x86] Add STC + STD instruction resource tests	Simon Pilgrim	2018-08-01	1	-1/+8
\| \| \| \|	llvm-svn: 338514
*	[llvm-mca][x86] Add 32-bit instruction resource tests	Simon Pilgrim	2018-07-31	1	-0/+80
\| \| \| \| \| \|	These aren't exhaustive, but cover some instructions that are only available in 32-bit mode (where would we be without good BCD math performance?). llvm-svn: 338404
*	[llvm-mca][x86] Add movsx/movzx instructions to general x86_64 resource tests	Simon Pilgrim	2018-07-20	1	-1/+70
\| \| \| \|	llvm-svn: 337586
*	[llvm-mca][x86] Add extend, carry-flag and CMP instructions to general ↵	Simon Pilgrim	2018-07-17	1	-1/+120
\| \| \| \| \| \|	x86_64 resource tests llvm-svn: 337306
*	[llvm-mca][x86] Add MOVBE resource tests to all supporting targets	Simon Pilgrim	2018-07-17	1	-0/+52
\| \| \| \| \| \|	SNB doesn't support MOVBE but the numbers in Generic (which use the SNB model) look sane. llvm-svn: 337305
*	[llvm-mca][x86] Add BSWAP resource tests	Simon Pilgrim	2018-07-17	1	-1/+8
\| \| \| \|	llvm-svn: 337302