| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
| |
These were being over cautious for costs for one/two op general shuffles - VSHUFPD doesn't have to replicate the same shuffle in both lanes like VSHUFPS does.
llvm-svn: 335216
|
|
|
|
| |
llvm-svn: 330433
|
|
|
|
|
|
| |
We're mostly testing with generic isa attributes, but PR36550 will require testing of specific target's scheduler models as well.
llvm-svn: 330056
|
|
|
|
|
|
| |
update_analyze_test_checks.py
llvm-svn: 329410
|
|
|
|
|
|
| |
VPPERM/VPERMIL2PD/VPERMIL2PS all provide more effective 2-input shuffles than regular AVX instructions
llvm-svn: 311005
|
|
|
|
| |
llvm-svn: 310654
|
|
|
|
| |
llvm-svn: 310650
|
|
|
|
| |
llvm-svn: 310645
|
|
|
|
|
|
| |
Cover most 128/256/512/1024-bit cases for vXf64/vXi64, vXf32/vXi32, vXi16 + vXi8
llvm-svn: 310641
|
|
X86 target does not provide any target specific cost calculation for interleave patterns.It uses the common target-independent calculation, which gives very high numbers. As a result, the scalar version is chosen in many cases. The situation on AVX-512 is even worse, since we have 3-src shuffles that significantly reduce the cost.
In this patch I calculate the cost on AVX-512. It will allow to compare interleave pattern with gather/scatter and choose a better solution (PR31426).
* Shiffle-broadcast cost will be changed in Simon's upcoming patch.
Differential Revision: https://reviews.llvm.org/D28118
llvm-svn: 290810
|