[X86] Enable -x86-experimental-vector-widening-legalization by default.

This patch changes our defualt legalization behavior for 16, 32, and 64 bit vectors with i8/i16/i32/i64 scalar types from promotion to widening. For example, v8i8 will now be widened to v16i8 instead of promoted to v8i16. This keeps the elements widths the same and pads with undef elements. We believe this is a better legalization strategy. But it carries some issues due to the fragmented vector ISA. For example, i8 shifts and multiplies get widened and then later have to be promoted/split into vXi16 vectors. This has the potential to cause regressions so we wanted to get it in early in the 10.0 cycle so we have plenty of time to address them. Next steps will be to merge tests that explicitly test the command line option. And then we can remove the option and its associated code. llvm-svn: 367901
author: Craig Topper <craig.topper@intel.com> 2019-08-05 18:25:36 +0000
committer: Craig Topper <craig.topper@intel.com> 2019-08-05 18:25:36 +0000
commit: 3de33245d2c992c9e0af60372043540b60f3a810 (patch)
tree: afdcbe5ccb52da1ddb8e42c619cd1f7dc586b78d /llvm/test/CodeGen/X86/extract-concat.ll
parent: 9bf59384c640959421b3e8005dec31f78d472750 (diff)
download: bcm5719-llvm-3de33245d2c992c9e0af60372043540b60f3a810.tar.gz
bcm5719-llvm-3de33245d2c992c9e0af60372043540b60f3a810.zip
1 files changed, 7 insertions, 2 deletions
diff --git a/llvm/test/CodeGen/X86/extract-concat.ll b/llvm/test/CodeGen/X86/extract-concat.ll
index 029c69a34cb..95e592a6d46 100644
--- a/llvm/test/CodeGen/X86/extract-concat.ll
+++ b/llvm/test/CodeGen/X86/extract-concat.ll
@@ -5,9 +5,14 @@ define void @foo(<4 x float> %in, <4 x i8>* %out) {
 ; CHECK-LABEL: foo:
 ; CHECK:       # %bb.0:
 ; CHECK-NEXT:    cvttps2dq %xmm0, %xmm0
+; CHECK-NEXT:    pextrb $8, %xmm0, %eax
+; CHECK-NEXT:    pextrb $4, %xmm0, %ecx
+; CHECK-NEXT:    pextrb $0, %xmm0, %edx
+; CHECK-NEXT:    movd %edx, %xmm0
+; CHECK-NEXT:    pinsrb $1, %ecx, %xmm0
+; CHECK-NEXT:    pinsrb $2, %eax, %xmm0
 ; CHECK-NEXT:    movl $255, %eax
-; CHECK-NEXT:    pinsrd $3, %eax, %xmm0
-; CHECK-NEXT:    pshufb {{.*#+}} xmm0 = xmm0[0,4,8,12,u,u,u,u,u,u,u,u,u,u,u,u]
+; CHECK-NEXT:    pinsrb $3, %eax, %xmm0
 ; CHECK-NEXT:    movd %xmm0, (%rdi)
 ; CHECK-NEXT:    retq
   %t0 = fptosi <4 x float> %in to <4 x i32>
author	Craig Topper <craig.topper@intel.com>	2019-08-05 18:25:36 +0000
committer	Craig Topper <craig.topper@intel.com>	2019-08-05 18:25:36 +0000
commit	3de33245d2c992c9e0af60372043540b60f3a810 (patch)
tree	afdcbe5ccb52da1ddb8e42c619cd1f7dc586b78d /llvm/test/CodeGen/X86/extract-concat.ll
parent	9bf59384c640959421b3e8005dec31f78d472750 (diff)
download	bcm5719-llvm-3de33245d2c992c9e0af60372043540b60f3a810.tar.gz bcm5719-llvm-3de33245d2c992c9e0af60372043540b60f3a810.zip