[AVX] Lower / fast-isel scalar FP selects into VBLENDV instructions (PR22483) - bcm5719-llvm

diff options

author	Sanjay Patel <spatel@rotateright.com>	2015-03-05 21:46:54 +0000
committer	Sanjay Patel <spatel@rotateright.com>	2015-03-05 21:46:54 +0000
commit	302404b2772a6d8cb290958784fb4d1414f5dd35 (patch)
tree	c8a1ef6ca4d72ba87dbc3563d60b56e2de2893d1 /clang/test/CodeGenCXX/catch-undef-behavior.cpp
parent	6caca38f684299d11f80ec4b3c5dfee5cc8f5785 (diff)
download	bcm5719-llvm-302404b2772a6d8cb290958784fb4d1414f5dd35.tar.gz bcm5719-llvm-302404b2772a6d8cb290958784fb4d1414f5dd35.zip

[AVX] Lower / fast-isel scalar FP selects into VBLENDV instructions (PR22483)

This patch reduces code size for all AVX targets and increases speed for some chips. SSE 4.1 introduced the useless (see code comments) 2-register form of BLENDV and only in the packed float/double flavors. AVX subsequently made the instruction useful by adding a 4-register operand form. So we just need to paper over the lack of scalar forms of this instruction, complicate the code to choose float or double forms, and use blendv on scalars since all FP is in xmm registers anyway. This gives us an approximately 50% speed up for a blendv microbenchmark sequence on SandyBridge and Haswell: blendv : 29.73 cycles/iter logic : 43.15 cycles/iter No new test cases with this patch because: 1. fast-isel-select-sse.ll tests the positive side for regular X86 lowering and fast-isel 2. sse-minmax.ll and fp-select-cmp-and.ll confirm that we're not firing for scalar selects without AVX 3. fp-select-cmp-and.ll and logical-load-fold.ll confirm that we're not firing for scalar selects with constants. http://llvm.org/bugs/show_bug.cgi?id=22483 Differential Revision: http://reviews.llvm.org/D8063 llvm-svn: 231408

Diffstat (limited to 'clang/test/CodeGenCXX/catch-undef-behavior.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: