llvmpipe: Optimize lp_rast_triangle_32_3_16 for POWER8
authorOded Gabbay <oded.gabbay@gmail.com>
Tue, 29 Dec 2015 16:12:35 +0000 (18:12 +0200)
committerOded Gabbay <oded.gabbay@gmail.com>
Wed, 6 Jan 2016 12:54:16 +0000 (14:54 +0200)
commit9d59b9d00cdb1e0e8bd139fba5250df869727386
tree984de785fb66ed240bc8e97a8045ce7e50762148
parent925c46cfc48042ec0bc5a83df962d2d7dd038394
llvmpipe: Optimize lp_rast_triangle_32_3_16 for POWER8

This patch converts the SSE-optimized lp_rast_triangle_32_3_16()
to VMX/VSX.

I measured the results on POWER8 machine with 32 cores at 3.4GHz and
16GB of RAM.

                      FPS/Score
 Name            Before     After    Delta
------------------------------------------------
openarena        16.35      16.7     2.14%
xonotic          4.707      4.97     5.57%

glmark2 didn't show a significant (more than 1%) difference.

v2: Make sure code is build only on POWER8 LE machine

Signed-off-by: Oded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: Roland Scheidegger <sroland@vmware.com>
src/gallium/drivers/llvmpipe/lp_rast_tri.c