nv50/ir: optimize multiplication by 16-bit immediates into two xmads
authorRhys Perry <pendingchaos02@gmail.com>
Sat, 18 Aug 2018 14:06:50 +0000 (15:06 +0100)
committerRhys Perry <pendingchaos02@gmail.com>
Mon, 27 Aug 2018 12:57:11 +0000 (13:57 +0100)
commitd27c7918916cdc8092959124955f887592e37d72
tree56bba3b7d22cc57c741025917dceb89a63e5e6d1
parent400a4eb964bc43c339ce213ee40b41e11c8d44dc
nv50/ir: optimize multiplication by 16-bit immediates into two xmads

Rather than the usual three that would be created.

total instructions in shared programs : 5796385 -> 5786560 (-0.17%)
total gprs used in shared programs    : 670103 -> 669968 (-0.02%)
total shared used in shared programs  : 548832 -> 548832 (0.00%)
total local used in shared programs   : 21164 -> 21068 (-0.45%)

                local     shared        gpr       inst      bytes
    helped           1           0          64        1040        1040
      hurt           0           0          27           0           0

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Karol Herbst <kherbst@redhat.com>
src/gallium/drivers/nouveau/codegen/nv50_ir_peephole.cpp