aco: fix consecutively written vgprs from vmem instructions
authorRhys Perry <pendingchaos02@gmail.com>
Thu, 7 May 2020 13:27:42 +0000 (14:27 +0100)
committerMarge Bot <eric+marge@anholt.net>
Wed, 13 May 2020 12:26:42 +0000 (12:26 +0000)
commitc1c0cf7a66905e8d7ad506842a41b0ad0c5b10da
treec4006e8b60d44046a091b1a16e289945a5116bfd
parent0c7bed72f7948d51a2109f181e7a2d3c77dbd19e
aco: fix consecutively written vgprs from vmem instructions

If one VMEM instruction uses a sampler and the other doesn't, we can't do
this optimization.

Totals from 47 (0.04% of 127638) affected shaders:
CodeSize: 271744 -> 271656 (-0.03%); split: -0.04%, +0.01%
Instrs: 52783 -> 52761 (-0.04%); split: -0.05%, +0.01%
Cycles: 5547040 -> 5546952 (-0.00%); split: -0.00%, +0.00%
VMEM: 10022 -> 9887 (-1.35%)

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4949>
src/amd/compiler/aco_insert_waitcnt.cpp