aco: implement 16-bit nir_intrinsic_quad_* on GFX6-GFX7