aco: implement 8-bit/16-bit nir_intrinsic_read_first_invocation