radeonsi/ngg: add VGT_FLUSH when enabling fast launch