radeonsi: split input upload off from si_launch_grid