radeonsi/gfx9: set EXEC for non-mono merged shaders, add a barrier between them