radeonsi: enable CU0 in each SE for LS-HS execution