radeonsi: optimize out the loop in si_get_ps_input_cntl