r300/compiler: fix buffer underflow when setting SEM_WAIT on last instruction