gallium: call tgsi_set_exec_mask() and use exec mask in SSE ARL code