i965/fs: Implement HSW BFI exec size workarounds in the SIMD lowering pass.