intel/fs: Do 8-bit subgroup scan operations in 16 bits