gallivm: faster iround implementation for sse2