Normally the progression of the SV for-loop would move on to the
next register. Instead however in the scalar case these instructions
**remain in the same register** and insert or transfer between **bits**
-of the scalar integer source or destination.
+of the scalar integer source or destination. The reason is that when
+using CR Fields as predicate masks and there is a need to transfer
+into a GPR, again for use as a predicate mask, the CR Field bits
+need to be efficiently packed into that one GPR (r3, r10 or r31).
Further useful violation of the normal SV Elwidth override rules allows
for packing (or unpacking) of multiple CR test results into (or out of)