i++;
j++;
-Note above that the source elwidth is *not used at all* in LD-immediate
+Note above that the source elwidth is *not used at all* in LD-immediate.
+*(For Pack/Unpack Mode which shares the same source elwidth bits this
+is no great loss)*.
For LD/Indexed, the key is that in the calculation of the Effective Address,
RA has no elwidth override but RB does. Pseudocode below is simplified