scalar. If VL=1 behaviour is also activated then this is potentially
interfered with, except that, again, RT may be set as a vector destination.
-
```
if svctx.ldstmode == elementstride:
EA = ireg[RA] + ireg[RB]*j # register-strided
```
+Vector destination is again "VLSPLAT" mode, but if a Scalar
+destination was set with VL>1, then just as with LD-immediate
+it is the entire predicate mask which must be zero to stop
+the scalar element from being loaded, and the same effect may
+be achieved with VL=1 by ORing all predicate mask bits down to
+a single bit as a new predicate.
+
## answers to 4, loops/uses
### REMAP
operation effectively tests **ALL** relevant bits 0..VL-1 as nonzero in the
decision-making, whereas VL=1 will only test the first.
-a need for merging all bits into a single alternative predicate mask
+a need for merging (ORing) all bits into a single alternative predicate mask
(single-bit) is the sort of thing we can probably live with.
### fast traditional packed SIMD