the immediate, D, is zero. Modifications to support this
"cheat" on top of pre-existing Scalar HDL (and Simulators)
have both turned out to be minimal.[^mul]
-
Also added was the option to perform signed or unsigned Effective
Address calculation, which comes into play only on LD/ST Indexed,
when elwidth overrides are used. Another quirk: `RA` is never
*All of the above modes are covered by Twin-Predication*
In particular, a special predicate mode `1<<r3` uses the register `r3`
-*binary* value effectively as a single (Scalar) Index offset into
-what would otherwise be a Vector operation. Combined with the
+*binary* value, converted to single-bit unary mask,
+effectively as a single (Scalar) Index *runtime*-dynamic offset into
+a Vector.[^r3] Combined with the
(mis-named) "mapreduce" mode when used as a source predicate
-a `VSPLAT` is performed. When used as a destination predicate `1<<r3`
+a `VSPLAT` (broadcast) is performed. When used as a destination
+predicate `1<<r3`
provides `VINSERT` behaviour.
+[^r3]: Effectively: `GPR(RA+r3)`
+
Also worth an explicit mention is that Twin Predication when using
different source from destination predicate masks effectively combines
back-to-back `VCOMPRESS` and `VEXPAND` (in a single instruction), and,