sequentially-numbered registers.
Note that there are no immediate versions of cache-inhibited LD/ST
-(no *Scalar* cache-inhibited immediate instructions to Vectorise)
+(no *Scalar* cache-inhibited immediate instructions to Vectorise).
+A future version of the Power ISA *may* have such Scalar instructions.
**LD/ST Indexed**
Note that cache-inhibited LD/ST (`ldcix`) when VSPLAT is activated will perform **multiple** LD/ST operations, sequentially. `ldcix` even with scalar src will read the same memory location *multiple times*, storing the result in successive Vector destination registers. This because the cache-inhibit instructions are used to read and write memory-mapped peripherals.
If a genuine cache-inhibited LD-VSPLAT is required then a *scalar*
-cache-inhibited LD should be performed, followed by a VSPLAT-augmented mv.
+cache-inhibited LD should be performed, followed by a VSPLAT-augmented mv,
+copying the one *scalar* value into multiple register destinations.
Note also that cache-inhibited VSPLAT with Predicate-result is possible.
This allows for example to issue a massive batch of memory-mapped
truncating VL to that point. No branch is needed to issue that large burst
of LDs.
+The multiple reads/writes to/from the same destination address is,
+in Vector-Indexed LD/ST, very similar to the relaxed constraints of
+mapreduce mode,
+
# Vectorisation of Scalar Power ISA v3.0B
OpenPOWER Load/Store operations may be seen from [[isa/fixedload]] and