The benefit of Cache-inhibited LD-splats is that it allows
for memory-mapped peripherals to have multiple
data values read in quick succession and stored in sequentially
-numbered registers.
+numbered registers (but, see Note below).
For non-cache-inhibited ST from a vector source onto a scalar
destination: with the Vector
written out in quick succession to a memory-mapped peripheral from
sequentially-numbered registers.
-Note that there are no immediate versions of cache-inhibited LD/ST.
+Note that there are no immediate versions of cache-inhibited LD/ST
+(no *Scalar* cache-inhibited immediate instructions to Vectorise)
**LD/ST Indexed**
If a genuine cache-inhibited LD-VSPLAT is required then a *scalar*
cache-inhibited LD should be performed, followed by a VSPLAT-augmented mv.
+Note also that cache-inhibited VSPLAT with Predicate-result is possible.
+This allows for example to issue a massive batch of memory-mapped
+peripheral reads, stopping at the first NULL-terminated character and
+truncating VL to that point. No branch is needed to issue that large burst
+of LDs.
+
# Vectorisation of Scalar Power ISA v3.0B
OpenPOWER Load/Store operations may be seen from [[isa/fixedload]] and