For `LD-VSPLAT`, on non-cache-inhibited Loads, the read can occur
just the once and be copied, rather than hitting the Data Cache
multiple times with the same memory read at the same location.
-This would allow for memory-mapped peripherals to have multiple
+The benefit of Cache-inhibited LD-splats is that it allows
+for memory-mapped peripherals to have multiple
data values read in quick succession and stored in sequentially
numbered registers.
masks will skip some elements (in source non-zeroing mode).
Cache-inhibited ST operations on the other hand **MUST** write out
a Vector source multiple successive times to the exact same Scalar
-destination.
+destination. Just like Cache-inhibited LDs, multiple values may be
+written out in quick succession to a memory-mapped peripheral from
+sequentially-numbered registers.
Note that there are no immediate versions of cache-inhibited LD/ST.