The resource savings from Vector LD/ST are significant and stem from
the fact that one single instruction can trigger a dozen (or in some
-microarchitectures such as Cray or NEC SX Aurora) hundreds of element
-level Memory accesses.
+microarchitectures such as Cray or NEC SX Aurora) hundreds of element-level Memory accesses.
Additionally, and simply: if the Arithmetic side of an ISA supports
Vector Operations, then in order to keep the ALUs 100% occupied the