* Writing an awful lot of Specification Documentation
(4 years and counting)
-Once the basics of this concept have sunk in, advancements quickly
-follow:
+Once the basics of this concept have sunk in, early
+advancements quickly follow naturally from analysis
+of the problem-space:
-* Predication (an absolutely critical component for a Vector ISA).
+* Predication (an absolutely critical component for a Vector ISA),
+ then the next logical advancement is to allow separate predication masks
+ to be applied to *both* the source *and* the destination, independently.
* Element-width overrides: most Scalar ISAs today are 64-bit only,
with primarily Load and Store being able to handle 8/16/32/64
and sometimes 128-bit (quad-word), where Vector ISAs need to
* "Reordering" of the assumption of linear sequential element
access, for Matrices, rotations, transposition, Convolutions,
DCT, FFT, Parallel Prefix-Sum and other common transformations
- that require significant effort in other ISAs.
+ that require significant programming effort in other ISAs.