It does have to be pointed out that there is huge pressure on the
Mode bits. There was therefore insufficient room, unlike the way that
EXT001 was designed, to provide "identifying bits" *without first partially
-decoding the Suffix*. This should in no way be conflated with the
-complexity of a *full* Suffix Decode.
+decoding the Suffix*.
Some considerable care has been taken to ensure that Decoding may be
performed in a strict forward-pipelined fashion that, aside from changes in
tests succeeding (or failing) or whether 'SOME' tests succeed (or fail).
These options provide the ability to cover the majority of Parallel
3D GPU Conditions, saving a not inconsiderable number of instructions
-especially given the close interaction with CTR in hot-loops.
+especially given the close interaction with CTR in hot-loops.[^parity]
+
+[^parity]: adding a parity (XOR) option was too much. instead a parallel-reduction on `crxor` may be used in combination with a Scalar Branch.
Also `SVLR` is introduced, which is a parallel twin of `LR`, and saving
and restoring of LR and SVLR may be deferred until the final decision