it is critical to think in terms of the "rules", that everything is
Scalar instructions in strict Program Order.
-Branch is the one and only place where the Scalar
+Branch is the one and only place where the Scalar
(non-prefixed) operations differ from the Vector (element)
instructions, as explained in a separate section.
The
which are expected of a Vector / GPU ISA. These save a considerable
number of instructions in tight inner loop situations.
+Condition Register Fields are 4-bit wide and consequently element-width
+overrides make absolutely no sense whatsoever. Therefore the elwidth
+override field bits can be used for other purposes when Vectorising
+CR Field instructions. Moreover, Rc=1 is completely invalid for
+CR operations such as `crand`: Rc=1 is for arithmetic operations, producing
+a "co-result" that goes into CR0 or CR1.
+
# CR weird instructions
[[sv/int_cr_predication]] is by far the biggest violator of the SVP64