256 + 2 = 2 # this is correct whether we use the larger or smaller width
# aka hw can optimize narrowing addition
-
# Notes about Swizzle
Basically, there isn't enough room to try to fit two src src1/2 swizzle, and SV, even into 64 bit (actually 24) without severely compromising on the number of bits allocated to either swizzle, or SV, or both.
# note about INT predicate
-001 ALWAYS (implicit) Operation is not masked
+ 001 ALWAYS (implicit) Operation is not masked
this means by default that 001 will always be in nonpredicated ops, which seems anomalous. would 000 be better to indicate "no predication"?
[[sv/branches]] is so heavily interdependent in CTR-test and VLSet
Modes, and only having a single source (BI) that it is simply strongly
-recommended not to interfere with its behaviour, at all.
+recommended not to interfere with its behaviour, at all. additionally
+the unaltered behaviour is needed to substitute for the loss of
+all-ones predicate mask behaviour on SV-scalar-regs
## answers to 4, loops/uses
A major motivation for changing SVP64 with all isvec=0 to temporarily
override VL to 1 is to allow supporting traditional SIMD that has
constantly varying element sizes (and therefore vector lengths too)
-without needing setvl every few instructions.
+without needing setvl every few instructions, by using SUBVL and
+elwidth overrides.
Examples of use cases: