first element with a predicate bit set.
In either case the result is in the element with the first bit set in
-the predicate mask.
+the predicate mask. Thus, no move/copy *within the Reduction itself* was needed.
Programmer's Note: For *some* hardware implementations
the vector-to-scalar copy may be a slow operation, as may the Predicated
of the values, compressing them (VREDUCE-style) into a contiguous block,
which will guarantee that the result goes into the very first element
of the destination vector, in which case clearly no follow-up
-vector-to-scalar MV operation is needed.
+predicated vector-to-scalar MV operation is needed.
**Usage conditions**
from occurring or other workaround deployed. When persistence is set this issue
is moot.
-It is critical to note that if persistence is clear `svremap` is the **only** way
+It is critical to note that if persistence is clear then `svremap` is the *only* way
to activate REMAP on any given (following) instruction. If persistence is set however then
**all** SVP64 instructions go through REMAP as long as `SVme` is non-zero.