or `addex` is perfectly sufficient to produce arbitrary-length
big-integer add due to the rules set in SVP64 that all Vector Operations
are directly equivalent to the strict Program Order Execution of
-their element-level operations.
+their element-level operations. Assuming that the two bigints (or
+a part thereof) have been loaded into sequentially-contiguous
+registers, with the least-significant bits being in the lowest-numbered
+register in each case:
R0,CA = A0+B0+CA adde r0,a0,b0
|
only store the last XER.CA. The size of the underlying back-end SIMD ALU
is entirely at the discretion of the implementer.
+If there is pressure on the register file (multi-million-digit big integers)
+then a partial-sum may be carried out with LD and ST in a standard
+Cray-style Vector Loop:
+
+
+
# Multiply
Long-multiply, assuming an O(N^2) algorithm, is performed by summing