RA*RB+RC+RD cannot overflow, so does not require
setting an additional CA flag.
+Normally, in a Scalar ISA, the use of a register as both a source
+and destination like this would create costly Dependency Hazards, so
+such an instruction would never be proposed. However: it turns out
+that, just as with repeated chained application of `addeo`, macro-op
+fusion may be internally applied to a sequence of these strange multiply
+operations. Such a trick works equally as well in Scalar-only.
+
## Divide
-The simplest implementation of big-int divide is the standard textbook
+The simplest implementation of big-int divide is the standard schoolbook
"Long Division", set with RADIX 64 instead of Base 10. Donald Knuth's
Algorithm D performs estimates which, if wrong, are compensated for
afterwards. Essentially however there are three phases: