RT = lowerhalf(product)
RC = upperhalf(product)
-Successive iterations effectively use RC as a 64-bit carry.
+Successive iterations effectively use RC as a 64-bit carry, and
+as noted by Intel in their notes on mulx,
+RA*RB+RC+RD cannot overflow, so does not require
+setting an additional CA flag.
+
+## Divide
+
+The simplest implementation of big-int divide is the standard textbook
+"Long Division", set with RADIX 64 instead of Base 10. Donald Knuth's
+Algorithm D performs estimates which, if wrong, are compensated for
+afterwards. Essentially however there are three phases:
+