In the case of divide or Transcendentals the algorithms needed are so
complex that simple implementations can often take an astounding 128
-clock cycles to complete. Other instructions waiting for the results
+clock cycles to complete (Goldschmidtt reduces that significantly).
+Other instructions waiting for the results
will back up and eventually stall, where in-order systems pretty much
just stall straight away.