Implementations have a hard time recognizing existing atomic operations
via macro-op fusion because they would often have to detect and fuse a
-large number of instructions, including branches.
+large number of instructions, including branches. This is contrary
+to the RISC paradigm.
There is also the issue that PowerISA's memory fences are unnecessarily
strong, particularly `isync` which is used for a lot of `acquire` and