**Notes and Observations**:
-1. minimum/maximum instructions are needed for vector reductions, where the
- SVP64 tree reduction needs a single instruction to work properly.
+1. SVP64 REMAP Parallel Reduction needs a single Scalar instruction to
+ work with, for best effectiveness. With no SFFS minimum/maximum instructions
+ Simple-V min/max Parallel Reduction is severely compromised.
2. if you implement any of the FP min/max modes, the rest are not much more
- hardware.
+ hardware.
3. SVP64/VSX may have different meaning from SVP64/SFFS,
- so it is *really* crucial to have SVP64/SFFS ops even if "equivalent" to VSX.
+ so it is *really* crucial to have SFFS ops even if "equivalent" to VSX.
4. FP min/max are rather complex to implement in software, the most commonly
used FP max function `fmax` from glibc compiled for SFFS is 32 (!)
instructions.
\newpage{}
+# Instructions
+
## `FMM` -- Floating Min/Max Mode
<a id="fmm-floating-min-max-mode"></a>