in both Horizontal-First and Vertical-First Mode as well as Predication
(Single and Twin) for the GPRs r3, r10 and r30. CR-Field-based
Predicates, if used, may still raise illegal instruction trap.
-* **DSP/AV**: 128 registers, all SV Branch instructions,
- crweird instructions, element-width
+* **DSP/AV**: 128 registers,
+ element-width
overrides, and all Modes (Saturation, Fail-First, Predicate-Result,
Mapreduce/Iteration)
-* **3D/Advanced/Supercomputing**: Matrix, DCT/FFT and Indexing
+* **3D/Advanced/Supercomputing**: all SV Branch instructions;
+ crweird instructions;
+ Matrix, DCT/FFT and Indexing
REMAP capability
These requirements within each Level constitute the minimum mandatory
This level is more suitable for Hardware implementations where performance and power saving begins to matter. A second instruction, `svstep`, used
by Vertical-First Mode, is required, as is hardware-level looping in
-Horizontal-First Mode. Illegal Instruction trap may not be called to
+Horizontal-First Mode. Illegal Instruction trap may not be used to
emulate `svstep`.
At the bare minimum, Twin and Single Predication must be supported for
at least the GPRs r3, r10 and r30. CR Field Predication may also be
supported in hardware but only by also increasing the number of CR Fields
-to 128.
+to the required total 128.
Another important aspect is that when Rc=1 is set, CR Field Vector co-results
are produced. Should these exceed CR7 (CR8-CR127) and the number of CR Fields
has not been increased to 128 then an Illegal Instruction Trap must be
raised. In practical terms, to avoid this scenario, MAXVL should not
-exceed 8 for Arithmetic or Logical operations when Rc=1.
+exceed 8 for Arithmetic or Logical operations, when Rc=1.
Zeroing on source and destination for Predicates
must also be supported (sz, dz) however
ISA, completely without
modifying the Scalar Power ISA. The cost is that instructions are Prefixed
to 64-bit.
+
+# DSP / Audio / Video Level
+
+This level is best suited to high-performance power-efficient but
+specialist Compute workloads. 128 GPRs, FPRs and CR Fields are all
+required, as is element-width overrides to allow data processing
+down to the 8-bit level.