form to a 2D or 3D transposed form, or "offset" to permit arbitrary
access to elements within a register.
-Their primary use is for Matrix Multiplication, reordering of sequential data in-place.
+Their primary use is for Matrix Multiplication, reordering of sequential data in-place. Three CSRs are provided so that a single FMAC may be used in a single loop to perform 4x4 times 4x4 Matrix multiplication, generating 64 FMACs
The 32-bit REMAP CSR may reshape up to 3 registers: