may be performed in total.
Also given that it is in-registers only at present some care has to be
taken on regfile resource utilisation. However it is perfectly possible
-to utilise Matrix REMAP to perform the three inner-most "kernel" loops of
+to utilise Matrix REMAP to perform the three inner-most "kernel"
+(Tiling) loops of
the usual 6-level large Matrix Multiply, without the usual difficulties
associated with SIMD.