From 21860c083c431b21ecbe3200f34b6503fe6e8b73 Mon Sep 17 00:00:00 2001 From: lkcl Date: Sat, 22 Jun 2019 17:37:49 +0100 Subject: [PATCH] --- simple_v_extension/specification.mdwn | 10 +++++++--- 1 file changed, 7 insertions(+), 3 deletions(-) diff --git a/simple_v_extension/specification.mdwn b/simple_v_extension/specification.mdwn index 2eb60764f..af163d608 100644 --- a/simple_v_extension/specification.mdwn +++ b/simple_v_extension/specification.mdwn @@ -242,9 +242,11 @@ hardware loop is reduced to a single element: scalar operations. ## SUBVL - Sub Vector Length -This is a "group by quantity" that effectively divides VL into groups -of elements of length SUBVL. VL itself must therefore be set in advance -to a multiple of SUBVL. +This is a "group by quantity" that effectivrly asks each iteration of the hardware loop to load SUBVL elements of width elwidth at a time. Effectively, SUBVL is like a SIMD multiplier: instead of just 1 operation issued, SUBVL operations are issued. + +Another way to view SUBVL is that each element in the VL length vector is now SUBVL times elwidth bits in length. + +The primary use case for SUBVL is for 3D FP Vectors. A Vector of 3D coordinates X,Y,Z for example may be loaded and multiplied the stored, per VL element iteration, rather than having to set VL to three times larger. Legal values are 1, 2, 3 and 4, and the STATE CSR must hold the 2 bit values 0b00 thru 0b11. @@ -258,6 +260,8 @@ This saves a not insignificant number of instructions when handling 3D vectors, as otherwise a much longer predicate mask would have to be set up with regularly-repeated bit patterns. +See SUBVL Pseudocode illustration for details. + ## STATE This is a standard CSR that contains sufficient information for a -- 2.30.2