From: Jacob Lifshay <programmerjake@gmail.com>
Date: Fri, 22 Oct 2021 03:49:06 +0000 (-0700)
Subject: add description of SimdSlice and why it needs padding
X-Git-Tag: opf_rfc_ls005_v1~3567
X-Git-Url: https://git.libre-soc.org/?a=commitdiff_plain;h=16ef2669382877afc198ff382633ad75c34be5dc;p=libreriscv.git

add description of SimdSlice and why it needs padding
---

diff --git a/3d_gpu/architecture/dynamic_simd.mdwn b/3d_gpu/architecture/dynamic_simd.mdwn
index 52f75e3f1..df87177ba 100644
--- a/3d_gpu/architecture/dynamic_simd.mdwn
+++ b/3d_gpu/architecture/dynamic_simd.mdwn
@@ -237,6 +237,7 @@ constructs in the form described above.
 * [[dynamic_simd/mul]]
 * [[dynamic_simd/shift]]
 * [[dynamic_simd/logicops]] Horizontal reduction: some all xor bool
+* [[dynamic_simd/slice]] nmigen ast.Slice
 
 # Integration with nmigen: "Type 2" (dsl.Module)
 
diff --git a/3d_gpu/architecture/dynamic_simd/slice.mdwn b/3d_gpu/architecture/dynamic_simd/slice.mdwn
new file mode 100644
index 000000000..48cebda9e
--- /dev/null
+++ b/3d_gpu/architecture/dynamic_simd/slice.mdwn
@@ -0,0 +1,124 @@
+# Dynamic Partitioned Slice (`SimdSlice`)
+
+In order to match the semantics of nmigen's `Slice` class, `SimdSlice` has to have each element of the result have
+exactly the same `Shape` as the result of slicing the input `SimdSignal`'s corresponding element.
+
+## Example code:
+
+```python
+a = SimdSignal(...)
+a_s = a.sig # shorthand to make table smaller
+b = a[3:6]
+b_s = a.sig # shorthand to make table smaller
+```
+
+## `a`'s Elements:
+
+<table>
+    <tr class="text-right">
+        <th scope="row" class="text-left">Bit #</th>
+        <td>63&#8288;&hellip;&#8288;56</td>
+        <td>55&#8288;&hellip;&#8288;48</td>
+        <td>47&#8288;&hellip;&#8288;40</td>
+        <td>39&#8288;&hellip;&#8288;32</td>
+        <td>31&#8288;&hellip;&#8288;24</td>
+        <td>23&#8288;&hellip;&#8288;16</td>
+        <td>15&#8288;&hellip;&#8288;8</td>
+        <td>7&#8288;&hellip;&#8288;0</td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 8-bit</th>
+        <td><code>a_s[56:64]</code></td>
+        <td><code>a_s[48:56]</code></td>
+        <td><code>a_s[40:48]</code></td>
+        <td><code>a_s[32:40]</code></td>
+        <td><code>a_s[24:32]</code></td>
+        <td><code>a_s[16:24]</code></td>
+        <td><code>a_s[8:16]</code></td>
+        <td><code>a_s[0:8]</code></td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 16-bit</th>
+        <td colspan="2"><code>a_s[48:64]</code></td>
+        <td colspan="2"><code>a_s[32:48]</code></td>
+        <td colspan="2"><code>a_s[16:32]</code></td>
+        <td colspan="2"><code>a_s[0:16]</code></td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 32-bit</th>
+        <td colspan="4"><code>a_s[32:64]</code></td>
+        <td colspan="4"><code>a_s[0:32]</code></td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 64-bit</th>
+        <td colspan="8"><code>a_s[0:64]</code></td>
+    </tr>
+</table>
+
+So, slicing bits `3:6` of a 32-bit element of `a` must, because we have to match nmigen, produce a 3-bit element, which might seem like no problem, however, slicing bits `3:6` of a 16-bit element of a 64-bit `SimdSignal` must *also* produce a 3-bit element, so, in order to get a `SimdSignal` where *all* elements are 3-bit elements, as required by `SimdSlice`'s output, we have to introduce padding:
+
+## `b`'s Elements:
+
+<table>
+    <tr class="text-right">
+        <th scope="row" class="text-left">Bit #</th>
+        <td>23&#8288;&hellip;&#8288;21</td>
+        <td>20&#8288;&hellip;&#8288;18</td>
+        <td>17&#8288;&hellip;&#8288;15</td>
+        <td>14&#8288;&hellip;&#8288;12</td>
+        <td>11&#8288;&hellip;&#8288;9</td>
+        <td>8&#8288;&hellip;&#8288;6</td>
+        <td>5&#8288;&hellip;&#8288;3</td>
+        <td>2&#8288;&hellip;&#8288;0</td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 8-bit</th>
+        <td><code>b_s[21:24]</code></td>
+        <td><code>b_s[18:21]</code></td>
+        <td><code>b_s[15:18]</code></td>
+        <td><code>b_s[12:15]</code></td>
+        <td><code>b_s[9:12]</code></td>
+        <td><code>b_s[6:9]</code></td>
+        <td><code>b_s[3:6]</code></td>
+        <td><code>b_s[0:3]</code></td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 16-bit</th>
+        <td class="text-center"><i>Padding</i></td>
+        <td><code>b_s[18:21]</code></td>
+        <td class="text-center"><i>Padding</i></td>
+        <td><code>b_s[12:15]</code></td>
+        <td class="text-center"><i>Padding</i></td>
+        <td><code>b_s[6:9]</code></td>
+        <td class="text-center"><i>Padding</i></td>
+        <td><code>b_s[0:3]</code></td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 32-bit</th>
+        <td colspan="3" class="text-center"><i>Padding</i></td>
+        <td><code>b_s[12:15]</code></td>
+        <td colspan="3" class="text-center"><i>Padding</i></td>
+        <td><code>b_s[0:3]</code></td>
+    </tr>
+    <tr class="text-right">
+        <th scope="row" class="text-left">ElWid: 64-bit</th>
+        <td colspan="7" class="text-center"><i>Padding</i></td>
+        <td><code>b_s[0:3]</code></td>
+    </tr>
+</table>
+
+<style>
+    /* duplicated from bootstrap so text editors can see it
+        -- ignored by ikiwiki */
+    .text-left {
+        text-align: left !important
+    }
+
+    .text-right {
+        text-align: right !important
+    }
+
+    .text-center {
+        text-align: center !important
+    }
+</style>

Bit #	63⁠…⁠56	55⁠…⁠48	47⁠…⁠40	39⁠…⁠32	31⁠…⁠24	23⁠…⁠16	15⁠…⁠8	7⁠…⁠0
ElWid: 8-bit	`a_s[56:64]`	`a_s[48:56]`	`a_s[40:48]`	`a_s[32:40]`	`a_s[24:32]`	`a_s[16:24]`	`a_s[8:16]`	`a_s[0:8]`
ElWid: 16-bit	`a_s[48:64]`		`a_s[32:48]`		`a_s[16:32]`		`a_s[0:16]`
ElWid: 32-bit	`a_s[32:64]`				`a_s[0:32]`
ElWid: 64-bit	`a_s[0:64]`
Bit #	23⁠…⁠21	20⁠…⁠18	17⁠…⁠15	14⁠…⁠12	11⁠…⁠9	8⁠…⁠6	5⁠…⁠3	2⁠…⁠0
ElWid: 8-bit	`b_s[21:24]`	`b_s[18:21]`	`b_s[15:18]`	`b_s[12:15]`	`b_s[9:12]`	`b_s[6:9]`	`b_s[3:6]`	`b_s[0:3]`
ElWid: 16-bit	Padding	`b_s[18:21]`	Padding	`b_s[12:15]`	Padding	`b_s[6:9]`	Padding	`b_s[0:3]`
ElWid: 32-bit	Padding			`b_s[12:15]`	Padding			`b_s[0:3]`
ElWid: 64-bit	Padding							`b_s[0:3]`