(no commit message)

[libreriscv.git] / simple_v_extension / appendix.mdwn
diff --git a/simple_v_extension/appendix.mdwn b/simple_v_extension/appendix.mdwn

index 553d3eee758bc6ad4ef29574dc8d5efbc88b48fc..c29044cfea6b9772be22c43d9b8dc3d968f819ee 100644 (file)
--- a/simple_v_extension/appendix.mdwn
+++ b/simple_v_extension/appendix.mdwn
@@ -1,4 +1,8 @@
-# Simple-V (Parallelism Extension Proposal) Appendix
+[[!oldstandards]]
+
+# Simple-V (Parallelism Extension Proposal) Appendix (OBSOLETE)
+
+**OBSOLETE**
  
  * Copyright (C) 2017, 2018, 2019 Luke Kenneth Casson Leighton
  * Status: DRAFTv0.6
@@ -194,7 +198,9 @@ comprehensive in its effect on instructions.
  Branch operations are augmented slightly to be a little more like FP
  Compares (FEQ, FNE etc.), by permitting the cumulation (and storage)
  of multiple comparisons into a register (taken indirectly from the predicate
-table).  As such, "ffirst" - fail-on-first - condition mode can be enabled.
+table) and enhancing them to branch "consensually" depending on *multiple*
+tests.  "ffirst" - fail-on-first - condition mode can also be enabled,
+to terminate the comparisons early.
  See ffirst mode in the Predication Table section.
  
  There are two registers for the comparison operation, therefore there
@@ -232,6 +238,13 @@ element tests must be performed (and the result optionally stored in
  the result mask), with a "post-analysis" phase carried out which checks
  whether to branch.
  
+Note also that whilst it may seem excessive to have all four (because
+conditional comparisons may be inverted by swapping src1 and src2),
+data-dependent fail-on-first is *not* invertible and *only* terminates
+on first zero-condition encountered.  Additionally it may be inconvenient
+to have to swap the predicate registers associated with src1 and src2,
+because this involves a new VBLOCK Context.
+
  ### Standard Branch <a name="standard_branch"></a>
  
  Branch operations use standard RV opcodes that are reinterpreted to
@@ -326,16 +339,20 @@ complex), this becomes:
  
      if pred_inversion:
          if pred_zeroing:
-            if result != 0:
+            # NOR
+            if result == 0:
                  goto branch
          else:
-            if result == 0:
+            # NAND
+            if (result & ps) != result:
                  goto branch
      else:
          if pred_zeroing:
-            if (result & ps) != result:
+            # OR
+            if result != 0:
                  goto branch
          else:
+            # AND
              if (result & ps) == result:
                  goto branch
  
@@ -1083,7 +1100,7 @@ Note:
    is also marked as scalar, this is how the compatibility with
    standard RV LOAD/STORE is preserved by this algorithm.
  
-### Example Tables showing LOAD elements
+### Example Tables showing LOAD elements <a name="load_example"></a>
  
  This section contains examples of vectorised LOAD operations, showing
  how the two stage process works (three if zero/sign-extension is included).
@@ -1443,7 +1460,7 @@ circumstances it is perfectly fine to simply have the lanes
  "inactive" for predicated elements, even though it results in
  less than 100% ALU utilisation.
  
-## Twin-predication (based on source and destination register)
+## Twin-predication (based on source and destination register) <a name="tpred"></a>
  
  Twin-predication is not that much different, except that that
  the source is independently zero-predicated from the destination.