# Condition Register SVP64 Operations Condition Register Fields are only 4 bits wide: this presents some interesting conceptual challenges for SVP64, particularly with respect to element width (which is clearly meaningless). Likewise, arithmetic saturation (an important part of Arithmetic SVP64) has no meaning. Consequently an alternative Mode Format is required. This alternative mapping **only** applies to instructions that **only** reference a CR Field or CR bit as the sole exclusive result. This section **does not** apply to instructions which primarily produce arithmetic results that also, as an aside, produce a corresponding CR Field (such as when Rc=1). Instructions that involve Rc=1 are definitively arithmetic in nature, where the corresponding Condition Register Field can be considered to be a "co-result". Thus, if the arithmetic result is Vectorised, so is the CR Field "co-result", which puts both firmly out of scope for this section. Other modes are still applicable and include: * **Data-dependent fail-first**. useful to truncate VL based on analysis of a Condition Register result bit. * **Scalar and parallel reduction**. Reduction is useful for turning a Vector of Condition Register Fields into one single Condition Register. * **Predicate-result**. Equivalent to python "filter", in that only elements which pass a test will end up actually being modified. This is in effect the same as ANDing the Condition Test with the destination predicate mask (hence the name, "predicate-result"). SVP64 RM `MODE` (includes `ELWIDTH` bits) for CR-based operations: | 4 | 5 | 19-20 | 21 | 22 23 | description | | - | - | ----- | --- |---------|----------------- | | / | / | 00 | 0 | dz sz | normal mode | | / | / | 00 | 1 | 0 RG | scalar reduce mode (mapreduce), SUBVL=1 | | / | / | 00 | 1 | 1 CRM | parallel reduce mode (mapreduce), SUBVL=1 | | / | / | 00 | 1 | SVM RG | subvector reduce mode, SUBVL>1 | |dz |VLi| 01 | inv | CR-bit | Ffirst 3-bit mode | |sz |VLi| 01 | inv | dz Rc1 | Ffirst 5-bit mode | | / | / | 10 | / | / / | RESERVED | |sz |SNZ| 11 | inv | CR-bit | 3-bit pred-result CR sel | | / |SNZ| 11 | inv | dz sz | 5-bit pred-result z/nonz | Fields: TODO # Data-dependent fail-first on CR operations Data-dependent SVP64 Vectorised Operations involving the creation or modification of a CR require an extra two bits, which are not available in the compact space of the `MODE` Field. With the concept of element width overrides being meaningless for CR Fields it is possible to use the `ELWIDTH` field for extra fields. Condition Register based operations such as `mfcr` and `crand` can thus be made more flexible. However the rules that apply in this section also apply to future CR-based instructions. There are two primary different types of CR operations: * Those which have a 3-bit operand field (referring to a CR Field) * Those which have a 5-bit operand (referring to a bit within the whole 32-bit CR) Examining these two as has already been done it is observed that the difference may be considered to be that the 5-bit variant provides additional information about which CR Field bit (EQ, GE, LT, SO) is to be operated on by the instruction. Thus, logically, we may set the following rule: * When a 5-bit CR Result field is used in an instruction, the `inv, VLi and RC1` variant of Data-Dependent Fail-First must be used. i.e. the bit of the CR field to be tested is the one that has just been modified by the operation. * When a 3-bit CR Result field is used the `inv CRbit` variant must be used in order to select which CR Field bit shall be tested (EQ, LE, GE, SO) The reason why the 3-bit CR variant needs the additional CR-bit field should be obvious from the fact that the 3-bit CR Field from the base Power ISA v3.0B operation clearly does not contain and is missing the two CR Field Selector bits. Thus, these two bits (to select EQ, LE, GE or SO) must be provided in another way. Examples of the former type: * crand, cror, crnor. These all are 5-bit (BA, BB, BT). The bit to be tested against `inv` is the one selected by `BT` * mcrf. This has only 3-bit (BF, BFA). In order to select the bit to be tested, the alternative FFirst encoding must be used. This limits sv.mcrf in that it may not use the `VLi` (VL inclusive) Mode. This is unfortunste but unavoidable due to encoding pressure on SVP64. # Predicate-result Condition Register operations These are again slightly different compared to SVP64 arithmetic pred-result (described in [[svp64/appendix]]). The reason is that, again, for arithmetic operations the production of a CR Field when Rc=1 is a *co-result* accompanying the main arithmetic result, whereas for CR-based operations the CR Field or CR bit *is* itself the result of the operation. Therefore, logically, Predicate-result needs to be adapted to test the actual result of the CR-based instruction, rather than test the co-resultant CR when Rc=1. for i in range(VL): # predication test, skip all masked out elements. # skips when sz=0 if predicate_masked_out(i): continue # result is to go into CR. may be a 4-bit CR Field # (3-bit mode) or just a single bit (5-bit mode) result = op(...) # obtain CRbit from BA result operand field, # if this CR op has 5-bit CR result operands if 5bit mode: CRbit = BA[3:4] # now test CR, similar to branch if CRnew[CRbit] != inv: continue # test failed: cancel store # result optionally stored update_CR(result)