add AMX
authorLuke Kenneth Casson Leighton <lkcl@lkcl.net>
Fri, 29 Jul 2022 01:37:57 +0000 (02:37 +0100)
committerLuke Kenneth Casson Leighton <lkcl@lkcl.net>
Fri, 29 Jul 2022 01:37:57 +0000 (02:37 +0100)
openpower/sv/comparison_table.mdwn

index 25d7f0e248a1134a46cd580fc858746f28e7edc7..a086ecf42f15f664a8cb7728e0b4358f82e8b5e1 100644 (file)
@@ -6,7 +6,7 @@
 |VSX           |700+           |700+? (26)        |Packed SIMD          |no                 |no                  |no                   |yes (12)                 |yes             |no      |no                   |no                     |no              |yes (13)             | no                    |
 |NEON          |~250 (14)      |7088 (28)         |Packed SIMD          |no                 |no                  |no                   |yes                      |yes             |no      |no                   |no                     |no              |no                   | no                    |
 |SVE2          |~1000 (15)     |6040 (29)         |Predicated SIMD(16)  |no (16)            |yes                 |no                   |yes                      |yes             |no      |yes (7)              |no                     |no              |yes (33)             | no                    |
-|AVX512 (17)   |~1000s (18)    |7256 (30)         |Predicated SIMD      |no                 |yes                 |no                   |yes                      |yes             |no      |no                   |no                     |no              |no                   | no                    |
+|AVX512 (17)   |~1000s (18)    |7256 (30)         |Predicated SIMD      |no                 |yes                 |no                   |yes                      |yes             |no      |no                   |no                     |no              |yes (34)             | no                    |
 |RVV (19)      |~190 (20)      |~25000 (31)       |Scalable (21)        |yes                |yes                 |no                   |yes                      |yes (22)        |no      |yes                  |no                     |no              |no                   | no                    |
 |Aurora SX(23) |~200 (24)      |unknown (32)      |Scalable (25)        |yes                |yes                 |no                   |yes                      |no              |no      |no                   |no                     |no              |?                    | no                    |
 
@@ -50,3 +50,4 @@
 * (32): Unknown. estimated to be of the order of length of RVV due to also being a Cray-style Scalable ISA.
 * (33): [Scalable Matrix Optional Extension](https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/scalable-matrix-extension-armv9-a-architecture)
   the key is an outer-product instruction [SMOPA](https://developer.arm.com/documentation/ddi0602/2022-06/SME-Instructions/SMOPA--Signed-integer-sum-of-outer-products-and-accumulate-?lang=en) which is very hard to tell at a glance if it is power-2 or non-power-2
+* (34:) [Advanced matrix Extensions](https://en.wikipedia.org/wiki/Advanced_Matrix_Extensions) supports BF16 and INT8 only. Separate regfile, power-of-two "tiles". Not general-purpose at all.