aarch64: ACLE I8MM multiply-accumulate intrinsics
authorDennis Zhang <dennis.zhang@arm.com>
Fri, 7 Feb 2020 15:04:23 +0000 (15:04 +0000)
committerDennis Zhang <dennis.zhang@arm.com>
Fri, 7 Feb 2020 15:04:23 +0000 (15:04 +0000)
commit40f648378061c170cf6a9ab680af01b3a3a83569
tree7c67d016392906eb12ab5507e948ca8b10c074b1
parentb7903d9f5beb5db440e56fa057d32c6f13f7c5ec
aarch64: ACLE I8MM multiply-accumulate intrinsics

This patch adds intrinsics for 8-bit integer matrix multiply-accumulate
operations including vmmlaq_s32, vmmlaq_u32, and vusmmlaq_s32.

gcc/ChangeLog:

2020-02-07  Dennis Zhang  <dennis.zhang@arm.com>

* config/aarch64/aarch64-simd-builtins.def (simd_smmla): New entry.
(simd_ummla, simd_usmmla): Likewise.
* config/aarch64/aarch64-simd.md (aarch64_simd_<sur>mmlav16qi): New.
* config/aarch64/arm_neon.h (vmmlaq_s32, vmmlaq_u32): New.
(vusmmlaq_s32): New.

gcc/testsuite/ChangeLog:

2020-02-07  Dennis Zhang  <dennis.zhang@arm.com>

* gcc.target/aarch64/simd/vmmla.c: New test.
gcc/ChangeLog
gcc/config/aarch64/aarch64-simd-builtins.def
gcc/config/aarch64/aarch64-simd.md
gcc/config/aarch64/arm_neon.h
gcc/testsuite/ChangeLog
gcc/testsuite/gcc.target/aarch64/simd/vmmla.c [new file with mode: 0644]