aarch64: Use RTL builtins for [su]mull_n intrinsics
authorJonathan Wright <jonathan.wright@arm.com>
Tue, 19 Jan 2021 22:44:24 +0000 (22:44 +0000)
committerJonathan Wright <jonathan.wright@arm.com>
Fri, 29 Jan 2021 13:53:44 +0000 (13:53 +0000)
commitee4c4fe289e768d3c6b6651c8bfa3fdf458934f4
tree7c808551f1a00f5f0807d78e4b93e6fa131c3ba4
parent9b588cfb4288ef41eb59edc6ab8cb83056b3b6a4
aarch64: Use RTL builtins for [su]mull_n intrinsics

Rewrite [su]mull_n Neon intrinsics to use RTL builtins rather than
inline assembly code, allowing for better scheduling and
optimization.

gcc/ChangeLog:

2021-01-19  Jonathan Wright  <jonathan.wright@arm.com>

* config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
builtin generator macros.
* config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
Define.
* config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
instead of inline asm.
(vmull_n_s32): Likewise.
(vmull_n_u16): Likewise.
(vmull_n_u32): Likewise.
gcc/config/aarch64/aarch64-simd-builtins.def
gcc/config/aarch64/aarch64-simd.md
gcc/config/aarch64/arm_neon.h