PR79262 has been fixed for almost all AArch64 cpus, however the example is still
vectorized in a few cases, resulting in lower performance. Adjust the vector
cost slightly so that so that -mcpu=cortex-a53 now has identical performance as
-mcpu=cortex-a57 on libquantum.
gcc/
PR target/79262
* config/aarch64/aarch64.c (generic_vector_cost): Adjust
vec_to_scalar_cost.
From-SVN: r278452
(toplev::main): ... here as we need init_options_struct
being called.
+2019-11-19 Wilco Dijkstra <wdijkstr@arm.com>
+
+ PR target/79262
+ * config/aarch64/aarch64.c (generic_vector_cost): Adjust
+ vec_to_scalar_cost.
+
2019-11-19 Wilco Dijkstra <wdijkstr@arm.com>
* config/arm/arm-cpus.in (armv7): Set tune to Cortex-A53.
1, /* vec_int_stmt_cost */
1, /* vec_fp_stmt_cost */
2, /* vec_permute_cost */
- 1, /* vec_to_scalar_cost */
+ 2, /* vec_to_scalar_cost */
1, /* scalar_to_vec_cost */
1, /* vec_align_load_cost */
1, /* vec_unalign_load_cost */