aco: implement 8-bit/16-bit reductions