aco: implement 16-bit vertex fetches with tbuffer_load_format_d16_*