move ack to inside loop, seems to reduce time spent in InputGroup parallel test