QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services