论文信息 - CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU–GPU system - 字舞流文

CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU–GPU system

D. Qian | Qi Zhang | Yi Liu | Tao Liu