论文信息 - Atom: Low-bit Quantization for Efficient and Accurate LLM Serving - 字舞流文

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Zihao Ye | Baris Kasikci | Lequn Chen | Size Zheng | Tianqi Chen | Yilong Zhao | Chien-Yu Lin | Kan Zhu | Lequn Chen | Size Zheng | Luis Ceze | Arvind Krishnamurthy | Kan Zhu