Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
暂无分享,去创建一个
Zihao Ye | Baris Kasikci | Lequn Chen | Size Zheng | Tianqi Chen | Yilong Zhao | Chien-Yu Lin | Kan Zhu | Lequn Chen | Size Zheng | Luis Ceze | Arvind Krishnamurthy | Kan Zhu