High-throughput Generative Inference of Large Language Models with a Single GPU
暂无分享,去创建一个
Daniel Y. Fu | Percy Liang | Christopher Ré | Lianmin Zheng | Joseph Gonzalez | Binhang Yuan | Ce Zhang | Beidi Chen | Zhuohan Li | Max Ryabinin | I. Stoica | Zhiqiang Xie | Clark W. Barrett | Ying Sheng