Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling
暂无分享,去创建一个
Michael W. Mahoney | Steven K. Esser | Prafulla Dhariwal | Benjamin Mann | Arvind Neelakantan | Melanie Subbiah | Pranav Shyam | Girish Sastry | J. McKinstry | A. Gholami | Z. Yao | Sehoon Kim | Jared Kaplan | Xiuying Wei | Zhiyu Chen | Kai-Min Yang | Yunchen Zhang | Xiangguo Zhang | Ruihao Gong | Deepika Bablani | Yuhang Li | Zhengang Li | Qing Jin | Richard Zhuang | Jian Ren | Nick Ryder | Kurt Keutzer | Xianglong Liu | Jinyang Guo | Tom Brown | Yaohui Cai | Zhen Dong | Hawq | Edward J. Hu | Yelong Shen | Zeyuan Phillip Wallis | Sumant Hanu-mante | Yanzhi Wang | Sergey Tulyakov. 2022 | F8net | S. K. Esser