Yuxiong He
发表
Olatunji Ruwase,
Samyam Rajbhandari,
Jeff Rasley,
2021,
SC21: International Conference for High Performance Computing, Networking, Storage and Analysis.
Xiangru Lian,
Ammar Ahmad Awan,
Hanlin Tang,
2021,
ICML.
1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB’s Convergence Speed
pdf
Ammar Ahmad Awan,
Hanlin Tang,
Samyam Rajbhandari,
2021,
2022 IEEE 29th International Conference on High Performance Computing, Data, and Analytics (HiPC).
Curriculum Learning: A Regularization Method for Efficient and Stable Billion-Scale GPT Model Pre-Training
pdf
Conglong Li,
Yuxiong He,
Minjia Zhang,
2021,
ArXiv.
Alexandre Muzio,
Ammar Ahmad Awan,
Hany Hassan Awadalla,
2021,
ArXiv.
Bo Wu,
Yuxiong He,
Minjia Zhang,
2021,
NeurIPS.
Shaolei Ren,
Yuxiong He,
2013
.
Olatunji Ruwase,
Samyam Rajbhandari,
Jeff Rasley,
2020,
SC20: International Conference for High Performance Computing, Networking, Storage and Analysis.