DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining
暂无分享,去创建一个
Sang Michael Xie | Sang Michael Xie | Quoc V. Le | Percy Liang | Hieu Pham | Hanxiao Liu | Yifeng Lu | Nan Du | Tengyu Ma | Xuanyi Dong | A. Yu
暂无分享,去创建一个
Sang Michael Xie | Sang Michael Xie | Quoc V. Le | Percy Liang | Hieu Pham | Hanxiao Liu | Yifeng Lu | Nan Du | Tengyu Ma | Xuanyi Dong | A. Yu