The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
暂无分享,去创建一个
David Ifeoluwa Adelani | Teven Le Scao | Leandro von Werra | Stella Rose Biderman | Javier de la Rosa | Pedro Ortiz Suarez | Albert Villanova del Moral | Kyle Lo | Yacine Jernite | Margaret Mitchell | Aitor Soroa Etxabe | Itziar Gonzalez-Dios | Anna Rogers | Tristan Thrush | Aaron Gokaslan | Jian Zhu | S. Longpre | Olivier Nguyen | Zaid Alyafeai | Manan Dey | Thomas Wang | Leon Weber | Sasha Luccioni | Pierre Colombo | Jenny Chim | Jorg Frohberg | Huu Nguyen | Maraim Masoud | Gérard Dupont | Somaieh Nikpoor | Christopher Akiki | F. Toni | Daniel Alexander van Strien | Shamik Bose | Hugo Laurenccon | Paulo Villegas | Quentin Lhoest | Lucile Saulnier | Long Phan | Angelina McMillan-Major | Chenghao Mou | Giada Pistilli | Khalid Almubarak | Mario vSavsko | Minh Chien Vu | Sebastian Nagel | S. Pai | Violette Lepercq | Loubna Ben Allal | I. Yu | H. Tran | E. G. Ponferrada | M. Muñoz | Suzana Ilic | Long Phan | So-maieh Nikpoor | Stella Biderman | Aaron Gokaslan