J. Lamy-Poirier
发表
J. Lamy-Poirier,
2022,
ArXiv.
D. Gaiotto,
J. Lamy-Poirier,
2013
.
P. Mathieu,
J. Lamy-Poirier,
2010,
1012.4722.
J. Lamy-Poirier,
2016
.
J. Lamy-Poirier,
2015,
1511.08275.
J. Lamy-Poirier,
2014,
1412.0530.
D. Gaiotto,
J. Lamy-Poirier,
2013,
1301.5342.
Harm de Vries,
Carolyn Jane Anderson,
Leandro von Werra,
2023,
ArXiv.
H. Kröger,
R. Zomorrodi,
J. Laprise,
2010
.
P. Mathieu,
J. Lamy-Poirier,
2010,
1010.2462.
Harm de Vries,
Carolyn Jane Anderson,
Leandro von Werra,
2023,
ArXiv.
Anqi Xu,
Joel Lamy-Poirier,
Anqi Xu,
2018,
ArXiv.
Layered gradient accumulation and modular pipeline parallelism: fast and efficient training of large language models
pdf
Joel Lamy-Poirier,
J. Lamy-Poirier,
2021,
ArXiv.
Leandro von Werra,
Carlos Muñoz Ferrandis,
Terry Yue Zhuo,
2024,
ArXiv.