Findings of the WMT 2021 Biomedical Translation Shared Task: Summaries of Animal Experiments as New Test Set

In the sixth edition of the WMT Biomedical Task, we addressed a total of eight language pairs, namely English/German, English/French, English/Spanish, English/Portuguese, English/Chinese, English/Russian, English/Italian, and English/Basque. Further, our tests were composed of three types of textual test sets. New to this year, we released a test set of summaries of animal experiments, in addition to the test sets of scientific abstracts and terminologies. We received a total of 107 submissions from 15 teams from 6 countries.

[1]  Karin M. Verspoor,et al.  Findings of the WMT 2018 Biomedical Translation Shared Task: Evaluation on Medline test sets , 2018, WMT.

[2]  Zhaopeng Tu,et al.  Tencent AI Lab Machine Translation Systems for the WMT21 Biomedical Translation Task , 2021, WMT.

[3]  André F. T. Martins,et al.  Marian: Fast Neural Machine Translation in C++ , 2018, ACL.

[4]  Mariana L. Neves,et al.  Overview of the CLEF eHealth 2019 Multilingual Information Extraction , 2019, CLEF.

[5]  D. Lindberg,et al.  The Unified Medical Language System , 1993, Yearbook of Medical Informatics.

[6]  Atsushi Fujita,et al.  Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers , 2021, ACL.

[7]  Philipp Koehn,et al.  Findings of the 2020 Conference on Machine Translation (WMT20) , 2020, WMT.

[8]  Colin Raffel,et al.  Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[9]  Mariana L. Neves,et al.  MEDLINE as a Parallel Corpus: a Survey to Gain Insight on French-, Spanish- and Portuguese-speaking Authors’ Abstract Writing Practice , 2020, LREC.

[10]  Karin M. Verspoor,et al.  Findings of the WMT 2017 Biomedical Translation Shared Task , 2017, WMT.

[11]  S. Wild African languages to get more bespoke scientific terms , 2021, Nature.

[12]  Maite Oronoz,et al.  Findings of the WMT 2020 Biomedical Translation Shared Task: Basque, Italian and Russian as New Additional Languages , 2020, WMT.

[13]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[14]  Barbara Grune,et al.  Rethinking 3R strategies: Digging deeper into AnimalTestInfo promotes transparency in in vivo biomedical research , 2017, PLoS biology.

[15]  HW-TSC’s Submissions to the WMT21 Biomedical Translation Task , 2021, WMT.

[16]  Karin M. Verspoor,et al.  Parallel Corpora for the Biomedical Domain , 2018, LREC.

[17]  Christian Federmann,et al.  Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations , 2010, LREC.

[18]  Bardia Rafieian,et al.  High Frequent In-domain Words Segmentation and Forward Translation for the WMT21 Biomedical Task , 2021, WMT.

[19]  Sadaf Abdul Rauf,et al.  FJWU Participation for the WMT21 Biomedical Translation Task , 2021, WMT.

[20]  Sandeep Subramanian,et al.  NVIDIA NeMo’s Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21 , 2021, WMT.

[21]  Ander Martinez,et al.  The Fujitsu DMATH Submissions for WMT21 News Translation and Biomedical Translation Tasks , 2021, WMT.

[22]  Karin M. Verspoor,et al.  Findings of the 2016 Conference on Machine Translation , 2016, WMT.

[23]  Wei Peng,et al.  Huawei AARC’s Submissions to the WMT21 Biomedical Translation Task: Domain Adaption from a Practical Perspective , 2021, WMT.

[24]  K. Bretonnel Cohen,et al.  Findings of the WMT 2019 Biomedical Translation Shared Task: Evaluation for MEDLINE Abstracts and Biomedical Terminologies , 2019, WMT.