论文信息 - Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation

Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation

This paper highlights the impressive utility of multi-parallel corpora for transfer learning in a one-to-many low-resource neural machine translation (NMT) setting. We report on a systematic comparison of multistage fine-tuning configurations, consisting of (1) pre-training on an external large (209k–440k) parallel corpus for English and a helping target language, (2) mixed pre-training or fine-tuning on a mixture of the external and low-resource (18k) target parallel corpora, and (3) pure fine-tuning on the target parallel corpora. Our experiments confirm that multi-parallel corpora are extremely useful despite their scarcity and content-wise redundancy thus exhibiting the true power of multilingualism. Even when the helping target language is not one of the target languages of our concern, our multistage fine-tuning can give 3–9 BLEU score gains over a simple one-to-one model.

Raj Dabre | Atsushi Fujita | Atsushi Fujita | Raj Dabre

[1] Philipp Koehn,et al. Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[2] Marcin Junczys-Dowmunt,et al. The United Nations Parallel Corpus v1.0 , 2016, LREC.

[3] Philipp Koehn,et al. Findings of the 2018 Conference on Machine Translation (WMT18) , 2018, WMT.

[4] David Chiang,et al. Transfer Learning across Low-Resource, Related Languages for Neural Machine Translation , 2017, IJCNLP.

[5] Yoshua Bengio,et al. Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism , 2016, NAACL.

[6] Girish Nath Jha. The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI) , 2010, LREC.

[7] Chenhui Chu,et al. An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation , 2017, ACL.

[8] Rico Sennrich,et al. Improving Neural Machine Translation Models with Monolingual Data , 2015, ACL.

[9] Masao Utiyama,et al. Introduction of the Asian Language Treebank , 2016, 2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA).

[10] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[11] Mauro Cettolo,et al. WIT3: Web Inventory of Transcribed and Translated Talks , 2012, EAMT.

[12] Pushpak Bhattacharyya,et al. Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages , 2018, NAACL.

[13] Mark Steedman,et al. A massively parallel corpus: the Bible in 100 languages , 2014, Lang. Resour. Evaluation.

[14] Richard Socher,et al. Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[15] Dianhai Yu,et al. Multi-Task Learning for Multiple Language Translation , 2015, ACL.

[16] Deniz Yuret,et al. Transfer Learning for Low-Resource Neural Machine Translation , 2016, EMNLP.

[17] Martin Wattenberg,et al. Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.

[18] Victor O. K. Li,et al. Universal Neural Machine Translation for Extremely Low Resource Languages , 2018, NAACL.

[19] Marcello Federico,et al. Transfer Learning in Multilingual Neural Machine Translation with Dynamic Vocabulary , 2018, IWSLT.

[20] Kevin Knight,et al. Multi-Source Neural Translation , 2016, NAACL.

[21] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[22] Eiichiro Sumita,et al. Multilingual Parallel Corpus for Global Communication Plan , 2018, LREC.

[23] Jan Niehues,et al. The IWSLT 2015 Evaluation Campaign , 2015, IWSLT.

[24] Raj Dabre,et al. Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine Translation , 2019, MTSummit.

[25] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[26] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.