论文信息 - Neural Machine Translation Training in a Multi-Domain Scenario - 字舞流文

Neural Machine Translation Training in a Multi-Domain Scenario

In this paper, we explore alternative ways to train a neural machine translation system in a multi-domain scenario. We investigate data concatenation (with fine tuning), model stacking (multi-level fine tuning), data selection and weighted ensemble. We evaluate these methods based on three criteria: i) translation quality, ii) training time, and iii) robustness towards out-of-domain tests. Our findings on Arabic-English and German-English language pairs show that the best translation quality can be achieved by building an initial system on a concatenation of available out-of-domain data and then fine-tuning it on in-domain data. Model stacking works best when training begins with the furthest out-of-domain data and the model is incrementally fine-tuned with the next furthest domain and so on. Data selection did not give the best results, but can be considered as a decent compromise between training time and translation quality. A weighted ensemble of different individual models performed better than data selection. It is beneficial in a scenario when there is no time for fine-tuning.

Yonatan Belinkov | Nadir Durrani | Fahim Dalvi | Stephan Vogel | Hassan Sajjad | Nadir Durrani | S. Vogel | Hassan Sajjad | Fahim Dalvi | Yonatan Belinkov

[1] Markus Freitag,et al. Fast Domain Adaptation for Neural Machine Translation , 2016, ArXiv.

[2] Rico Sennrich,et al. Edinburgh’s Statistical Machine Translation Systems for WMT16 , 2016, WMT.

[3] George F. Foster,et al. Cost Weighting for Neural Machine Translation Domain Adaptation , 2017, NMT@ACL.

[4] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[5] Christopher D. Manning,et al. Stanford Neural Machine Translation Systems for Spoken Language Domains , 2015, IWSLT.

[6] Chenhui Chu,et al. An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation , 2017, ACL.

[7] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[8] Marcello Federico,et al. Report on the 10th IWSLT evaluation campaign , 2013, IWSLT.

[9] Jianfeng Gao,et al. Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.

[10] Philipp Koehn,et al. Findings of the 2014 Workshop on Statistical Machine Translation , 2014, WMT@ACL.

[11] Shafiq R. Joty,et al. Using joint models or domain adaptation in statistical machine translation , 2015, MTSUMMIT.

[12] Nadir Durrani,et al. A Deep Fusion Model for Domain Adaptation in Phrase-based MT , 2016, COLING.

[13] Josep Maria Crego,et al. Domain Control for Neural Machine Translation , 2016, RANLP.

[14] Lemao Liu,et al. Instance Weighting for Neural Machine Translation Domain Adaptation , 2017, EMNLP.

[15] Christophe Servan,et al. Domain specialization: a post-training domain adaptation for Neural Machine Translation , 2016, ArXiv.

[16] Marcin Junczys-Dowmunt,et al. The United Nations Parallel Corpus v1.0 , 2016, LREC.

[17] Ahmed Abdelali,et al. The AMARA corpus: building resources for translating the web’s educational content , 2013, IWSLT.

[18] Chenhui Chu,et al. An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation , 2017, ArXiv.

[19] Masao Utiyama,et al. Overview of the Patent Translation Task at the NTCIR-7 Workshop , 2008, NTCIR.

[20] Masao Utiyama,et al. Sentence Embedding for Neural Machine Translation Domain Adaptation , 2017, ACL.

[21] Jörg Tiedemann,et al. OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles , 2016, LREC.

[22] Rico Sennrich,et al. Edinburgh Neural Machine Translation Systems for WMT 16 , 2016, WMT.

[23] Nadir Durrani,et al. Farasa: A Fast and Furious Segmenter for Arabic , 2016, NAACL.

[24] Mauro Cettolo. An Arabic-Hebrew parallel corpus of TED talks , 2016, ArXiv.

[25] Nadir Durrani,et al. How to Avoid Unwanted Pregnancies: Domain Adaptation using Neural Network Models , 2015, EMNLP.

[26] Rico Sennrich,et al. Nematus: a Toolkit for Neural Machine Translation , 2017, EACL.

[27] Yonatan Belinkov,et al. Challenging Language-Dependent Segmentation for Arabic: An Application to Machine Translation and Part-of-Speech Tagging , 2017, ACL.

[28] Preslav Nakov,et al. QCRI at IWSLT 2013: experiments in Arabic-English and English-Arabic spoken language translation , 2013, IWSLT.

[29] Christof Monz,et al. Dynamic Data Selection for Neural Machine Translation , 2017, EMNLP.

[30] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.