论文信息 - Two-Phase Cross-Lingual Language Model Fine-Tuning for Machine Translation Quality Estimation

Two-Phase Cross-Lingual Language Model Fine-Tuning for Machine Translation Quality Estimation

In this paper, we describe the Bering Lab’s submission to the WMT 2020 Shared Task on Quality Estimation (QE). For word-level and sentence-level translation quality estimation, we fine-tune XLM-RoBERTa, the state-of-theart cross-lingual language model, with a few additional parameters. Model training consists of two phases. We first pre-train our model on a huge artificially generated QE dataset, and then we fine-tune the model with a humanlabeled dataset. When evaluated on the WMT 2020 English-German QE test set, our systems achieve the best result on the target-side of word-level QE and the second best results on the source-side of word-level QE and sentencelevel QE among all submissions.

Dongjun Lee | Dongjun Lee

[1] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[2] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.

[3] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[4] André F. T. Martins,et al. OpenKiwi: An Open Source Framework for Quality Estimation , 2019, ACL.

[5] Marco Turchi,et al. ESCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing , 2018, LREC.

[6] André F. T. Martins,et al. Findings of the WMT 2019 Shared Tasks on Quality Estimation , 2019, WMT.

[7] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[8] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9] Veselin Stoyanov,et al. Unsupervised Cross-lingual Representation Learning at Scale , 2019, ACL.

[10] Matthew G. Snover,et al. A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.