论文信息 - Domain Robustness in Neural Machine Translation

Domain Robustness in Neural Machine Translation

Translating text that diverges from the training domain is a key challenge for neural machine translation (NMT). Domain robustness - the generalization of models to unseen test domains - is low compared to statistical machine translation. In this paper, we investigate the performance of NMT on out-of-domain test sets, and ways to improve it. We observe that hallucination (translations that are fluent but unrelated to the source) is common in out-of-domain settings, and we empirically compare methods that improve adequacy (reconstruction), out-of-domain translation (subword regularization), or robustness against adversarial examples (defensive distillation), as well as noisy channel models. In experiments on German to English OPUS data, and German to Romansh, a low-resource scenario, we find that several methods improve domain robustness, reconstruction standing out as a method that not only improves automatic scores, but also shows improvements in a manual assessments of adequacy, albeit at some loss in fluency. However, out-of-domain performance is still relatively low and domain robustness remains an open problem.

Rico Sennrich | Annette Rios | Mathias Müller

[1] Satoshi Nakamura,et al. Incorporating Discrete Translation Lexicons into Neural Machine Translation , 2016, EMNLP.

[2] Ashish Agarwal,et al. Hallucinations in Neural Machine Translation , 2018 .

[3] David A. Wagner,et al. Towards Evaluating the Robustness of Neural Networks , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[4] Christopher D. Manning,et al. Stanford Neural Machine Translation Systems for Spoken Language Domains , 2015, IWSLT.

[5] Alexander M. Rush,et al. Sequence-Level Knowledge Distillation , 2016, EMNLP.

[6] Taku Kudo,et al. Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates , 2018, ACL.

[7] Marine Carpuat,et al. Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation , 2018, NAACL.

[8] Josep Maria Crego,et al. Domain Control for Neural Machine Translation , 2016, RANLP.

[9] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[10] Nathan Ng,et al. Simple and Effective Noisy Channel Modeling for Neural Machine Translation , 2019, EMNLP.

[11] Patrick D. McDaniel,et al. Extending Defensive Distillation , 2017, ArXiv.