论文信息 - Neural Machine Translation from Simplified Translations

Neural Machine Translation from Simplified Translations

Text simplification aims at reducing the lexical, grammatical and structural complexity of a text while keeping the same meaning. In the context of machine translation, we introduce the idea of simplified translations in order to boost the learning ability of deep neural translation models. We conduct preliminary experiments showing that translation complexity is actually reduced in a translation of a source bi-text compared to the target reference of the bi-text while using a neural machine translation (NMT) system learned on the exact same bi-text. Based on knowledge distillation idea, we then train an NMT system using the simplified bi-text, and show that it outperforms the initial system that was built over the reference data set. Performance is further boosted when both reference and automatic translations are used to learn the network. We perform an elementary analysis of the translated corpus and report accuracy results of the proposed approach on English-to-French and English-to-German translation tasks.

Josep Maria Crego | Jean Senellart | J. Crego | Jean Senellart

[1] Ondrej Bojar,et al. Results of the WMT16 Metrics Shared Task , 2016 .

[2] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[3] Gonzalo Iglesias,et al. Fast and Accurate Preordering for SMT using Neural Networks , 2015, HLT-NAACL.

[4] Ondrej Bojar,et al. Results of the WMT13 Metrics Shared Task , 2015, WMT@EMNLP.

[5] Jan Niehues,et al. Pre-Translation for Neural Machine Translation , 2016, COLING.

[6] Alexander M. Rush,et al. Sequence-Level Knowledge Distillation , 2016, EMNLP.

[7] Bo Wang,et al. SYSTRAN's Pure Neural Machine Translation Systems , 2016, ArXiv.

[8] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.

[9] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[10] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11] Ping Chen,et al. An Experimental Study of LSTM Encoder-Decoder Model for Text Simplification , 2016, ArXiv.