论文信息 - A Neural Network based Approach to Automatic Post-Editing - 字舞流文

A Neural Network based Approach to Automatic Post-Editing

We present a neural network based automatic post-editing (APE) system to improve raw machine translation (MT) output. Our neural model of APE (NNAPE) is based on a bidirectional recurrent neural network (RNN) model and consists of an encoder that encodes an MT output into a fixed-length vector from which a decoder provides a post-edited (PE) translation. APE translations produced by NNAPE show statistically significant improvements of 3.96, 2.68 and 1.35 BLEU points absolute over the original MT, phrase-based APE and hierarchical APE outputs, respectively. Furthermore, human evaluation shows that the NNAPE generated PE translations are much better than the original MT output.

Josef van Genabith | Santanu Pal | Sudip Kumar Naskar | Mihaela Vela | M. Vela | S. Naskar | Santanu Pal

[1] Ralph Weischedel,et al. A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[3] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[4] Ondrej Dusek,et al. DEPFIX: A System for Automatic Correction of Czech MT Outputs , 2012, WMT@NAACL-HLT.

[5] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[6] Matthew G. Snover,et al. A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[7] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[8] Wojciech Zaremba,et al. An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.

[9] Mihaela Vela,et al. Quantifying the Influence of MT Output in the Translators’ Performance: A Case Study in Technical Translation , 2014, HaCaT@EACL.

[10] Alon Lavie,et al. Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[11] Phil Blunsom,et al. Recurrent Continuous Translation Models , 2013, EMNLP.

[12] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.

[13] Rudolf Rosa,et al. Two-step translation with grammatical post-processing , 2011, WMT@EMNLP.

[14] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[15] A. Loffler-Laurian. Traduction automatique et style , 1985 .

[16] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[17] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.

[18] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[19] Donald A. DePalma,et al. Project management for crowdsourced translation: How user-translated content projects work in real life , 2011 .

[20] Sharon O'Brien,et al. Quality and Machine Translation: A realistic objective? , 2009 .

[21] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[22] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[23] Michael J. Denkowski,et al. Machine Translation for Human Translators , 2015 .

[24] Michel Simard,et al. Statistical Phrase-Based Post-Editing , 2007, NAACL.

[25] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[26] Philipp Koehn. A Process Study of Computed Aided Translation , 2009 .

[27] T. Veale. Gaijin : A Bootstrapping , Template-Driven Approach to Example-Based MT , 1997 .

[28] Josef van Genabith,et al. USAAR-SAPE: An English–Spanish Statistical Automatic Post-Editing System , 2015, WMT@EMNLP.

[29] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[30] George F. Foster,et al. Batch Tuning Strategies for Statistical Machine Translation , 2012, NAACL.

[31] Roland Kuhn,et al. Rule-Based Translation with Statistical Phrase-Based Post-Editing , 2007, WMT@ACL.

[32] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[33] Francisco Casacuberta,et al. Statistical Post-Editing of a Rule-Based Machine Translation System , 2009, NAACL.