论文信息 - Translation Quality Estimation using Recurrent Neural Network

Translation Quality Estimation using Recurrent Neural Network

This paper describes our submission to the shared task on word/phrase level Quality Estimation (QE) in the First Conference on Statistical Machine Translation (WMT16). The objective of the shared task was to predict if the given word/phrase is a correct/incorrect (OK/BAD) translation in the given sentence. In this paper, we propose a novel approach for word level Quality Estimation using Recurrent Neural Network Language Model (RNN-LM) architecture. RNN-LMs have been found very effective in different Natural Language Processing (NLP) applications. RNN-LM is mainly used for vector space language modeling for different NLP problems. For this task, we modify the architecture of RNNLM. The modified system predicts a label (OK/BAD) in the slot rather than predicting the word. The input to the system is a word sequence, similar to the standard RNN-LM. The approach is language independent and requires only the translated text for QE. To estimate the phrase level quality, we use the output of the word level QE system.

M. Sasikumar | Raj Nath Patel | M. Sasikumar

[1] Lucia Specia,et al. Data enhancement and selection strategies for the word-level Quality Estimation , 2015, WMT@EMNLP.

[2] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.

[3] Philipp Koehn,et al. Findings of the 2013 Workshop on Statistical Machine Translation , 2013, WMT@ACL.

[4] Philipp Koehn,et al. Findings of the 2015 Workshop on Statistical Machine Translation , 2015, WMT@EMNLP.

[5] Philipp Koehn,et al. Findings of the 2012 Workshop on Statistical Machine Translation , 2012, WMT@NAACL-HLT.

[6] Dongfeng Cai,et al. Strategy-Based Technology for Estimating MT Quality , 2015, WMT@EMNLP.

[7] Philipp Koehn,et al. Findings of the 2014 Workshop on Statistical Machine Translation , 2014, WMT@ACL.

[8] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[9] Andrew Y. Ng,et al. Parsing with Compositional Vector Grammars , 2013, ACL.

[10] Geoffrey Zweig,et al. Recurrent neural networks for language understanding , 2013, INTERSPEECH.

[11] Lucia Specia,et al. SHEF-NN: Translation Quality Estimation with Neural Networks , 2015, WMT@EMNLP.