Multiple confusion network application in MT system combination

System combination has emerged as a powerful method for machine translation (MT). In the construction of word level confusion network (CN), the alignment and skeleton selection are two key issues of system combination. The paper introduces multi-CN for solving skeleton selection. We fail to yield the better performance through using simple prior score as CN-based feature, so we introduce more sophisticated CN-based and consensus-decoding-based features into combination framework to test multi-CN's validity. The approaches of multi-CN are shown to be superior to single-CN in the setting of the Chinese-to-English track of the 2008 NIST Open MT evaluation.

[1]  Kristina Toutanova,et al.  Joint Optimization for Machine Translation System Combination , 2009, EMNLP.

[2]  Andy Way,et al.  An Incremental Three-pass System Combination Framework by Combining Multiple Hypothesis Alignment Methods , 2010, Int. J. Asian Lang. Process..

[3]  Jianfeng Gao,et al.  Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems , 2008, EMNLP.

[4]  Shankar Kumar,et al.  Minimum Bayes-Risk Decoding for Statistical Machine Translation , 2004, NAACL.

[5]  Shankar Kumar,et al.  Lattice Minimum Bayes-Risk Decoding for Statistical Machine Translation , 2008, EMNLP.

[6]  Giuseppe Riccardi,et al.  Computing consensus translation from multiple machine translation systems , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[7]  Richard M. Schwartz,et al.  Improved Word-Level System Combination for Machine Translation , 2007, ACL.

[8]  Sanjeev Khudanpur,et al.  Machine Translation System Combination using ITG-based Alignments , 2008, ACL.

[9]  Hichem Sahbi,et al.  Consensus Network Decoding for Statistical Machine Translation System Combination , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[10]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[11]  Yang Feng,et al.  Lattice-based System Combination for Statistical Machine Translation , 2009, EMNLP.

[12]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[13]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[14]  Richard M. Schwartz,et al.  Combining Outputs from Multiple Machine Translation Systems , 2007, NAACL.