论文信息 - Accurate Non-Hierarchical Phrase-Based Translation

Accurate Non-Hierarchical Phrase-Based Translation

A principal weakness of conventional (i.e., non-hierarchical) phrase-based statistical machine translation is that it can only exploit continuous phrases. In this paper, we extend phrase-based decoding to allow both source and target phrasal discontinuities, which provide better generalization on unseen data and yield significant improvements to a standard phrase-based system (Moses). More interestingly, our discontinuous phrase-based system also outperforms a state-of-the-art hierarchical system (Joshua) by a very significant margin (+1.03 BLEU on average on five Chinese-English NIST test sets), even though both Joshua and our system support discontinuous phrases. Since the key difference between these two systems is that ours is not hierarchical---i.e., our system uses a string-based decoder instead of CKY, and it imposes no hard hierarchical reordering constraints during training and decoding---this paper sets out to challenge the commonly held belief that the tree-based parameterization of systems such as Hiero and Joshua is crucial to their good performance against Moses.

Christopher D. Manning | Michel Galley | Michel Galley

[1] Ralph Weischedel,et al. A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[2] José B. Mariño,et al. N-gram-based Machine Translation , 2006, CL.

[3] Daniel Gildea,et al. Machine Translation as Lexicalized Parsing with Hooks , 2005, IWPT.

[4] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[5] Hermann Ney,et al. The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[6] Stefan Riezler,et al. On Some Pitfalls in Automatic Evaluation and Significance Testing for MT , 2005, IEEvaluation@ACL.

[7] Daniel Jurafsky,et al. Phrasal: A Statistical Machine Translation Toolkit for Exploring New Model Features , 2010, NAACL.

[8] Taro Watanabe,et al. Left-to-Right Target Generation for Hierarchical Phrase-Based Translation , 2006, ACL.

[9] Wolfgang Macherey,et al. Lattice-based Minimum Error Rate Training for Statistical Machine Translation , 2008, EMNLP.

[10] Philipp Koehn,et al. Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models , 2004, AMTA.

[11] François Yvon,et al. Gappy Translation Units under Left-to-Right SMT Decoding , 2009, EAMT.