Reordering Model for Forest-to-String Machine Translation

In this paper, we present a novel extension of a forest-to-string machine translation system with a reordering model. We predict reordering probabilities for every pair of source words with a model using features observed from the input parse forest. Our approach naturally deals with the ambiguity present in the input parse forest, but, at the same time, takes into account only the parts of the input forest used by the current translation hypothesis. The method provides improvement from 0.6 up to 1.0 point measured by (Ter − Bleu)/2 metric.

[1]  Christoph Tillmann,et al.  A Unigram Orientation Model for Statistical Machine Translation , 2004, NAACL.

[2]  Hal Daumé Notes on CG and LM-BFGS Optimization of Logistic Regression , 2008 .

[3]  Qun Liu,et al.  Forest-Based Translation , 2008, ACL.

[4]  Jason Eisner,et al.  Learning Linear Ordering Problems for Better Translation , 2009, EMNLP.

[5]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[6]  Rabih Zbib,et al.  Factored Soft Source Syntactic Constraints for Hierarchical Machine Translation , 2013, EMNLP.

[7]  Yang Liu,et al.  Tree-to-String Alignment Template for Statistical Machine Translation , 2006, ACL.

[8]  Daniel Marcu,et al.  What’s in a translation rule? , 2004, NAACL.

[9]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[10]  Hao Yu,et al.  Maximum Entropy Based Phrase Reordering for Hierarchical Phrase-Based Translation , 2010, EMNLP.

[11]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[12]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[13]  Haizhou Li,et al.  Fast Translation Rule Matching for Syntax-based Statistical Machine Translation , 2009, EMNLP.

[14]  Philipp Koehn,et al.  Explorer Edinburgh System Description for the 2005 IWSLT Speech Translation Evaluation , 2005 .

[15]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[16]  Daniel Marcu,et al.  Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.

[17]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[18]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[19]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[20]  Haitao Mi,et al.  Forest-based Translation Rule Extraction , 2008, EMNLP.

[21]  Kevin Duh,et al.  Hierarchical Phrase-based Machine Translation with Word-based Reordering Model , 2010, COLING.

[22]  Bowen Zhou,et al.  Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding , 2013, EMNLP.

[23]  Salim Roukos,et al.  A Maximum Entropy Word Aligner for Arabic-English Machine Translation , 2005, HLT.

[24]  Liang Huang,et al.  Statistical Syntax-Directed Translation with Extended Domain of Locality , 2006, AMTA.