A Dependency Edge-based Transfer Model for Statistical Machine Translation

Previous models in syntax-based statistical machine translation usually resort to some kinds of synchronous procedures, few of these works are based on the analysis-transfer-generation methodology. In this paper, we present a statistical implementation of the analysis-transfergeneration methodology in rule-based translation. The procedures of syntax analysis, syntax transfer and language generation are modeled independently in order to break the synchronous constraint, resorting to dependency structures with dependency edges as atomic manipulating units. Large-scale experiments on Chinese to English translation show that our model exhibits state-of-the-art performance by significantly outperforming the phrase-based model. The statistical transfer-generation method results in significantly better performance with much smaller models.

[1]  Jinxi Xu,et al.  A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model , 2008, ACL.

[2]  Michael White,et al.  Learning Domain-Specific Transfer Rules: An Experiment with Korean to English Translation , 2002, COLING 2002.

[3]  Qun Liu,et al.  Translation with Source Constituency and Dependency Trees , 2013, EMNLP.

[4]  Kevin Knight,et al.  A Syntax-based Statistical Translation Model , 2001, ACL.

[5]  Daniel Jurafsky,et al.  Discriminative Reordering with Chinese Grammatical Relations Features , 2009, SSST@HLT-NAACL.

[6]  Qun Liu,et al.  A Dependency Treelet String Correspondence Model for Statistical Machine Translation , 2007, WMT@ACL.

[7]  Kevin Knight,et al.  Training Tree Transducers , 2004, NAACL.

[8]  Qun Liu,et al.  A novel dependency-to-string model for statistical machine translation , 2011, EMNLP.

[9]  Liang Huang,et al.  Statistical Syntax-Directed Translation with Extended Domain of Locality , 2006, AMTA.

[10]  Hermann Ney,et al.  Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.

[11]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[12]  Noah A. Smith,et al.  Phrase Dependency Machine Translation with Quasi-Synchronous Tree-to-Tree Features , 2014, CL.

[13]  Martha Palmer,et al.  Synchronous Dependency Insertion Grammars: A Grammar Formalism for Syntax Based Statistical MT , 2004 .

[14]  Arul Menezes,et al.  Achieving commercial-quality translation with example-based methods , 2001, MTSUMMIT.

[15]  Noah A. Smith,et al.  Feature-Rich Translation by Quasi-Synchronous Lattice Parsing , 2009, EMNLP.

[16]  Alon Lavie,et al.  Automatic rule learning for resource-limited MT , 2002, AMTA.

[17]  Dekang Lin,et al.  A Path-based Transfer Model for Machine Translation , 2004, COLING.

[18]  Yang Liu,et al.  Tree-to-String Alignment Template for Statistical Machine Translation , 2006, ACL.

[19]  Chris Quirk,et al.  Dependency Treelet Translation: Syntactically Informed Phrasal SMT , 2005, ACL.

[20]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[21]  Hermann Ney,et al.  The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[22]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[23]  Xerardo Estévez,et al.  Santiago de Compostela, Spain , 2013 .

[24]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[25]  Daniel Marcu,et al.  A Phrase-Based,Joint Probability Model for Statistical Machine Translation , 2002, EMNLP.

[26]  Qun Liu,et al.  Forest-Based Translation , 2008, ACL.

[27]  Philipp Koehn,et al.  Clause Restructuring for Statistical Machine Translation , 2005, ACL.

[28]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[29]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.