Modeling Syntactic and Semantic Structures in Hierarchical Phrase-based Translation

Incorporating semantic structure into a linguistics-free translation model is challenging, since semantic structures are closely tied to syntax. In this paper, we propose a two-level approach to exploiting predicate-argument structure reordering in a hierarchical phrase-based translation model. First, we introduce linguistically motivated constraints into a hierarchical model, guiding translation phrase choices in favor of those that respect syntactic boundaries. Second, based on such translation phrases, we propose a predicate-argument structure reordering model that predicts reordering not only between an argument and its predicate, but also between two arguments. Experiments on Chinese-to-English translation demonstrate that both advances significantly improve translation accuracy.

[1]  Pascale Fung,et al.  Automatic Learning of Chinese English Semantic Structure Mapping , 2006, 2006 IEEE Spoken Language Technology Workshop.

[2]  Philipp Koehn,et al.  Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[3]  Pascale Fung,et al.  Semantic Roles for SMT: A Hybrid Two-Pass Model , 2009, NAACL.

[4]  Daniel Marcu,et al.  SPMT: Statistical Machine Translation with Syntactified Target Language Phrases , 2006, EMNLP.

[5]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[6]  Hwee Tou Ng,et al.  Joint Syntactic and Semantic Parsing of Chinese , 2010, ACL.

[7]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[8]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[9]  Haizhou Li,et al.  A Syntax-Driven Bracketing Model for Phrase-Based Translation , 2009, ACL.

[10]  Michael Collins,et al.  A Discriminative Model for Tree-to-Tree Translation , 2006, EMNLP.

[11]  Daniel Marcu,et al.  Scalable Inference and Training of Context-Rich Syntactic Translation Models , 2006, ACL.

[12]  Ding Liu,et al.  Semantic Role Features for Machine Translation , 2010, COLING.

[13]  Chengqing Zong,et al.  Joint Inference for Bilingual Semantic Role Labeling , 2010, EMNLP.

[14]  Philip Resnik,et al.  Soft Syntactic Constraints for Hierarchical Phrased-Based Translation , 2008, ACL.

[15]  Daniel Marcu,et al.  Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy , 2007, EMNLP.

[16]  Qun Liu,et al.  Forest-Based Translation , 2008, ACL.

[17]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[18]  Colin Cherry,et al.  Cohesive Phrase-Based Decoding for Statistical Machine Translation , 2008, ACL.

[19]  Haizhou Li,et al.  Modeling the Translation of Predicate-Argument Structure for SMT , 2012, ACL.

[20]  David Chiang,et al.  Hierarchical Phrase-Based Translation , 2007, CL.

[21]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[22]  Jinxi Xu,et al.  String-to-Dependency Statistical Machine Translation , 2010, CL.

[23]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[24]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[25]  Daniel Marcu,et al.  What Can Syntax-Based MT Learn from Phrase-Based MT? , 2007, EMNLP.

[26]  Haizhou Li,et al.  Learning Translation Boundaries for Phrase-Based Decoding , 2010, NAACL.

[27]  Hermann Ney,et al.  Improved Statistical Alignment Models , 2000, ACL.