Two monolingual parses are better than one (synchronous parse)

We describe a synchronous parsing algorithm that is based on two successive monolingual parses of an input sentence pair. Although the worst-case complexity of this algorithm is and must be O(n6) for binary SCFGs, its average-case run-time is far better. We demonstrate that for a number of common synchronous parsing problems, the two-parse algorithm substantially outperforms alternative synchronous parsing strategies, making it efficient enough to be utilized without resorting to a pruned search.

[1]  Daniel Gildea,et al.  Bayesian Learning of Non-Compositional Phrases with Synchronous Parsing , 2008, ACL.

[2]  Mehryar Mohri,et al.  Weighted Automata Algorithms , 2009 .

[3]  Cyril Allauzen,et al.  N-Way Composition of Weighted Finite-State Transducers , 2009, Int. J. Found. Comput. Sci..

[4]  Walter L. Ruzzo,et al.  An Improved Context-Free Recognizer , 1980, ACM Trans. Program. Lang. Syst..

[5]  Giorgio Satta,et al.  Some Computational Complexity Results for Synchronous Context-Free Grammars , 2005, HLT/EMNLP.

[6]  Phil Blunsom,et al.  Probabilistic Inference for Machine Translation , 2008, EMNLP.

[7]  Ceriel J. H. Jacobs,et al.  Parsing as Intersection , 2008 .

[8]  John DeNero,et al.  Better Word Alignments with Supervised ITG Models , 2009, ACL.

[9]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[10]  David Chiang,et al.  Hierarchical Phrase-Based Translation , 2007, CL.

[11]  William J. Byrne,et al.  Hierarchical Phrase-Based Translation with Weighted Finite State Transducers , 2009, HLT-NAACL.

[12]  Giorgio Satta,et al.  Translation Algorithms by Means of Language Intersection , 2006 .

[13]  David Chiang,et al.  Forest Rescoring: Faster Decoding with Integrated Language Models , 2007, ACL.

[14]  Cyril Allauzen,et al.  3-Way Composition of Weighted Finite-State Transducers , 2008, CIAA.

[15]  Richard Edwin Stearns,et al.  Syntax-Directed Transduction , 1966, JACM.

[16]  Phil Blunsom,et al.  A Discriminative Latent Variable Model for Statistical Machine Translation , 2008, ACL.

[17]  Franz Josef Och,et al.  A Systematic Comparison of Phrase-Based, Hierarchical and Syntax-Augmented Statistical MT , 2008, COLING.

[18]  Gertjan van Noord The Intersection of Finite State Automata and Definite Clause Grammars , 1995, ACL.

[19]  Andreas Zollmann,et al.  Syntax Augmented Machine Translation via Chart Parsing , 2006, WMT@HLT-NAACL.

[20]  Daniel Gildea,et al.  Binarization of Synchronous Context-Free Grammars , 2009, CL.