A $${\mathcal{O}(|G|n^6)}$$ time extension of inversion transduction grammars

Range concatenation grammars are viewed as a hierarchy of synchronous grammars. It is shown how inversion transduction grammars (ITGs) and extensions thereof, including synchronous tree-adjoining grammars, are captured by the hierarchy, and the expressivity and linguistic relevance of subclasses of the hierarchy are discussed. A $${\mathcal{O}(|G|n^6)}$$ time extension of ITGs is proposed. The extension translates cross-serial dependencies into nested ones and handles complex kinds of discontinuous translation units and so-called inside-out alignments. In fact, our $${\mathcal{O}(|G|n^6)}$$ time extension generates all possible alignments. It is shown that this additional expressivity comes at the cost of probabilistic parsing.

[1]  Zhiyi Chi,et al.  Estimation of Probabilistic Context-Free Grammars , 1998, Comput. Linguistics.

[2]  Anders Søgaard,et al.  Using a maximum entropy-based tagger to improve a very fast vine parser , 2009, IWPT.

[3]  Tadao Kasami,et al.  On Multiple Context-Free Grammars , 1991, Theor. Comput. Sci..

[4]  Dekai Wu,et al.  Empirical lower bounds on translation unit error rate for the full class of inversion transduction grammars , 2009, IWPT.

[5]  H. Karlgren COLING-90 : 計 : papers presented to the 13th International Conference on Computational Linguistics : on the occasion of the 25th anniversary of COLING and the 350th anniversary of Helsinki University , 1990 .

[6]  Alfred V. Aho,et al.  The Theory of Parsing, Translation, and Compiling , 1972 .

[7]  Éric Gaussier,et al.  Aligning words using matrix factorisation , 2004, ACL.

[8]  David J. Weir,et al.  The equivalence of four extensions of context-free grammars , 1994, Mathematical systems theory.

[9]  Pierre Boullier A Proposal for a Natural Lan-guage Processing Syntactic Backbone , 1997 .

[10]  Tadao Kasami,et al.  RNA Pseudoknotted Structure Prediction Using Stochastic Multiple Context-Free Grammar , 2006 .

[11]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[12]  Hermann Ney,et al.  A Comparative Study on Reordering Constraints in Statistical Machine Translation , 2003, ACL.

[13]  Jason Eisner,et al.  Learning Non-Isomorphic Tree Mappings for Machine Translation , 2003, ACL.

[14]  Anders Søgaard Can inversion transduction grammars generate hand alignments , 2010, EAMT.

[15]  Laura Kallmeyer,et al.  Tree-Local Multicomponent Tree-Adjoining Grammars with Shared Nodes , 2005, Computational Linguistics.

[16]  Mirella Lapata,et al.  Optimal Constituent Alignment with Edge Covers for Semantic Projection , 2006, ACL.

[17]  I. Dan Melamed,et al.  Empirical Lower Bounds on the Complexity of Translational Equivalence , 2006, ACL.

[18]  Stuart M. Shieber,et al.  Synchronous Tree-Adjoining Grammars , 1990, COLING.

[19]  Anders Søgaard,et al.  Empirical Lower Bounds on Aligment Error Rates in Syntax-Based Machine Translation , 2009, SSST@HLT-NAACL.

[20]  Giorgio Satta,et al.  Generalized Multitext Grammars , 2004, ACL.

[21]  David Chiang,et al.  Hierarchical Phrase-Based Translation , 2007, CL.

[22]  Robert C. Berwick,et al.  Computational complexity and natural language , 1987 .

[23]  Giorgio Satta,et al.  A Two-Dimensional Hierarchy for Parallel Rewriting Systems , 1994 .

[24]  Stuart M. Shieber,et al.  Probabilistic Synchronous Tree-Adjoining Grammars for Machine Translation: The Argument from Bilingual Dictionaries , 2007, SSST@HLT-NAACL.

[25]  Thomas Sudkamp,et al.  Languages and Machines , 1988 .

[26]  Yonggang Guan Klammergrammatiken, Netzgrammatiken und Interpretationen von Netzen , 1992 .

[27]  Zhiyi Chi,et al.  Statistical Properties of Probabilistic Context-Free Grammars , 1999, CL.

[28]  Marc Dymetman,et al.  Translating with Non-contiguous Phrases , 2005, HLT.

[29]  Stuart M. Shieber,et al.  Simpler TAG semantics through synchronization , 2006 .

[30]  Gerald Gazdar,et al.  Applicability of Indexed Grammars to Natural Languages , 1988 .

[31]  Daniel Gildea,et al.  Extracting Synchronous Grammar Rules From Word-Level Alignments in Linear Time , 2008, COLING.

[32]  Bowen Zhou,et al.  An EM algorithm for SCFG in formal syntax-based translation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[33]  Giorgio Satta,et al.  Some Computational Complexity Results for Synchronous Context-Free Grammars , 2005, HLT/EMNLP.

[34]  Keith textscHall,et al.  Comparing Reordering Constraints for SMT Using Efficient BLEU Oracle Computation , 2007, HLT-NAACL 2007.

[35]  João Graça,et al.  Building a Golden Collection of Parallel Multi-Language Word Alignment , 2008, LREC.

[36]  Francisco Casacuberta,et al.  Submission to ICGI-2000 Computational complexity of problems on probabilistic grammars and transducers , 2007 .

[37]  Daniel Gildea,et al.  Synchronous Binarization for Machine Translation , 2006, NAACL.

[38]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[39]  Pierre Boullier,et al.  A Cubic Time Extension of Context-Free Grammars , 2000, Grammars.

[40]  Peter Poller,et al.  Structural translation with synchronous tree adjoining grammars in VERBMOBIL , 1996 .

[41]  Jürgen Wedekind Approaches to unification in grammar: a brief survey , 1997 .

[42]  Pierre Boullier,et al.  On TAG and Multicomponent TAG Parsing , 1998 .

[43]  Raymond J. Mooney,et al.  Learning Synchronous Grammars for Semantic Parsing with Lambda Calculus , 2007, ACL.

[44]  Dekai Wu Probabilistic synchronous tree-adjoining grammars for machine , 2007 .

[45]  Wolfgangmaier Andanderssøgaard,et al.  Treebanks and Mild Context-Sensitivity , 2008 .

[46]  Dekai Wu Trainable Coarse Bilingual Grammars for Parallel Text Bracketing , 1995, VLC@ACL.

[47]  Mary McGee Wood Natural language parsing and linguistic theories , 2004, Machine Translation.

[48]  Joakim Nivre,et al.  Learning Stochastic Bracketing Inversion Transduction Grammars with a Cubic Time Biparsing Algorithm , 2009, IWPT.

[49]  黄辉,et al.  Machine Translation Using Constraint-Based Synchronous Grammar , 2006 .