Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing

We introduce dependency parsing schemata, a formal framework based on Sikkel's parsing schemata for constituency parsers, which can be used to describe, analyze, and compare dependency parsing algorithms. We use this framework to describe several well-known projective and non-projective dependency parsers, build correctness proofs, and establish formal relationships between them. We then use the framework to define new polynomial-time parsing algorithms for various mildly non-projective dependency formalisms, including well-nested structures with their gap degree bounded by a constant k in time O(n5+2k), and a new class that includes all gap degree k structures present in several natural language treebanks (which we call mildly ill-nested structures for gap degree k) in time O(n4+3k). Finally, we illustrate how the parsing schema framework can be applied to Link Grammar, a dependency-related formalism.

[1]  Vincenzo Lombardo,et al.  An Earley-type recognizer for dependency grammar , 1996, COLING.

[2]  Marco Kuhlmann,et al.  Dependency Structures and Lexicalized Grammars An Algebraic Approach , 2010, Lecture Notes in Computer Science.

[3]  Vincenzo Lombardo,et al.  Integration of syntactic and lexical information in a hierarchical dependency grammar , 1998, Workshop On Processing Of Dependency-Based Grammars.

[4]  Alexis Nasr,et al.  Pseudo-Projectivity, A Polynomially Parsable Non-Projective Dependency Grammar , 1998, ACL.

[5]  Noah A. Smith,et al.  Compiling Comp Ling: Weighted Dynamic Programming and the Dyna Language , 2005, HLT.

[6]  Jinxi Xu,et al.  A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model , 2008, ACL.

[7]  Giorgio Satta,et al.  A faster parsing algorithm for Lexicalized Tree-Adjoining Grammars , 2000, TAG+.

[8]  Jirí Havelka Beyond Projectivity: Multilingual Evaluation of Constraints and Measures on Non-Projective Structures , 2007, ACL.

[9]  Michael A. Covington,et al.  A Fundamental Algorithm for Dependency Parsing , 2004 .

[10]  Michael A. Covington,et al.  A dependency parser for variable-word-order languages , 1990 .

[11]  Sebastian Riedel,et al.  The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.

[12]  Timm Lichte,et al.  Characterizing Discontinuity in Constituent Treebanks , 2009, FG.

[13]  David J. Weir,et al.  A Deductive Approach to Dependency Parsing , 2008, ACL.

[14]  Tadao Kasami,et al.  An Efficient Recognition and Syntax-Analysis Algorithm for Context-Free Languages , 1965 .

[15]  Stuart M. Shieber,et al.  Principles and Implementation of Deductive Parsing , 1994, J. Log. Program..

[16]  Yuan Ding,et al.  Machine Translation Using Probabilistic Synchronous Dependency Insertion Grammars , 2005, ACL.

[17]  Giorgio Satta,et al.  Efficient Parsing for Bilexical Context-Free Grammars and Head Automaton Grammars , 1999, ACL.

[18]  Jason Eisner,et al.  Three New Probabilistic Models for Dependency Parsing: An Exploration , 1996, COLING.

[19]  Miguel A. Alonso,et al.  Tabular Algorithms for TAG , 1999 .

[20]  Koby Crammer,et al.  Online Large-Margin Training of Dependency Parsers , 2005, ACL.

[21]  Joakim Nivre,et al.  Mildly Non-Projective Dependency Structures , 2006, ACL.

[22]  Klaas Sikkel,et al.  Parsing Schemata: A Framework for Specification and Analysis of Parsing Algorithms , 2002 .

[23]  Miguel A. Alonso,et al.  Tabular Algorithms for TAG Parsing , 1999, EACL.

[24]  Richard Johansson,et al.  The CoNLL 2008 Shared Task on Joint Parsing of Syntactic and Semantic Dependencies , 2008, CoNLL.

[25]  Giorgio Satta,et al.  Optimal Reduction of Rule Length in Linear Context-Free Rewriting Systems , 2009, NAACL.

[26]  David J. Weir,et al.  Parsing Mildly Non-Projective Dependency Structures , 2009, EACL.

[27]  Daniel H. Younger,et al.  Recognition and Parsing of Context-Free Languages in Time n^3 , 1967, Inf. Control..

[28]  Kevin Duh,et al.  Multilingual Dependency Parsing using Bayes Point Machines , 2006, NAACL.

[29]  Giuseppe Attardi,et al.  Experiments with a Multilanguage Non-Projective Dependency Parser , 2006, CoNLL.

[30]  D. G. Hays Dependency Theory: A Formalism and Some Observations , 1964 .

[31]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[32]  Yuji Matsumoto,et al.  Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[33]  Günter Hotz,et al.  On Parsing Coupled-Context-Free Languages , 1996, Theor. Comput. Sci..

[34]  Giorgio Satta,et al.  On the Complexity of Non-Projective Data-Driven Dependency Parsing , 2007, IWPT.

[35]  Gerold Schneider,et al.  A Linguistic Comparison of Constituency, Dependency and Link Grammar , 1998 .

[36]  Bernard Lang,et al.  The Structure of Shared Forests in Ambiguous Parsing , 1989, ACL.

[37]  Kemal Oflazer,et al.  The Annotation Process in the Turkish Treebank , 2003, LINC@EACL.

[38]  Yongqiang Li,et al.  A Cascaded Syntactic and Semantic Dependency Parsing System , 2008, CoNLL.

[39]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .

[40]  Noah A. Smith,et al.  Compiling Comp Ling: Weighted Dynamic Programming and the Dyna Language , 2005, HLT.

[41]  Marco Kuhlmann,et al.  Dependency structures and lexicalized grammars , 2007 .

[42]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[43]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[44]  Fernando Pereira,et al.  Non-Projective Dependency Parsing using Spanning Tree Algorithms , 2005, HLT.

[45]  Joakim Nivre,et al.  Characterizing the Errors of Data-Driven Dependency Parsing Models , 2007, EMNLP.

[46]  Ralf Zimmer,et al.  RelEx - Relation extraction using dependency parse trees , 2007, Bioinform..

[47]  Joakim Nivre,et al.  Memory-Based Dependency Parsing , 2004, CoNLL.

[48]  Carlos Gómez-Rodríguez,et al.  Book Reviews: Parsing Schemata for Practical Text Analysis by Carlos Gómez Rodríguez , 2010, CL.

[49]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[50]  Jesús Vilares,et al.  A compiler for parsing schemata , 2009 .

[51]  Saso Dzeroski,et al.  Towards a Slovene Dependency Treebank , 2006, LREC.

[52]  Joakim Nivre,et al.  Pseudo-Projective Dependency Parsing , 2005, ACL.

[53]  M. Felisa Verdejo,et al.  Textual Entailment Recognition Based on Dependency Analysis and WordNet , 2005, MLCW.

[54]  Klaas Sikkel,et al.  How to compare the structure of parsing algorithms , 1994 .

[55]  David J. Weir,et al.  Characterizing Structural Descriptions Produced by Various Grammatical Formalisms , 1987, ACL.

[56]  Manuel Bodirsky,et al.  Well-Nested Drawings as Models of Syntactic Structure ? , 2005 .

[57]  Giorgio Satta,et al.  Recognition of Linear Context-Free Rewriting Systems , 1992, ACL.

[58]  Marco Kuhlmann,et al.  Mildly Non-Projective Dependency Grammar , 2013, CL.

[59]  Marco Kuhlmann,et al.  Mildly Context-Sensitive Dependency Languages , 2007, ACL.

[60]  David Bamman,et al.  The Design and Use of a Latin Dependency Treebank , 2006 .

[61]  Joakim Nivre,et al.  Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines , 2006, CoNLL.

[62]  Eckhard Bick,et al.  Floresta Sintá(c)tica: A treebank for Portuguese , 2002, LREC.

[63]  Aravind K. Joshi,et al.  Tree-Adjoining Grammars , 1997, Handbook of Formal Languages.

[64]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[65]  Miguel A. Alonso,et al.  A compiler for parsing schemata , 2009, Softw. Pract. Exp..

[66]  Joakim Nivre Incremental Non-Projective Dependency Parsing , 2007, HLT-NAACL.

[67]  Jacques Courtin,et al.  Parsing with Dependency Relations and Robust Parsing , 2002 .

[68]  Joakim Nivre,et al.  Integrating Graph-Based and Transition-Based Dependency Parsers , 2008, ACL.

[69]  Joakim Nivre,et al.  MAMBA Meets TIGER: Reconstructing a Swedish Treebank from Antiquity , 2005 .

[70]  Gertjan van Noord,et al.  The Alpino Dependency Treebank , 2001, CLIN.

[71]  Tat-Seng Chua,et al.  Question answering passage retrieval using dependency relations , 2005, SIGIR '05.

[72]  Jay Earley,et al.  An efficient context-free parsing algorithm , 1970, Commun. ACM.

[73]  Jan Hajic,et al.  Prague Arabic Dependency Treebank: Development in Data and Tools , 2004 .

[74]  Haim Gaifman,et al.  Dependency Systems and Phrase-Structure Systems , 1965, Inf. Control..