Incremental Discontinuous Phrase Structure Parsing with the GAP Transition

This article introduces a novel transition system for discontinuous lexicalized constituent parsing called SR-GAP. It is an extension of the shift-reduce algorithm with an additional gap transition. Evaluation on two German treebanks shows that SR-GAP outperforms the previous best transition-based discontinuous parser (Maier, 2015) by a large margin (it is notably twice as accurate on the prediction of discontinuous constituents), and is competitive with the state of the art (Fernández-González and Martins, 2015). As a side contribution, we adapt span features (Hall et al., 2014) to discontinuous parsing.

[1]  Wojciech Skut,et al.  An Annotation Scheme for Free Word Order Languages , 1997, ANLP.

[2]  Danqi Chen,et al.  of the Association for Computational Linguistics: , 2001 .

[3]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[4]  Eric Villemonte de la Clergerie Parsing Mildly Context-Sensitive Languages with Thread Automata , 2002, COLING.

[5]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[6]  Frank Keller,et al.  Probabilistic Parsing for German Using Sister-Head Dependencies , 2003, ACL.

[7]  Brian Roark,et al.  Incremental Parsing with the Perceptron Algorithm , 2004, ACL.

[8]  Michael A. Covington,et al.  A Fundamental Algorithm for Dependency Parsing , 2004 .

[9]  Joakim Nivre,et al.  Parsing Discontinuous Phrase Structure with Grammatical Functions , 2008, GoTAL.

[10]  John Langford,et al.  Hash Kernels for Structured Data , 2009, J. Mach. Learn. Res..

[11]  Giorgio Satta,et al.  Optimal Reduction of Rule Length in Linear Context-Free Rewriting Systems , 2009, NAACL.

[12]  Wolfgang Maier,et al.  Direct Parsing of Discontinuous Constituents in German , 2010, SPMRL@NAACL-HLT.

[13]  Daniel Gildea,et al.  Optimal Parsing Strategies for Linear Context-Free Rewriting Systems , 2010, NAACL.

[14]  Laura Kallmeyer,et al.  Parsing Beyond Context-Free Grammars , 2010, Cognitive Technologies.

[15]  Laura Kallmeyer,et al.  PLCFRS Parsing of English Discontinuous Constituents , 2011, IWPT.

[16]  Andreas van Cranenburgh Efficient parsing with Linear Context-Free Rewriting Systems , 2012, EACL.

[17]  Rens Bod,et al.  Discontinuous Parsing with an Efficient and Accurate DOP Model , 2013, IWPT.

[18]  Nizar Habash,et al.  Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages , 2013, SPMRL@EMNLP.

[19]  Yoav Goldberg,et al.  Efficient Implementation of Beam-Search Incremental Parsers , 2013, ACL.

[20]  Benoît Crabbé,et al.  An LR-inspired generalized lexicalized phrase structure parser , 2014, COLING.

[21]  Dan Klein,et al.  Less Grammar, More Features , 2014, ACL.

[22]  Yannick Versley,et al.  Experiments with Easy-first nonprojective constituent parsing , 2014 .

[23]  Yannick Versley Incorporating Semi-supervised Features into Discontinuous Easy-First Constituent Parsing , 2014, ArXiv.

[24]  Haitao Mi,et al.  Shift-Reduce Constituency Parsing with Dynamic Programming and POS Tag Lattice , 2015, NAACL.

[25]  André F. T. Martins,et al.  Parsing as Reduction , 2015, ACL.

[26]  Carlos Gómez-Rodríguez,et al.  An Efficient Dynamic Oracle for Unrestricted Non-Projective Parsing , 2015, ACL.

[27]  Wolfgang Maier,et al.  Discontinuous Incremental Shift-reduce Parsing , 2015, ACL.

[28]  Laura Kallmeyer,et al.  LR Parsing for LCFRS , 2016, NAACL.

[29]  Benoît Crabbé,et al.  Multilingual discriminative lexicalized phrase structure parsing , 2015, EMNLP.

[30]  Timm Lichte,et al.  Discontinuous parsing with continuous trees , 2016 .

[31]  James Cross,et al.  Incremental Parsing with Minimal Features Using Bi-Directional LSTM , 2016, ACL.

[32]  Eliyahu Kiperwasser,et al.  Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations , 2016, TACL.

[33]  Rens Bod,et al.  Data-Oriented Parsing with Discontinuous Constituents and Function Tags , 2016, J. Lang. Model..