Shift-reduce Spinal TAG Parsing with Dynamic Programming

Summary The spinal tree adjoining grammar (TAG) parsing model of [Carreras 08] achieves the current state-of-the-art constituent parsing accuracy on the commonly used English Penn Treebank evaluation setting. Unfortunately, the model has the serious drawback of low parsing efficiency since its Eisner-CKY style parsing algorithm needs O ( n 4 ) computation time for input length n . This paper investigates a more practical solution and presents a beam search shift-reduce algorithm for spinal TAG parsing. Since the algorithm works in O ( bn ) ( b is beam width), it can be expected to provide a significant improvement in parsing speed. However, to achieve faster parsing, it needs to prune a large number of candidates in an exponentially large search space and often suffers from severe search errors. In fact, our experiments show that the basic beam search shift-reduce parser does not work well for spinal TAGs. To alleviate this problem, we extend the proposed shift-reduce algorithm with two techniques: Dynamic Programming of [Huang 10a] and Supertagging. The proposed extended parsing algorithm is about 8 times faster than the Berkeley parser , which is well-known to be fast constituent parsing software, while offering state-of-the-art performance. Moreover, we conduct experiments on the Keyaki Treebank for Japanese to show that the good performance of our proposed parser is language-independent.

[1]  Xavier Carreras,et al.  Transition-based Spinal Parsing , 2015, CoNLL.

[2]  Yusuke Miyao,et al.  Optimal Shift-Reduce Constituent Parsing with Structured Perceptron , 2015, ACL.

[3]  Yue Zhang,et al.  Fast and Accurate Shift-Reduce Constituent Parsing , 2013, ACL.

[4]  Hiroyuki Shindo,et al.  Bayesian Symbol-Refined Tree Substitution Grammars for Syntactic Parsing , 2012, ACL.

[5]  Yang Guo,et al.  Structured Perceptron with Inexact Search , 2012, NAACL.

[6]  Stephen Clark,et al.  Shift-Reduce CCG Parsing , 2011, ACL.

[7]  Yoav Goldberg,et al.  Language-Independent Parsing with Empty Elements , 2011, ACL.

[8]  Mary P. Harper,et al.  Self-Training with Products of Latent Variable Grammars , 2010, EMNLP.

[9]  Kenji Sagae,et al.  Dynamic Programming for Linear-Time Incremental Parsing , 2010, ACL.

[10]  Michael Collins,et al.  Efficient Third-Order Dependency Parsers , 2010, ACL.

[11]  Hiroshi Nakagawa,et al.  Deterministic Shift-Reduce Parsing for Unification-Based Grammars by Using Default Unification , 2009, EACL.

[12]  Joakim Nivre,et al.  Algorithms for Deterministic Incremental Dependency Parsing , 2008, CL.

[13]  Xavier Carreras,et al.  TAG, Dynamic Programming, and the Perceptron for Efficient, Feature-Rich Parsing , 2008, CoNLL.

[14]  Liang Huang Forest Reranking: Discriminative Parsing with Non-Local Features , 2008, ACL.

[15]  Xavier Carreras,et al.  Experiments with a Higher-Order Projective Dependency Parser , 2007, EMNLP.

[16]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[17]  Alon Lavie,et al.  A Best-First Probabilistic Shift-Reduce Parser , 2006, ACL.

[18]  Mihai Surdeanu,et al.  Projective Dependency Parsing with Perceptron , 2006, CoNLL.

[19]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[20]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[21]  Chu-Ren Huang,et al.  OLACMS: Comparisons and Applications in Chinese and Formosan Languages , 2002, ALR@COLING.

[22]  David Chiang,et al.  Recovering Latent Information in Treebanks , 2002, COLING.

[23]  Srinivas Bangalore,et al.  Supertagging: An Approach to Almost Parsing , 1999, CL.

[24]  Stuart M. Shieber,et al.  Principles and Implementation of Deductive Parsing , 1994, J. Log. Program..

[25]  Haitao Mi,et al.  Shift-Reduce Constituency Parsing with Dynamic Programming and POS Tag Lattice , 2015, NAACL.

[26]  Alastair Butler,et al.  Keyaki Treebank: phrase structure with functional information for Japanese , 2012 .

[27]  Hitoshi Habe,et al.  The Graduate School of Information Science , 2011 .