论文信息 - Fast and Accurate Shift-Reduce Constituent Parsing - 字舞流文

Fast and Accurate Shift-Reduce Constituent Parsing

Shift-reduce dependency parsers give comparable accuracies to their chartbased counterparts, yet the best shiftreduce constituent parsers still lag behind the state-of-the-art. One important reason is the existence of unary nodes in phrase structure trees, which leads to different numbers of shift-reduce actions between different outputs for the same input. This turns out to have a large empirical impact on the framework of global training and beam search. We propose a simple yet effective extension to the shift-reduce process, which eliminates size differences between action sequences in beam-search. Our parser gives comparable accuracies to the state-of-the-art chart parsers. With linear run-time complexity, our parser is over an order of magnitude faster than the fastest chart parser.

Yue Zhang | Jingbo Zhu | Muhua Zhu | Min Zhang | Wenliang Chen | Yue Zhang | Muhua Zhu | Wenliang Chen | Min Zhang | Jingbo Zhu | Yue Zhang

[1] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[2] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[3] Adwait Ratnaparkhi,et al. A Linear Observed Time Statistical Parser Based on Maximum Entropy Models , 1997, EMNLP.

[4] Eugene Charniak,et al. A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[5] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[6] Michael Collins,et al. Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[7] Yuji Matsumoto,et al. Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[8] Brian Roark,et al. Incremental Parsing with the Perceptron Algorithm , 2004, ACL.

[9] Mitchell P. Marcus,et al. On the parameter space of generative lexicalized statistical parsing models , 2004 .

[10] Eugene Charniak,et al. Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[11] M. A. R T A P A L,et al. The Penn Chinese TreeBank: Phrase structure annotation of a large corpus , 2005, Natural Language Engineering.

[12] Percy Liang,et al. Semi-Supervised Learning for Natural Language , 2005 .

[13] Alon Lavie,et al. A Classifier-Based Parser with Linear Run-Time Complexity , 2005, IWPT.

[14] Koby Crammer,et al. Online Large-Margin Training of Dependency Parsers , 2005, ACL.

[15] Alon Lavie,et al. Parser Combination by Reparsing , 2006, NAACL.

[16] Eugene Charniak,et al. Effective Self-Training for Parsing , 2006, NAACL.

[17] Joakim Nivre,et al. MaltParser: A Data-Driven Parser-Generator for Dependency Parsing , 2006, LREC.

[18] Dan Klein,et al. Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[19] Xavier Carreras,et al. TAG, Dynamic Programming, and the Perceptron for Efficient, Feature-Rich Parsing , 2008, CoNLL.

[20] Jinxi Xu,et al. A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model , 2008, ACL.

[21] Stephen Clark,et al. Joint Word Segmentation and POS Tagging Using a Single Perceptron , 2008, ACL.

[22] Xavier Carreras,et al. Simple Semi-supervised Dependency Parsing , 2008, ACL.

[23] Liang Huang,et al. Forest Reranking: Discriminative Parsing with Non-Local Features , 2008, ACL.

[24] Mary P. Harper,et al. Self-Training PCFG Grammars with Latent Annotations Across Languages , 2009, EMNLP.

[25] Kentaro Torisawa,et al. Improving Dependency Parsing with Subtrees from Auto-Parsed Data , 2009, EMNLP.

[26] Stephen Clark,et al. Transition-Based Parsing of the Chinese Treebank using a Global Discriminative Model , 2009, IWPT.

[27] Kenji Sagae,et al. Dynamic Programming for Linear-Time Incremental Parsing , 2010, ACL.

[28] Mary P. Harper,et al. Self-Training with Products of Latent Variable Grammars , 2010, EMNLP.

[29] Joakim Nivre,et al. Transition-based Dependency Parsing with Rich Non-local Features , 2011, ACL.

[30] Joakim Nivre,et al. A Transition-Based System for Joint Part-of-Speech Tagging and Labeled Non-Projective Dependency Parsing , 2012, EMNLP.

[31] Weiwei Sun,et al. Capturing Paradigmatic and Syntagmatic Lexical Relations: Towards Accurate Chinese Part-of-Speech Tagging , 2012, ACL.

[32] Jingbo Zhu,et al. Exploiting Lexical Dependencies from Large-Scale Data for Better Shift-Reduce Constituency Parsing , 2012, COLING.

[33] Ling Huang. Improve Chinese Parsing with MaxEnt Reranking Parser , .