论文信息 - What is the Minimal Set of Fragments that Achieves Maximal Parse Accuracy?

What is the Minimal Set of Fragments that Achieves Maximal Parse Accuracy?

We aim at finding the minimal set of fragments which achieves maximal parse accuracy in Data Oriented Parsing. Experiments with the Penn Wall Street Journal treebank show that counts of almost arbitrary fragments within parse trees are important, leading to improved parse accuracy over previous models tested on this treebank (a precision of 90.8% and a recall of 90.6%). We isolate some dependency relations which previous models neglect but which contribute to higher parse accuracy.

Rens Bod

[1] Yoram Singer,et al. Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[2] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[3] Rens Bod,et al. Beyond Grammar: An Experience-Based Theory of Language , 1998 .

[4] Michael Collins,et al. Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[5] Jason Eisner,et al. Three New Probabilistic Models for Dependency Parsing: An Exploration , 1996, COLING.

[6] Yves Schabes,et al. Stochastic Lexicalized Tree-adjoining Grammars , 1992, COLING.

[7] Rens Bod. Combining semantic and syntactic structure for language modeling , 2000, INTERSPEECH.

[8] Joshua Goodman,et al. Parsing Inside-Out , 1998, ArXiv.

[9] Eugene Charniak,et al. Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[10] David M. Magerman. Statistical Decision-Tree Models for Parsing , 1995, ACL.

[11] Richard M. Schwartz,et al. Coping with Ambiguity and Unknown Words through Probabilistic Models , 1993, CL.