Statistical Parser on the PARC DepBank

We evaluate the accuracy of an unlexicalized statistical parser, trained on 4K treebanked sentences from balanced data and tested on the PARC DepBank. We demonstrate that a parser which is competitive in accuracy (without sacrificing processing speed) can be quickly tuned without reliance on large in-domain manuallyconstructed treebanks. This makes it more practical to use statistical parsers in applications that need access to aspects of predicate-argument structure. The comparison of systems using DepBank is not straightforward, so we extend and validate DepBank and highlight a number of representation and scoring issues for relational evaluation schemes.

[1]  James R. Curran,et al.  The Importance of Supertagging for Wide-Coverage CCG Parsing , 2004, COLING.

[2]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[3]  Hans-Ulrich Krieger,et al.  A Bag of Useful Techniques for Efficient and Robust Parsing , 1999, ACL.

[4]  Ted Briscoe,et al.  Efficient Extraction of Grammatical Relations , 2005, IWPT.

[5]  Ted Briscoe,et al.  An introduction to tag sequence grammars and the RASP system parser , 2006 .

[6]  Takenobu Tokunaga,et al.  A New Formalization of Probabilistic GLR Parsing , 1997, IWPT.

[7]  Jun'ichi Tsujii,et al.  Probabilistic Disambiguation Models for Wide-Coverage HPSG Parsing , 2005, ACL.

[8]  Daniel M. Bikel,et al.  Intricacies of Collins’ Parsing Model , 2004, CL.

[9]  Dekang Lin,et al.  Dependency-Based Evaluation of Minipar , 2003 .

[10]  Daniel Gildea,et al.  Corpus Variation and Parser Performance , 2001, EMNLP.

[11]  Ted Briscoe,et al.  High Precision Extraction of Grammatical Relations , 2001, COLING.

[12]  Ted Briscoe,et al.  Parser evaluation: a survey and a new proposal , 1998, LREC.

[13]  Geoffrey Sampson English for the computer , 1995 .

[14]  Stefan Riezler,et al.  Speed and Accuracy in Shallow and Deep Stochastic Parsing , 2004, NAACL.

[15]  Geoffrey Nunberg,et al.  The linguistics of punctuation , 1990 .

[16]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[17]  Ted Briscoe,et al.  Robust Accurate Statistical Annotation of General Text , 2002, LREC.

[18]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.