论文信息 - Evaluating a Wide-Coverage CCG Parser

Evaluating a Wide-Coverage CCG Parser

This paper compares three evaluation metrics for a CCG parser trained and tested on a CCG version of the Penn Treebank. The standard Parseval metrics can be applied to the output of this parser; however, these metrics are problematic for CCG, and a comparison with scores given for standard Penn Treebank parsers is uninformative. As an alternative, we consider two evaluations based on headdependencies; one considers local dependencies defined in terms of the derivation tree, and one considers dependencies defined in terms of the CCG categories. The latter set of dependencies includes long-range dependencies such as those inherent in coordination and extraction phenomena.

Stephen Clark | Julia Hockenmaier

[1] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[2] Mark Steedman,et al. Generative Models for Statistical Parsing with Combinatory Categorial Grammar , 2002, ACL.

[3] Michael Collins,et al. Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[4] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[5] Mark Steedman,et al. Building Deep Dependency Structures using a Wide-Coverage CCG Parser , 2002, ACL.

[6] Mark Steedman,et al. The syntactic process , 2004, Language, speech, and communication.

[7] Eugene Charniak,et al. A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[8] Julia Hockenmaier,et al. Statistical Parsing for CCG with Simple Generative Models , 2001, ACL.

[9] Ted Briscoe,et al. Parser evaluation: a survey and a new proposal , 1998, LREC.

[10] Dekang Lin,et al. A dependency-based method for evaluating broad-coverage parsers , 1995, Natural Language Engineering.

[11] Mark Steedman,et al. Acquiring Compact Lexicalized Grammars from a Cleaner Treebank , 2002, LREC.