论文信息 - Parsing Internal Noun Phrase Structure with Collins' Models

Parsing Internal Noun Phrase Structure with Collins' Models

Collins’ widely-used parsing models treat noun phrases (NPs) in a different manner to other constituents. We investigate these differences, using the recently released internal NP bracketing data (Vadas and Curran, 2007a). Altering the structure of the Treebank, as this data does, has a number of consequences, as parsers built using Collins’ models assume that their training and test data will have structure similar to the Penn Treebank’s. Our results demonstrate that it is difficult for Collins’ models to adapt to this new NP structure, and that parsers using these models make mistakes as a result. This emphasises how important treebank structure itself is, and the large amount of influence it can have.

James R. Curran | David Vadas

[1] David Vadas. Large-Scale Supervised Models for Noun Phrase Bracketing , 2007 .

[2] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[3] Ralph Grishman,et al. An Improved Extraction Pattern Representation Model for Automatic IE Pattern Acquisition , 2003, ACL.

[4] Dan Klein,et al. Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank , 2001, ACL.

[5] Sanda M. Harabagiu,et al. COGEX: A Logic Prover for Question Answering , 2003, NAACL.

[6] Sandra Kübler. How Do Treebank Annotation Schemes Influence Parsing Results? Or How Not to Compare Apples And Oranges , 2005 .

[7] Mark Johnson,et al. PCFG Models of Linguistic Tree Representations , 1998, CL.

[8] James R. Curran,et al. Adding Noun Phrase Structure to the Penn Treebank , 2007, ACL.

[9] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[10] Josef van Genabith,et al. Treebank Annotation Schemes and Parser Evaluation for German , 2007, EMNLP.

[11] Joshua Goodman,et al. Probabilistic Feature Grammars , 1997, IWPT.

[12] Mitchell P. Marcus,et al. On the parameter space of generative lexicalized statistical parsing models , 2004 .

[13] Donald W. Lee. Close Apposition: An Unresolved Pattern , 1952 .

[14] Michael Collins,et al. Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.