Treebanks Gone Bad Parser Evaluation and Retraining using a Treebank of Ungrammatical Sentences

This article describes how a treebank of ungrammatical sentences can be created from a treebank of well-formed sentences. The treebank creation procedure involves the automatic introduction of frequently occurring grammatical errors into the sentences in an existing treebank, and the minimal transformation of the original analyses in the treebank so that they describe the newly created ill-formed sentences. Such a treebank can be used to test how well a parser is able to ignore grammatical errors in texts (as people do), and can be used to induce a grammar capable of analysing such sentences. This article demonstrates these two applications using the Penn Treebank. In a robustness evaluation experiment, two state-of-the-art statistical parsers are evaluated on an ungrammatical version of Section 23 of the Wall Street Journal (WSJ) portion of the Penn Treebank. This experiment shows that the performance of both parsers degrades with grammatical noise. A breakdown by error type is provided for both parsers. A second experiment retrains both parsers using an ungrammatical version of WSJ Sections 2-21. This experiment indicates that an ungrammatical treebank is a useful resource in improving parser robustness to grammatical errors, but that the correct combination of grammatical and ungrammatical training data has yet to be determined.

[1]  Joseph Paul Stemberger,et al.  Syntactic errors in speech , 1982 .

[2]  Lance A. Miller,et al.  Parse Fitting and Prose Fixing: Getting a Hold on III-Formedness , 1983, Am. J. Comput. Linguistics.

[3]  Ralph Grishman,et al.  A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[4]  Robert Dale,et al.  Towards Robust PATR , 1992, COLING.

[5]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[6]  Fuliang Weng,et al.  Handling Syntactic Extra-Grammaticality , 1993, IWPT.

[7]  Nelleke Oostdijk,et al.  English Language Corpora , 1993 .

[8]  Ann Bies,et al.  The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.

[9]  Robin Cooper,et al.  Robust Chart Parsing with Mildly Inconsistent Feature Structures , 1994 .

[10]  Jungyun Seo,et al.  A Robust Parser Based on Syntactic Information , 1995, EACL.

[11]  Wolfgang Menzel,et al.  Robust Processing of Natural Language , 1995, KI.

[12]  Carolyn Penstein Rosé,et al.  An Efficient Distribution of Labor in a Two Stage Robust Interpretation Process , 1997, EMNLP.

[13]  Kathleen F. McCoy,et al.  Recognizing Syntactic Errors in the Writing of Second Language Learners , 1998, ACL.

[14]  Carl James,et al.  Errors in Language Learning and Use: Exploring Error Analysis , 1998 .

[15]  Mark Johnson,et al.  Estimators for Stochastic “Unification-Based” Grammars , 1999, ACL.

[16]  D Nicholls,et al.  The Cambridge Learner Corpus-Error coding and analysis , 1999 .

[17]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[18]  Jean Aitchison,et al.  Language and the Internet , 2002, Lit. Linguistic Comput..

[19]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[20]  Frederik Fouvry Robust Processing for Constraint-based Grammar Formalisms , 2003 .

[21]  Geoffrey Sampson,et al.  A test of the leaf-ancestor metric for parse accuracy , 2003, Natural Language Engineering.

[22]  Carl Vogel,et al.  Parsing Ill-Formed Text Using an Error Grammar , 2004, Artificial Intelligence Review.

[23]  Jennifer Foster,et al.  Parsing Ungrammatical Input: an Evaluation Procedure , 2004, LREC.

[24]  Timothy Baldwin,et al.  Arboretum: Using a precision grammar for grammar checking in CALL , 2004 .

[25]  Wolfgang Sternefeld,et al.  Annotating and Querying a Treebank of Suboptimal Structures , 2004 .

[26]  Carl Vogel,et al.  Good Reasons for Noting Bad Grammar : Constructing a Corpus of Ungrammatical Language , 2004 .

[27]  Johnny Bigert Probabilistic Detection of Context-Sensitive Spelling Errors , 2004, LREC.

[28]  Erik Smitterberg,et al.  International Corpus of Learner English , 2004 .

[29]  Hitoshi Isahara,et al.  The Overview of the SST Speech Corpus of Japanese Learner English and Evaluation Through the Experiment on Automatic Detection of Learners' Errors , 2004, LREC.

[30]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[31]  Jennifer Foster Good reasons for noting bad grammar : empirical investigations into the parsing of ungrammatical written English , 2005 .

[32]  Daniel P. Lopresti Performance evaluation for text processing of noisy inputs , 2005, SAC '05.

[33]  Michael Collins,et al.  Discriminative Reranking for Natural Language Parsing , 2000, CL.

[34]  Noah A. Smith,et al.  Guiding Unsupervised Grammar Induction Using Contrastive Estimation , 2005 .

[35]  Noah A. Smith,et al.  Contrastive Estimation: Training Log-Linear Models on Unlabeled Data , 2005, ACL.

[36]  Ola Knutsson,et al.  Unsupervised Evaluation of Parser Robustness , 2005, CICLing.

[37]  Jun'ichi Tsujii,et al.  A discriminative language model with pseudo-negative samples , 2007, ACL.

[38]  Sylvana Sofkova Hashemi Ambiguity Resolution by Reordering Rules in Text Containing Errors , 2007, IWPT.

[39]  Animesh Mukherjee,et al.  Investigation and modeling of the structure of texting language , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[40]  Rohini K. Srihari,et al.  Information extraction for multi-participant, task-oriented, synchronous, computer-mediated communication: a corpus study of chat data , 2007 .

[41]  Berthold Crysmann,et al.  ANNOTATION OF ERROR TYPES FOR GERMAN NEWS CORPUS , .