An Ensemble Method for Selection of High Quality Parses

While the average performance of statistical parsers gradually improves, they still attach to many sentences annotations of rather low quality. The number of such sentences grows when the training and test data are taken from different domains, which is the case for major web applications such as information retrieval and question answering. In this paper we present a Sample Ensemble Parse Assessment (SEPA) algorithm for detecting parse quality. We use a function of the agreement among several copies of a parser, each of which trained on a different sample from the training data, to assess parse quality. We experimented with both generative and reranking parsers (Collins, Charniak and Johnson respectively). We show superior results over several baselines, both when the training and test data are from the same domain and when they are from different domains. For a test setting used by previous work, we show an error reduction of 31% as opposed to their 20%.

[1]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[2]  Oren Etzioni,et al.  Scaling question answering to the Web , 2001, WWW '01.

[3]  Sanda M. Harabagiu,et al.  COGEX: A Logic Prover for Question Answering , 2003, NAACL.

[4]  Michael Collins,et al.  Hidden-Variable Models for Discriminative Reranking , 2005, HLT.

[5]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[6]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[7]  Shlomo Argamon,et al.  Committee-Based Sample Selection for Probabilistic Classifiers , 1999, J. Artif. Intell. Res..

[8]  Oren Etzioni,et al.  Detecting Parser Errors Using Web-based Semantic Filters , 2006, EMNLP.

[9]  Antonio Cisternino,et al.  PiQASso: Pisa Question Answering System , 2001, TREC.

[10]  DaganIdo,et al.  Committee-based sample selection for probabilistic classifiers , 1999 .

[11]  Daniel Gildea,et al.  Corpus Variation and Parser Performance , 2001, EMNLP.

[12]  Miles Osborne,et al.  A Two-Stage Method for Active Learning of Statistical Grammars , 2005, IJCAI.

[13]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[14]  David Yarowsky,et al.  Rule Writing or Annotation: Cost-efficient Resource Usage for Base Noun Phrase Chunking , 2000, ACL.

[15]  Eric Brill,et al.  Bagging and Boosting a Treebank Parser , 2000, ANLP.

[16]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.