The Mention-Pair Model

This chapter introduces one of the early and most influential machine learning approaches to coreference resolution, the mention-pair model. Initiated in the mid-1990s and further developed into a more generic resolver by Soon et al. in 2001 and many others, the simple model still remains a popular benchmark in the learning-based resolution research. The mention-pair model recasts the coreference resolution problem as a classification task in which a classifier is trained to decide for a given pair of noun phrases whether they corefer or not. In a second step, full coreference chains are built by clustering these pairwise decisions. This chapter reviews the main building blocks of the mention-pair model: the construction of positive and negative instances and the related problem of data set skewness, the selection of informative features, and the choice of machine learner and clustering mechanism.

[1]  Yannick Versley,et al.  Coreference Systems Based on Kernels Methods , 2008, COLING.

[2]  Tim Van de Cruys,et al.  Semantic Clustering in Dutch , 2005, CLIN.

[3]  Walter Daelemans,et al.  Memory-Based Language Processing , 2009, Studies in natural language processing.

[4]  Kees van Deemter,et al.  On Coreferring: Coreference in MUC and Related Annotation Schemes , 2000, CL.

[5]  Claire Gardent,et al.  Improving Machine Learning Approaches to Coreference Resolution , 2002, ACL.

[6]  Claire Cardie,et al.  Combining Sample Selection and Error-Driven Pruning for Machine Learning of Coreference Rules , 2002, EMNLP.

[7]  Douglas E. Appelt,et al.  The (Non)Utility of Predicate-Argument Frequencies for Pronoun Interpretation , 2004, NAACL.

[8]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[9]  Zhang Le,et al.  Maximum Entropy Modeling Toolkit for Python and C , 2004 .

[10]  Veronique Hoste,et al.  Optimization issues in machine learning of coreference resolution , 2005 .

[11]  Rich Caruana,et al.  Greedy Attribute Selection , 1994, ICML.

[12]  Olga Uryupina,et al.  Coreference Resolution with and without Linguistic Knowledge , 2006, LREC.

[13]  Jian Su,et al.  Improving Pronoun Resolution by Incorporating Coreferential Information of Candidates , 2004, ACL.

[14]  Antal van den Bosch,et al.  A modular approach to learning Dutch co-reference , 2008 .

[15]  Olga Uryupina,et al.  High-precision Identification of Discourse New and Unique Noun Phrases , 2003, ACL.

[16]  Alon Itai,et al.  Automatic Processing of Large Corpora for the Resolution of Anaphora References , 1990, COLING.

[17]  Jian Su,et al.  An NP-Cluster Based Approach to Coreference Resolution , 2004, COLING.

[18]  Eduard H. Hovy,et al.  A Deeper Look into Features for Coreference Resolution , 2009, DAARC.

[19]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[20]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[21]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[22]  Mariona Taulé,et al.  AnCora: Multilevel Annotated Corpora for Catalan and Spanish , 2008, LREC.

[23]  Dan Roth,et al.  Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[24]  Vincent Ng Supervised Ranking for Pronoun Resolution: Some Recent Improvements , 2005, AAAI.

[25]  Walter Daelemans,et al.  Evaluating Hybrid Versus Data-Driven Coreference Resolution , 2007, DAARC.

[26]  Simone Paolo Ponzetto,et al.  Exploiting Semantic Role Labeling, WordNet and Wikipedia for Coreference Resolution , 2006, NAACL.

[27]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[28]  Walter Daelemans,et al.  Semantic and Syntactic Features for Dutch Coreference Resolution , 2008, CICLing.

[29]  Michael Strube,et al.  A Machine Learning Approach to Pronoun Resolution in Spoken Dialogue , 2003, ACL.

[30]  Vincent Ng,et al.  Supervised Noun Phrase Coreference Research: The First Fifteen Years , 2010, ACL.

[31]  Wendy G. Lehnert,et al.  Using Decision Trees for Coreference Resolution , 1995, IJCAI.

[32]  Guangping Zeng,et al.  Accurate Semantic Class Classifier for Coreference Resolution , 2009, EMNLP.

[33]  Kees van Deemter,et al.  Coreference Annotation: Whither? , 2000, LREC.

[34]  Jian Su,et al.  Coreference Resolution Using Semantic Relatedness Information from Automatically Discovered Patterns , 2007, ACL.

[35]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[36]  Sanda M. Harabagiu,et al.  RESOLUTION , 1977, Monatsschrift für Kriminologie und Strafrechtsreform.

[37]  Vincent Ng,et al.  Semantic Class Induction and Coreference Resolution , 2007, ACL.

[38]  Malvina Nissim,et al.  Comparing Knowledge Sources for Nominal Anaphora Resolution , 2005, Computational Linguistics.

[39]  Claire Cardie,et al.  Identifying Anaphoric and Non-Anaphoric Noun Phrases to Improve Coreference Resolution , 2002, COLING.

[40]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[41]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[42]  Vincent Ng,et al.  Machine Learning for Coreference Resolution: From Local Classification to Global Ranking , 2005, ACL.

[43]  Xiaoqiang Luo,et al.  A Mention-Synchronous Coreference Resolution Algorithm Based On the Bell Tree , 2004, ACL.

[44]  Scott Bennett,et al.  Evaluating Automated and Manual Acquisition of Anaphora Resolution Strategies , 1995, ACL.

[45]  Andrew Kehler,et al.  Probabilistic Coreference in Information Extraction , 1997, EMNLP.

[46]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[47]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[48]  R. Iida,et al.  Incorporating Contextual Cues in Trainable Models for Coreference Resolution , 2003 .

[49]  Claire Cardie,et al.  Noun Phrase Coreference as Clustering , 1999, EMNLP.

[50]  Renata Vieira,et al.  Discourse-New Detectors for Definite Description Resolution: A Survey and a Preliminary Proposal , 2004 .

[51]  John Hale,et al.  A Statistical Approach to Anaphora Resolution , 1998, VLC@COLING/ACL.

[52]  Jerry R. Hobbs Resolving pronoun references , 1986 .

[53]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[54]  Vincent Ng,et al.  Shallow Semantics for Coreference Resolution , 2007, IJCAI.

[55]  Michael Strube,et al.  The Influence of Minimum Edit Distance on Reference Resolution , 2002, EMNLP.

[56]  Jian Su,et al.  Coreference Resolution Using Competition Learning Approach , 2003, ACL.

[57]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[58]  Wendy G. Lehnert,et al.  A trainable approach to coreference resolution for information extraction , 1996 .

[59]  Heng Ji,et al.  Using Semantic Relations to Refine Coreference Decisions , 2005, HLT.

[60]  Pascal Denis,et al.  Specialized Models and Ranking for Coreference Resolution , 2008, EMNLP.

[61]  David Fisher,et al.  Description of the UMass system as used for MUC-6 , 1995, MUC.