Easy-first Coreference Resolution

We describe an approach to coreference resolution that relies on the intuition that easy decisions should be made early, while harder decisions should be left for later when more information is available. We are inspired by the recent success of the rule-based system of Raghunathan et al. (2010), which relies on the same intuition. Our system, however, automatically learns from training data what constitutes an easy decision. Thus, we can utilize more features, learn more precise weights, and adapt to any dataset for which training data is available. Experiments show that our system outperforms recent state-of-the-art coreference systems including Raghunathan et al.’s system as well as a competitive baseline that uses a pairwise classifier.

[1]  Breck Baldwin,et al.  Algorithms for Scoring Coreference Chains , 1998 .

[2]  Dekang Lin,et al.  Bootstrapping Path-Based Pronoun Resolution , 2006, ACL.

[3]  Jian Su,et al.  Coreference Resolution Using Competition Learning Approach , 2003, ACL.

[4]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[5]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[6]  Claire Cardie,et al.  Reconcile: A Coreference Resolution Research Platform , 2010 .

[7]  Claire Cardie,et al.  Coreference Resolution with Reconcile , 2010, ACL.

[8]  Daniel Marcu,et al.  Learning as search optimization: approximate large margin methods for structured prediction , 2005, ICML.

[9]  Dan Roth,et al.  Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[10]  Claire Cardie,et al.  Conundrums in Noun Phrase Coreference Resolution: Making Sense of the State-of-the-Art , 2009, ACL.

[11]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[12]  John Langford,et al.  Search-based structured prediction , 2009, Machine Learning.

[13]  Su Jian,et al.  A high-performance coreference resolution system using a constraint-based multi-agent strategy , 2004, COLING 2004.

[14]  Nianwen Xue,et al.  CoNLL-2011 Shared Task: Modeling Unrestricted Coreference in OntoNotes , 2011, CoNLL Shared Task.

[15]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[16]  Lynette Hirschman,et al.  A Model-Theoretic Coreference Scoring Scheme , 1995, MUC.

[17]  Pedro M. Domingos,et al.  Joint Unsupervised Coreference Resolution with Markov Logic , 2008, EMNLP.

[18]  Yang Guo,et al.  Structured Perceptron with Inexact Search , 2012, NAACL.

[19]  Dan Klein,et al.  Coreference Resolution in a Modular, Entity-Centered Model , 2010, NAACL.

[20]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[21]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[22]  Claire Gardent,et al.  Improving Machine Learning Approaches to Coreference Resolution , 2002, ACL.

[23]  H. Sebastian Seung,et al.  Learning to Agglomerate Superpixel Hierarchies , 2011, NIPS.

[24]  Yoav Goldberg,et al.  An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing , 2010, NAACL.

[25]  Jian Su,et al.  A High-Performance Coreference Resolution System using a Constraint-based Multi-Agent Strategy , 2004, COLING.

[26]  Veselin Stoyanov,et al.  Empirical Risk Minimization of Graphical Model Parameters Given Approximate Inference, Decoding, and Model Structure , 2011, AISTATS.

[27]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[28]  Veselin Stoyanov,et al.  Minimum-Risk Training of Approximate CRF-Based NLP Systems , 2012, NAACL.

[29]  Brian Roark,et al.  Incremental Parsing with the Perceptron Algorithm , 2004, ACL.

[30]  Constantin Orasan,et al.  NPs for Events: Experiments in Coreference Annotation , 2006, LREC.

[31]  Andrew McCallum,et al.  First-Order Probabilistic Models for Coreference Resolution , 2007, NAACL.