Entity-Centric Coreference Resolution of Person Entities for Open Information Extraction

on abierta Abstract: This work presents a coreference resolution system of person entities based on a multi-pass architecture which sequentially applies a set of independent modules, using an entity-centric approach. Several evaluations show that the system obtains promising results in dierent scenarios ( 71% and 81% F1 CoNLL). Furthermore, the impact of coreference resolution in information extraction was analyzed, by applying an open information extraction system after the coreference resolution tool. The results of this test indicate that information extraction gives better both recall and precision results. The evaluations were carried out in Spanish, Portuguese and Galician, and all the resources and tools are freely distributed.

[1]  Dan Klein,et al.  Unsupervised Coreference Resolution in a Nonparametric Bayesian Model , 2007, ACL.

[2]  Antonio Ferrández Rodríguez,et al.  A Computational Approach to Zero-pronouns in Spanish , 2000, ACL.

[3]  Nianwen Xue,et al.  CoNLL-2011 Shared Task: Modeling Unrestricted Coreference in OntoNotes , 2011, CoNLL Shared Task.

[4]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[5]  Lluís Padró,et al.  FreeLing 3.0: Towards Wider Multilinguality , 2012, LREC.

[6]  Xiaoqiang Luo,et al.  On Coreference Resolution Performance Metrics , 2005, HLT.

[7]  Pablo Gamallo,et al.  A Resource-Based Method for Named Entity Extraction and Classification , 2011, EPIA.

[8]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[9]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[10]  Lluís Padró,et al.  A Constraint-Based Hypergraph Partitioning Approach to Coreference Resolution , 2013, CL.

[11]  Pablo Gamallo Otero,et al.  A grammatical formalism based on patterns of part of speech tags , 2011 .

[12]  Veselin Stoyanov,et al.  Easy-first Coreference Resolution , 2012, COLING.

[13]  Lynette Hirschman,et al.  A Model-Theoretic Coreference Scoring Scheme , 1995, MUC.

[14]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[15]  Rafael Muñoz,et al.  An Algorithm for Anaphora Resolution in Spanish Texts , 2001, CL.

[16]  Maria Antònia Martí,et al.  AnCora-CO: Coreferentially annotated corpora for Spanish and Catalan , 2010, Lang. Resour. Evaluation.

[17]  Breck Baldwin,et al.  CogNIAC: high precision coreference with limited knowledge and linguistic resources , 1997 .

[18]  Marcos Garcia,et al.  Identificação e classificação de entidades mencionadas em galego , 2012 .

[19]  Pablo Gamallo,et al.  Análise Morfossintáctica para Português Europeu e Galego: Problemas, Soluções e Avaliação , 2010, Linguamática.

[20]  Pablo Gamallo,et al.  An Entity-Centric Coreference Resolution System for Person Entities with Rich Linguistic Information , 2014, COLING.

[21]  Pablo Gamallo,et al.  Dependency-Based Open Information Extraction , 2012 .

[22]  Yannick Versley,et al.  SemEval-2010 Task 1: Coreference Resolution in Multiple Languages , 2009, *SEMEVAL.

[23]  Heeyoung Lee,et al.  Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules , 2013, CL.

[24]  Ruslan Mitkov,et al.  Robust Pronoun Resolution with Limited Knowledge , 1998, ACL.

[25]  Eduard H. Hovy,et al.  Coreference Resolution across Corpora: Languages, Coding Schemes, and Preprocessing Information , 2010, ACL.

[26]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[27]  Eduard H. Hovy,et al.  A Deeper Look into Features for Coreference Resolution , 2009, DAARC.

[28]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[29]  Pablo Gamallo,et al.  Multilingual corpora with coreferential annotation of person entities , 2014, LREC.

[30]  Breck Baldwin,et al.  Algorithms for Scoring Coreference Chains , 1998 .