论文信息 - Evaluation of Coreference Resolution Tools for Polish from the Information Extraction Perspective

Evaluation of Coreference Resolution Tools for Polish from the Information Extraction Perspective

In this paper we discuss the performance of existing tools for coreference resolution for Polish from the perspective of information extraction tasks. We take into consideration the source of mentions, i.e., gold standard vs mentions recognized automatically. We evaluate three existing tools, i.e., IKAR, Ruler and Bartek on the KPWr corpus. We show that the widely used metrics for coreference evaluation (B3, MUC, CEAF, BLANC) do not reflect the real performance when dealing with the task of semantic relations recognition between named entities. Thus, we propose a supplementary metric called PARENT, which measures the correctness of linking between referential mentions and named entities.

Adam Kaczmarek | Michal Marcinczuk

[1] Eduard H. Hovy,et al. BLANC: Implementing the Rand index for coreference evaluation , 2010, Natural Language Engineering.

[2] Maciej Piasecki,et al. Approaching plWordNet 2.0 , 2012 .

[3] Breck Baldwin,et al. Algorithms for Scoring Coreference Chains , 1998 .

[4] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.

[5] Lynette Hirschman,et al. A Model-Theoretic Coreference Scoring Scheme , 1995, MUC.

[6] Bartosz Broda,et al. IKAR: An Improved Kit for Anaphora Resolution for Polish , 2012, COLING.

[7] Xiaoqiang Luo,et al. Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation , 2014, ACL.

[8] Yannick Versley,et al. BART: A Modular Toolkit for Coreference Resolution , 2008, ACL.

[9] Xiaoqiang Luo,et al. An Extension of BLANC to System Mentions , 2014, ACL.

[10] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[11] Michael Strube,et al. Evaluation Metrics For End-to-End Coreference Resolution Systems , 2010, SIGDIAL Conference.