GERBIL - Benchmarking Named Entity Recognition and Linking consistently

The ability to compare frameworks from the same domain is of central importance for their introduction into complex applications. In the domains of named entity recognition and entity linking, the large number of systems and their orthogonal evaluation w.r.t. measures and datasets has led to an unclear landscape pertaining to the abilities and weaknesses of the different frameworks. We present GERBIL—an improved platform for repeatable, storable and citable semantic annotation experiments— and how we extended it since its release. With GERBIL, we narrowed this evaluation gap by generating concise, archivable, humanand machine-readable experiments, analytics and diagnostics. The rationale behind our framework is to provide developers, end users and researchers with easy-to-use interfaces that allow for the agile, fine-grained and uniform evaluation of annotation tools on multiple datasets. By these means, we aim to ensure that both tool developers and end users can derive meaningful insights pertaining to the extension, integration and use of annotation applications. In particular, GERBIL provides comparable results to tool developers so as to allow them to easily discover the strengths and weaknesses of their implementations with respect to the state of the art. With the permanent experiment URIs provided by our framework, we ensure the reproducibility and archiving of evaluation results. Moreover, the framework generates data in machine-processable format, allowing for the efficient querying and post-processing of evaluation results. Additionally, the tool diagnostics provided by GERBIL allows deriving insights pertaining to the areas in which tools should be further refined, thus allowing developers to create an informed agenda for extensions and end users to detect the right tools for their purposes. Finally, we implemented additional types of experiments including entity typing. GERBIL aims to become a focal point for the state of the art, driving the research agenda of the community by presenting comparable objective evaluation results. Furthermore, we tackle the central problem of the evaluation of entity linking, i.e., we answer the question how an evaluation algorithm can compare two URIs to each other without being bound to a specific knowledge base. Our approach to this problem opens a way to address the deprecation of URIs of existing gold standards for named entity recognition and entity linking, a feature which is currently not supported by the state of the art. We derived the importance of this feature from usage and dataset requirements collected from the GERBIL user community, which has already carried out more than 24.000 single evaluations using our framework. Through the resulting updates, GERBIL now supports 8 tasks, 46 datasets and 20 systems.

[1]  Mark Steedman,et al.  Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, 27-31 July 2011, John McIntyre Conference Centre, Edinburgh, UK, A meeting of SIGDAT, a Special Interest Group of the ACL , 2011, EMNLP.

[2]  James Cheney,et al.  PROV-O: The PROV ontology:W3C recommendation 30 April 2013 , 2013 .

[3]  Aba-Sah Dadzie,et al.  Making Sense of Microposts (#Microposts2014) Named Entity Extraction & Linking Challenge , 2014, #MSM.

[4]  Pablo N. Mendes,et al.  Improving efficiency and accuracy in multilingual entity extraction , 2013, I-SEMANTICS '13.

[5]  Gerhard Weikum,et al.  Discovering emerging entities with ambiguous names , 2014, WWW.

[6]  Sören Auer,et al.  AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data , 2014, International Semantic Web Conference.

[7]  Achim Rettinger,et al.  X-LiSA: Cross-lingual Semantic Annotation , 2014, Proc. VLDB Endow..

[8]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[9]  Axel-Cyrille Ngonga Ngomo,et al.  CETUS - A Baseline Approach to Type Extraction , 2015, SemWebEval@ESWC.

[10]  Paolo Ferragina,et al.  Fast and Accurate Annotation of Short Texts with Wikipedia Pages , 2010, IEEE Software.

[11]  Diego Reforgiato Recupero,et al.  Using FRED for Named Entity Resolution, Linking and Typing for Knowledge Base Population , 2015, SemWebEval@ESWC.

[12]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[13]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[14]  Sebastian Hellmann,et al.  N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format , 2014, LREC.

[15]  Roberto Navigli,et al.  Multilingual Word Sense Disambiguation and Entity Linking for Everybody , 2014, International Semantic Web Conference.

[16]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[17]  Krzysztof Janowicz,et al.  The Semantic Web , 2014, Lecture Notes in Computer Science.

[18]  Adam Kilgarriff,et al.  SENSEVAL: an exercise in evaluating world sense disambiguation programs , 1998, LREC.

[19]  Scott Cotton,et al.  SENSEVAL-2: Overview , 2001, *SEMEVAL.

[20]  Raphaël Troncy,et al.  Learning with the Web: Spotting Named Entities on the Intersection of NERD and Machine Learning , 2013, #MSM.

[21]  Daniel S. Weld,et al.  Design Challenges for Entity Linking , 2015, TACL.

[22]  Philipp Cimiano,et al.  Combining Textual and Graph-Based Features for Named Entity Disambiguation Using Undirected Probabilistic Graphical Models , 2016, EKAW.

[23]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[24]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[25]  M. Dixon,et al.  - 1-Preparing Clean Views of Data for Data Mining , 2000 .

[26]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[27]  Milan Dojchinovski,et al.  Entityclassifier.eu: Real-Time Classification of Entities in Text with Wikipedia , 2013, ECML/PKDD.

[28]  Michael Granitzer,et al.  DoSeR - A Knowledge-Base-Agnostic Framework for Entity Disambiguation Using Semantic Embeddings , 2016, ESWC.

[29]  Raphaël Troncy,et al.  GERBIL: General Entity Annotator Benchmarking Framework , 2015, WWW.

[30]  Paolo Ferragina,et al.  From TagME to WAT: a new entity annotator , 2014, ERD '14.

[31]  Massimiliano Ciaramita,et al.  A framework for benchmarking entity-annotation systems , 2013, WWW.

[32]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[33]  Nathan Schneider,et al.  Association for Computational Linguistics: Human Language Technologies , 2011 .

[34]  Hinrich Schütze,et al.  A Piggyback System for Joint Entity Mention Detection and Linking in Web Queries , 2016, WWW.

[35]  Yonghuai Liu,et al.  An easy-to-use evaluation framework for benchmarking entity recognition and disambiguation systems , 2017, Frontiers of Information Technology & Electronic Engineering.

[36]  Mark Steedman,et al.  Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013, 18-21 October 2013, Grand Hyatt Seattle, Seattle, Washington, USA, A meeting of SIGDAT, a Special Interest Group of the ACL , 2013, EMNLP.

[37]  Sören Auer,et al.  Linked SDMX Data: Path to high fidelity Statistical Linked Data , 2015, Semantic Web.

[38]  Jens Lehmann,et al.  Integrating NLP Using Linked Data , 2013, SEMWEB.

[39]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[40]  Michael Hausenblas,et al.  Describing linked datasets with the VoID vocabulary , 2011 .

[41]  Gerhard Weikum,et al.  KORE: keyphrase overlap relatedness for entity disambiguation , 2012, CIKM.

[42]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[43]  Roberto Navigli,et al.  SemEval-2013 Task 12: Multilingual Word Sense Disambiguation , 2013, *SEMEVAL.

[44]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[45]  Paul T. Groth,et al.  The Semantic Web – ISWC 2014 , 2014, Lecture Notes in Computer Science.

[46]  Harald Sack,et al.  Statistical Analyses of Named Entity Disambiguation Benchmarks , 2013, NLP-DBPEDIA@ISWC.

[47]  Salvatore Orlando,et al.  Dexter: an open source framework for entity linking , 2013, ESAIR '13.

[48]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[49]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[50]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[51]  Raphaël Troncy,et al.  Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web , 2014, LREC.

[52]  Harald Sack,et al.  Semantic Multimedia Information Retrieval Based on Contextual Descriptions , 2013, ESWC.

[53]  Aba-Sah Dadzie,et al.  Proceedings of the 4 th Workshop on Making Sense of Microposts ( # Microposts 2014 ) Big things come in small packages , 2014 .

[54]  Mark A. Przybocki,et al.  The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[55]  Entity Recognition and Disambiguation Challenge Workshop , 2014 .

[56]  Roberto Navigli,et al.  SemEval-2007 Task 07: Coarse-Grained English All-Words Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[57]  Beth M. Sundheim,et al.  TIPSTER/MUC-5 Information Extraction System Evaluation , 1993, TIPSTER.

[58]  Thomas Hofmann,et al.  Probabilistic Bag-Of-Hyperlinks Model for Entity Linking , 2015, WWW.

[59]  Mark Dredze,et al.  Annotating Named Entities in Twitter Data with Crowdsourcing , 2010, Mturk@HLT-NAACL.

[60]  Axel-Cyrille Ngonga Ngomo,et al.  Evaluating Entity Annotators Using GERBIL , 2015, ESWC.

[61]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[62]  Dan Roth,et al.  Relational Inference for Wikification , 2013, EMNLP.

[63]  Harald Sack,et al.  Don't compare Apples to Oranges: Extending GERBIL for a fine grained NEL evaluation , 2016, SEMANTICS.

[64]  Markus Freudenberg,et al.  DataID: towards semantically rich metadata for complex datasets , 2014, SEM '14.

[65]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[66]  Giuseppe Ottaviano,et al.  Fast and Space-Efficient Entity Linking for Queries , 2015, WSDM.

[67]  Fabian M. Suchanek,et al.  YAGO3: A Knowledge Base from Multilingual Wikipedias , 2015, CIDR.

[68]  Andrea Giovanni Nuzzolese,et al.  Open Knowledge Extraction Challenge , 2015, SemWebEval@ESWC.

[69]  Axel-Cyrille Ngonga Ngomo,et al.  Ensemble Learning for Named Entity Recognition , 2014, SEMWEB.

[70]  Raphaël Troncy,et al.  Analysis of named entity recognition and linking for tweets , 2014, Inf. Process. Manag..

[71]  Aba-Sah Dadzie,et al.  Proceedings of the the 4th Workshop on Making Sense of Microposts co-located with the 23rd International World Wide Web Conference (WWW 2014) , 2014 .