Named Entity Linking in a Complex Domain: Case Second World War History

This paper discusses the challenges of applying named entity linking in a rich, complex domain – specifically, the linking of (1) military units, (2) places and (3) people in the context of interlinked Second World War data. Multiple sub-scenarios are discussed in detail through concrete evaluations, analyzing the problems faced, and the solutions developed. A key contribution of this work is to highlight the heterogeneity of problems and approaches needed even inside a single domain, depending on both the source data as well as the target authority.

[1]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[2]  Eero Hyvönen,et al.  Representing and Utilizing Changing Historical Places as an Ontology Time Series , 2011, Geospatial Semantics and the Semantic Web.

[3]  Eero Hyvönen,et al.  WarSampo Data Service and Semantic Portal for Publishing Linked Open Data About the Second World War History , 2016, ESWC.

[4]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[5]  Jiawei Han,et al.  Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions , 2015, IEEE Transactions on Knowledge and Data Engineering.

[6]  Anthony McEnery,et al.  Porting an English semantic tagger to the Finnish language , 2003 .

[7]  Eetu Mäkelä,et al.  LAS: an integrated language analysis tool for multiple languages , 2016, J. Open Source Softw..

[8]  Martin Doerr,et al.  The CIDOC Conceptual Reference Module: An Ontological Approach to Semantic Interoperability of Metadata , 2003, AI Mag..

[9]  Kimmo Kettunen,et al.  Modern Tools for Old Content - in Search of Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910 , 2016, LWDA.

[10]  Carina Silberer,et al.  Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration , 2008, LREC.

[11]  Krzysztof Janowicz,et al.  Improving wikipedia-based place name disambiguation in short texts using structured data from DBpedia , 2014, GIR.

[12]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[13]  Claire Grover,et al.  Use of the Edinburgh geoparser for georeferencing digitized historical collections , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[14]  Eduardo Mena,et al.  Overview of a semantic disambiguation method for unstructured web contexts , 2009, K-CAP '09.

[15]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[16]  Eetu Mäkelä Combining a REST Lexical Analysis Web Service with SPARQL for Mashup Semantic Annotation from Text , 2014, ESWC.

[17]  John Atkinson,et al.  Geo-referencing with semi-automatic gazetteer expansion using lexico-syntactical patterns and co-reference analysis , 2011, Int. J. Geogr. Inf. Sci..

[18]  Joel Nothman,et al.  Evaluating Entity Linking with Wikipedia , 2013, Artif. Intell..

[19]  M. Doerr The CIDOC CRM – an Ontological Approach to Semantic Interoperability of Metadata , 2003 .

[20]  Eero Hyvönen,et al.  Linked Death - representing, publishing, and using Second World War death records as Linked Open Data , 2016, WHiSe@ESWC.

[21]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[22]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[23]  Eduardo Mena,et al.  Multiontology Semantic Disambiguation in Unstructured Web Contexts , 2009 .