KLocator: An Ontology-Based Framework for Scenario-Driven Geographical Scope Resolution

The automatic extraction of geographical information from textual pieces of information is a challenging task that has been getting increasing attention from application and research areas that need to incorporate locationawareness in their methods and services. In this paper, we present KLocator, a novel ontology-based system for correctly identifying geographical entity references within texts and mapping them to knowledge sources, as well as determining the geographical scope of texts, namely the areas and regions to which the texts are geographically relevant. Compared to other similar approaches, KLocator has two important novelties: i) It does not utilize only background geographical information for performing the above tasks but allows the exploitation of any kind of semantic information that is explicitly or implicitly related to geographical entities in the given domain and application scenario. ii) It is highly customizable, allowing users to define and apply custom geographical resolution models that best fit to the domain(s) and expected content of the texts to be analyzed. Both these features, according to our experiments, manage to substantially improve the effectiveness of the geographical entity and scope resolution tasks, especially in scenarios where explicit geographical information is scarce. Keywords-Geographical Entity Resolution; Geographical Scope Resolution; Ontologies; Semantic Data.

[1]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[2]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[3]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[4]  Anja Jentzsch Linked Open Data Cloud , 2014 .

[5]  Jens Lehmann,et al.  LinkedGeoData: A core for a web of spatial open data , 2012, Semantic Web.

[6]  Krys J. Kochut,et al.  Wikipedia in Action: Ontological Knowledge in Text Categorization , 2008, 2008 IEEE International Conference on Semantic Computing.

[7]  Asunción Gómez-Pérez,et al.  GeoLinked data and INSPIRE through an application case , 2010, GIS '10.

[8]  Georg Gartner,et al.  Applications of location–based services: a selected review , 2007, J. Locat. Based Serv..

[9]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[10]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[11]  Raphaël Troncy,et al.  POLITECNICO DI TORINO Repository ISTITUZIONALE NERD : A Framework for Evaluating Named Entity Recognition Tools in the Web of Data / , 2022 .

[12]  Eero Hyvönen,et al.  Ontology-based Disambiguation of Spatiotemporal Locations , 2008, IRSW.

[13]  Panos Alexopoulos,et al.  Optimizing Geographical Entity and Scope Resolution in Texts using Non-Geographical Semantic Information , 2012 .

[14]  Daqing He,et al.  Geographic Named Entity Disambiguation with Automatic Profile Generation , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[15]  Andreas Abecker,et al.  Entity Reference Resolution via Spreading Activation on RDF-Graphs , 2010, ESWC.

[16]  Gosse Bouma,et al.  Placename Ambiguity Resolution , 2008 .

[17]  Gosse Bouma,et al.  Every document has a geographical scope , 2012, Data Knowl. Eng..

[18]  Mukesh K. Mohania,et al.  Efficiently linking text documents with relevant structured information , 2006, VLDB.

[19]  Yannis Avrithis,et al.  Automatic Thematic Categorization of Multimedia Documents using Ontological Information and Fuzzy Algebra , 2006 .