A lightweight approach to explore, enrich and use data with a geospatial dimension with semantic web technologies

The concept of "location" provides one a useful dimension to explore, align, combine, and analyze data. Though one can rely on bespoke GIS systems to conduct their data analyses, we aim to investigate the feasibility of using Semantic Web technologies to leverage the exploration and enrichment of data in CSV files with the vast amount of geographic and geospatial data that are available on the Linked Data Web. In this paper, we propose a lightweight method and set of tools for: uplift - transforming non-RDF resources into RDF documents; creating links between RDF datasets; client-side processing of geospatial functions; and downlift - transforming (enriched) RDF documents back into a non-RDF format. With this approach, people who wish to avail of the spatial dimension in data can do so from their client (e.g., in a browser) without the need to rely on bespoke technology. This could be of great utility for decision makers and scholars, amongst others. We applied our approach on datasets that are hosted on the Irish open data portal, and combined it with authoritative geospatial data made available by Ordnance Survey Ireland (OSi). Albeit aware that our approach cannot compete with specialist tools, we do demonstrate its feasibility. Though currently conducted for enriching datasets hosted on the Irish open data portal, future work will look into broader governance and provenance aspects of geospatial data enriched dataset management.

[1]  Spiros Athanasiou,et al.  TripleGeo: an ETL Tool for Transforming Geospatial Data into RDF Triples , 2014, EDBT/ICDT Workshops.

[2]  Michael Martin,et al.  Exploring the web of spatial data with facete , 2014, WWW '14 Companion.

[3]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[4]  Ruben Verborgh,et al.  Triple Pattern Fragments: A low-cost knowledge graph interface for the Web , 2016, J. Web Semant..

[5]  Robert Isele,et al.  Silk Server - Adding missing Links while consuming Linked Data , 2010, COLD.

[6]  Christophe Debruyne,et al.  R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings , 2016, LDOW@WWW.

[7]  Christophe Debruyne,et al.  Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments , 2017, LDOW@WWW.

[8]  Axel-Cyrille Ngonga Ngomo,et al.  EAGLE: Efficient Active Learning of Link Specifications Using Genetic Programming , 2012, ESWC.

[9]  Stefan Decker,et al.  Linked data authority records for Irish place names , 2014, International Journal on Digital Libraries.

[10]  森 純一郎,et al.  Proceedings of the Workshop on Linked Data on the Web, LDOW 2016, co-located with 25th International World Wide Web Conference (WWW 2016) , 2016, LDOW@WWW.

[11]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[12]  Hugh Glaser,et al.  Linked Open Government Data: Lessons from Data.gov.uk , 2012, IEEE Intelligent Systems.

[13]  Jens Lehmann,et al.  Automating RDF Dataset Transformation and Enrichment , 2015, ESWC.

[14]  Dave Kolas,et al.  Enabling the geospatial Semantic Web with Parliament and GeoSPARQL , 2012, Semantic Web.

[15]  Christophe Debruyne,et al.  Towards a project centric metadata model and lifecycle for ontology mapping governance , 2015, iiWAS.

[16]  Sören Auer,et al.  LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data , 2011, IJCAI.

[17]  Axel Polleres,et al.  Binary RDF representation for publication and exchange (HDT) , 2013, J. Web Semant..