Multidimensional and quantitative interlinking approach for Linked Geospatial Data

ABSTRACT Linked Data is known as one of the best solutions for multisource and heterogeneous web data integration and discovery in this era of Big Data. However, data interlinking, which is the most valuable contribution of Linked Data, remains incomplete and inaccurate. This study proposes a multidimensional and quantitative interlinking approach for Linked Data in the geospatial domain. According to the characteristics and roles of geospatial data in data discovery, eight elementary data characteristics are adopted as data interlinking types. These elementary characteristics are further combined to form compound and overall data interlinking types. Each data interlinking type possesses one specific predicate to indicate the actual relationship of Linked Data and uses data similarity to represent the correlation degree quantitatively. Therefore, geospatial data interlinking can be expressed by a directed edge associated with a relation predicate and a similarity value. The approach transforms existing simple and qualitative geospatial data interlinking into complete and quantitative interlinking and promotes the establishment of high-quality and trusted Linked Geospatial Data. The approach is applied to build data intra-links in the Chinese National Earth System Scientific Data Sharing Network (NSTI-GEO) and data -links in NSTI-GEO with the Chinese Meteorological Data Network and National Population and Health Scientific Data Sharing Platform.

[1]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[2]  Thérèse Steenberghen,et al.  Publishing metadata of geospatial indicators as Linked Open Data: a policy-oriented approach , 2014, GIScience 2014.

[3]  E. Lynn Usery,et al.  Design and development of linked data from The National Map , 2012, Semantic Web.

[4]  Jens Lehmann,et al.  LinkedGeoData: A core for a web of spatial open data , 2012, Semantic Web.

[5]  David M. Shotton,et al.  Provenance and Linked Data in Biological Data Webs , 2008, LDOW.

[6]  Songshan Yue,et al.  A data description model for reusing, sharing and integrating geo-analysis models , 2015, Environmental Earth Sciences.

[7]  András Bárdossy,et al.  Downscaling daily precipitation time series using a combined circulation- and regression-based approach , 2010 .

[8]  Robert Isele,et al.  Active learning of expressive linkage rules using genetic programming , 2013, J. Web Semant..

[9]  Andreas Schmidt,et al.  Data mining and linked open data – New perspectives for data analysis in environmental research , 2015 .

[10]  David Sánchez,et al.  Ontology-based semantic similarity: A new feature-based approach , 2012, Expert Syst. Appl..

[11]  重信 池戸,et al.  ISO (International Organization for Standardization ; 国際標準化機構) , 1997 .

[12]  Elena Volpi,et al.  Rainfall downscaling in time: theoretical and empirical comparison between multifractal and Hurst-Kolmogorov discrete random cascades , 2012 .

[13]  Óscar Corcho,et al.  Transforming meteorological data into Linked Data , 2013, Semantic Web.

[14]  Markus Freitag,et al.  GovWILD: integrating open government data for transparency , 2012, WWW.

[15]  Guomo Zhou,et al.  Geostatistical interpolation of missing data and downscaling of spatial resolution for remotely sensed atmospheric methane column concentrations , 2012 .

[16]  Bruce Hewitson,et al.  Doubled CO2 precipitation changes for the Susquehanna basin: down-scaling from the Genesis general c , 1998 .

[17]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[18]  Douglas D. Nebert,et al.  Interpreting the ASTM “Content Standard for Digital Geospatial Metadata” , 1996 .

[19]  Pavel Shvaiko,et al.  Trentino Government Linked Open Geo-data: A Case Study , 2012, SEMWEB.

[20]  Ning Liu,et al.  An Algorithm to Generate Data Linkages of Linked Sensor Data , 2015, CyberC.

[21]  T. Saaty How to Make a Decision: The Analytic Hierarchy Process , 1990 .

[22]  Martin Gaedke,et al.  Silk - A Link Discovery Framework for the Web of Data , 2009, LDOW.

[23]  Kurt Keutzer,et al.  Path-delay-fault testability properties of multiplexor-based networks , 1993, Integr..

[24]  Bernhard Haslhofer,et al.  The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data , 2008, LDOW.

[25]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[26]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[27]  Asunción Gómez-Pérez,et al.  Integrating geographical information in the Linked Digital Earth , 2014, Int. J. Digit. Earth.

[28]  M. Koubarakis,et al.  Linked Open Earth Observation Data: The LEO Project , 2013 .

[29]  Jan M. H. Hendrickx,et al.  Up-scaling of SEBAL derived evapotranspiration maps from Landsat (30 m) to MODIS (250 m) scale , 2009 .

[30]  Hugh Glaser,et al.  Managing URI Synonymity to Enable Consistent Reference on the Semantic Web , 2008, IRSW.

[31]  Asunción Gómez-Pérez,et al.  GeoLinked data and INSPIRE through an application case , 2010, GIS '10.

[32]  Ning Liu,et al.  An Algorithm to Generate Data Linkages of Linked Sensor Data , 2015, 2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery.

[33]  C. Arms,et al.  Digital Formats : Factors for Sustainability , Functionality , and Quality , 2005 .

[34]  Yong Liu,et al.  Using Linked Data in a heterogeneous Sensor Web: challenges, experiments and lessons learned , 2013, Int. J. Digit. Earth.

[35]  Thomas L. Saaty,et al.  How to Make a Decision: The Analytic Hierarchy Process , 1990 .

[36]  Robert Isele,et al.  Efficient Multidimensional Blocking for Link Discovery without losing Recall , 2011, WebDB.

[37]  Krzysztof Janowicz,et al.  Metadata Topic Harmonization and Semantic Search for Linked‐Data‐Driven Geoportals: A Case Study Using ArcGIS Online , 2015, Trans. GIS.

[38]  Sören Auer,et al.  LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data , 2011, IJCAI.

[39]  James A. Hendler,et al.  TWC LOGD: A portal for linked open government data ecosystems , 2011, J. Web Semant..

[40]  Ben Butchart,et al.  An Infrastructure for Publishing Geospatial Metadata as Open Linked Metadata , .

[41]  Jerry R. Hobbs,et al.  An ontology of time for the semantic web , 2004, TALIP.

[42]  Peng Yue,et al.  A Linked Data Approach for Geospatial Data Provenance , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Jens Lehmann,et al.  LinkedGeoData: Adding a Spatial Dimension to the Web of Data , 2009, SEMWEB.

[44]  Krzysztof Janowicz,et al.  Linked Data - A Paradigm Shift for Geographic Information Science , 2014, GIScience.

[45]  J. Goodwin,et al.  Geographical Linked Data: The Administrative Geography of Great Britain on the Semantic Web , 2008 .

[46]  Susan Stitt,et al.  USA/NBII: Biological Data Profile of the Content Standard for Digital Geospatial Metadata, FGDC-STD-001.1-1999 , 2005 .

[47]  Mark B. Sandler,et al.  Automatic Interlinking of Music Datasets on the Semantic Web , 2008, LDOW.

[48]  Peter Uvin,et al.  Scaling up the grass roots and scaling down the summit: The relations between Third World nongovernmental organisations and the United Nations , 1995 .