On-The-Fly Academic Linked Data Integration

The web of Linked Open Data (LOD) has a prominent and rapid evolution recently. Over the last few years, LOD had developed to involve a wide range of various domains. Due to these facts, and the great interconnections among linked open datasets, linked data integration task had gained a huge attention and became a focal point of research. LOD applications aim to incorporate data from different LOD sources. Unfortunately, these sources of data are heterogeneous in schema and/or in vocabularies. Due to this heterogeneity, numerous challenges are emerging that have to be overcome. In this paper, a LOD integration framework is proposed, which aims to tackle these challenges. It works on integrating academic LOD datasets that reside in different LOD repositories with intrinsic schema and vocabularies heterogeneity. An automatic mapping technique in the integration processes is proposed in this paper. Consequently, an obvious decrease in execution time for the entire integration process, as well as, a great progress in the integrated data quality assessment metrics has been achieved.

[1]  Václav Kaczmarczyk,et al.  Interoperability and ontology for heterogeneous systems , 2010, CI 2010.

[2]  Mark B. Sandler,et al.  Automatic Interlinking of Music Datasets on the Semantic Web , 2008, LDOW.

[3]  Christian Bizer,et al.  The R2R Framework: Publishing and Discovering Mappings on the Web , 2010, COLD.

[4]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.

[5]  Fatiha Saïs,et al.  Linked Data Annotation and Fusion driven by Data Quality Evaluation , 2015, EGC.

[6]  Antoine Isaac,et al.  SKOS Simple Knowledge Organization System Primer , 2009 .

[7]  Ronald Fagin,et al.  A Declarative Framework for Linking Entities , 2016, ACM Trans. Database Syst..

[8]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[9]  Rafael Corchuelo,et al.  Generating SPARQL Executable Mappings to Integrate Ontologies , 2011, ER.

[10]  Robert Isele,et al.  LDIF - Linked Data Integration Framework , 2011, COLD.

[11]  Craig A. Knoblock,et al.  Karma: A System for Mapping Structured Sources into the Semantic Web , 2012, ESWC.

[12]  Wenqiang Liu,et al.  Truth Discovery to Resolve Object Conflicts in Linked Data , 2015, ArXiv.

[13]  Douglas Tudhope,et al.  Improving interoperability using vocabulary linked data , 2015, International Journal on Digital Libraries.

[14]  Jérôme David,et al.  Context-based ontology matching and data interlinking , 2015 .

[15]  Joseph Moses Juran,et al.  Quality-control handbook , 1951 .

[16]  Patrick Lambrix,et al.  SAMBO - A system for aligning and merging biomedical ontologies , 2006, J. Web Semant..

[17]  Vassilis Christophides,et al.  Resource Description Framework (RDF) Schema (RDFS) , 2009, Encyclopedia of Database Systems.

[18]  Renée J. Miller,et al.  Semantic Link Discovery over Relational Data , 2012, Semantic Search over the Web.

[19]  Robert Isele,et al.  Interlinking and Knowledge Fusion , 2014, Linked Open Data.

[20]  Erhard Rahm,et al.  Schema and ontology matching with COMA++ , 2005, SIGMOD '05.

[21]  Leslie F. Sikos Mastering Structured Data on the Semantic Web: From HTML5 Microdata to Linked Open Data , 2015 .

[22]  Óscar Corcho,et al.  Federating queries in SPARQL 1.1: Syntax, semantics and evaluation , 2013, J. Web Semant..

[23]  Leslie F. Sikos,et al.  Mastering Structured Data on the Semantic Web , 2015, Apress.

[24]  Mike Hobbs,et al.  Data integration approach for semi-structured and structured data (Linked Data) , 2015, 2015 IEEE 13th International Conference on Industrial Informatics (INDIN).

[25]  Martin Gaedke,et al.  Silk - A Link Discovery Framework for the Web of Data , 2009, LDOW.

[26]  Stéphane Boyera,et al.  The Semantic Web for all , 2015, Semantic Web.

[27]  Michel Dumontier,et al.  SPARQL-enabled identifier conversion with Identifiers.org , 2015, Bioinform..

[28]  Klaus Tochtermann,et al.  Interlinking Scientific Authors with the LOD Cloud: A Case Study , 2012, NDT.

[29]  Christoph Lange,et al.  Luzzu Quality Metric Language - A DSL for Linked Data Quality Assessment , 2015, ArXiv.

[30]  Ryutaro Ichise,et al.  SLINT: a schema-independent linked data interlinking system , 2012, OM.

[31]  Christian Bizer,et al.  Sieve: linked data quality assessment and fusion , 2012, EDBT-ICDT '12.

[32]  Krzysztof Janowicz,et al.  The GeoLink Framework for Pattern-based Linked Data Integration , 2015, International Semantic Web Conference.

[33]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.