Assessing linkset quality for complementing third-party datasets

Linked data best practices are getting extremely popular: various companies and public institutions have started taking advantage of linked data principles for exposing their datasets, and for relating their datasets to those served by third parties. Such enthusiasm is due to the linked data promise of evolving into a Global Data Space. Linksets are sets of links relating datasets and they surely play a fundamental role in this promise. However, a stable and well-accepted notion of linkset quality has not been yet defined. This paper contributes to overcome this lack by proposing a linkset quality measure. Among the different quality dimensions that can be addressed, the proposed measure focuses on completeness. The paper formally defines novel scoring functions and proposes an interpretation of these functions when maintaining and complementing third party datasets.

[1]  Jens Lehmann,et al.  Assessing Linked Data Mappings Using Network Measures , 2012, ESWC.

[2]  Riccardo Albertoni,et al.  Semantic Similarity and Selection of Resources Published According to Linked Data Best Practice , 2010, OTM Workshops.

[3]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[4]  Tim Berners-Lee,et al.  Linked data , 2020, Semantic Web for the Working Ontologist.

[5]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[6]  Christian Bizer,et al.  Quality-driven information filtering using the WIQA policy framework , 2009, J. Web Semant..

[7]  Christian Bizer,et al.  Sieve: linked data quality assessment and fusion , 2012, EDBT-ICDT '12.

[8]  W. Marsden I and J , 2012 .

[9]  Robert Isele,et al.  LDIF - Linked Data Integration Framework , 2011, COLD.

[10]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[11]  Michael Hausenblas,et al.  Describing linked datasets with the VoID vocabulary , 2011 .

[12]  Elena Console,et al.  Data Fusion , 2009, Encyclopedia of Database Systems.