A comprehensive quality model for Linked Data

With the increasing amount of Linked Data published on the Web, the community has recognised the importance of the quality of such data and a number of initiatives have been undertaken to specify and evaluate Linked Data quality. However, these initiatives are characterised by a high diversity in terms of the quality aspects that they address and measure. This leads to difficulties in comparing and benchmarking evaluation results, as well as in selecting the right data source according to certain quality needs. This paper presents a quality model for Linked Data, which provides a unique terminology and reference for Linked Data quality specification and evaluation. The mentioned quality model specifies a set of quality characteristics and quality measures related to Linked Data, together with formulas for the calculation of measures. Furthermore, this paper also presents an extension of the W3C Data Quality Vocabulary that can be used to capture quality information specific to Linked Data, a Linked Data representation of the Linked Data quality model, and a use case in which the benefits of the quality model proposed in this paper are presented in a tool for Linked Data evaluation.

[1]  G. Steiner,et al.  A CHINESE PROVERB , 2013 .

[2]  B. Kitchenham,et al.  DESMET : A method for evaluating Software Engineering methods and tools , 2000 .

[3]  Maria-Esther Vidal,et al.  Analyzing Linked Data Quality with LiQuate , 2013, OTM Workshops.

[4]  Riccardo Albertoni,et al.  A Linkset Quality Metric Measuring Multilingual Gain in SKOS Thesauri , 2015, LDQ@ESWC.

[5]  Mohsen Kahani,et al.  Quality Metrics for Linked Open Data , 2015, DEXA.

[6]  Christoph Lange,et al.  Luzzu -- A Framework for Linked Data Quality Assessment , 2016, 2016 IEEE Tenth International Conference on Semantic Computing (ICSC).

[7]  O. Hartig Trustworthiness of Data on the Web , 2008 .

[8]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.

[9]  Mohammad Kazem Akbari,et al.  Customizing ISO 9126 quality model for evaluation of B2B applications , 2009, Inf. Softw. Technol..

[10]  Stefan Schlobach,et al.  LOD Laundromat: A Uniform Way of Publishing Other People's Dirty Data , 2014, SEMWEB.

[11]  Christian Bizer,et al.  Sieve: linked data quality assessment and fusion , 2012, EDBT-ICDT '12.

[12]  Fabien L. Gandon,et al.  RDF 1.1 XML Syntax , 2014 .

[13]  AkbariMohammad Kazem,et al.  Customizing ISO 9126 quality model for evaluation of B2B applications , 2009 .

[14]  Raúl García-Castro,et al.  Benchmarking Semantic Web Technology , 2011, Studies on the Semantic Web.

[15]  James A. Hendler,et al.  Trust Networks on the Semantic Web , 2003, WWW.

[16]  Yolanda Gil,et al.  Trusting Information Sources One Citizen at a Time , 2002, SEMWEB.

[17]  Motoei Azuma SquaRE The next generation of the ISO/IEC 9126 and 14598 international standards series on software product quality , 2001 .

[18]  Andrea Maurino,et al.  ABSTAT: Linked Data Summaries with ABstraction and STATistics , 2015, ESWC.

[19]  Antonio Vallecillo,et al.  Measuring the usability of software components , 2006, J. Syst. Softw..

[20]  Barbara A. Kitchenham,et al.  The use and usefulness of the ISO/IEC 9126 quality standard , 2005, 2005 International Symposium on Empirical Software Engineering, 2005..

[21]  Asunción Gómez-Pérez,et al.  SemQuaRE - An extension of the SQuaRE quality model for the evaluation of semantic technologies , 2015, Comput. Stand. Interfaces.

[22]  Edward G. Schilling,et al.  Juran's Quality Handbook , 1998 .

[23]  Xavier Franch,et al.  Using Quality Models in Software Package Selection , 2003, IEEE Softw..

[24]  Declan O'Sullivan,et al.  Improving Curated Web-Data Quality with Structured Harvesting and Assessment , 2014, Int. J. Semantic Web Inf. Syst..

[25]  Jens Lehmann,et al.  Assessing Linked Data Mappings Using Network Measures , 2012, ESWC.

[26]  Maribel Acosta,et al.  Crowdsourcing Linked Data Quality Assessment , 2013, SEMWEB.

[27]  Felix Naumann,et al.  Profiling linked open data with ProLOD , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[28]  Martin Hepp,et al.  Towards a vocabulary for data quality management in semantic web architectures , 2011, LWDM '11.

[29]  Jens Lehmann,et al.  Linked Open Data Statistics: Collection and Exploitation , 2013, KESW.

[30]  Christian Bizer,et al.  Quality-driven information filtering using the WIQA policy framework , 2009, J. Web Semant..

[31]  Asunción Gómez-Pérez,et al.  Loupe - An Online Tool for Inspecting Datasets in the Linked Data Cloud , 2015, SEMWEB.

[32]  Nandana Mihindukulasooriya,et al.  Linked Data Platform as a novel approach for Enterprise Application Integration , 2013, COLD.

[33]  Jens Lehmann,et al.  Test-driven evaluation of linked data quality , 2014, WWW.

[34]  Christoph Lange,et al.  daQ, an Ontology for Dataset Quality Information , 2014, LDOW.