A Quality Model for Linked Data Exploration

Linked (Open) Data (LD) offer the great opportunity to interconnect and share large amounts of data on a global scale, creating added value compared to data published via pure HTML. However, this enormous potential is not completely accessible. In fact, LD datasets are often affected by errors, inconsistencies, missing values and other quality issues that may lower their usage. Users are often not aware of the quality and characteristics of the LD datasets that they use for various and diverse tasks; thus they are not conscious of the effects that poor quality datasets may have on the results of their analyses. In this paper we present our initial results aimed to unleash LD usefulness, by providing a set of quality dimensions able to drive the selection and evaluation of LD sources. As a proof of concepts, we applied our model for assessing the quality of two LD datasets.

[1]  Joseph Moses Juran,et al.  Quality-control handbook , 1951 .

[2]  Anisa Rula,et al.  Methodology for Assessment of Linked Data Quality , 2014, LDQ@SEMANTICS.

[3]  Chiara Francalanci,et al.  Reputation-based Selection of Web Information Sources , 2010, ICEIS.

[4]  Jürgen Umbrich,et al.  An empirical survey of Linked Data conformance , 2012, J. Web Semant..

[5]  Christian Bizer,et al.  Quality-driven information filtering using the WIQA policy framework , 2009, J. Web Semant..

[6]  Cinzia Cappiello,et al.  A Quality Model for Mashup Components , 2009, ICWE.

[7]  Valeria De Antonellis,et al.  A Linked Data Perspective for Effective Exploration of Web APIs Repositories , 2013, ICWE.

[8]  Cesare Pautasso,et al.  Information Quality in Mashups , 2010, IEEE Internet Computing.

[9]  Paolo Tomeo,et al.  Building a relatedness graph from Linked Open Data: A case study in the IT domain , 2016, Expert Syst. Appl..

[10]  Carlo Batini,et al.  Methodologies for data quality assessment and improvement , 2009, CSUR.

[11]  Christian Bizer,et al.  Sieve: linked data quality assessment and fusion , 2012, EDBT-ICDT '12.

[12]  Giuseppe Desolda Enhancing Workspace Composition by Exploiting Linked Open Data as a Polymorphic Data Source , 2015 .

[13]  Felix Naumann,et al.  Data fusion , 2009, CSUR.

[14]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[15]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.