A metric-driven approach for interlinking assessment of RDF graphs

In recent years the web has evolved from a global information space of linked documents to one where both documents and data are linked. What supports this evolution is a set of best practices in publishing and connecting structured data on the web that is called linked data. The usefulness of linked data relies on how much related concepts are linked together. The aim of this research is to propose a metric-driven approach for interlinking assessment of a single dataset. The proposed metrics are categorized into three groups called internal linking, external linking and link-ability from other datasets. These metrics consider both graph structure (topology) and schema of datasets (semantic information) to evaluate interlinking with appropriate accuracy.

[1]  Mohsen Kahani,et al.  A Metrics-Driven Approach for Quality Assessment of Linked Open Data , 2014, J. Theor. Appl. Electron. Commer. Res..

[2]  Geert Poels,et al.  Distance-based software measurement: necessary and sufficient properties for software measures , 2000, Inf. Softw. Technol..

[3]  Maria-Esther Vidal,et al.  Analyzing Linked Data Quality with LiQuate , 2013, ESWC.

[4]  L. Finkelstein Widely, strongly and weakly defined measurement , 2003 .

[5]  Sandro Morasca,et al.  Property-Based Software Engineering Measurement , 1996, IEEE Trans. Software Eng..

[6]  George Varghese,et al.  Network monitoring using traffic dispersion graphs (tdgs) , 2007, IMC '07.

[7]  Asunción Gómez-Pérez,et al.  Assessing linkset quality for complementing third-party datasets , 2013, EDBT '13.

[8]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[9]  Peter Martini,et al.  Graph based Metrics for Intrusion Response Measures in Computer Networks , 2007, 32nd IEEE Conference on Local Computer Networks (LCN 2007).

[10]  Ellen W. Zegura,et al.  A quantitative comparison of graph-based models for Internet topology , 1997, TNET.

[11]  Edward T. Bullmore,et al.  Reproducibility of graph metrics of human brain functional networks , 2009, NeuroImage.

[12]  Dragan Gasevic,et al.  Assessing the maintainability of software product line feature models using structural metrics , 2011, Software Quality Journal.

[13]  O. Sporns,et al.  Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.

[14]  Nathalie Pernelle,et al.  On Evaluating the Quality of RDF Identity Links in the LOD , 2014 .

[15]  Christoph Böhm,et al.  Enriching the Web of Data with topics and links , 2013 .

[16]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[17]  A. Leon-Garcia,et al.  Comparison of network criticality, algebraic connectivity, and other graph metrics , 2009, SIMPLEX '09.

[18]  O. Hartig Trustworthiness of Data on the Web , 2008 .

[19]  Shari Lawrence Pfleeger,et al.  Software Metrics : A Rigorous and Practical Approach , 1998 .

[20]  H. D. Rombach,et al.  The Goal Question Metric Approach , 1994 .

[21]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.

[22]  John Feo,et al.  High performance semantic factoring of giga-scale semantic graph databases. , 2010 .

[23]  Jens Lehmann,et al.  Assessing Linked Data Mappings Using Network Measures , 2012, ESWC.

[24]  J. Eisl,et al.  Co-operative handover in 3G System Architecture Evolution , 2007 .

[25]  Heiko Paulheim,et al.  Identifying Wrong Links between Datasets by Multi-dimensional Outlier Detection , 2014, WoDOOM.

[26]  Jürgen Umbrich,et al.  An empirical survey of Linked Data conformance , 2012, J. Web Semant..

[27]  Martin J. Eppler,et al.  Conceptualizing Information Quality: A Review of Information Quality Frameworks from the Last Ten Years , 2000, IQ.

[28]  Frank van Harmelen,et al.  Finding the Achilles Heel of the Web of Data: Using Network Analysis for Link-Recommendation , 2010, SEMWEB.

[29]  Yuzhong Qu,et al.  Object Link Structure in the Semantic Web , 2010, ESWC.

[30]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[31]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.