Similarity-based Browsing over Linked Open Data

An increasing amount of data is published on the Web according to the Linked Open Data (LOD) principles. End users would like to browse these data in a flexible manner. In this paper we focus on similarity-based browsing and we introduce a novel method for computing the similarity between two entities of a given RDF/S graph. The distinctive characteristics of the proposed metric is that it is generic (it can be used to compare nodes of any kind), it takes into account the neighborhoods of the nodes, and it is configurable (with respect to the accuracy vs computational complexity tradeoff). We demonstrate the behavior of the metric using examples from an application over LOD. Finally, we generalize and elaborate on implementation approaches harmonized with the distributed nature of LOD which can be used for computing the most similar entities using neighborhood-based similarity metrics.

[1]  Christian Bizer,et al.  Executing SPARQL Queries over the Web of Linked Data , 2009, SEMWEB.

[2]  Ismail Akbari,et al.  A novel algorithm for ontology matching , 2010, J. Inf. Sci..

[3]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..

[4]  Andreas Harth,et al.  VisiNav: Visual Web Data Search and Navigation , 2009, DEXA.

[5]  Heiner Stuckenschmidt,et al.  Leveraging Terminological Structure for Object Reconciliation , 2010, ESWC.

[6]  Eero Hyvönen,et al.  Ontogator - A Semantic View-Based Search Engine Service for Web Applications , 2006, International Semantic Web Conference.

[7]  Adrian Ulges,et al.  Navidgator - Similarity Based Browsing for Image and Video Databases , 2008, KI.

[8]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[9]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[10]  Umberto Straccia,et al.  A Minimal Deductive System for General Fuzzy RDF , 2009, RR.

[11]  Abraham Bernstein,et al.  The Fundamentals of iSPARQL: A Virtual Triple Approach for Similarity-Based Semantic Web Tasks , 2007, ISWC/ASWC.

[12]  Michel C. A. Klein,et al.  Ontology Versioning and Change Detection on the Web , 2002, EKAW.

[13]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[14]  Giovanni Maria Sacco,et al.  Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience , 2009, The Information Retrieval Series.

[15]  Rose Dieng-Kuntz,et al.  Measuring Similarity of Elements in OWL DL Ontologies , 2005 .

[16]  David R. Karger,et al.  Fresnel: A Browser-Independent Presentation Vocabulary for RDF , 2005, SEMWEB.

[17]  Sébastien Ferré,et al.  Conceptual Navigation in RDF Graphs with SPARQL-Like Queries , 2010, ICFCA.

[18]  Jérôme Euzenat,et al.  Similarity-Based Ontology Alignment in OWL-Lite , 2004, ECAI.

[19]  Boi Faltings,et al.  OSS: A Semantic Similarity Function based on Hierarchical Ontologies , 2007, IJCAI.

[20]  Giovanni Tummarello,et al.  RDFSync: Efficient Remote Synchronization of RDF Models , 2007, ISWC/ASWC.

[21]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS.

[22]  Thomas Lukasiewicz,et al.  Semantic search on the Web , 2010, Semantic Web.

[23]  Paul R. Cohen,et al.  Information retrieval by constrained spreading activation in semantic networks , 1987, Inf. Process. Manag..

[24]  Mark A. Musen,et al.  Promptdiff: a fixed-point algorithm for comparing ontology versions , 2002, AAAI/IAAI.

[25]  Tim Berners-Lee,et al.  Delta: an ontology for the distribution of differences between RDF graphs , 2004 .

[26]  Eyal Oren,et al.  Extending Faceted Navigation for RDF Data , 2006, SEMWEB.

[27]  Mark Klein,et al.  Semantic Process Retrieval with iSPARQL , 2007, ESWC.

[28]  Lynda Hardman,et al.  /facet: A Browser for Heterogeneous Semantic Web Repositories , 2006, SEMWEB.

[29]  Volker Haarslev,et al.  An empirical comparison of ontology matching techniques , 2009, J. Inf. Sci..

[30]  Yannis Tzitzikas,et al.  Interactive Exploration of Fuzzy RDF Knowledge Bases , 2011, ESWC.