Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes

The Linked Open Data cloud has been increasing in popularity, with DBpedia as a first-class citizen in this cloud that has been widely adopted across many applications. Measuring similarity between resources and identifying their relatedness could be used for various applications such as item-based recommender systems. To this end, several similarity measures such as LDSD (Linked Data Semantic Distance) were proposed. However, some fundamental axioms for similarity measures such as “equal self-similarity”, “symmetry” or “minimality” are violated, and property similarities have been ignored. Moreover, none of the previous studies have provided a comparative study on other similarity measures. In this paper, we present a similarity measure, called Resim (Resource Similarity), based on top of a revised LDSD similarity measure. Resim aims to calculate the similarity of any resources in DBpedia by taking into account the similarity of the properties of these resources as well as satisfying the fundamental axioms. In addition, we evaluate our similarity measure with two state-of-the-art similarity measures (LDSD and Shakti) in terms of calculating the similarities for general resources (i.e., any resources without a domain restriction) in DBpedia and resources for music artist recommendations. Results show that our similarity measure can resolve some of the limitations of state-of-the-art similarity measures and performs better than them for calculating the similarities between general resources and music artist recommendations.

[1]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[2]  Conor Hayes,et al.  Using Linked Data to Build Open, Collaborative Recommender Systems , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[3]  Alexandre Passant,et al.  Measuring Semantic Distance on Linking Data and Using it for Resources Recommendations , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[4]  Ehud Rivlin,et al.  Placing search in context: the concept revisited , 2002, TOIS.

[5]  Alexander Maedche,et al.  Clustering Ontology-Based Metadata in the Semantic Web , 2002, PKDD.

[6]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[7]  Jihoon Yang,et al.  Discovery of Hidden Similarity on Collaborative Filtering to Overcome Sparsity Problem , 2004, Discovery Science.

[8]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[9]  Markus Zanker,et al.  Linked open data to support content-based recommender systems , 2012, I-SEMANTICS '12.

[10]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[11]  José Paulo Leal,et al.  Computing Semantic Relatedness using DBPedia , 2012, SLATE.

[12]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[13]  Łukasz Strobin,et al.  Evaluating semantic similarity with a new method of path analysis in RDF using genetic algorithms , 2013 .

[14]  Odej Kao,et al.  Adaptation and Evaluation of a Semantic Similarity Measure for DBPedia: A First Experiment , 2012, 2012 Seventh International Workshop on Semantic and Social Media Adaptation and Personalization.

[15]  Alexandre Passant,et al.  dbrec - Music Recommendations Using DBpedia , 2010, SEMWEB.

[16]  Qi Gao,et al.  Analyzing temporal dynamics in Twitter profiles for personalized recommendations in the social web , 2011, WebSci '11.

[17]  Tommaso Di Noia,et al.  Exploiting the web of data in model-based recommender systems , 2012, RecSys.

[18]  Lingling Meng,et al.  A Review of Semantic Similarity Measures in WordNet 1 , 2013 .

[19]  Pasquale Lops,et al.  Linked Open Data-enabled Strategies for Top-N Recommendations , 2014, CBRecSys@RecSys.

[20]  Tommaso Di Noia,et al.  Top-N recommendations from implicit feedback leveraging linked open data , 2013, IIR.

[21]  John G. Breslin,et al.  Aggregated, interoperable and multi-domain user profiles for the social web , 2012, I-SEMANTICS '12.

[22]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Geert-Jan Houben,et al.  Cross-system user modeling and personalization on the Social Web , 2013, User Modeling and User-Adapted Interaction.

[24]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.