An Empirical Study of Vocabulary Relatedness and Its Application to Recommender Systems

When thousands of vocabularies having been published on the SemanticWeb by various authorities, a question arises as to how they are related to each other. Existing work has mainly analyzed their similarity. In this paper, we inspect the more general notion of relatedness, and characterize it from four angles: well-defined semantic relatedness, lexical similarity in contents, closeness in expressivity and distributional relatedness. We present an empirical study of these measures on a large, real data set containing 2,996 vocabularies, and 15 million RDF documents that use them. Then, we propose to apply vocabulary relatedness to the problem of post-selection vocabulary recommendation. We implement such a recommender service as part of a vocabulary search engine, and test its effectiveness against a handcrafted gold standard.

[1]  Mark A. Musen,et al.  Building a biomedical ontology recommender web service , 2010, J. Biomed. Semant..

[2]  Dean Allemang,et al.  The Semantic Web - ISWC 2006, 5th International Semantic Web Conference, ISWC 2006, Athens, GA, USA, November 5-9, 2006, Proceedings , 2006, SEMWEB.

[3]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[4]  Maria Fasli,et al.  A graph-based approach to measuring semantic relatedness in ontologies , 2011, WIMS '11.

[5]  Mathieu d'Aquin,et al.  Extending Open Rating Systems for Ontology Ranking and Reuse , 2010, EKAW.

[6]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[7]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[8]  Ramanathan V. Guha,et al.  User Ratings of Ontologies: Who Will Rate the Raters? , 2005, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[9]  Mark A. Musen,et al.  What Four Million Mappings Can Tell You about Two Hundred Ontologies , 2009, SEMWEB.

[10]  Yun Peng,et al.  Finding and Ranking Knowledge on the Semantic Web , 2005, SEMWEB.

[11]  Graeme Hirst,et al.  Distributional measures of concept-distance: A task-oriented evaluation , 2006, EMNLP.

[12]  Philipp Cimiano,et al.  Knowledge Engineering and Management by the Masses , 2010, Lecture Notes in Computer Science.

[13]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.

[14]  Cristian R. Munteanu,et al.  An Approach for the Automatic Recommendation of Ontologies Using Collaborative Knowledge , 2010, KES.

[15]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[16]  Yuzhong Qu,et al.  Term Dependence on the Semantic Web , 2008, SEMWEB.

[17]  Harith Alani,et al.  Ontology ranking based on the analysis of concept structures , 2005, K-CAP '05.

[18]  Zhaohui Zheng,et al.  Learning to model relatedness for news recommendation , 2011, WWW.

[19]  Ian Horrocks,et al.  The Semantic Web – ISWC 2010: 9th International Semantic Web Conference, ISWC 2010, Shanghai, China, November 7-11, 2010, Revised Selected Papers, Part I , 2010, SEMWEB.

[20]  Brian Davis,et al.  Knowledge Engineering and Knowledge Management , 2012, Lecture Notes in Computer Science.

[21]  Laurent Mazuel,et al.  Semantic Relatedness Measure Using Object Properties in an Ontology , 2008, SEMWEB.

[22]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[23]  Dmitri Loguinov,et al.  IRLbot: scaling to 6 billion pages and beyond , 2008, WWW.

[24]  Enrico Motta,et al.  Capturing Emerging Relations between Schema Ontologies on the Web of Data , 2010, COLD.

[25]  Steffen Staab,et al.  The Semantic Web - ISWC 2008, 7th International Semantic Web Conference, ISWC 2008, Karlsruhe, Germany, October 26-30, 2008. Proceedings , 2008, SEMWEB.

[26]  Jérôme David,et al.  Comparison between Ontology Distances (Preliminary Results) , 2008, SEMWEB.

[27]  Jérôme David,et al.  Ontology Similarity in the Alignment Space , 2010, International Semantic Web Conference.

[28]  Li Ding,et al.  Characterizing the Semantic Web on the Web , 2006, SEMWEB.

[29]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[30]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[31]  Jérôme Euzenat,et al.  A Feature and Information Theoretic Framework for Semantic Similarity and Relatedness , 2010, SEMWEB.

[32]  Christoph Tempich,et al.  Towards a benchmark for Semantic Web reasoners - an analysis of the DAML ontology library , 2003, EON.

[33]  Abraham Bernstein,et al.  The Semantic Web - ISWC 2009, 8th International Semantic Web Conference, ISWC 2009, Chantilly, VA, USA, October 25-29, 2009. Proceedings , 2009, SEMWEB.

[34]  Yuzhong Qu,et al.  How Matchable Are Four Thousand Ontologies on the Semantic Web , 2011, ESWC.

[35]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.