Web-Based Measure of Semantic Relatedness

Semantic relatedness measures quantify the degree in which some words or concepts are related, considering not only similarity but any possible semantic relationship among them. Relatedness computation is of great interest in different areas, such as Natural Language Processing, Information Retrieval, or the Semantic Web. Different methods have been proposed in the past; however, current relatedness measures lack some desirable properties for a new generation of Semantic Web applications: maximum coverage, domain independence, and universality. In this paper, we explore the use of a semantic relatedness measure between words, that uses the Web as knowledge source. This measure exploits the information about frequencies of use provided by existing search engines. Furthermore, taking this measure as basis, we define a new semantic relatedness measure among ontology terms. The proposed measure fulfils the above mentioned desirable properties to be used on the Semantic Web. We have tested extensively this semantic measure to show that it correlates well with human judgment, and helps solving some particular tasks, as word sense disambiguation or ontology matching.

[1]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[2]  Simone Paolo Ponzetto,et al.  WikiRelate! Computing Semantic Relatedness Using Wikipedia , 2006, AAAI.

[3]  Danushka Bollegala,et al.  Measuring semantic similarity between words using web search engines , 2007, WWW '07.

[4]  Mehran Sahami,et al.  A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.

[5]  Enrico Motta,et al.  Next Generation Semantic Web Applications , 2006, ASWC.

[6]  Eduardo Mena,et al.  Querying the web: a multiontology disambiguation method , 2006, ICWE '06.

[7]  Stavros Christodoulakis,et al.  Ontology-Driven Semantic Ranking for Natural Language Disambiguation in the OntoNL Framework , 2007, ESWC.

[8]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[9]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[10]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[11]  Vipul Kashyap,et al.  Relationships at the Heart of Semantic Web: Modeling, Discovering, and Exploiting Complex Semantic Relationships , 2004 .

[12]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[13]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[14]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[15]  Ted Pedersen,et al.  Maximizing Semantic Relatedness to Perform Word Sense Disambiguation , 2005 .

[16]  Hsin-Hsi Chen,et al.  Novel Association Measures Using Web Search with Double Checking , 2006, ACL.

[17]  Richard A. Harshman,et al.  Indexing by latent semantic indexing , 1990 .

[18]  Enrico Motta,et al.  Evaluating the Semantic Web: A Task-Based Approach , 2007, ISWC/ASWC.

[19]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[20]  Adam Kilgarriff,et al.  Introduction to the Special Issue on the Web as Corpus , 2003, CL.

[21]  Eduardo Mena,et al.  Discovering the Semantics of User Keywords , 2007, J. Univers. Comput. Sci..

[22]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[23]  Stan Szpakowicz,et al.  Roget's thesaurus and semantic similarity , 2012, RANLP.