A New Method for Calculating Word Sense Similarity in WordNet 1

Semantic similarity between word senses is hot topic in many applications of computational linguistics and artificial intelligence, such as word sense disambiguation, information extraction, semantic annotation and ontology learning. Many methods for calculating word sense similarity have been proposed. In recent years the methods based on WordNet have shown its talents and attracted great concern. In the paper, we present a new method in WordNet for calculating word sense similarity, which is noun and is-a relation based. We evaluate our method on the data set of Rubenstein and Goodenough, which is traditional and widely used. The correlation with human judgment is o.8804 in proposed measure, which is more close to human judgments than related works. Experiments show that our new measure significantly outperformed than other existing computational methods.

[1]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[2]  Euripides G. M. Petrakis,et al.  Semantic similarity methods in wordNet and their application to information retrieval on the web , 2005, WIDM '05.

[3]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[4]  Xiaohua Hu,et al.  Integration of semantic-based bipartite graph representation and mutual refinement strategy for biomedical literature clustering , 2006, KDD '06.

[5]  Steffen Staab,et al.  WordNet improves text document clustering , 2003, SIGIR 2003.

[6]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[7]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[8]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[9]  Mark Stevenson,et al.  A Semantic Approach to IE Pattern Induction , 2005, ACL.

[10]  Masoud Rahgozar,et al.  A Knowledge-Based Question Answering System for B2C eCommerce , 2008, Fifth International Conference on Information Technology: New Generations (itng 2008).

[11]  David Sánchez,et al.  Content annotation for the semantic web: an automatic web-based approach , 2011, Knowledge and Information Systems.

[12]  Jorge García Duque,et al.  A flexible semantic inference methodology to reason about user preferences in knowledge-based recommender systems , 2008, Knowl. Based Syst..

[13]  小嶋 秀樹,et al.  Computing lexical cohesion as a tool for text analysis , 1994 .

[14]  David McLean,et al.  An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources , 2003, IEEE Trans. Knowl. Data Eng..

[15]  Ted Pedersen,et al.  Using Measures of Semantic Relatedness for Word Sense Disambiguation , 2003, CICLing.

[16]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[17]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[18]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .