Semantic similarity assessment of words using weighted WordNet

Word and concept similarity assessment is one of the most important elements in natural language processing and information and knowledge retrieval. WordNet, as a popular concept hierarchy, is used in many such applications. Similarity of words in WordNet is also considered in recent researches. Many researches that use WordNet, have calculated similarity between each pair-word by considering depth of subsumer of the words and shortest path between them. In this paper, three novel models to make better semantic word similarity measure have been presented and it was improved by giving weights to the edges of WordNet hierarchy. It was considered that the nearer an edge is to the root in the hierarchy, the less effect it has in calculating the similarity. Therefore, we have offered a new formula for weighting the edges of hierarchy and based on that, we calculated the distance between two words and depth of words; and then tuned parameters of the transfer functions using particle swarm optimization. Experimental results on a common benchmark, created by human judgment, show that the resultant correlation improved; furthermore our formulae were applied to a more realistic application called sentence similarity assessment and it led to the better results.

[1]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[2]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[3]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[4]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[5]  Troels Andreasen,et al.  Perspectives on ontology‐based querying , 2007, Int. J. Intell. Syst..

[6]  Sukanya Manna,et al.  Fuzzy word similarity: A semantic approach using WordNet , 2010, International Conference on Fuzzy Systems.

[7]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[8]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[9]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[10]  M. Kahani,et al.  Semantic role based sentence compression , 2012, 2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE).

[11]  Julio J. Castillo A WordNet-based semantic approach to textual entailment and cross-lingual textual entailment , 2011, Int. J. Mach. Learn. Cybern..

[12]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[13]  Rafal A. Angryk,et al.  Measuring semantic similarity using wordnet-based context vectors , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[14]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[15]  Peng-Yuan Liu,et al.  Application-Oriented Comparison and Evaluation of Six Semantic Similarity Measures Based on Wordnet , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[16]  Wanli Zuo,et al.  An Approach for Calculating Semantic Similarity between Words Using WordNet , 2011, 2011 Second International Conference on Digital Manufacturing & Automation.

[17]  David McLean,et al.  An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources , 2003, IEEE Trans. Knowl. Data Eng..

[18]  Yves Lepage,et al.  Ambiguity spotting using wordnet semantic similarity in support to recommended practice for Software Requirements Specifications , 2011, 2011 7th International Conference on Natural Language Processing and Knowledge Engineering.

[19]  Yun Tian,et al.  Comparison of current semantic similarity methods in WordNet , 2010, 2010 International Conference on Computer Application and System Modeling (ICCASM 2010).

[20]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[21]  David M. W. Powers,et al.  Measuring Semantic Similarity in the Taxonomy of WordNet , 2005, ACSC.

[22]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[23]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[24]  Xiao-Ying Liu,et al.  Measuring Semantic Similarity in Wordnet , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[25]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[26]  Vedat Coskun,et al.  A new semantic similarity measure evaluated in word sense disambiguation , 2005, NODALIDA.

[27]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[28]  Gregory Grefenstette,et al.  Use of syntactic context to produce term association lists for text retrieval , 1992, SIGIR '92.

[29]  Mahmoud Naghibzadeh,et al.  Using WordNet to determine semantic similarity of words , 2010, 2010 5th International Symposium on Telecommunications.

[30]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[31]  Xiaohua Hu,et al.  The Evaluation of Sentence Similarity Measures , 2008, DaWaK.