Evaluating WordNet-based Measures of Lexical Semantic Relatedness

The quantification of lexical semantic relatedness has many applications in NLP, and many different measures have been proposed. We evaluate five of these measures, all of which use WordNet as their central resource, by comparing their performance in detecting and correcting real-word spelling errors. An information-content-based measure proposed by Jiang and Conrath is found superior to those proposed by Hirst and St-Onge, Leacock and Chodorow, Lin, and Resnik. In addition, we explain why distributional similarity is not an adequate proxy for lexical semantic relatedness.

[1]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[2]  Michael Halliday,et al.  Cohesion in English , 1976 .

[3]  Paul Procter,et al.  Longman Dictionary of Contemporary English , 1978 .

[4]  A. Agresti,et al.  Statistical Methods for the Social Sciences , 1979 .

[5]  W. Whitten,et al.  Bidirectional synonym ratings of 464 noun pairs , 1979 .

[6]  W. Nelson Francis,et al.  FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[7]  H. Gross Errors in Linguistic Performance: Slips of the Tongue, Ear, Pen, and Hand , 1983 .

[8]  L. Barsalou,et al.  Ad hoc categories , 1983, Memory & cognition.

[9]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[10]  G. Lakoff,et al.  Women, Fire, and Dangerous Things: What Categories Reveal about the Mind , 1988 .

[11]  Lawerence W. Barsalou Intraconcept similarity and its implications for interconcept similarity , 1989 .

[12]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[13]  Roy Rada,et al.  Ranking documents with a thesaurus , 1989, JASIS.

[14]  G. Lakoff Women, fire, and dangerous things : what categories reveal about the mind , 1989 .

[15]  Michael Hoey,et al.  Patterns of Lexis In Text , 1991 .

[16]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[17]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[18]  Myoung-Ho Kim,et al.  Information Retrieval Based on Conceptual Distance in is-a Hierarchies , 1993, J. Documentation.

[19]  Hideki Kozima,et al.  Similarity between Words Computed by Spreading Activation on an English Dictionary , 1993, EACL.

[20]  Michael Sussna,et al.  Word sense disambiguation for free-text indexing using a massive semantic network , 1993, CIKM '93.

[21]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[22]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[23]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[24]  David St-Onge,et al.  Detecting and Correcting Malapropisms with Lexical Chains , 1995 .

[25]  Della Summers,et al.  Longman Dictionary of Contemporary English , 1995 .

[26]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[27]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[28]  Michael John Sussna,et al.  Text retrieval using inference in semantic metanetworks , 1997 .

[29]  Akira Ito,et al.  Context-sensitive word distance by adaptive scaling of a semantic space , 1997 .

[30]  Martin Chodorow,et al.  Combining local context and wordnet similarity for word sense identification , 1998 .

[31]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[32]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[33]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[34]  Lillian Lee,et al.  Measures of Distributional Similarity , 1999, ACL.

[35]  Alexander Budanitsky,et al.  Lexical Semantic Relatedness and Its Application in Natural Language Processing , 1999 .

[36]  Philip Resnik,et al.  Measuring Verb Similarity , 2000 .

[37]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[38]  Barbara A. Spellman,et al.  Analogical priming via semantic relations , 2001, Memory & cognition.

[39]  Ehud Rivlin,et al.  Placing search in context: the concept revisited , 2002, TOIS.

[40]  Stan Szpakowicz,et al.  Roget's thesaurus and semantic similarity , 2012, RANLP.

[41]  Ted Pedersen,et al.  Extended Gloss Overlaps as a Measure of Semantic Relatedness , 2003, IJCAI.

[42]  L. Murphy Semantic Relations and the Lexicon , 2003 .

[43]  Ted Pedersen,et al.  Using Measures of Semantic Relatedness for Word Sense Disambiguation , 2003, CICLing.

[44]  Julie Elizabeth Weeds,et al.  Measures and applications of lexical distributional similarity , 2003 .

[45]  Julie Weeds,et al.  Finding Predominant Word Senses in Untagged Text , 2004, ACL.

[46]  Graeme Hirst,et al.  Non-Classical Lexical Semantic Relations , 2004, Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics - CLS '04.

[47]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[48]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[49]  Ido Dagan,et al.  Similarity-Based Models of Word Cooccurrence Probabilities , 1998, Machine Learning.

[50]  David J. Weir,et al.  Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity , 2005, CL.

[51]  Graeme Hirst,et al.  Correcting real-word spelling errors by restoring lexical cohesion , 2005, Natural Language Engineering.

[52]  Rada Mihalcea,et al.  Measuring the Semantic Similarity of Texts , 2005, EMSEE@ACL.

[53]  Wee Sun Lee,et al.  Learning Semantic Classes for Word Sense Disambiguation , 2005, ACL.

[54]  Mark Stevenson,et al.  A Semantic Approach to IE Pattern Induction , 2005, ACL.

[55]  Janyce Wiebe,et al.  Computing Attitude and Affect in Text: Theory and Applications , 2005, The Information Retrieval Series.

[56]  Graeme Hirst,et al.  The Subjectivity of Lexical Cohesion in Text , 2006, Computing Attitude and Affect in Text.

[57]  Marcel Worring,et al.  Adding Semantics to Detectors for Video Retrieval , 2007, IEEE Transactions on Multimedia.