Semantic distance norms computed from an electronic dictionary (WordNet)

WordNet, an electronic dictionary (or lexical database), is a valuable resource for computational and cognitive scientists. Recent work on the computing of semantic distances among nodes (synsets) in WordNet has made it possible to build a large database of semantic distances for use in selecting word pairs for psychological research. The database now contains nearly 50,000 pairs of words that have values for semantic distance, associative strength, and similarity based on co-occurrence. Semantic distance was found to correlate weakly with these other measures but to correlate more strongly with another measure of semantic relatedness, featural similarity. Hierarchical clustering analysis suggested that the knowledge structure underlying semantic distance is similar in gross form to that underlying featural similarity. In experiments in which semantic similarity ratings were used, human participants were able to discriminate semantic distance. Thus, semantic distance as derived from WordNet appears distinct from other measures of word pair relatedness and is psychologically functional. This database may be downloaded fromwww.psychonomic.org/archive/.

[1]  Yoshihiko Nitta,et al.  Co-Occurrence Vectors From Corpora vs. Distance Vectors From Dictionaries , 1994, COLING.

[2]  Alan Schwartz,et al.  Tutorial: Perl, a psychologically efficient reformatting language , 1998 .

[3]  Ken McRae,et al.  Category - Specific semantic deficits , 2008 .

[4]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[5]  Mark S. Seidenberg,et al.  Pre- and postlexical loci of contextual effects on word recognition , 1984, Memory & cognition.

[6]  D. Spence,et al.  Lexical co-occurrence and association strength , 1990 .

[7]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[8]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[9]  J. Deese The structure of associations in language and thought , 1966 .

[10]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[11]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[12]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[13]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[14]  John B. Goodenough,et al.  Contextual correlates of synonymy , 1965, CACM.

[15]  Mark S. Seidenberg,et al.  Semantic feature production norms for a large set of living and nonliving things , 2005, Behavior research methods.

[16]  R. Burchfield Frequency Analysis of English Usage: Lexicon and Grammar. By W. Nelson Francis and Henry Kučera with the assistance of Andrew W. Mackie. Boston: Houghton Mifflin. 1982. x + 561 , 1985 .

[17]  Thomas A. Schreiber,et al.  The University of South Florida free association, rhyme, and word fragment norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[18]  W. Nelson Francis,et al.  FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[19]  Erwin A. Esper Analogy and association in linguistics and psychology , 1973 .

[20]  G. Miller On knowing a word. , 1999, Annual review of psychology.

[21]  Mark S. Seidenberg,et al.  On the nature and scope of featural representations of word meaning. , 1997, Journal of experimental psychology. General.

[22]  Christiane Fellbaum,et al.  Nouns in WordNet , 1998 .

[23]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[24]  M. Lucas,et al.  Semantic priming without association: A meta-analytic review , 2000, Psychonomic bulletin & review.

[25]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[26]  I. Fischler Semantic facilitation without association in a lexical decision task , 1977, Memory & cognition.

[27]  J. Gabrieli,et al.  Effects of Semantic and Associative Relatedness on Automatic Priming , 1998 .

[28]  Lorraine K. Tyler,et al.  A Distributed Memory Model of the Associative Boost in Semantic Priming , 1994, Connect. Sci..

[29]  William E. Forrester,et al.  The relationships between judged similarity, judged association, and normative association , 1966 .

[30]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.