论文信息 - An Association Network for Computing Semantic Relatedness

An Association Network for Computing Semantic Relatedness

To judge how much a pair of words (or texts) are semantically related is a cognitive process. However, previous algorithms for computing semantic relatedness are largely based on co-occurrences within textual windows, and do not actively leverage cognitive human perceptions of relatedness. To bridge this perceptional gap, we propose to utilize free association as signals to capture such human perceptions. However, free association, being manually evaluated, has limited lexical coverage and is inherently sparse. We propose to expand lexical coverage and overcome sparseness by constructing an association network of terms and concepts that combines signals from free association norms and five types of cooccurrences extracted from the rich structures of Wikipedia. Our evaluation results validate that simple algorithms on this network give competitive results in computing semantic relatedness between words and between short texts.

Seung-won Hwang | Kenny Q. Zhu | Keyang Zhang

[1] Dekang Lin,et al. An Information-Theoretic Definition of Similarity , 1998, ICML.

[2] Weiwei Guo,et al. Improving Lexical Semantics for Sentential Semantics: Modeling Selectional Preference and Similar Words in a Latent Variable Model , 2013, HLT-NAACL.

[3] Weiwei Guo,et al. Modeling Sentences in the Latent Space , 2012, ACL.

[4] Evgeniy Gabrilovich,et al. A word at a time: computing word relatedness using temporal semantic analysis , 2011, WWW.

[5] Thomas A. Schreiber,et al. The University of South Florida free association, rhyme, and word fragment norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[6] Martha Palmer,et al. Verb Semantics and Lexical Selection , 1994, ACL.

[7] Graeme Hirst,et al. Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[8] Evgeniy Gabrilovich,et al. Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[9] Mario Jarmasz,et al. Roget's Thesaurus as a Lexical Resource for Natural Language Processing , 2012, ArXiv.

[10] James J. Jenkins,et al. THE 1952 MINNESOTA WORD ASSOCIATION NORMS , 1970 .

[11] Zuhair Bandar,et al. A Comparative Study of Two Short Text Semantic Similarity Measures , 2008, KES-AMSTA.