论文信息 - Producing high-dimensional semantic spaces from lexical co-occurrence

Producing high-dimensional semantic spaces from lexical co-occurrence

A procedure that processes a corpus of text and produces numeric vectors containing information about its meanings for each word is presented. This procedure is applied to a large corpus of natural language text taken from Usenet, and the resulting vectors are examined to determine what information is contained within them. These vectors provide the coordinates in a high-dimensional space in which word relationships can be analyzed. Analyses of both vector similarity and multidimensional scaling demonstrate that there is significant semantic information carried in the vectors. A comparison of vector similarity with human reaction times in a single-word priming experiment is presented. These vectors provide the basis for a representational model of semantic memory, hyperspace analogue to language (HAL).

Curt Burgess | Kevin Lund | C. Burgess | K. Lund

[1] C. Osgood,et al. The Measurement of Meaning , 1958 .

[2] S. Ervin-Tripp. SUBSTITUTION, CONTEXT, AND ASSOCIATION11The research was supported by grant M3772(A) of the National Institutes of Health. The aid of Jean Critchfield and Letizia Ciotti-Miller is gratefully acknowledged. , 1970 .

[3] I. Fischler. Semantic facilitation without association in a lexical decision task , 1977, Memory & cognition.

[4] J. H. Neely. Semantic priming and retrieval from lexical memory: Roles of inhibitionless spreading activation and limited-capacity attention. , 1977 .

[5] R N Shepard,et al. Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[6] Roger W. Schvaneveldt,et al. Pathfinder associative networks: studies in knowledge organization , 1990 .

[7] C. Burgess,et al. Semantic and associative priming in the cerebral hemispheres: Some words do, some words don't … sometimes, some places , 1990, Brain and Language.

[8] D. Spence,et al. Lexical co-occurrence and association strength , 1990 .

[9] Uri Zernik,et al. Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[10] H. Schütze,et al. Dimensions of meaning , 1992, Supercomputing '92.

[11] Victor Sadler,et al. Review of Lexical acquisition: exploiting on-line resources to build a lexicon by Uri Zernik. Lawrence Erlbaum Associates 1991. , 1993 .

[12] C. Mair,et al. Using large corpora , 1997 .