论文信息 - Finding Semantic Similarity in Raw Text: the Deese Antonyms

Finding Semantic Similarity in Raw Text: the Deese Antonyms

As more and more text becomes readily available in electronic form, much interest is being generated by finding ways of automatically extracting information from subsets of this text. While manual indexing and automatic keyword indexing are well known, both have drawbacks. Recent research on robust syntactic analysis and statistical correlations promises that some of the intuitive advantages of manual indexing can be retained in a fully automatic system. Here I present an experiment performed with my system SEXTANT which extracts semantically similar words from raw text. Using statistical methods combined with robust syntactic analysis, SEXTANT was able to find many of the intuitive pairings between semantically similar words studied by Deese [Deese, 1954].

Gregory Grefenstette | G. Grefenstette

[1] David A. Evans,et al. A Summary of the CLARIT project , 1991 .

[2] Slava M. Katz,et al. Co-Occurrences of Antonymous Adjectives and Their Contexts , 1991, Comput. Linguistics.

[3] George A. Miller,et al. Introduction to WordNet: An On-line Lexical Database , 1990 .

[4] Gregory Grefenstette,et al. Use of syntactic context to produce term association lists for text retrieval , 1992, SIGIR '92.

[5] Kenneth Ward Church,et al. Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[6] Mary Hart,et al. Automatic indexing using selective NLP and first-order thesauri , 1991, RIAO.

[7] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[8] H. Charles Romesburg,et al. Cluster analysis for researchers , 1984 .

[9] Donald Hindle,et al. Noun Classification From Predicate-Argument Structures , 1990, ACL.

[10] Gerda Ruge,et al. Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[11] Susan T. Dumais,et al. Enhancing Performance in Latent Semantic Indexing (LSI) Retrieval , 1990 .

[12] J. Deese. The associative structure of some common english adjectives , 1964 .