论文信息 - A Noun-Predicate Bigram-Based Similarity Measure for Lexical Relations

A Noun-Predicate Bigram-Based Similarity Measure for Lexical Relations

The method outlined in this paper demonstrates that the information-theoretic similarity measure and noun-predicate bigrams are effective methods for creating lists of semantically-related words for lexical database work. Our experiments revealed that instead of serious syntactic analysis, bigrams and morpho-syntactic information sufficed for the feature-based similarity measure. We contend that our method would be even more appreciated if it applied to a raw newswire corpus in which unlisted words in existing dictionaries, such as recently-created words, proper nouns, and syllabic abbreviations, are prevailing.

Insik Cho | Hyopil Shin

[1] Konstantinos Koumpis,et al. Automatic summarization of voicemail messages using lexical and prosodic features , 2005, TSLP.

[2] Ricardo Baeza-Yates,et al. Information Retrieval: Data Structures and Algorithms , 1992 .

[3] Dekang Lin,et al. Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[4] Nancy Ide,et al. Making Senses: Bootstrapping Sense-Tagged Lists of Semantically-Related Words , 2006, CICLing.

[5] Dekang Lin,et al. An Information-Theoretic Definition of Similarity , 1998, ICML.

[6] Roy Rada,et al. Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[7] A. Tversky. Features of Similarity , 1977 .

[8] Jussi Piitulainen,et al. Idiomatic Object Usage and Support Verbs , 1998, COLING-ACL.

[9] Hiyan Alshawi,et al. Training and Scaling Preference Functions for Disambiguation , 1994, Comput. Linguistics.

[10] Caroline Gasperin,et al. Using Syntactic Contexts for Measuring Word Similarity , 2007 .

[11] Gregory Grefenstette,et al. Explorations in automatic thesaurus discovery , 1994 .

[12] Peter D. Turney. Similarity of Semantic Relations , 2006, CL.

[13] Pablo Gamallo,et al. Syntactic-Based Methods for Measuring Word Similarity , 2001, TSD.