Information Retrieval by Means of Word Sense Disambiguation

The increasing problem of information overload can be reduced by the improvement of information access tasks like Information Retrieval. Relevance Feedback plays a key role in this task, and is typically based only on the information extracted from documents judged by the user for a given query. We propose to make use of a thesaurus to complement this information to improve RF. This must be done by means of a Word Sense Disambiguation process that correctly identifies the suitable information from the thesaurus WORDNET. The results of our experiments show that the utilisation of a thesaurus requires Word Sense Disambiguation, and that with this process, Relevance Feedback is substantially improved.

[1]  Gregory Grefenstette,et al.  Cross-Language Information Retrieval , 1998, The Springer International Series on Information Retrieval.

[2]  Adam Kilgarriff,et al.  What is word sense disambiguation good for? , 1997, ArXiv.

[3]  Janyce Wiebe,et al.  Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[4]  Yorick Wilks,et al.  Word Sense Disambiguation using Optimised Combinations of Knowledge Sources , 1998, COLING-ACL.

[5]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[6]  Donna Harman,et al.  The fourth text REtrieval conference , 1996 .

[7]  Manuel de Buenaga Rodríguez,et al.  Using WordNet to Complement Training Information in Text Categorization , 1997, ArXiv.

[8]  Donna K. Harman,et al.  Overview of the Fourth Text REtrieval Conference (TREC-4) , 1995, TREC.

[9]  Stan Matwin,et al.  A WordNet-based Algorithm for Word Sense Disambiguation , 1995, IJCAI.

[10]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[11]  Luis Alfonso Ureña López,et al.  Integrating Linguistic Resources in TC through WSD , 2001, Comput. Humanit..

[12]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[13]  Alfonso Urena Lopez,et al.  Integrating and Evaluating WSD in the Adaptation of a Lexical Database in Text Categorization Task , 1998 .

[14]  Ted Pedersen,et al.  Distinguishing Word Senses in Untagged Text , 1997, EMNLP.

[15]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[16]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.