Word sense disambiguation and information retrieval

It has often been thought that word sense ambiguity is a cause of poor performance in Information Retrieval (IR) systems. The belief is that if ambiguous words can be correctly disambiguated, IR performance will increase. However, recent research into the application of a word sense disambiguator to an IR system failed to show any performance increase. From these results it has become clear that more basic research is needed to investigate the relationship between sense ambiguity, disambiguation, and IR.

[1]  D. Lewis Probabilities of Conditionals and Conditional Probabilities , 1976 .

[2]  Yaacov Choueka,et al.  Disambiguation by short contexts , 1985, Comput. Humanit..

[3]  W ChurchKenneth,et al.  A program for aligning sentences in bilingual corpora , 1993 .

[4]  Yorick Wilks,et al.  Subject-Dependent Co-Occurence and Word Sense Disambiguation , 1991, ACL.

[5]  Donna K. Harman,et al.  Relevance feedback revisited , 1992, SIGIR '92.

[6]  Ron Sacks-Davis,et al.  Using syntactic analysis in a document retrieval system that uses signature files , 1989, SIGIR '90.

[7]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[8]  David A. Evans,et al.  Clarit-TREC Experiments , 1995, Inf. Process. Manag..

[9]  Marti A. Hearst Noun Homograph Disambiguation Using Local Context in Large Text Corpora , 1991 .

[10]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[11]  T. Ahlswede,et al.  Word Sense Disambiguation by Human Subjects: Computational and Psycholinguistic Applications , 1993, Workshop On The Acquisition Of Lexical Knowledge From Text.

[12]  M. Sanderson,et al.  Sense resolution properties of logical imaging , 1995 .

[13]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.

[14]  Donna K. Harman,et al.  Overview of the Third Text REtrieval Conference (TREC-3) , 1995, TREC.

[15]  Peter J. L. Wallis,et al.  Information Retrieval based on Paraphrase , 1993 .

[16]  David Cooper,et al.  Document Retrieval Experiments using Indexing Vocabularies of varying Size. I. Variety Generation Symbols Assigned to the Fronts of Index Terms , 1979, J. Documentation.

[17]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[18]  Ezra Black,et al.  An Experiment in Computational Discrimination of English Word Senses , 1988, IBM J. Res. Dev..

[19]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[20]  Alon Itai,et al.  Two Languages Are More Informative Than One , 1991, ACL.

[21]  Fabio Crestani,et al.  Information Retrieval by Logical Imaging , 1995, J. Documentation.

[22]  Kenneth Ward Church,et al.  A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[23]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[24]  George C. Demetriou Lexical Disambiguation Using Constraint Handling In Prolog (CHIP) , 1993, EACL.

[25]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[26]  Alan F. Smeaton,et al.  The Retrieval Effects of Query Expansion on a Feedback Document Retrieval System , 1983, Comput. J..

[27]  J. Jorgensen The psychological reality of word senses , 1990 .

[28]  M. J. Ridley,et al.  An expert system for quality control and duplicate detection in bibliographic databases , 1992 .

[29]  W. Brogden Annual Review of Psychology , 1957 .

[30]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[31]  Alan F. Smeaton,et al.  Using morpho-syntactic language analysis in phrase matching , 1991, RIAO.

[32]  David C. Blair,et al.  Information retrieval and the philosophy of language , 1992, Annu. Rev. Inf. Sci. Technol..

[33]  Mounia Lalmas,et al.  Theories of information and uncertainty for the modelling of information retrieval : an application of situation theory and Dempster-Schafer's theory of evidence , 1996 .

[34]  Herbert Coblans,et al.  Progress in Documentation. , 1972 .

[35]  David D. Lewis,et al.  Representation and Learning in Information Retrieval , 1991 .

[36]  Jan O. Pedersen Information Retrieval Based on Word Senses , 1995 .

[37]  Louise Guthrie,et al.  Lexical Disambiguation using Simulated Annealing , 1992, COLING.

[38]  Sholom M. Weiss,et al.  Towards language independent automated learning of text categorization models , 1994, SIGIR '94.

[39]  Hans-Peter Frei,et al.  Concept based query expansion , 1993, SIGIR.

[40]  Robert Stalnaker Probability and Conditionals , 1970, Philosophy of Science.

[41]  Maurice B. Line,et al.  PROGRESS IN DOCUMENTATION: ‘obsolescence’ and changes in the use of literature with time , 1974 .

[42]  Hinrich Schütze,et al.  Customizing a Lexicon to Better Suit a Computational Task , 1996 .

[43]  Stephen F. Weiss Learning to disambiguate , 1973, Inf. Storage Retr..

[44]  Joel L Fagan,et al.  Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods , 1987 .

[45]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[46]  Fabio Crestani,et al.  Probability Kinematics in Information Retrieval a case study , 1995 .

[47]  Philip J. Hayes,et al.  Intelligent high-volume text processing using shallow, domain-specific techniques , 1992 .

[48]  John Price-Wilkin,et al.  Oxford English Dictionary (2nd ed.) , 1991 .

[49]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[50]  Michael Sussna,et al.  Word sense disambiguation for free-text indexing using a massive semantic network , 1993, CIKM '93.

[51]  C. J. van Rijsbergen,et al.  A Non-Classical Logic for Information Retrieval , 1997, Comput. J..

[52]  Graeme Hirst,et al.  Semantic Interpretation and the Resolution of Ambiguity , 1987, Studies in natural language processing.

[53]  Alan F. Smeaton,et al.  Using WordNet in a Knowledge-Based Approach to Information Retrieval , 1995 .

[54]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[55]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[56]  Edward F. Kelly,et al.  Computer recognition of English word senses , 1975 .

[57]  Natasa Milic-Frayling,et al.  CLARIT TREC-4 Experiments , 1995, TREC.

[58]  Edward T. O'Neill Characteristics of Duplicate Records in OCLC's Online Union Catalog. , 1993 .

[59]  Robert Krovetz,et al.  Viewing morphology as an inference process , 1993, Artif. Intell..

[60]  George A. Miller WordNet: A Lexical Database for English , 1992, HLT.

[61]  Hector Garcia-Molina,et al.  SCAM: A Copy Detection Mechanism for Digital Documents , 1995, DL.

[62]  Mark Sanderson,et al.  NRT: News Retrieval Tool , 1991, Electron. Publ..

[63]  Hector Garcia-Molina,et al.  Copy detection mechanisms for digital documents , 1995, SIGMOD '95.

[64]  Brian M. Slator,et al.  Providing machine tractable dictionary tools , 1990 .

[65]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[66]  J. Simpson,et al.  The Oxford English Dictionary , 1884 .

[67]  Elaine Svenonius,et al.  Automatic recognition of title page names , 1991, Inf. Process. Manag..

[68]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[69]  Chuck Rieger,et al.  Parsing and comprehending with word experts (a theory and its realization) , 1982 .

[70]  Brian F. Chellas Modal Logic: Normal systems of modal logic , 1980 .

[71]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[72]  Craig Stanfill,et al.  Parallel free-text search on the connection machine system , 1986, CACM.

[73]  David Yarowsky,et al.  Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs , 1992, ACL.

[74]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..