A Comparison of Text Retrieval Models

Many retrieval models have been proposed as the basis of text retrieval systems. The three main classes that have been investigated are the exact-match, vector space and probabilistic models. The retrieval effectiveness of strategies based on these models has been evaluated experimentally, but there has been little in the way of comparison in terms of their formal properties. In this paper we introduce a recent form of the probabilistic model based on inference networks, and show how the vector space and exact-match models can be described in this framework. Differences between these models can be explained as differences in the estimation of probabilities, both in the initial search and during relevance feedback.

[1]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[2]  Tadeusz Radecki,et al.  Incorporation of Relevance Feedback into Boolean Retrieval System , 1982, SIGIR.

[3]  Abraham Bookstein,et al.  Explanation and Generalization of Vector Models in Information Retrieval , 1982, SIGIR.

[4]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[5]  G. Salton,et al.  Extended Boolean information retrieval , 1983, CACM.

[6]  Edward A. Fox,et al.  A comparison of two methods for boolean query relevancy feedback , 1984, Inf. Process. Manag..

[7]  W. Bruce Croft Boolean Queries and Term Dependencies in Probabilistic Retrieval Models. , 1986 .

[8]  Vijay V. Raghavan,et al.  On extending the vector space model for Boolean query processing , 1986, SIGIR '86.

[9]  Tadeusz Radecki,et al.  Probabilistic methods for ranking output documents in conventional Boolean retrieval systems , 1988, Inf. Process. Manag..

[10]  Amihai Motro,et al.  VAGUE: a user interface to relational databases that permits vague queries , 1988, TOIS.

[11]  Abraham Bookstein Set oriented retrieval , 1988, SIGIR '88.

[12]  Robert M. Losee,et al.  Integrating Boolean queries in conjunctive normal form with probabilistic retrieval models , 1988, Inf. Process. Manag..

[13]  Clement T. Yu,et al.  Two learning schemes in information retrieval , 1988, SIGIR '88.

[14]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[15]  Gerard Salton,et al.  A Simple Blueprint for Automatic Boolean Query Processing , 1988, Inf. Process. Manag..

[16]  Chris Buckley,et al.  Probabilistic document indexing from relevance feedback data , 1989, SIGIR '90.

[17]  Norbert Fuhr,et al.  Models for retrieval with probabilistic indexing , 1989, Inf. Process. Manag..

[18]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[19]  W. Bruce Croft,et al.  Experiments with query acquisition and use in document retrieval systems , 1989, SIGIR '90.

[20]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[21]  David Maier,et al.  Readings in Object-Oriented Database Systems , 1989 .

[22]  Mark E. Frisse,et al.  Information retrieval from hypertext: update on the dynamic medical handbook project , 1989, Hypertext.

[23]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[24]  Hector Garcia-Molina,et al.  A Probalilistic Relational Data Model , 1990, EDBT.

[25]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[26]  G Salton,et al.  Developments in Automatic Text Retrieval , 1991, Science.

[27]  David D. Lewis,et al.  Representation and Learning in Information Retrieval , 1991 .

[28]  W. Bruce Croft,et al.  The use of phrases and structured queries in information retrieval , 1991, SIGIR '91.

[29]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[30]  W. Bruce Croft,et al.  Efficient probabilistic Inference for text retrieval , 1991, RIAO.

[31]  FRGVijay V. RaghavanThe The Axiomatic Approach for Theory Development in Ir , 1994 .