Effectiveness of Weighted Searching in an Operational IR Environment

We describe an experiment to compare the effectiveness of Boolean retrieval and weighted retrieval on a commercial data collection that contains millions of documents. The experiment was carried out via a front-end connected to an operational conventional host-oriented IR environment. In contrast to previous experiments where weighted retrieval had to be simulated on the host, our host was equipped with a built-in weighted retrieval algorithm. The results of the experiment clearly show that weighted retrieval performs significantly better than Boolean retrieval. On the other hand, no difference in the performance between manually weighted queries and automatically weighted queries could be detected.

[1]  Charles T. Meadow,et al.  Basics of online searching , 1981 .

[2]  E. Michael Keen Some aspects of proximity searching in text retrieval systems , 1992, J. Inf. Sci..

[3]  C. Paice Soft evaluation of Boolean search queries in information retrieval systems , 1984 .

[4]  Tadeusz Radecki Trends in research on information retrieval -- The potential for improvements in conventional Boolean retrieval systems , 1988, Inf. Process. Manag..

[5]  J. D. Bovey,et al.  Weighting, ranking and relevance feedback in a front—end system , 1986, J. Inf. Sci..

[6]  Edward A. Fox,et al.  Practical enhanced Boolean retrieval: Experiences with the smart and sire systems , 1988, Inf. Process. Manag..

[7]  Vijay V. Raghavan,et al.  Extended Boolean query processing in the generalized vector space model , 1989, Inf. Syst..

[8]  Hans-Peter Frei,et al.  Evaluating Weighted Search Terms as Booleau Queries , 1991, Information Retrieval.

[9]  Hans-Peter Frei,et al.  Concept based query expansion , 1993, SIGIR.

[10]  Peter Schäuble,et al.  Determining the effectiveness of retrieval algorithms , 1991, Inf. Process. Manag..

[11]  Donald H. Kraft,et al.  A mathematical model of a weighted boolean retrieval system , 1979, Inf. Process. Manag..

[12]  Peter Schäuble,et al.  The Perils of Interpreting Recall and Precision Values , 1991, Information Retrieval.

[13]  Tadeusz Radecki Reducing the Perils of Merging Boolean and Weighted Retrieval Systems , 1982, J. Documentation.

[14]  Yonggang Qiu ISIR: An Integrated System for Information Retrieval , 1993 .