Optimizing Similarity Using Multi-Query Relevance Feedback

We propose a novel method for automatically adjusting parameters in ranked-output text retrieval systems to improve retrieval performance. A ranked-output text retrieval system implements a ranking function which orders documents, placing documents estimated to be more relevant to the user's query before less relevant ones. The system adjusts its parameters to maximize the match between the system's document ordering and a target ordering. The target ordering is typically given by user feedback on a set of sample queries, but is more generally any document preference relation. We demonstrate the utility of the approach by using it to estimate a similarity measure (scoring the relevance of documents to queries) in a vector space model of information retrieval. Experimental results using several collections indicate that the approach automatically finds a similarity measure which performs equivalently to or better than all “classic” similarity measures studied. It also performs within 1% of an estimated optimal measure (found by exhaustive sampling of the similarity measures). The method is compared to two alternative methods: A Perceptron learning rule motivated by Wong and Yao's (1990) Query Formulation method, and a Least Squared learning rule, motivated by Fuhr and Buckley's (1991) Probabilistic Learning approach. Though both alternatives have useful characteristics, we demonstrate empirically that neither can be used to estimate the parameters of the optimal similarity measure. © 1998 John Wiley & Sons, Inc.

[1]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[2]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[3]  Don R. Swanson,et al.  Probabilistic models for automatic indexing , 1974, J. Am. Soc. Inf. Sci..

[4]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[5]  L. Guttman What is Not What in Statistics , 1977 .

[6]  Michael McGill,et al.  An Evaluation of Factors Affecting Document Ranking by Information Retrieval Systems. , 1979 .

[7]  Gerard Salton,et al.  Automatic term class construction using relevance--A summary of work in automatic pseudoclassification , 1980, Inf. Process. Manag..

[8]  Libena Vokac,et al.  Optimal values of recall and precision , 1982, J. Am. Soc. Inf. Sci..

[9]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[10]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[11]  Donna K. Harman,et al.  An experimental study of factors important in document ranking , 1986, SIGIR '86.

[12]  Vijay V. Raghavan,et al.  A critical analysis of vector space model for information retrieval , 1986 .

[13]  G. Furnas,et al.  Pictures of relevance: a geometric analysis of similarity measures , 1987 .

[14]  I. Borg Multidimensional similarity structure analysis , 1987 .

[15]  Michael D. Gordon Probabilistic and genetic algorithms in document retrieval , 1988, CACM.

[16]  R.J.F. Dow,et al.  Neural net pruning-why and how , 1988, IEEE 1988 International Conference on Neural Networks.

[17]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[18]  Adi Raveh,et al.  A Nonmetric Approach to Linear Discriminant Analysis , 1989 .

[19]  S. K. M. Wong,et al.  Query formulation in linear retrieval models , 1990, J. Am. Soc. Inf. Sci..

[20]  Susan T. Dumais,et al.  Enhancing Performance in Latent Semantic Indexing (LSI) Retrieval , 1990 .

[21]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[22]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[23]  Chris Buckley,et al.  A probabilistic learning approach for document indexing , 1991, TOIS.

[24]  Garrison W. Cottrell,et al.  Latent semantic indexing is an optimal special case of multidimensional scaling , 1992, SIGIR '92.

[25]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[26]  Chris Buckley,et al.  Optimizing Document Indexing and Search Term Weighting Based on Probabilistic Models , 1992, TREC.

[27]  Daniel E. Rose,et al.  Content awareness in a file system interface: implementing the “pile” metaphor for organizing information , 1993, SIGIR.

[28]  Nicholas J. Belkin,et al.  The effect multiple query representations on information retrieval system performance , 1993, SIGIR.

[29]  Yiyu Yao,et al.  Computation of term associations by a neural network , 1993, SIGIR.

[30]  F. Crestani,et al.  Learning strategies for an adaptive information retrieval system using neural networks , 1993, IEEE International Conference on Neural Networks.

[31]  H. White,et al.  Cross-Validation Estimates IMSE , 1993, NIPS 1993.

[32]  Donna Harman,et al.  Overview of the First Text REtrieval Conference. , 1993, SIGIR 1993.

[33]  Jacob Shapiro,et al.  Multiversion information retrieval systems and feedback with mechanism of selection , 1993 .

[34]  Garrison W. Cottrell,et al.  Representing documents using an explicit model of their similarities , 1995 .

[35]  Gerard Salton,et al.  Improving Retrieval Performance by Relevance Feedback , 1997 .