Inferring Query Performance Using Pre-retrieval Predictors

The prediction of query performance is an interesting and important issue in Information Retrieval (IR). Current predictors involve the use of relevance scores, which are time-consuming to compute. Therefore, current predictors are not very suitable for practical applications. In this paper, we study a set of predictors of query performance, which can be generated prior to the retrieval process. The linear and non-parametric correlations of the predictors with query performance are thoroughly assessed on the TREC disk4 and disk5 (minus CR) collections. According to the results, some of the proposed predictors have significant correlation with query performance, showing that these predictors can be useful to infer query performance in practical applications.

[1]  Walter L. Smith Probability and Statistics , 1959, Nature.

[2]  Iadh Ounis,et al.  University of Glasgow at the Web Track: Dynamic Application of Hyperlink Analysis using the Query Scope , 2003, TREC.

[3]  James Allan,et al.  Recent Experiments with INQUERY , 1995, TREC.

[4]  Claudio Carpineto,et al.  Query Difficulty, Robustness, and Selective Application of Query Expansion , 2004, ECIR.

[5]  Stephen E. Robertson,et al.  Okapi at TREC-4 , 1995, TREC.

[6]  Iadh Ounis,et al.  A Query-based Pre-retrieval Model Selection Approach to Information Retrieval , 2004, RIAO.

[7]  Kalervo Järvelin,et al.  Employing the resolution power of search keys , 2001, J. Assoc. Inf. Sci. Technol..

[8]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[9]  Iadh Ounis,et al.  A study of parameter tuning for term frequency normalization , 2003, CIKM '03.

[10]  Eamonn Mullins,et al.  Probability and Statistics. 2nd edn. , 1988 .

[11]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 2 , 2000, Inf. Process. Manag..

[12]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[13]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[14]  W. Bruce Croft,et al.  A general language model for information retrieval , 1999, CIKM '99.

[15]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[16]  W. Bruce Croft,et al.  A general language model for information retrieval (poster abstract) , 1999, SIGIR '99.