Query expansion is a well-known technique that has been shown to improve <i>average</i> retrieval performance. This technique has not been used in many operational systems because of the fact that it can greatly degrade the performance of some individual queries. We show how comparison between language models of the unexpanded and expanded retrieval results can be used to predict when the expanded retrieval has strayed from the original sense of the query. In these cases, the unexpanded results are used while the expanded results are used in the remaining cases (where such straying is not detected). We evaluate this method on a wide variety of TREC collections.
[1]
W. Bruce Croft,et al.
A general language model for information retrieval
,
1999,
CIKM '99.
[2]
R. K. Tuteja,et al.
Characterization of a quantitative-qualitative measure of relative information
,
1984,
Inf. Sci..
[3]
W. Bruce Croft,et al.
A Language Modeling Framework for Selective Query Expansion
,
2004
.
[4]
W. Bruce Croft,et al.
A general language model for information retrieval (poster abstract)
,
1999,
SIGIR '99.
[5]
W. Bruce Croft,et al.
Predicting query performance
,
2002,
SIGIR '02.