论文信息 - Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term

Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term

This paper follows a formal approach to information retrieval based on statistical language models. By introducing some simple reformulations of the basic language modeling approach we introduce the notion of importance of a query term. The importance of a query term is an unknown parameter that explicitly models which of the query terms are generated from the relevant documents (the important terms), and which are not (the unimportant terms). The new language modeling approach is shown to explain a number of practical facts of today's information retrieval systems that are not very well explained by the current state of information retrieval theory, including stop words, mandatory terms, coordination level ranking and retrieval using phrases.

Djoerd Hiemstra | D. Hiemstra

[1] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[2] Stephen E. Robertson,et al. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[3] Chris Buckley,et al. Optimization of inverted vector searches , 1985, SIGIR '85.

[4] Stephen E. Robertson,et al. Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[5] Michael Persin,et al. Document filtering for fast ranking , 1994, SIGIR '94.

[6] Gerard Salton,et al. The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[7] Djoerd Hiemstra,et al. Language models and probability of relevance , 2001 .

[8] David Hawking,et al. Relevance weighting using distance between term occurrences , 1996 .

[9] Ron Sacks-Davis,et al. Similarity Measures for Short Queries , 1995, TREC.

[10] Wanda Pratt,et al. Transparent Queries: investigation users' mental models of search engines , 2001, SIGIR '01.

[11] J. J. Rocchio,et al. Relevance feedback in information retrieval , 1971 .