论文信息 - Introduction to Information Retrieval: Relevance feedback and query expansion

Introduction to Information Retrieval: Relevance feedback and query expansion

SYNONYMY In most collections, the same concept may be referred to using different words. This issue, known as synonymy , has an impact on the recall of most information retrieval (IR) systems. For example, you would want a search for aircraft to match plane (but only for references to an airplane , not a woodworking plane), and for a search on thermodynamics to match references to heat in appropriate discussions. Users often attempt to address this problem themselves by manually refining a query, as was discussed in Section 1.4; in this chapter, we discuss ways in which a system can help with query refinement, either fully automatically or with the user in the loop. The methods for tackling this problem split into two major classes: global methods and local methods. Global methods are techniques for expanding or reformulating query terms independent of the query and results returned from it, so that changes in the query wording will cause the new query to match other semantically similar terms. Global methods include: Query expansion/reformulation with a thesaurus or WordNet (Section 9.2.2) Query expansion via automatic thesaurus generation (Section 9.2.3) Techniques like spelling correction (discussed in Chapter 3) Local methods adjust a query relative to the documents that initially appear to match the query. The basic methods here are: Relevance feedback (Section 9.1) Pseudorelevance feedback, also known as blind relevance feedback (Section 9.1.6) (Global) Indirect relevance feedback (Section 9.1.7)

Hinrich Schütze | Prabhakar Raghavan | Christopher D. Manning

[1] Gerard Salton,et al. Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[2] W. Bruce Croft,et al. Query expansion using local and global document analysis , 1996, SIGIR '96.

[3] Donna K. Harman,et al. Relevance feedback revisited , 1992, SIGIR '92.

[4] Chris Buckley,et al. Learning routing queries in a query zone , 1997, SIGIR '97.