论文信息 - How Precise Does Document Scoring Need to Be?

How Precise Does Document Scoring Need to Be?

We explore the implications of tied scores arising in the document similarity scoring regimes that are used when queries are processed in a retrieval engine. Our investigation has two parts: first, we evaluate past TREC runs to determine the prevalence and impact of tied scores, to understand the alternative treatments that might be used to handle them; and second, we explore the implications of what might be thought of as “deliberate” tied scores, in order to allow for faster search. In the first part of our investigation we show that while tied scores had the potential to be disruptive to TREC evaluations, in practice their effect was relatively minor. The second part of our exploration helps understand why that was so, and shows that quite marked levels of score rounding can be tolerated, without greatly affecting the ability to compare between systems. The latter finding offers the potential for approximate scoring regimes that provide faster query processing with little or no loss of effectiveness.

[1] Donna K. Harman,et al. The TREC Test Collections , 2005 .

[2] Alistair Moffat,et al. Rank-biased precision for measurement of retrieval effectiveness , 2008, TOIS.

[3] Jaana Kekäläinen,et al. Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[4] Ellen M. Voorhees,et al. Overview of the seventh text retrieval conference (trec-7) [on-line] , 1999 .

[5] Marc Najork,et al. Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores , 2008, ECIR.

[6] Tetsuya Sakai,et al. Alternatives to Bpref , 2007, SIGIR.

[7] Peter Bailey,et al. Relevance assessment: are judges exchangeable and does it matter , 2008, SIGIR '08.

[8] Ellen M. Voorhees. Variations in relevance judgments and the measurement of retrieval effectiveness , 2000, Inf. Process. Manag..

[9] Ellen M. Voorhees,et al. Overview of the Seventh Text REtrieval Conference , 1998 .

[10] Mark Sanderson,et al. Quantifying test collection quality based on the consistency of relevance judgements , 2011, SIGIR.

[11] Alistair Moffat,et al. Memory Efficient Ranking , 1994, Inf. Process. Manag..

[12] Ellen M. Voorhees,et al. The seventh text REtrieval conference (TREC-7) , 1999 .

[13] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.

[14] Stephen E. Robertson,et al. GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[15] Alistair Moffat,et al. Pruned query evaluation using pre-computed impacts , 2006, SIGIR.

[16] Andrei Z. Broder,et al. Efficient query evaluation using a two-level retrieval process , 2003, CIKM '03.

[17] W. Bruce Croft,et al. A language modeling approach to information retrieval , 1998, SIGIR '98.