论文信息 - CHEERS: CHeap & Engineered Evaluation of Retrieval Systems

CHEERS: CHeap & Engineered Evaluation of Retrieval Systems

In test collection based evaluation of retrieval effectiveness, many research investigated different directions for an economical and a semi-automatic evaluation of retrieval systems. Although several methods have been proposed and experimentally evaluated, their accuracy seems still limited. In this paper we present our proposal for a more engineered approach to information retrieval evaluation.

Kevin Roitero

[1] Javed A. Aslam,et al. On the effectiveness of evaluating retrieval systems in the absence of relevance judgments , 2003, SIGIR.

[2] K. Sparck Jones,et al. INFORMATION RETRIEVAL TEST COLLECTIONS , 1976 .

[3] Eddy Maddalena,et al. Do Easy Topics Predict Effectiveness Better Than Difficult Topics? , 2017, ECIR.

[4] Stephen E. Robertson,et al. A few good topics: Experiments in topic set reduction for retrieval evaluation , 2009, TOIS.

[5] Ian Soboroff,et al. Ranking retrieval systems without relevance judgments , 2001, SIGIR '01.

[6] and software — performance evaluation , .

[7] Tetsuya Sakai,et al. Topic set size design , 2015, Information Retrieval Journal.

[8] Ingemar J. Cox,et al. On Aggregating Labels from Multiple Crowd Workers to Infer Relevance of Documents , 2012, ECIR.

[9] Eddy Maddalena,et al. Considering Assessor Agreement in IR Evaluation , 2017, ICTIR.

[10] Tetsuya Sakai,et al. Ranking Retrieval Systems without Relevance Assessments: Revisited , 2010, EVIA@NTCIR.

[11] Alistair Moffat,et al. Statistical power in retrieval experimentation , 2008, CIKM '08.