论文信息 - On Topic Difficulty in IR Evaluation: The Effect of Systems, Corpora, and System Components

On Topic Difficulty in IR Evaluation: The Effect of Systems, Corpora, and System Components

In a test collection setting, topic difficulty can be defined as the average effectiveness of a set of systems for a topic. In this paper we study the effects on the topic difficulty of: (i) the set of retrieval systems; (ii) the underlying document corpus; and (iii) the system components. By generalizing methods recently proposed to study system component factor analysis, we perform a comprehensive analysis on topic difficulty and the relative effects of systems, corpora, and component interactions. Our findings show that corpora have the most significant effect on topic difficulty.

[1] Mike Thelwall,et al. Synthesis Lectures on Information Concepts, Retrieval, and Services , 2009 .

[2] Eddy Maddalena,et al. Do Easy Topics Predict Effectiveness Better Than Difficult Topics? , 2017, ECIR.

[3] Ying Zhang,et al. Differences in effectiveness across sub-collections , 2012, CIKM.

[4] Elad Yom-Tov,et al. Estimating the query difficulty for information retrieval , 2010, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[5] Mark Sanderson,et al. Using Collection Shards to Study Retrieval Performance Effect Sizes , 2019, ACM Trans. Inf. Syst..

[6] Josiane Mothe,et al. Query Performance Prediction and Effectiveness Evaluation Without Relevance Judgments: Two Sides of the Same Coin , 2018, SIGIR.

[7] Nicola Ferro,et al. A General Linear Mixed Models Approach to Study System Component Effects , 2016, SIGIR.

[8] Donna K. Harman,et al. Overview of the Reliable Information Access Workshop , 2009, Information Retrieval.

[9] Nicola Ferro,et al. Toward an anatomy of IR system component performances , 2018, J. Assoc. Inf. Sci. Technol..

[10] Stephen E. Robertson,et al. Hits hits TREC: exploring IR evaluation results with network analysis , 2007, SIGIR.

[11] Mark Sanderson,et al. Sub-corpora Impact on System Effectiveness , 2017, SIGIR.

[12] Paul Over,et al. Blind Men and Elephants: Six Approaches to TREC data , 1999, Information Retrieval.