论文信息 - Document features predicting assessor disagreement

Document features predicting assessor disagreement

The notion of relevance differs between assessors, thus giving rise to assessor disagreement. Although assessor disagreement has been frequently observed, the factors leading to disagreement are still an open problem. In this paper we study the relationship between assessor disagreement and various topic independent factors such as readability and cohesiveness. We build a logistic model using reading level and other simple document features to predict assessor disagreement and rank documents by decreasing probability of disagreement. We compare the predictive power of these document-level features with that of a meta-search feature that aggregates a document's ranking across multiple retrieval runs. Our features are shown to be on a par with the meta-search feature, without requiring a large and diverse set of retrieval runs to calculate. Surprisingly, however, we find that the reading level features are negatively correlated with disagreement, suggesting that they are detecting some other aspect of document content.

[1] Emine Yilmaz,et al. Measure-based metasearch , 2005, SIGIR '05.

[2] Ben Carterette,et al. The effect of assessor error on IR system evaluation , 2010, SIGIR.

[3] William Webber,et al. Re-examining the Effectiveness of Manual Review , 2011 .

[4] Ellen M. Voorhees,et al. Variations in relevance judgments and the measurement of retrieval effectiveness , 1998, SIGIR '98.

[5] W. Bruce Croft,et al. Quality-biased ranking of web documents , 2011, WSDM '11.

[6] Peter Bailey,et al. Relevance assessment: are judges exchangeable and does it matter , 2008, SIGIR '08.

[7] Kevyn Collins-Thompson,et al. Predicting reading difficulty with statistical language models , 2005, J. Assoc. Inf. Sci. Technol..

[8] Jianqiang Wang,et al. A user study of relevance judgments for E-Discovery , 2010, ASIST.

[9] William Webber,et al. Effect of written instructions on assessor agreement , 2012, SIGIR '12.

[10] Ben Carterette,et al. Alternative assessor disagreement and retrieval depth , 2012, CIKM '12.