论文信息 - Corrected Evaluation Results of the NTCIR WWW-2, WWW-3, and WWW-4 English Subtasks

Corrected Evaluation Results of the NTCIR WWW-2, WWW-3, and WWW-4 English Subtasks

Unfortunately, the oﬃcial English (sub)task results reported in the NTCIR-14 WWW-2, NTCIR-15 WWW-3, and NTCIR-16 WWW-4 overview papers are incorrect due to noise in the oﬃcial qrels ﬁles; this paper reports results based on the corrected qrels ﬁles. The noise is due to a fatal bug in the backend of our relevance assessment interface. More speciﬁcally, at WWW-2, WWW-3, and WWW-4, two versions of pool ﬁles were created for each English topic: a PRI (“prioritised”) ﬁle, which uses the NTCIRPOOL script to prioritise likely relevant documents, and a RND (“randomised”) ﬁle, which randomises the pooled documents. This was done for the purpose of studying the eﬀect of document ordering for relevance assessors. However, the

[1] Jingtao Zhan. THUIR at the NTCIR-16 WWW-4 Task , 2022 .

[2] T. Sakai,et al. Overview of the NTCIR-16 We Want Web with CENTRE (WWW-4) Task , 2022 .

[3] T. Sakai,et al. SLWWW at the NTCIR-16 WWW-4 Task , 2022 .

[4] Philipp Schaer,et al. repro_eval: A Python Interface to Reproducibility Measures of System-Oriented IR Experiments , 2022, ECIR.

[5] Zhaohao Zeng,et al. SLWWW at the NTCIR-15 WWW-3 Task , 2020 .

[6] Zhicheng Dou,et al. Overview of the NTCIR-15 We Want Web with CENTRE (WWW-3) Task , 2020 .

[7] Yiqun Liu,et al. THUIR at the NTCIR-14 WWW-2 Task , 2019, NTCIR.

[8] Oren Kurland,et al. The Technion at the WWW-3 Task: Cluster-Based Document Retrieval , 2022 .

[9] Xiaochen Zuo. RUCIR at the NTCIR-15 WWW-3 Task , 2022 .

[10] Zhu Liang. NAUIR at the NTCIR-15 WWW-3 Task , 2022 .

[11] Min Zhang,et al. THUIR at the NTCIR-15 WWW-3 Task , 2022 .

[12] Makoto P. Kato,et al. KASYS at the NTCIR-16 WWW-4 Task , 2022 .

[13] Kohei Shinden,et al. KASYS at the NTCIR-15 WWW-3 Task , 2022 .

[14] Andrew Yates. MPII at the NTCIR-14 CENTRE Task , 2022 .