Reproducibility is a Process, not an Achievement: The Replicability of IR Reproducibility Experiments

This paper espouses a view of reproducibility in the computational sciences as a process and not just a point-in-time “achievement”. As a concrete case study, we revisit the Open-Source IR Reproducibility Challenge from 2015 and attempt to replicate those experiments: four years later, are those computational artifacts still functional? Perhaps not surprisingly, we are not able to replicate most of the retrieval runs encapsulated by those artifacts in a modern computational environment. We outline the various idiosyncratic reasons why, distilled into a series of “lessons learned” to help form an emerging set of best practices for the long-term sustainability of reproducibility efforts.

[1]  Noriko Kando,et al.  Increasing Reproducibility in IR: Findings from the Dagstuhl Seminar on "Reproducibility of Data-Oriented Experiments in e-Science" , 2016, SIGIR Forum.

[2]  Andrew Trotman,et al.  Towards an Efficient and Effective Search Engine , 2012, OSIR@SIGIR.

[3]  Jimmy J. Lin,et al.  Overview of the 2019 Open-Source IR Replicability Challenge (OSIRRC 2019) , 2019, OSIRRC@SIGIR.

[4]  Steffen Mazanek,et al.  SHARE: a web portal for creating and sharing executable research papers , 2011, ICCS.

[5]  Jimmy J. Lin,et al.  The Impact of Score Ties on Repeatability in Document Ranking , 2019, SIGIR.

[6]  Sebastiano Vigna,et al.  MG4J at TREC 2006 , 2006, TREC.

[7]  Ben He,et al.  Terrier : A High Performance and Scalable Information Retrieval Platform , 2022 .

[8]  Jens Dittrich,et al.  Janiform Intra-Document Analytics for Reproducible Research , 2015, Proc. VLDB Endow..

[9]  Craig MacDonald,et al.  Toward Reproducible Baselines: The Open-Source IR Reproducibility Challenge , 2016, ECIR.

[10]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[11]  Marc-Allen Cartright,et al.  Galago: A Modular Distributed Processing and Retrieval System , 2012, OSIR@SIGIR.

[12]  Charles L. A. Clarke,et al.  Overview of the TREC 2004 Terabyte Track , 2004, TREC.

[13]  W. Bruce Croft,et al.  Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..

[14]  Brian Matthews,et al.  A Framework for Software Preservation , 2010, Int. J. Digit. Curation.

[15]  Andrew Trotman,et al.  Anytime Ranking for Impact-Ordered Indexes , 2015, ICTIR.