A collaborative approach to computational reproducibility

Although a standard in natural science, reproducibility has been only episodically applied in experimental computer science. Scientific papers often present a large number of tables, plots and pictures that summarize the obtained results, but then loosely describe the steps taken to derive them. Not only can the methods and the implementation be complex, but also their configuration may require setting many parameters and/or depend on particular system configurations. While many researchers recognize the importance of reproducibility, the challenge of making it happen often outweigh the benefits. Fortunately, a plethora of reproducibility solutions have been recently designed and implemented by the community. In particular, packaging tools (e.g., ReproZip) and virtualization tools (e.g., Docker) are promising solutions towards facilitating reproducibility for both authors and reviewers. To address the incentive problem, we have implemented a new publication model for the Reproducibility Section of Information Systems Journal. In this section, authors submit a reproducibility paper that explains in detail the computational assets from a previous published manuscript in Information Systems.

[1]  Tim Brody,et al.  Evaluating Research Impact through Open Access to Scholarly Communication , 2006 .

[2]  Andrew P. Davison,et al.  Learning from the Past: Approaches for Reproducibility in Computational Neuroscience , 2013 .

[3]  Philippe Bonnet,et al.  Repeatability and workability evaluation of SIGMOD 2011 , 2011, SGMD.

[4]  Dennis Shasha,et al.  ReproZip: Computational Reproducibility With Ease , 2016, SIGMOD Conference.

[5]  S. Lawrence Free online availability substantially increases a paper's impact , 2001, Nature.

[6]  R. Nuzzo How scientists fool themselves – and how they can stop , 2015, Nature.

[7]  C. Begley,et al.  Drug development: Raise standards for preclinical cancer research , 2012, Nature.

[8]  Arian Maleki,et al.  Reproducible Research in Computational Harmonic Analysis , 2009, Computing in Science & Engineering.

[9]  Heather A. Piwowar,et al.  Sharing Detailed Research Data Is Associated with Increased Citation Rate , 2007, PloS one.

[10]  Alex M. Warren Repeatability and Benefaction in Computer Systems Research — A Study and a Modest Proposal , 2015 .

[11]  Steve Hitchcock,et al.  The effect of open access and downloads ('hits') on citation impact: a bibliography of studies , 2004 .

[12]  BichlerMartin,et al.  More than bin packing , 2015 .

[13]  Carl Boettiger,et al.  An introduction to Docker for reproducible research , 2014, OPSR.

[14]  Markus Rupp,et al.  Reproducible research in signal processing , 2009, IEEE Signal Processing Magazine.

[15]  C. Tenopir,et al.  Data Sharing by Scientists: Practices and Perceptions , 2011, PloS one.

[16]  Jelena Kovacevic,et al.  How to Encourage and Publish Reproducible Research , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[17]  Martin Bichler,et al.  Reproducible experiments on dynamic resource allocation in cloud data centers , 2016, Inf. Syst..

[18]  Martin Bichler,et al.  More than bin packing: Dynamic resource allocation strategies in cloud data centers , 2015, Inf. Syst..