Executable Papers for the R Community: The R2 Platform for Reproducible Research

Abstract Reviewing the computational part of scientific papers puts a lot of effort on referees: even if authors provide their data and code the referee often needs to install additional software on his machine and figure out which parts of the code belong to which part of the manuscript. As a result, computational results or often not reviewed at all. We propose a new web service which outsources validation of computational results in executable papers to an independent third party. Our system adapts the well-tested toolbox currently checking R extension packages in software repositories like CRAN to check manuscripts in paper repositories. In addition, paper packages can easily be downloaded from the server and installed to replicate results locally by anyone wishing to do so.

[1]  Markus Rupp,et al.  Reproducible research in signal processing , 2009, IEEE Signal Processing Magazine.

[2]  Kevin R Coombes,et al.  Run batch effects potentially compromise the usefulness of genomic signatures for ovarian cancer. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[3]  A. J. Rossini,et al.  Emacs Speaks Statistics: A Multiplatform, Multipackage Development Environment for Statistical Analysis , 2004 .

[4]  Friedrich Leisch,et al.  Sweave: Dynamic Generation of Statistical Reports Using Literate Data Analysis , 2002, COMPSTAT.

[5]  Ramana V. Davuluri,et al.  Biomedical Informatics for Cancer Research: Springer , 2010 .

[6]  Graham J. Williams,et al.  Rattle: A Data Mining GUI for R , 2009, R J..

[7]  Anil Potti,et al.  An integrated genomic-based approach to individualized treatment of patients with advanced-stage ovarian cancer. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[8]  Victoria Stodden,et al.  Reproducible Research Concepts and Tools for Cancer Bioinformatics , 2010 .

[9]  Scott M. Berry A Statistician Reads the Sports Pages , 2003 .

[10]  K. Coombes,et al.  Deriving chemosensitivity from cell lines: Forensic bioinformatics and reproducible research in high-throughput biology , 2009, 1010.1092.

[11]  Jelena Kovacevic,et al.  Reproducible research in signal processing , 2009, IEEE Signal Process. Mag..

[12]  Donald E. Knuth,et al.  Literate Programming , 1984, Comput. J..

[13]  Achim Zeileis,et al.  On reproducible econometric research , 2009 .

[14]  Kurt Hornik,et al.  Prospects and challenges in R package development , 2011, Comput. Stat..

[15]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[16]  Torsten Hothorn,et al.  Case studies in reproducibility , 2011, Briefings Bioinform..

[17]  David L. Donoho,et al.  WaveLab and Reproducible Research , 1995 .

[18]  Jan de Leeuw,et al.  Reproducible Research: the Bottom Line , 2001 .