Easyreporting simplifies the implementation of Reproducible Research layers in R software

During last years “irreproducibility” became a general problem in omics data analysis due to the use of sophisticated and poorly described computational procedures. For avoiding misleading results, it is necessary to inspect and reproduce the entire data analysis as a unified product. Reproducible Research (RR) provides general guidelines for public access to the analytic data and related analysis code combined with natural language documentation, allowing third-parties to reproduce the findings. We developed easyreporting, a novel R/Bioconductor package, to facilitate the implementation of an RR layer inside reports/tools without requiring any knowledge of the R Markdown language. We describe the main functionalities and illustrate how to create an analysis report using a typical case study concerning the analysis of RNA-seq data. Then, we also show how to trace R functions automatically. Thanks to this latter feature, easyreporting results beneficial for developers to implement procedures that automatically keep track of the analysis steps within Graphical User Interfaces (GUIs). Easyreporting can be useful in supporting the reproducibility of any data analysis project and the implementation of GUIs. It turns out to be very helpful in bioinformatics, where the complexity of the analyses makes it extremely difficult to trace all the steps and parameters used in the study.

[1]  Casey S. Greene,et al.  Recommendations to enhance rigor and reproducibility in biomedical research , 2020, GigaScience.

[2]  Ryan Gosselin,et al.  Current RNA-seq methodology reporting limits reproducibility , 2019, Briefings Bioinform..

[3]  Stephen R. Piccolo,et al.  Tools and techniques for computational reproducibility , 2016, GigaScience.

[4]  Mark Gerstein,et al.  Structuring supplemental materials in support of reproducibility , 2017, Genome Biology.

[5]  Francesco Napolitano repo: an R package for data-centered management of bioinformatic pipelines , 2017, BMC Bioinformatics.

[6]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[7]  Donald E. Knuth,et al.  Literate Programming , 1984, Comput. J..

[8]  Anton Nekrutenko,et al.  Ten Simple Rules for Reproducible Computational Research , 2013, PLoS Comput. Biol..

[9]  C. Angelini,et al.  Distinct Antigen Delivery Systems Induce Dendritic Cells’ Divergent Transcriptional Response: New Insights from a Comparative and Reproducible Computational Analysis , 2017, International journal of molecular sciences.

[10]  Ute Roessner,et al.  Best practice data life cycle approaches for the life sciences , 2017, bioRxiv.

[11]  Claudia Angelini,et al.  HiCeekR: A Novel Shiny App for Hi-C Data Analysis , 2019, Front. Genet..

[12]  A. Marke,et al.  Transforming Climate Finance and Green Investment with Blockchains , 2018 .

[13]  Carl Boettiger,et al.  An introduction to Docker for reproducible research , 2014, OPSR.

[14]  Claudia Angelini,et al.  Advantages and Limits in the Adoption of Reproducible Research and R-Tools for the Analysis of Omic Data , 2015, CIBB.

[15]  Claudia Angelini,et al.  StructuRly: A novel shiny app to produce comprehensive, detailed and interactive plots for population genetic analysis , 2020, PloS one.

[16]  Friedrich Leisch,et al.  Sweave: Dynamic Generation of Statistical Reports Using Literate Data Analysis , 2002, COMPSTAT.

[17]  Jonathan M. Borwein,et al.  Facilitating Reproducibility in Scientific Computing: Principles and Practice , 2015 .

[18]  Francesco Russo,et al.  Advancements in RNASeqGUI towards a Reproducible Analysis of RNA-Seq Experiments , 2016, BioMed research international.

[19]  Claudia Angelini,et al.  RNASeqGUI: a GUI for analysing RNA-Seq data , 2014, Bioinform..

[20]  William Michael Landau,et al.  The drake R package: a pipeline toolkit for reproducibility and high-performance computing , 2018, J. Open Source Softw..

[21]  Marco Beccuti,et al.  Reproducible bioinformatics project: a community for reproducible bioinformatics analysis pipelines , 2017, BMC Bioinformatics.

[22]  Kevin Rue-Albrecht,et al.  iSEE: Interactive SummarizedExperiment Explorer , 2018, F1000Research.