Advancements in RNASeqGUI towards a Reproducible Analysis of RNA-Seq Experiments

We present the advancements and novelties recently introduced in RNASeqGUI, a graphical user interface that helps biologists to handle and analyse large data collected in RNA-Seq experiments. This work focuses on the concept of reproducible research and shows how it has been incorporated in RNASeqGUI to provide reproducible (computational) results. The novel version of RNASeqGUI combines graphical interfaces with tools for reproducible research, such as literate statistical programming, human readable report, parallel executions, caching, and interactive and web-explorable tables of results. These features allow the user to analyse big datasets in a fast, efficient, and reproducible way. Moreover, this paper represents a proof of concept, showing a simple way to develop computational tools for Life Science in the spirit of reproducible research.

[1]  Zhifa Liu,et al.  An R package that automatically collects and archives details for reproducible computing , 2014, BMC Bioinformatics.

[2]  Pooja Mittal,et al.  A novel signaling pathway impact analysis , 2009, Bioinform..

[3]  Nature Protocols , 2006, Nature Cell Biology.

[4]  Francesca Cordero,et al.  oneChannelGUI: a graphical interface to Bioconductor tools, designed for life scientists who are not familiar with R language , 2007, Bioinform..

[5]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[6]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[7]  Cristóbal Fresno,et al.  RDAVIDWebService: a versatile R interface to DAVID , 2013, Bioinform..

[8]  Federico Marini,et al.  Systematically evaluating interfaces for RNA-seq analysis from a life scientist perspective , 2016, Briefings Bioinform..

[9]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[10]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[11]  Roger D Peng,et al.  Reproducible research and Biostatistics. , 2009, Biostatistics.

[12]  Marianna Aprile,et al.  RNA-Seq and human complex diseases: recent accomplishments and future perspectives , 2012, European Journal of Human Genetics.

[13]  Duncan Temple Lang,et al.  RGtk2: A Graphical User Interface Toolkit for R , 2010 .

[14]  James Taylor,et al.  Next-generation sequencing data interpretation: enhancing reproducibility and accessibility , 2012, Nature Reviews Genetics.

[15]  Melanie A. Huntley,et al.  ReportingTools: an automated result processing and presentation toolkit for high-throughput genomic analyses , 2013, Bioinform..

[16]  Brigid Wilson,et al.  Implementing Reproducible Research , 2014 .

[17]  Roger D. Peng,et al.  INTERACTING WITH DATA USING THE FILEHASH PACKAGE FOR R , 2006 .

[18]  Michael Friendly,et al.  HE Plots for Repeated Measures Designs , 2010 .

[19]  Andreas Kiermeier,et al.  Visualizing and Assessing Acceptance Sampling Plans: The R Package AcceptanceSampling , 2008 .

[20]  Robert Gentleman,et al.  Statistical Applications in Genetics and Molecular Biology , 2005 .

[21]  A. Ciccodicola,et al.  Uncovering the Complexity of Transcriptomes with RNA-Seq , 2010, Journal of biomedicine & biotechnology.

[22]  Roger D. Peng,et al.  Caching and Distributing Statistical Analyses in R , 2008 .

[23]  Steve Weston,et al.  Foreach Parallel Adaptor for the 'parallel' Package , 2015 .

[24]  Claudia Angelini,et al.  RNASeqGUI: a GUI for analysing RNA-Seq data , 2014, Bioinform..

[25]  Na Li,et al.  Snow: A Parallel Computing Framework for the R System , 2009, International Journal of Parallel Programming.

[26]  Z. Boka Enhancing reproducibility , 2013, Nature Methods.

[27]  W. Shi,et al.  The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote , 2013, Nucleic acids research.

[28]  Darrel C. Ince,et al.  The case for open computer programs , 2012, Nature.

[29]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[30]  C. Thermes,et al.  Ten years of next-generation sequencing technology. , 2014, Trends in genetics : TIG.

[31]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[32]  Fatih Ozsolak,et al.  RNA sequencing: advances, challenges and opportunities , 2011, Nature Reviews Genetics.

[33]  Matthew D. Young,et al.  From RNA-seq reads to differential expression results , 2010, Genome Biology.

[34]  Mattia D'Antonio,et al.  RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application , 2015, BMC Genomics.

[35]  Yihui Xie,et al.  Dynamic Documents with R and knitr , 2015 .

[36]  Peter J. Woolf,et al.  GAGE: generally applicable gene set enrichment for pathway analysis , 2009, BMC Bioinformatics.

[37]  Li Yang,et al.  Conservation of an RNA regulatory map between Drosophila and mammals. , 2011, Genome research.

[38]  B. Di Camillo,et al.  Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis. , 2015, Briefings in functional genomics.

[39]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[40]  A. Mortazavi,et al.  Computation for ChIP-seq and RNA-seq studies , 2009, Nature Methods.

[41]  Mark Stitt,et al.  RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics , 2012, Nucleic Acids Res..

[42]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.