Automated analysis of small RNA datasets with RAPID

Summary: Understanding the role of small RNA (sRNA) in diverse biological processes is of current interest and often approached through sRNA sequencing. However, analysis of these datasets is difficult due to the complexity of biological RNA processing pathways, which differ between species. Therefore, several different parameters should be analyzed, that were found to guide researchers, such as sRNA strand specificity, length distribution, and distribution of base modifications. We present RAPID, a generic sRNA analysis pipeline, which captures information inherent in the datasets and automatically produces numerous visualizations as user-friendly HTML reports, covering multiple categories required for sRNA analysis. RAPID also facilitates an automated comparison of multiple datasets, with different normalization techniques, and integrates differential expression analysis using DESeq2. Availability and Implementation: RAPID under MIT license at https://github.com/SchulzLab/RAPID or as a bioconda recipe https://bioconda.github.io/recipes/rapid/README.html.

[1]  K. Joshi,et al.  Transcriptomics and Epigenomics , 2017 .

[2]  Nicolas Servant,et al.  A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis , 2013, Briefings Bioinform..

[3]  André Fischer,et al.  Oasis: online analysis of small RNA deep sequencing data , 2015, Bioinform..

[4]  M. Axtell ShortStack: comprehensive annotation and quantification of small RNA genes. , 2013, RNA.

[5]  Antonio Rinaldi,et al.  iMir: An integrated pipeline for high-throughput analysis of small non-coding RNA data obtained by smallRNA-Seq , 2013, BMC Bioinformatics.

[6]  J. Preer,et al.  Does Ribosomal DNA Get Out of the Micronuclear Chromosome inParamecium tetraurelia by Means of a Rolling Circle? , 1999, Molecular and Cellular Biology.

[7]  Yuk Yee Leung,et al.  SPAR: small RNA-seq portal for analysis of sequencing experiments , 2018, Nucleic Acids Res..

[8]  Marcel H. Schulz,et al.  Epigenetic regulation of serotype expression antagonizes transcriptome dynamics in Paramecium tetraurelia , 2015, DNA research : an international journal for rapid publication of reports on genes and genomes.

[9]  Michael I. Love,et al.  Differential analysis of count data – the DESeq2 package , 2013 .

[10]  L. Bossi,et al.  Competing endogenous RNAs: a target-centric view of small RNA regulation in bacteria , 2016, Nature Reviews Microbiology.

[11]  T. Berendonk,et al.  Intraspecific Genetic Variation in Paramecium Revealed by Mitochondrial Cytochrome c Oxidase I Sequences , 2006, The Journal of eukaryotic microbiology.

[12]  Marcel H. Schulz,et al.  A Global View of Gene Activity and Alternative Splicing by Deep Sequencing of the Human Transcriptome , 2008, Science.

[13]  Vincent Moulton,et al.  The UEA Small RNA Workbench: A Suite of Computational Tools for Small RNA Analysis. , 2017, Methods in molecular biology.

[14]  Fei Li,et al.  Prediction of piRNAs using transposon interaction and a support vector machine , 2014, BMC Bioinformatics.

[15]  Ulrike Götz,et al.  Primary and secondary siRNA synthesis triggered by RNAs from food bacteria in the ciliate Paramecium tetraurelia , 2015, Nucleic acids research.

[16]  Sebastian D. Mackowiak,et al.  miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades , 2011, Nucleic acids research.

[17]  Marcel H. Schulz,et al.  Two sets of RNAi components are required for heterochromatin formation in trans triggered by truncated transgenes , 2016, Nucleic acids research.

[18]  Luciano Milanesi,et al.  iSmaRT: a toolkit for a comprehensive analysis of small RNA‐Seq data , 2017, Bioinform..

[19]  Emmanuel Barillot,et al.  ncPRO-seq: a tool for annotation and profiling of ncRNAs in sRNA-seq data , 2012, Bioinform..

[20]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[21]  Huan Zhang,et al.  CPSS 2.0: a computational platform update for the analysis of small RNA sequencing data , 2017, Bioinform..

[22]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[23]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[24]  Wei Wang,et al.  piPipes: a set of pipelines for piRNA and transposon analysis via small RNA-seq, RNA-seq, degradome- and CAGE-seq, ChIP-seq and genomic DNA sequencing , 2014, Bioinform..

[25]  Martin J. Aryee,et al.  Survival in Quiescence Requires the Euchromatic Deployment of Clr4/SUV39H by Argonaute-Associated Small RNAs. , 2016, Molecular cell.

[26]  A. Chakraborty,et al.  An overview of pre-ribosomal RNA processing in eukaryotes , 2014, Wiley interdisciplinary reviews. RNA.

[27]  T. Vellai,et al.  Highly efficient RNAi and Cas9-based auto-cloning systems for C. elegans research , 2018, Nucleic acids research.

[28]  Andrew Lonie,et al.  iSRAP – a one-touch research tool for rapid profiling of small RNA-seq data , 2015, Journal of extracellular vesicles.

[29]  Ping Chen,et al.  SePIA: RNA and small RNA sequence processing, integration, and analysis , 2016, BioData Mining.

[30]  Vincent Moulton,et al.  A toolkit for analysing large-scale plant small RNA datasets , 2008, Bioinform..

[31]  Michael Hackenberg,et al.  sRNAtoolbox: an integrated collection of small RNA research tools , 2015, Nucleic Acids Res..

[32]  Qinghua Shi,et al.  mirTools 2.0 for non-coding RNA discovery, profiling, and functional annotation based on high-throughput sequencing , 2013, RNA biology.