miRge - A Multiplexed Method of Processing Small RNA-Seq Data to Determine MicroRNA Entropy

Small RNA RNA-seq for microRNAs (miRNAs) is a rapidly developing field where opportunities still exist to create better bioinformatics tools to process these large datasets and generate new, useful analyses. We built miRge to be a fast, smart small RNA-seq solution to process samples in a highly multiplexed fashion. miRge employs a Bayesian alignment approach, whereby reads are sequentially aligned against customized mature miRNA, hairpin miRNA, noncoding RNA and mRNA sequence libraries. miRNAs are summarized at the level of raw reads in addition to reads per million (RPM). Reads for all other RNA species (tRNA, rRNA, snoRNA, mRNA) are provided, which is useful for identifying potential contaminants and optimizing small RNA purification strategies. miRge was designed to optimally identify miRNA isomiRs and employs an entropy based statistical measurement to identify differential production of isomiRs. This allowed us to identify decreasing entropy in isomiRs as stem cells mature into retinal pigment epithelial cells. Conversely, we show that pancreatic tumor miRNAs have similar entropy to matched normal pancreatic tissues. In a head-to-head comparison with other miRNA analysis tools (miRExpress 2.0, sRNAbench, omiRAs, miRDeep2, Chimira, UEA small RNA Workbench), miRge was faster (4 to 32-fold) and was among the top-two methods in maximally aligning miRNAs reads per sample. Moreover, miRge has no inherent limits to its multiplexing. miRge was capable of simultaneously analyzing 100 small RNA-Seq samples in 52 minutes, providing an integrated analysis of miRNA expression across all samples. As miRge was designed for analysis of single as well as multiple samples, miRge is an ideal tool for high and low-throughput users. miRge is freely available at http://atlas.pathology.jhu.edu/baras/miRge.html.

[1]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[2]  Kyle J. Gaulton,et al.  The miRNA Profile of Human Pancreatic Islets and Beta-Cells and Relationship to Type 2 Diabetes Pathogenesis , 2013, PloS one.

[3]  Anton J. Enright,et al.  Kraken: A set of tools for quality control and analysis of high-throughput sequence data☆ , 2013, Methods.

[4]  R. Pease,et al.  A novel form of tissue-specific RNA processing produces apolipoprotein-B48 in intestine , 1987, Cell.

[5]  C. Ponting,et al.  Sequencing depth and coverage: key considerations in genomic analyses , 2014, Nature Reviews Genetics.

[6]  Yuanji Zhang,et al.  Lack of detectable oral bioavailability of plant microRNAs after feeding in mice , 2013, Nature Biotechnology.

[7]  Patricia P. Chan,et al.  GtRNAdb: a database of transfer RNA genes detected in genomic sequence , 2008, Nucleic Acids Res..

[8]  Toby C. Cornish,et al.  Lessons from miR-143/145: the importance of cell-type localization of miRNAs , 2014, Nucleic acids research.

[9]  Feng Chen,et al.  A challenge for miRNA: multiple isomiRs in miRNAomics. , 2014, Gene.

[10]  O. Kent,et al.  MicroRNA profiling of diverse endothelial cell types , 2011, BMC Medical Genomics.

[11]  M. Tewari,et al.  MicroRNA profiling: approaches and considerations , 2012, Nature Reviews Genetics.

[12]  Hong-Mei Zhang,et al.  Genome‐wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis , 2012, Human mutation.

[13]  Toby C. Cornish,et al.  A Critical Evaluation of microRNA Biomarkers in Non-Neoplastic Disease , 2014, PloS one.

[14]  C. Holding,et al.  Human embryonic genes re-expressed in cancer cells , 2001, Oncogene.

[15]  M. Laakso,et al.  Genetic regulation of human adipose microRNA expression and its consequences for metabolic traits. , 2013, Human molecular genetics.

[16]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[17]  C. Langford,et al.  5′ isomiR variation is of functional and evolutionary importance , 2014, Nucleic acids research.

[18]  Fabian J Theis,et al.  Next-generation sequencing reveals novel differentially regulated mRNAs, lncRNAs, miRNAs, sdRNAs and a piRNA in pancreatic cancer , 2015, Molecular Cancer.

[19]  Sören Müller,et al.  omiRas: a Web server for differential expression analysis of miRNAs derived from small RNA-Seq data , 2013, Bioinform..

[20]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[21]  Tingming Liang,et al.  Consistent isomiR expression patterns and 3′ addition events in miRNA gene clusters and families implicate functional and evolutionary relationships , 2012, Molecular Biology Reports.

[22]  Owen M. Rennert,et al.  Identification of Differentially Expressed MicroRNAs Across the Developing Human Brain , 2013, Molecular Psychiatry.

[23]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[24]  Sebastian D. Mackowiak,et al.  miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades , 2011, Nucleic acids research.

[25]  Jing Tu,et al.  Entropy-Based Model for MiRNA Isoform Analysis , 2015, PloS one.

[26]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[27]  Hsien-Da Huang,et al.  miRExpress: Analyzing high-throughput sequencing data for profiling microRNA expression , 2009, BMC Bioinformatics.

[28]  Zhigang Xue,et al.  Identification of miRNA Signatures during the Differentiation of hESCs into Retinal Pigment Epithelial Cells , 2012, PloS one.

[29]  R. Sachidanandam,et al.  High-throughput assessment of microRNA activity and function using microRNA sensor and decoy libraries , 2012, Nature Methods.

[30]  Ana M. Aransay,et al.  miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments , 2009, Nucleic Acids Res..

[31]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[32]  Michael Hackenberg,et al.  sRNAtoolbox: an integrated collection of small RNA research tools , 2015, Nucleic Acids Res..

[33]  Anton J. Enright,et al.  Chimira: analysis of small RNA sequencing data and microRNA modifications , 2015, Bioinform..

[34]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.