miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades

microRNAs (miRNAs) are a large class of small non-coding RNAs which post-transcriptionally regulate the expression of a large fraction of all animal genes and are important in a wide range of biological processes. Recent advances in high-throughput sequencing allow miRNA detection at unprecedented sensitivity, but the computational task of accurately identifying the miRNAs in the background of sequenced RNAs remains challenging. For this purpose, we have designed miRDeep2, a substantially improved algorithm which identifies canonical and non-canonical miRNAs such as those derived from transposable elements and informs on high-confidence candidates that are detected in multiple independent samples. Analyzing data from seven animal species representing the major animal clades, miRDeep2 identified miRNAs with an accuracy of 98.6–99.9% and reported hundreds of novel miRNAs. To test the accuracy of miRDeep2, we knocked down the miRNA biogenesis pathway in a human cell line and sequenced small RNAs before and after. The vast majority of the >100 novel miRNAs expressed in this cell line were indeed specifically downregulated, validating most miRDeep2 predictions. Last, a new miRNA expression profiling routine, low time and memory usage and user-friendly interactive graphic output can make miRDeep2 useful to a wide range of researchers.

[1]  V. Ambros,et al.  An Extensive Class of Small RNAs in Caenorhabditis elegans , 2001, Science.

[2]  L. Lim,et al.  An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans , 2001, Science.

[3]  T. Tuschl,et al.  Identification of Novel Genes Coding for Small Expressed RNAs , 2001, Science.

[4]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[5]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[6]  Yves Van de Peer,et al.  Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences , 2004, Bioinform..

[7]  Eugene Berezikov,et al.  Phylogenetic Shadowing and Computational Identification of Human microRNA Genes , 2005, Cell.

[8]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[9]  Vetle I. Torvik,et al.  Mammalian microRNAs derived from genomic repeats. , 2005, Trends in genetics : TIG.

[10]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[11]  Christopher M. Player,et al.  Large-Scale Sequencing Reveals 21U-RNAs and Additional MicroRNAs and Endogenous siRNAs in C. elegans , 2006, Cell.

[12]  I. King Jordan,et al.  A Family of Human MicroRNA Genes from Miniature Inverted-Repeat Transposable Elements , 2007, PloS one.

[13]  N. Rajewsky,et al.  The evolution of gene regulation by transcription factors and microRNAs , 2007, Nature Reviews Genetics.

[14]  S. Cohen,et al.  microRNA functions. , 2007, Annual review of cell and developmental biology.

[15]  Manolis Kellis,et al.  Evolution, biogenesis, expression, and target predictions of a substantially expanded set of Drosophila microRNAs. , 2007, Genome research.

[16]  N. Rajewsky,et al.  A human snoRNA with microRNA-like functions. , 2008, Molecular cell.

[17]  Tyson A. Clark,et al.  HITS-CLIP yields genome-wide insights into brain alternative RNA processing , 2008, Nature.

[18]  Eugene Berezikov,et al.  Functionally distinct regulatory RNAs generated by bidirectional transcription and processing of microRNA loci. , 2008, Genes & development.

[19]  N. Rajewsky,et al.  Discovering microRNAs from deep sequencing data using miRDeep , 2008, Nature Biotechnology.

[20]  David P. Bartel,et al.  Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals , 2008, Nature.

[21]  F. Slack,et al.  Small non-coding RNAs in animal development , 2008, Nature Reviews Molecular Cell Biology.

[22]  D. Haussler,et al.  Posttranscriptional Crossregulation between Drosha and DGCR8 , 2009, Cell.

[23]  C. Mayr,et al.  Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer Cells , 2009, Cell.

[24]  R. Gregory,et al.  Many roads to maturity: microRNA biogenesis pathways and their regulation , 2009, Nature Cell Biology.

[25]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[26]  Zachary Pincus,et al.  Dynamic expression of small non-coding RNAs, including novel microRNAs and piRNAs/21U-RNAs, during Caenorhabditis elegans development , 2009, Genome Biology.

[27]  Alok Bhattacharya,et al.  Analysis of microRNA transcriptome by deep sequencing of small RNA libraries of peripheral blood , 2010, BMC Genomics.

[28]  M. Levine,et al.  A distinct class of small RNAs arises from pre-miRNA–proximal regions in a simple chordate , 2009, Nature Structural &Molecular Biology.

[29]  R. Gregory,et al.  Post-transcriptional control of DGCR8 expression by the Microprocessor. , 2009, RNA.

[30]  M. Levine,et al.  miRTRAP, a computational method for the systematic identification of miRNAs from high throughput sequencing data , 2010, Genome Biology.

[31]  Martin Hirst,et al.  High-resolution profiling and discovery of planarian small RNAs , 2009, Proceedings of the National Academy of Sciences.

[32]  Johan Vallon-Christersson,et al.  The non-coding RNA of the multidrug resistance-linked vault particle encodes multiple regulatory small RNAs , 2009, Nature Cell Biology.

[33]  W. Filipowicz,et al.  Mechanisms of miRNA-mediated post-transcriptional regulation in animal cells. , 2009, Current opinion in cell biology.

[34]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[35]  Geoffrey J. Barton,et al.  Human miRNA Precursors with Box H/ACA snoRNA Features , 2009, PLoS Comput. Biol..

[36]  F. Piano,et al.  Large scale sorting of C. elegans embryos reveals the dynamics of small RNA expression , 2009, Nature Methods.

[37]  Ana M. Aransay,et al.  miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments , 2009, Nucleic Acids Res..

[38]  Peter F. Stadler,et al.  tRNAdb 2009: compilation of tRNA sequences and tRNA genes , 2008, Nucleic Acids Res..

[39]  N. Hayward,et al.  Characterization of the Melanoma miRNAome by Deep Sequencing , 2010, PloS one.

[40]  B. Langmead,et al.  Aligning Short Sequencing Reads with Bowtie , 2010, Current protocols in bioinformatics.

[41]  William Ritchie,et al.  Nuclear-localized tiny RNAs are associated with transcription initiation and splice sites in metazoans , 2010, Nature Structural &Molecular Biology.

[42]  M. Lachmann,et al.  MicroRNA, mRNA, and protein expression link development and aging in human and macaque brain. , 2010, Genome research.

[43]  Lai Wei,et al.  Regulation of microRNA expression and abundance during lymphopoiesis. , 2010, Immunity.

[44]  Kyle Kai-How Farh,et al.  Expanding the microRNA targeting code: functional sites with centered pairing. , 2010, Molecular cell.

[45]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[46]  Hui Zhou,et al.  Deep Sequencing of Human Nuclear and Cytoplasmic Small RNAs Reveals an Unexpectedly Complex Subcellular Distribution of miRNAs and tRNA 3′ Trailers , 2010, PloS one.

[47]  Dereje D. Jima,et al.  Deep sequencing of the small RNA transcriptome of normal and malignant human B cells identifies hundreds of novel microRNAs. , 2010, Blood.

[48]  Alessandra Carbone,et al.  MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data , 2010, Bioinform..

[49]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[50]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[51]  C. Nusbaum,et al.  Mammalian microRNAs: experimental evaluation of novel and previously annotated genes. , 2010, Genes & development.

[52]  W. Lei,et al.  Genome-wide identification of micro-ribonucleic acids associated with human endometrial receptivity in natural and stimulated cycles by deep sequencing. , 2011, Fertility and sterility.

[53]  Ana M. Aransay,et al.  miRanalyzer: an update on the detection and analysis of microRNAs in high-throughput sequencing experiments , 2011, Nucleic Acids Res..

[54]  Li Lin,et al.  Identification of miRNomes in human liver and hepatocellular carcinoma reveals miR-199a/b-3p as therapeutic target for hepatocellular carcinoma. , 2011, Cancer cell.