PmiRDiscVali: an integrated pipeline for plant microRNA discovery and validation

BackgroundMicroRNAs (miRNAs) constitute a well-known small RNA (sRNA) species with important regulatory roles. To date, several bioinformatics tools have been developed for large-scale prediction of miRNAs based on high-throughput sequencing data. However, some of these tools become invalid without reference genomes, while some tools cannot supply user-friendly outputs. Besides, most of the current tools focus on the importance of secondary structures and sRNA expression patterns for miRNA prediction, while they do not pay attention to miRNA processing for reliability check.ResultsHere, we reported a pipeline PmiRDiscVali for plant miRNA discovery and partial validation. This pipeline integrated the popular tool miRDeep-P for plant miRNA prediction, making PmiRDiscVali compatible for both reference-based and de novo predictions. To check the prediction reliability, we adopted the concept that the miRNA processing intermediates could be tracked by degradome sequencing (degradome-seq) during the development of PmiRDiscVali. A case study was performed by using the public sequencing data of Dendrobium officinale, in order to show the clear and concise presentation of the prediction results.ConclusionSummarily, the integrated pipeline PmiRDiscVali, featured with degradome-seq data-based validation and vivid result presentation, should be useful for large-scale identification of plant miRNA candidates.

[1]  M. Levine,et al.  miRTRAP, a computational method for the systematic identification of miRNAs from high throughput sequencing data , 2010, Genome Biology.

[2]  Eugene Berezikov,et al.  Evolution of microRNA diversity and regulation in animals , 2011, Nature Reviews Genetics.

[3]  Yincong Zhou,et al.  miRNA Digger: a comprehensive pipeline for genome-wide novel miRNA mining , 2016, Scientific Reports.

[4]  Huizhong Wang,et al.  Bioinformatics resources for deciphering the biogenesis and action pathways of plant small RNAs , 2017, Rice.

[5]  Jiyuan An,et al.  miRPlant: an integrated tool for identification of plant miRNA from RNA sequencing data , 2014, BMC Bioinformatics.

[6]  C. Nelson,et al.  miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data , 2012, Nucleic acids research.

[7]  Sebastian D. Mackowiak,et al.  miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades , 2011, Nucleic acids research.

[8]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[9]  Alessandra Carbone,et al.  MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data , 2010, Bioinform..

[10]  D. Bartel,et al.  Criteria for Annotation of Plant MicroRNAs , 2008, The Plant Cell Online.

[11]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[12]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[13]  Scott A. Givan,et al.  Expression of Arabidopsis MIRNA Genes1[w] , 2005, Plant Physiology.

[14]  Ana M. Aransay,et al.  miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments , 2009, Nucleic Acids Res..

[15]  Huizhong Wang,et al.  A transcriptome-wide, organ-specific regulatory map of Dendrobium officinale, an important traditional Chinese orchid herb , 2016, Scientific Reports.

[16]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[17]  Chon-Kit Kenneth Chan,et al.  Analysis of RNA-Seq Data Using TopHat and Cufflinks. , 2016, Methods in molecular biology.

[18]  Ping Wu,et al.  High-throughput degradome sequencing can be used to gain insights into microRNA precursor metabolism. , 2010, Journal of experimental botany.

[19]  Lei Li,et al.  miRDeep-P: a computational tool for analyzing the microRNA transcriptome in plants , 2011, Bioinform..

[20]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[21]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[22]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[23]  N. Rajewsky,et al.  Discovering microRNAs from deep sequencing data using miRDeep , 2008, Nature Biotechnology.

[24]  Sanghyuk Lee,et al.  MicroRNA genes are transcribed by RNA polymerase II , 2004, The EMBO journal.

[25]  G. Ruvkun,et al.  A uniform system for microRNA annotation. , 2003, RNA.

[26]  D. Bartel,et al.  MicroRNAS and their regulatory roles in plants. , 2006, Annual review of plant biology.

[27]  Huizhong Wang,et al.  Tracking microRNA Processing Signals by Degradome Sequencing Data Analysis , 2018, Front. Genet..

[28]  Xiaoxia Ma,et al.  The use of high-throughput sequencing methods for plant microRNA research , 2015, RNA biology.