MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans

BackgroundMicroRNAs (miRNAs) are recognized as one of the most important families of non-coding RNAs that serve as important sequence-specific post-transcriptional regulators of gene expression. Identification of miRNAs is an important requirement for understanding the mechanisms of post-transcriptional regulation. Hundreds of miRNAs have been identified by direct cloning and computational approaches in several species. However, there are still many miRNAs that remain to be identified due to lack of either sequence features or robust algorithms to efficiently identify them.ResultsWe have evaluated features valuable for pre-miRNA prediction, such as the local secondary structure differences of the stem region of miRNA and non-miRNA hairpins. We have also established correlations between different types of mutations and the secondary structures of pre-miRNAs. Utilizing these features and combining some improvements of the current pre-miRNA prediction methods, we implemented a computational learning method SVM (support vector machine) to build a high throughput and good performance computational pre-miRNA prediction tool called MiRFinder. The tool was designed for genome-wise, pair-wise sequences from two related species. The method built into the tool consisted of two major steps: 1) genome wide search for hairpin candidates and 2) exclusion of the non-robust structures based on analysis of 18 parameters by the SVM method. Results from applying the tool for chicken/human and D. melanogaster/D. pseudoobscura pair-wise genome alignments showed that the tool can be used for genome wide pre-miRNA predictions.ConclusionThe MiRFinder can be a good alternative to current miRNA discovery software. This tool is available at http://www.bioinformatics.org/mirfinder/.

[1]  P. Rouzé,et al.  Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Santosh K. Mishra,et al.  De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures , 2007, Bioinform..

[3]  Ola R. Snøve,et al.  Reliable prediction of Drosha processing sites improves microRNA gene prediction. , 2007, Bioinformatics.

[4]  Edouard Bertrand,et al.  Exportin-5 Mediates Nuclear Export of Minihelix-containing RNAs* , 2003, The Journal of Biological Chemistry.

[5]  Ron Shamir,et al.  Accurate identification of alternatively spliced exons using support vector machine , 2005, Bioinform..

[6]  B. Bass,et al.  A Role for the RNase III Enzyme DCR-1 in RNA Interference and Germ Line Development in Caenorhabditis elegans , 2001, Science.

[7]  William Stafford Noble,et al.  Assessing computational tools for the discovery of transcription factor binding sites , 2005, Nature Biotechnology.

[8]  Fei Li,et al.  Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine , 2005, BMC Bioinformatics.

[9]  Sanghyuk Lee,et al.  MicroRNA genes are transcribed by RNA polymerase II , 2004, The EMBO journal.

[10]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[13]  Terrence S. Furey,et al.  The UCSC Genome Browser Database: update 2006 , 2005, Nucleic Acids Res..

[14]  K. Czaplinski,et al.  Exportin 5 is a RanGTP-dependent dsRNA-binding protein that mediates nuclear export of pre-miRNAs. , 2004, RNA.

[15]  B. Cullen,et al.  Exportin-5 mediates the nuclear export of pre-microRNAs and short hairpin RNAs. , 2003, Genes & development.

[16]  C. Burge,et al.  The microRNAs of Caenorhabditis elegans. , 2003, Genes & development.

[17]  U. Kutay,et al.  Nuclear Export of MicroRNA Precursors , 2004, Science.

[18]  Jin-Wu Nam,et al.  Genomics of microRNA. , 2006, Trends in genetics : TIG.

[19]  L. Lim,et al.  An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans , 2001, Science.

[20]  A. Pasquinelli,et al.  A Cellular Function for the RNA-Interference Enzyme Dicer in the Maturation of the let-7 Small Temporal RNA , 2001, Science.

[21]  Fei Li,et al.  MicroRNA identification based on sequence and structure alignment , 2005, Bioinform..

[22]  Mihaela Zavolan,et al.  Identification of Clustered Micrornas Using an Ab Initio Prediction Method , 2022 .

[23]  B. Patterson,et al.  Letter to the editor. , 2018, Journal of professional nursing : official journal of the American Association of Colleges of Nursing.

[24]  C. Sander,et al.  Identification of microRNAs of the herpesvirus family , 2005, Nature Methods.

[25]  V. Ambros,et al.  An Extensive Class of Small RNAs in Caenorhabditis elegans , 2001, Science.

[26]  T. Tuschl,et al.  Identification of Novel Genes Coding for Small Expressed RNAs , 2001, Science.

[27]  D. Bartel,et al.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. , 2004, Molecular cell.

[28]  G. Hannon,et al.  C . elegans involved in developmental timing in Dicer functions in RNA interference and in synthesis of small RNA , 2001 .

[29]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[30]  V. Kim,et al.  The nuclear RNase III Drosha initiates microRNA processing , 2003, Nature.

[31]  B. Cullen,et al.  Human microRNAs are processed from capped, polyadenylated transcripts that can also function as mRNAs. , 2004, RNA.

[32]  G. Rubin,et al.  Computational identification of Drosophila microRNA genes , 2003, Genome Biology.

[33]  Byoung-Tak Zhang,et al.  Human microRNA prediction through a probabilistic co-learning model of sequence and structure , 2005, Nucleic acids research.

[34]  F. Slack,et al.  Control of developmental timing by small temporal RNAs: a paradigm for RNA‐mediated regulation of gene expression , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[35]  Peter F. Stadler,et al.  Hairpins in a Haystack: recognizing microRNA precursors in comparative genomics data , 2006, ISMB.