Prediction of microRNA Hairpins using One-Class Support Vector Machines

MicroRNAs(miRNAs) are small molecular non- coding RNAs that have important roles in the post-transcriptional mechanism of animals and plants. They are commonly 21-25 nucleotides (nt) long and derived from 60-90 nt RNA hairpin structures, called miRNA hairpins. A larger amount of sequence segments in the human genome have been computationally identified with such a 60-90 nt hairpin, however a majority of them are not miRNA hairpins. Most computational methods so far for predicting miRNA hairpins were based on a two-class classifier to distinguish between miRNA hairpins and the sequence segments with a hairpin structures. The difficulty of these methods is how to select hairpins as the negative examples of miRNA hairpins in the classifier-training datasets since only few miRNA hairpins are available. Therefore, their classifier may be mis-trained due to some false negative examples of the training dataset. In this paper, we introduce a one-class support vector machine (SVM) method to predict miRNA hairpins from the hairpin structures. Different from existing methods for predicting miRNA hairpins, the one-class SVM model is trained only on the information of the miRNA class. We also illustrated some examples of predicting miRNA hairpins in human chromosomes 10, 15, and 21 where our method overcomes the above disadvantage of existing two- class methods.

[1]  Fei Li,et al.  Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine , 2005, BMC Bioinformatics.

[2]  K. Czaplinski,et al.  Exportin 5 is a RanGTP-dependent dsRNA-binding protein that mediates nuclear export of pre-miRNAs. , 2004, RNA.

[3]  R. Shiekhattar,et al.  The Microprocessor complex mediates the genesis of microRNAs , 2004, Nature.

[4]  Adam Kowalczyk,et al.  One class SVM for yeast regulation prediction , 2002, SKDD.

[5]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[6]  Sam Griffiths-Jones,et al.  The microRNA Registry , 2004, Nucleic Acids Res..

[7]  Malik Yousef,et al.  One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..

[8]  V. Ambros The functions of animal microRNAs , 2004, Nature.

[9]  A. Hatzigeorgiou,et al.  A combined computational-experimental approach predicts human microRNA targets. , 2004, Genes & development.

[10]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  Molly Megraw,et al.  Support Vector Machines for Predicting microRNA Hairpins , 2006, BIOCOMP.

[13]  B. Cullen,et al.  Recognition and cleavage of primary microRNA precursors by the nuclear processing enzyme Drosha , 2005, The EMBO journal.

[14]  U. Kutay,et al.  Nuclear Export of MicroRNA Precursors , 2004, Science.

[15]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[16]  Jin-Hua Han,et al.  MicroRNA: Biological and Computational Perspective , 2016, Genomics, proteomics & bioinformatics.

[17]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[18]  Ola R. Snøve,et al.  Reliable prediction of Drosha processing sites improves microRNA gene prediction. , 2007, Bioinformatics.

[19]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[20]  J. Krol,et al.  Structural Features of MicroRNA (miRNA) Precursors and Their Relevance to miRNA Biogenesis and Small Interfering RNA/Short Hairpin RNA Design* , 2004, Journal of Biological Chemistry.

[21]  G. Hannon,et al.  Processing of primary microRNAs by the Microprocessor complex , 2004, Nature.

[22]  G. Church,et al.  Computational and experimental identification of C. elegans microRNAs. , 2003, Molecular cell.

[23]  V. Kim,et al.  The nuclear RNase III Drosha initiates microRNA processing , 2003, Nature.

[24]  Thomas S. Huang,et al.  One-class SVM for learning in image retrieval , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).