Identifying Mammalian MicroRNA Targets Based on Supervised Distance Metric Learning

MicroRNAs (miRNAs) have been emerged as a novel class of endogenous posttranscriptional regulators in a variety of animal and plant species. One challenge facing miRNA research is to accurately identify the target mRNAs, because of the very limited sequence complementarity between miRNAs and their target sites, and the scarcity of experimentally validated targets to guide accurate prediction. In this paper, we propose a new method called SuperMirTar that exploits super vised distance learning to predict miRNA targets. Specifically, we use the experimentally supported miRNA-mRNA pairs as a training set to learn a distance metric function that minimizes the distances between miRNAs and mRNAs with validated interactions, then use the learned function to calculate the distances of test miRNA-mRNA interactions, and those with smaller distances than a predefined threshold are regarded as true interactions. We carry out performance comparison between the proposed approach and seven existing methods on independent datasets; the results show that our method achieves superior performance and can effectively narrow the gap between the number of predicted miRNA targets and the number of experimentally validated ones.

[1]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[2]  R. Giegerich,et al.  Fast and effective prediction of microRNA/target duplexes. , 2004, RNA.

[3]  B. Frey,et al.  Bayesian Learning of MicroRNA Targets from Sequence and Expression Data , 2006 .

[4]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[5]  Chi-Ying F. Huang,et al.  miRTarBase: a database curates experimentally validated microRNA–target interactions , 2010, Nucleic Acids Res..

[6]  Yoshihiro Yamanishi,et al.  Supervised Bipartite Graph Inference , 2008, NIPS.

[7]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[8]  Charles A. Micchelli,et al.  When is there a representer theorem? Vector versus matrix regularizers , 2008, J. Mach. Learn. Res..

[9]  Pål Sætrom,et al.  Predicting the efficacy of short oligonucleotides in antisense and RNAi experiments with boosted genetic programming , 2004, Bioinform..

[10]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[11]  N. Rajewsky,et al.  Widespread changes in protein synthesis induced by microRNAs , 2008, Nature.

[12]  Brendan J. Frey,et al.  Comparing Sequence and Expression for Predicting microRNA Targets Using GenMIR3 , 2007, Pacific Symposium on Biocomputing.

[13]  Brendan J. Frey,et al.  Detecting MicroRNA Targets by Linking Sequence, MicroRNA and Gene Expression Data , 2006, RECOMB.

[14]  Hui Liu,et al.  Detecting microarray data supported microRNA-mRNA interactions , 2010, Int. J. Data Min. Bioinform..

[15]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[16]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[17]  A. Hatzigeorgiou,et al.  A combined computational-experimental approach predicts human microRNA targets. , 2004, Genes & development.

[18]  Dmitrij Frishman,et al.  TargetSpy: a supervised machine learning approach for microRNA target prediction , 2010, BMC Bioinformatics.

[19]  Xiaowei Wang,et al.  Sequence analysis Prediction of both conserved and nonconserved microRNA targets in animals , 2007 .

[20]  Dang D. Long,et al.  mirWIP: microRNA target prediction based on microRNA-containing ribonucleoprotein–enriched transcripts , 2008, Nature Methods.

[21]  Louise C. Showe,et al.  Naïve Bayes for microRNA target predictions - machine learning for microRNA targets , 2007, Bioinform..

[22]  A. Hatzigeorgiou,et al.  A guide through present computational approaches for the identification of mammalian microRNA targets , 2006, Nature Methods.

[23]  Xiaowei Wang,et al.  Systematic identification of microRNA functions by combining target prediction and expression profiling , 2006, Nucleic acids research.

[24]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[25]  Martin Reczko,et al.  The database of experimentally supported targets: a functional update of TarBase , 2008, Nucleic Acids Res..

[26]  Brendan J. Frey,et al.  Bayesian Inference of MicroRNA Targets from Sequence and Expression Data , 2007, J. Comput. Biol..

[27]  A. Ballabio,et al.  MicroRNA target prediction by expression analysis of host genes. , 2009, Genome research.

[28]  Martti T. Tammi,et al.  MicroTar: predicting microRNA targets from RNA duplexes , 2006, BMC Bioinformatics.

[29]  Sanghamitra Bandyopadhyay,et al.  TargetMiner: microRNA target prediction with systematic identification of tissue-specific negative examples , 2009, Bioinform..

[30]  Colin N. Dewey,et al.  A Genome-Wide Map of Conserved MicroRNA Targets in C. elegans , 2006, Current Biology.

[31]  Jack J. Dongarra,et al.  JLAPACK-compiling LAPACK Fortran to Java , 1999, Sci. Program..

[32]  Yvonne Tay,et al.  A Pattern-Based Method for the Identification of MicroRNA Binding Sites and Their Corresponding Heteroduplexes , 2006, Cell.

[33]  Vesselin Baev,et al.  MicroInspector: a web tool for detection of miRNA binding sites in an RNA sequence , 2005, Nucleic Acids Res..