Comparison and Integration of Target Prediction Algorithms for microRNA Studies

microRNAs are short RNA fragments that have the capacity of regulating hundreds of target gene expression. Currently, due to lack of high-throughput experimental methods for miRNA target identification, a collection of computational target prediction approaches have been developed. However, these approaches deal with different features or factors are weighted differently resulting in diverse range of predictions. The prediction accuracy remains uncertain. In this paper, three commonly used target prediction algorithms are evaluated and further integrated using algorithm combination, ranking aggregation and Bayesian Network classification. Our results revealed that each individual prediction algorithm displays its advantages as was shown on different test data sets. Among different integration strategies, the application of Bayesian Network classifier on the features calculated from multiple prediction methods significantly improved target prediction accuracy.

[1]  Anton J. Enright,et al.  MicroRNA targets in Drosophila , 2003, Genome Biology.

[2]  A. Hatzigeorgiou,et al.  A guide through present computational approaches for the identification of mammalian microRNA targets , 2006, Nature Methods.

[3]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[4]  Fons J. Verbeek,et al.  miRNA target prediction through mining of miRNA relationships , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.

[5]  C. Barbato,et al.  Computational Challenges in miRNA Target Predictions: To Be or Not to Be a True Target? , 2009, Journal of biomedicine & biotechnology.

[6]  Byoung-Tak Zhang,et al.  miTarget: microRNA target gene prediction using a support vector machine , 2006, BMC Bioinformatics.

[7]  Vasyl Pihur,et al.  RankAggreg, an R package for weighted rank aggregation , 2009, BMC Bioinformatics.

[8]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[9]  Anton J. Enright,et al.  Prediction of microRNA targets. , 2007, Drug discovery today.

[10]  Mihaela Zavolan,et al.  Inference of miRNA targets using evolutionary conservation and pathway analysis , 2007, BMC Bioinformatics.

[11]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[12]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[13]  Vasyl Pihur,et al.  Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach , 2007, Bioinform..

[14]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[15]  Vasudevan Seshadri,et al.  Translational control by the 3'-UTR: the ends specify the means. , 2003, Trends in biochemical sciences.

[16]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[17]  C. Spillane,et al.  Prediction and validation of microRNA targets in animal genomes , 2007, Journal of Biosciences.

[18]  A. Hatzigeorgiou,et al.  TarBase: A comprehensive database of experimentally supported animal microRNA targets. , 2005, RNA.

[19]  P. Schuster,et al.  Complete suboptimal folding of RNA and the stability of secondary structures. , 1999, Biopolymers.

[20]  Louise C. Showe,et al.  Naïve Bayes for microRNA target predictions - machine learning for microRNA targets , 2007, Bioinform..

[21]  Ian Witten,et al.  Data Mining , 2000 .

[22]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[23]  Jie Ding,et al.  Integration of Ranked Lists via Cross Entropy Monte Carlo with Applications to mRNA and microRNA Studies , 2009, Biometrics.

[24]  R. Giegerich,et al.  Fast and effective prediction of microRNA/target duplexes. , 2004, RNA.

[25]  Martin Reczko,et al.  The database of experimentally supported targets: a functional update of TarBase , 2008, Nucleic Acids Res..