Combining Supervised and Unsupervised Learning for Improved miRNA Target Prediction

MicroRNAs (miRNAs) are short non-coding RNAs which bind to mRNAs and regulate their expression. MiRNAs have been found to be associated with initiation and progression of many complex diseases. Investigating miRNAs and their targets can thus help develop new therapies by designing anti-miRNA oligonucleotides. While existing computational approaches can predict miRNA targets, these predictions have low accuracy. In this paper, we propose a two-step approach to refine the results of sequence-based prediction algorithms. The first step, which is based on our previous work, uses an ensemble learning approach that combines multiple existing methods. The second step utilizes support vector machine (SVM) classifiers in one- and two-class modes to infer miRNA-mRNA interactions based on both binding features, as well as network features extracted from gene regulatory network. Experimental results using two real data sets from TCGA indicate that the use of two-class SVM classification significantly improves the precision of miRNA-mRNA prediction.

[1]  Anjali J. Koppal,et al.  Supplementary data: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites , 2010 .

[2]  Louise C. Showe,et al.  Naïve Bayes for microRNA target predictions - machine learning for microRNA targets , 2007, Bioinform..

[3]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[4]  Doron Betel,et al.  The microRNA.org resource: targets and expression , 2007, Nucleic Acids Res..

[5]  Jan Krüger,et al.  RNAhybrid: microRNA target prediction easy, fast and flexible , 2006, Nucleic Acids Res..

[6]  Ola Snøve,et al.  Weighted sequence motifs as an improved seeding step in microRNA target prediction algorithms. , 2005, RNA.

[7]  Jiuyong Li,et al.  Ensemble Methods for MiRNA Target Prediction from Expression Data , 2015, PloS one.

[8]  B. Frey,et al.  Using expression profiling data to identify human microRNA targets , 2007, Nature Methods.

[9]  C. Sander,et al.  Analysis of microRNA-target interactions across diverse cancer types , 2013, Nature Structural &Molecular Biology.

[10]  C. Croce,et al.  Targeting microRNAs in cancer: rationale, strategies and challenges , 2010, Nature Reviews Drug Discovery.

[11]  Xiaowei Wang,et al.  miRDB: an online resource for microRNA target prediction and functional annotations , 2014, Nucleic Acids Res..

[12]  Anders Krogh,et al.  Signatures of RNA binding proteins globally coupled to effective microRNA target sites. , 2010, Genome research.

[13]  Harsh Dweep,et al.  miRWalk database for miRNA-target interactions. , 2014, Methods in molecular biology.

[14]  A. Luttun,et al.  Quantification of miRNA-mRNA Interactions , 2012, PloS one.

[15]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[16]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[17]  Don R. Hush,et al.  Network constraints and multi-objective optimization for one-class classification , 1996, Neural Networks.

[18]  Jiuyong Li,et al.  Inferring microRNA-mRNA causal regulatory relationships from expression data , 2013, Bioinform..

[19]  Nectarios Koziris,et al.  TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support , 2011, Nucleic Acids Res..

[20]  Louise C. Showe,et al.  Learning from positive examples when the negative class is undetermined- microRNA gene identification , 2008, Algorithms for Molecular Biology.

[21]  Behzad Zamani,et al.  Prediction of microRNA target genes using an efficient genetic algorithm-based decision tree , 2015, FEBS open bio.

[22]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[23]  Albert Y. Zomaya,et al.  A Review of Ensemble Methods in Bioinformatics , 2010, Current Bioinformatics.

[24]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[25]  Giovanni Seni,et al.  Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions , 2010, Ensemble Methods in Data Mining.

[26]  Enrico Macii,et al.  miREE: miRNA recognition elements ensemble , 2011, BMC Bioinformatics.

[27]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[28]  Sanghamitra Bandyopadhyay,et al.  TargetMiner: microRNA target prediction with systematic identification of tissue-specific negative examples , 2009, Bioinform..

[29]  Mahmood Fathy,et al.  Identifying functional cancer-specific miRNA-mRNA interactions in testicular germ cell tumor. , 2016, Journal of theoretical biology.

[30]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[31]  Brendan J. Frey,et al.  Bayesian Inference of MicroRNA Targets from Sequence and Expression Data , 2007, J. Comput. Biol..

[32]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[33]  Fredrik Johansson,et al.  A comparative study of conservation and variation scores , 2010, BMC Bioinformatics.

[34]  Minghua Deng,et al.  A Lasso regression model for the construction of microRNA-target regulatory networks , 2011, Bioinform..

[35]  Jeffrey A. Thompson,et al.  Common features of microRNA target prediction tools , 2014, Front. Genet..

[36]  Huiqing Liu,et al.  Identifying mRNA targets of microRNA dysregulated in cancer: with application to clear cell Renal Cell Carcinoma , 2010, BMC Systems Biology.

[37]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[38]  Timos K. Sellis,et al.  miRGen 2.0: a database of microRNA genomic information and regulation , 2009, Nucleic Acids Res..

[39]  Gabriele Sales,et al.  MAGIA, a web-based tool for miRNA and Genes Integrated Analysis , 2010, Nucleic Acids Res..

[40]  Kathleen Steinhöfel,et al.  MicroRNA Target Prediction Based Upon Metastable RNA Secondary Structures , 2015, IWBBIO.

[41]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[42]  I. Van der Auwera,et al.  Integrated miRNA and mRNA expression profiling of the inflammatory breast cancer subtype , 2010, British Journal of Cancer.

[43]  Tongbin Li,et al.  miRecords: an integrated resource for microRNA–target interactions , 2008, Nucleic Acids Res..

[44]  Sungroh Yoon,et al.  Ensemble learning can significantly improve human microRNA target prediction. , 2014, Methods.