Prediction of microRNAs involved in immune system diseases through network based features

MicroRNAs are a class of small non-coding regulatory RNA molecules that modulate the expression of several genes at post-transcriptional level and play a vital role in disease pathogenesis. Recent research shows that a range of miRNAs are involved in the regulation of immunity and its deregulation results in immune mediated diseases such as cancer, inflammation and autoimmune diseases. Computational discovery of these immune miRNAs using a set of specific features is highly desirable. In the current investigation, we present a SVM based classification system which uses a set of novel network based topological and motif features in addition to the baseline sequential and structural features to predict immune specific miRNAs from other non-immune miRNAs. The classifier was trained and tested on a balanced set of equal number of positive and negative examples to show the discriminative power of our network features. Experimental results show that our approach achieves an accuracy of 90.2% and outperforms the classification accuracy of 63.2% reported using the traditional miRNA sequential and structural features. The proposed classifier was further validated with two immune disease sub-class datasets related to multiple sclerosis microarray data and psoriasis RNA-seq data with higher accuracy. These results indicate that our classifier which uses network and motif features along with sequential and structural features will lead to significant improvement in classifying immune miRNAs and hence can be applied to identify other specific classes of miRNAs as an extensible miRNA classification system.

[1]  C. Croce Causes and consequences of microRNA dysregulation in cancer , 2009, Nature Reviews Genetics.

[2]  Xiaowei Wang miRDB: a microRNA target prediction and functional annotation database with a wiki interface. , 2008, RNA.

[3]  Arturo Casadevall,et al.  Host-Pathogen Interactions: Basic Concepts of Microbial Commensalism, Colonization, Infection, and Disease , 2000, Infection and Immunity.

[4]  Jun-ichi Satoh,et al.  Comprehensive analysis of human microRNA target networks , 2011, BioData Mining.

[5]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[6]  Hans-Peter Lenhof,et al.  Multiple Sclerosis: MicroRNA Expression Profiles Accurately Differentiate Patients with Relapsing-Remitting Disease from Healthy Controls , 2009, PloS one.

[7]  Wayne Tam,et al.  MicroRNAs of the immune system , 2010, Annals of the New York Academy of Sciences.

[8]  Qianjin Feng,et al.  Enhanced Performance of Brain Tumor Classification via Tumor Region Augmentation and Partition , 2015, PloS one.

[9]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[10]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[11]  Jan Krüger,et al.  RNAhybrid: microRNA target prediction easy, fast and flexible , 2006, Nucleic Acids Res..

[12]  Chun-Wei Hsu,et al.  Characterization of microRNA‐regulated protein‐protein interaction network , 2008, Proteomics.

[13]  Jiuyong Li,et al.  Identifying miRNAs, targets and functions , 2012, Briefings Bioinform..

[14]  Xiaolong Wang,et al.  miRNA-dis: microRNA precursor identification based on distance structure status pairs. , 2015, Molecular bioSystems.

[15]  H. Dweep,et al.  miRWalk2.0: a comprehensive atlas of microRNA-target interactions , 2015, Nature Methods.

[16]  Sabine C. Mueller,et al.  miRNAs can be generally associated with human pathologies as exemplified for miR-144* , 2014, BMC Medicine.

[17]  S. Cohen,et al.  microRNA functions. , 2007, Annual review of cell and developmental biology.

[18]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[19]  Weixiong Zhang,et al.  Deep sequencing of small RNAs from human skin reveals major alterations in the psoriasis miRNAome. , 2011, Human molecular genetics.

[20]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[21]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[22]  Archana Prabahar and Jeyakumar Natarajan,et al.  Prediction of miRNA in Human MHC that Encodes Different Immunological Functions Using Support Vector Machines , 2013 .

[23]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[24]  Yun Xiao,et al.  Prioritizing Candidate Disease miRNAs by Topological Features in the miRNA Target–Dysregulated Network: Case Study of Prostate Cancer , 2011, Molecular Cancer Therapeutics.

[25]  Q. Cui,et al.  An Analysis of Human MicroRNA and Disease Associations , 2008, PloS one.

[26]  Louise C. Showe,et al.  Bioinformatics Original Paper Combining Multi-species Genomic Data for Microrna Identification Using a Naı¨ve Bayes Classifier , 2022 .

[27]  Yadong Wang,et al.  miR2Disease: a manually curated database for microRNA deregulation in human disease , 2008, Nucleic Acids Res..

[28]  Mingzhi Liao,et al.  Predicting human microRNA precursors based on an optimized feature subset generated by GA-SVM. , 2011, Genomics.

[29]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[30]  C. Mattingly,et al.  The Comparative Toxicogenomics Database (CTD). , 2003, Environmental health perspectives.

[31]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[32]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[33]  Karin Breuer,et al.  InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation , 2012, Nucleic Acids Res..

[34]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[35]  R. Islam,et al.  MiRANN: a reliable approach for improved classification of precursor microRNA using Artificial Neural Network model. , 2012, Genomics.

[36]  Lina Chen,et al.  Identification of breast cancer patients based on human signaling network motifs , 2013, Scientific Reports.

[37]  Mark Gerstein,et al.  Information assessment on predicting protein-protein interactions , 2004, BMC Bioinformatics.

[38]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[39]  M. A. Shoorehdeli,et al.  HomoTarget: a new algorithm for prediction of microRNA targets in Homo sapiens. , 2013, Genomics.

[40]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[41]  Doron Betel,et al.  The microRNA.org resource: targets and expression , 2007, Nucleic Acids Res..

[42]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[43]  Holger Sültmann,et al.  Circulating miRNAs are correlated with tumor progression in prostate cancer , 2011, International journal of cancer.

[44]  Min Zhu,et al.  Identifying functional miRNA-mRNA regulatory modules with correspondence latent dirichlet allocation , 2010, Bioinform..

[45]  Wen-Hsiung Li,et al.  MicroRNA regulation of human protein protein interaction network. , 2007, RNA.

[46]  Peng Jiang,et al.  MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features , 2007, Nucleic Acids Res..

[47]  Byoung-Tak Zhang,et al.  Human microRNA prediction through a probabilistic co-learning model of sequence and structure , 2005, Nucleic acids research.

[48]  Mathieu Blanchette,et al.  Computational prediction of the localization of microRNAs within their pre-miRNA , 2013, Nucleic acids research.

[49]  Junhyong Kim,et al.  Self Containment, a Property of Modular RNA Structures, Distinguishes microRNAs , 2008, PLoS Comput. Biol..

[50]  Louxin Zhang,et al.  Counting motifs in the human interactome , 2013, Nature Communications.

[51]  Kan He,et al.  The prediction of the porcine pre-microRNAs in genome-wide based on support vector machine (SVM) and homology searching , 2012, BMC Genomics.

[52]  Ming Zhao,et al.  MicroRNAs regulate immune system via multiple targets. , 2014, Discovery medicine.

[53]  S. Ahmed,et al.  MicroRNA, a new paradigm for understanding immunoregulation, inflammation, and autoimmune diseases. , 2011, Translational research : the journal of laboratory and clinical medicine.

[54]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[55]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[56]  Nectarios Koziris,et al.  DIANA-microT web server: elucidating microRNA functions through target prediction , 2009, Nucleic Acids Res..

[57]  Zihua Hu,et al.  Identification of interacting transcription factors regulating tissue gene expression in human , 2010, BMC Genomics.

[58]  Lubo Popel,et al.  Combining the Principal Components Method with Di erent Learning Algorithms , 2000 .

[59]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[60]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[61]  Peer Bork,et al.  KEGG Atlas mapping for global analysis of metabolic pathways , 2008, Nucleic Acids Res..

[62]  Yvonne Tay,et al.  A Pattern-Based Method for the Identification of MicroRNA Binding Sites and Their Corresponding Heteroduplexes , 2006, Cell.

[63]  Qizhi Yao,et al.  MicroRNAs: Control and Loss of Control in Human Physiology and Disease , 2009, World Journal of Surgery.

[64]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[65]  Ashwin Srinivasan,et al.  Prediction of novel precursor miRNAs using a context-sensitive hidden Markov model (CSHMM) , 2010, BMC Bioinformatics.

[66]  Zelmina Lubovac,et al.  Combining functional and topological properties to identify core modules in protein interaction networks , 2006, Proteins.

[67]  Jing Chen,et al.  Disease candidate gene identification and prioritization using protein interaction networks , 2009, BMC Bioinformatics.

[68]  Sebastian Wernicke,et al.  FANMOD: a tool for fast network motif detection , 2006, Bioinform..

[69]  Marie-Paule Lefranc,et al.  IMGT, the international ImMunoGeneTics database , 1999, Nucleic Acids Res..