Knowledge-based Subtractive Integration of mRNA and miRNA Expression Profiles to Differentiate Myelodysplastic Syndrome

The goal of our work is to integrate conventional mRNA expression profiles with miRNA expressions using the knowledge of their validated or predicted interactions in order to improve class prediction in genetically determined diseases. The raw mRNA and miRNA expression features become enriched or replaced by new aggregated features that model the mRNA-miRNA interaction. The proposed subtractive integration method is directly motivated by the inhibition/degradation models of gene expression regulation. The method aggregates mRNA and miRNA expressions by subtracting a proportion of miRNA expression values from their respective target mRNAs. The method is used to model the outcome or development of myelodysplastic syndrome, a blood cell production disease often progressing to leukemia. The reached results demonstrate that the integration improves classification performance when dealing with mRNA and miRNA profiles of comparable predictive power.

[1]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[2]  V. Ambros,et al.  The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14 , 1993, Cell.

[3]  Jay L. Brewster,et al.  The microarray revolution: Perspectives from educators , 2004, Biochemistry and molecular biology education : a bimonthly publication of the International Union of Biochemistry and Molecular Biology.

[4]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[5]  Ju Han Kim,et al.  Synergistic effect of different levels of genomic data for cancer clinical outcome prediction , 2012, J. Biomed. Informatics.

[6]  Nahum Sonenberg,et al.  The mechanics of miRNA-mediated gene silencing: a look under the hood of miRISC , 2012, Nature Structural &Molecular Biology.

[7]  Norbert Gretz,et al.  miRWalk - Database: Prediction of possible miRNA binding sites by "walking" the genes of three genomes , 2011, J. Biomed. Informatics.

[8]  Ryan D. Morin,et al.  Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. , 2008, BioTechniques.

[9]  Juan Liu,et al.  A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules , 2011, Bioinform..

[10]  H. Votavova,et al.  Distinctive microRNA expression profiles in CD34+ bone marrow cells from patients with myelodysplastic syndrome , 2011, European Journal of Human Genetics.

[11]  Tu Bao Ho,et al.  Finding microRNA regulatory modules in human genome using rule induction , 2008, BMC Bioinformatics.

[12]  F. Welch,et al.  Causes and Consequences , 2017, Nature.

[13]  T. Okamoto,et al.  Evaluation of online miRNA resources for biomedical applications , 2012, Genes to cells : devoted to molecular & cellular mechanisms.

[14]  Shi-Hua Zhang,et al.  Identifying multi-layer gene regulatory modules from multi-dimensional genomic data , 2012, Bioinform..

[15]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[16]  Christian A. Rees,et al.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  George A Calin,et al.  mRNA/microRNA gene expression profile in microsatellite unstable colorectal cancer , 2007, Molecular Cancer.

[18]  Panayiotis V. Benos,et al.  mirConnX: condition-specific mRNA-microRNA network integrator , 2011, Nucleic Acids Res..

[19]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[20]  Danish Sayed,et al.  MicroRNAs in development and disease. , 2011, Physiological reviews.

[21]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[22]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[23]  D. Starczynowski,et al.  Deregulation of microRNAs in myelodysplastic syndrome , 2012, Leukemia.

[24]  Xiaowei Wang,et al.  Sequence analysis Prediction of both conserved and nonconserved microRNA targets in animals , 2007 .

[25]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[26]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[27]  Eva Budinska,et al.  A distinct expression of various gene subsets in CD34+ cells from patients with early and advanced myelodysplastic syndrome. , 2010, Leukemia research.

[28]  Daniela M Witten,et al.  Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data , 2009, Statistical applications in genetics and molecular biology.

[29]  Xinxia Peng,et al.  Computational identification of hepatitis C virus associated microRNA-mRNA regulatory modules in human livers , 2009, BMC Genomics.

[30]  C. Croce Causes and consequences of microRNA dysregulation in cancer , 2009, Nature Reviews Genetics.