A Systemic Analysis of Transcriptomic and Epigenomic Data To Reveal Regulation Patterns for Complex Disease

Integrating diverse genomics data can provide a global view of the complex biological processes related to the human complex diseases. Although substantial efforts have been made to integrate different omics data, there are at least three challenges for multi-omics integration methods: (i) How to simultaneously consider the effects of various genomic factors, since these factors jointly influence the phenotypes; (ii) How to effectively incorporate the information from publicly accessible databases and omics datasets to fully capture the interactions among (epi)genomic factors from diverse omics data; and (iii) Until present, the combination of more than two omics datasets has been poorly explored. Current integration approaches are not sufficient to address all of these challenges together. We proposed a novel integrative analysis framework by incorporating sparse model, multivariate analysis, Gaussian graphical model, and network analysis to address these three challenges simultaneously. Based on this strategy, we performed a systemic analysis for glioblastoma multiforme (GBM) integrating genome-wide gene expression, DNA methylation, and miRNA expression data. We identified three regulatory modules of genomic factors associated with GBM survival time and revealed a global regulatory pattern for GBM by combining the three modules, with respect to the common regulatory factors. Our method can not only identify disease-associated dysregulated genomic factors from different omics, but more importantly, it can incorporate the information from publicly accessible databases and omics datasets to infer a comprehensive interaction map of all these dysregulated genomic factors. Our work represents an innovative approach to enhance our understanding of molecular genomic mechanisms underlying human complex diseases.

[1]  D. Nam,et al.  WNT signaling in glioblastoma and therapeutic opportunities , 2016, Laboratory Investigation.

[2]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature Reviews Genetics.

[3]  Anupam Gupta,et al.  Discovering pathways by orienting edges in protein interaction networks , 2010, Nucleic acids research.

[4]  D. Sackerer,et al.  Novel tumor antigens identified by autologous antibody screening of childhood medulloblastoma cDNA libraries , 2003, International journal of cancer.

[5]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[6]  Aedín C. Culhane,et al.  Dimension reduction techniques for the integrative analysis of multi-omics data , 2016, Briefings Bioinform..

[7]  Li Liu,et al.  Decreased Expression of MiRNA-204-5p Contributes to Glioma Progression and Promotes Glioma Cell Growth, Migration and Invasion , 2015, PloS one.

[8]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[9]  G. Zadeh,et al.  A microRNA Link to Glioblastoma Heterogeneity , 2012, Cancers.

[10]  Jian Huang,et al.  Gene network-based cancer prognosis analysis with sparse boosting. , 2012, Genetics research.

[11]  Haiyan Huang,et al.  Review on statistical methods for gene network reconstruction using expression data. , 2014, Journal of theoretical biology.

[12]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[13]  Lei He,et al.  miR-340 suppresses glioblastoma multiforme , 2015, Oncotarget.

[14]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[15]  Daoyang Zhou,et al.  miR-20a mediates temozolomide-resistance in glioblastoma cells via negatively regulating LRIG1 expression. , 2015, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[16]  Volkhard Helms,et al.  Integrative network-based approach identifies key genetic elements in breast invasive carcinoma , 2015, BMC Genomics.

[17]  Z. Grieg,et al.  Wnt inhibition is dysregulated in gliomas and its re-establishment inhibits proliferation and tumor sphere formation. , 2016, Experimental cell research.

[18]  Tim Beißbarth,et al.  pwOmics: an R package for pathway-based integration of time-series omics data using public database knowledge , 2015, Bioinform..

[19]  J. Sinsheimer,et al.  Expression Quantitative Trait Loci: Replication, Tissue- and Sex-Specificity in Mice , 2010, Genetics.

[20]  Ping Wang,et al.  MiR-152 functions as a tumor suppressor in glioblastoma stem cells by targeting Krüppel-like factor 4. , 2014, Cancer letters.

[21]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[22]  Most Mauluda Akhtar,et al.  Bioinformatic tools for microRNA dissection , 2015, Nucleic acids research.

[23]  G. Bianconi,et al.  Differential network entropy reveals cancer system hallmarks , 2012, Scientific Reports.

[24]  Chun-mei Chen,et al.  Over-expression of ARHI decreases tumor growth, migration, and invasion in human glioma , 2014, Medical Oncology.

[25]  H. Kawami,et al.  Identification of an Alternatively Spliced RNA for the Ras Suppressor RSU-1 in Human Gliomas , 2002, Journal of Neuro-Oncology.

[26]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[27]  B. Kholodenko,et al.  Computational Approaches for Analyzing Information Flow in Biological Networks , 2012, Science Signaling.

[28]  V. Seifert,et al.  Inhibition of tissue factor/protease-activated receptor-2 signaling limits proliferation, migration and invasion of malignant glioma cells , 2010, Neuroscience.

[29]  G. Reifenberger,et al.  Frequent biallelic inactivation and transcriptional silencing of the DIRAS3 gene at 1p31 in oligodendroglial tumors with 1p loss , 2008, International journal of cancer.

[30]  Hao Jiang,et al.  Co-suppression of miR-221/222 cluster suppresses human glioma cell growth by targeting p27kip1 in vitro and in vivo. , 2009, International journal of oncology.

[31]  Xiaochuan Sun,et al.  Tumor suppressor miR-181c attenuates proliferation, invasion, and self-renewal abilities in glioblastoma , 2015, Neuroreport.

[32]  Zachary D. Smith,et al.  DNA methylation: roles in mammalian development , 2013, Nature Reviews Genetics.

[33]  D. Coppola,et al.  Identification and Characterization of Putative Tumor Suppressor NGB, a GTP-Binding Protein That Interacts with the Neurofibromatosis 2 Protein , 2007, Molecular and Cellular Biology.

[34]  C. Brennan,et al.  Recruited Cells Can Become Transformed and Overtake PDGF-Induced Murine Gliomas In Vivo during Tumor Progression , 2011, PloS one.

[35]  Juan F. Poyatos,et al.  The Balance of Weak and Strong Interactions in Genetic Networks , 2011, PloS one.

[36]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[37]  Charlotte Soneson,et al.  Integrative analysis of gene expression and copy number alterations using canonical correlation analysis , 2010, BMC Bioinformatics.

[38]  C. Farber,et al.  Identification of a gene module associated with BMD through the integration of network analysis and genome‐wide association data , 2010, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[39]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature reviews genetics.

[40]  Y. Mori,et al.  miR-203 Inhibits Frizzled-2 Expression via CD82/KAI1 Expression in Human Lung Carcinoma Cells , 2015, PloS one.

[41]  Wei Chen,et al.  FastGGM: An Efficient Algorithm for the Inference of Gaussian Graphical Model in Biological Networks , 2016, PLoS Comput. Biol..

[42]  Bin Wang,et al.  Oncogenic miR-20a and miR-106a enhance the invasiveness of human glioma stem cells by directly targeting TIMP-2 , 2014, Oncogene.

[43]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[44]  Haiyuan Yu,et al.  Network-based methods for human disease gene prediction. , 2011, Briefings in functional genomics.

[45]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[46]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[47]  Jing Wang,et al.  WEB-based GEne SeT AnaLysis Toolkit (WebGestalt): update 2013 , 2013, Nucleic Acids Res..

[48]  S. Pineda,et al.  Integration Analysis of Three Omics Data Using Penalized Regression Methods: An Application to Bladder Cancer , 2015, PLoS genetics.

[49]  Ernest Fraenkel,et al.  SAMNetWeb: identifying condition-specific networks linking signaling and transcription , 2015, Bioinform..

[50]  Judy H. Cho,et al.  Identification of association between disease and multiple markers via sparse partial least‐squares regression , 2011, Genetic epidemiology.

[51]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[52]  Larry A. Wasserman,et al.  The huge Package for High-dimensional Undirected Graph Estimation in R , 2012, J. Mach. Learn. Res..

[53]  Kwanjeera Wanichthanarak,et al.  MetaMapR: pathway independent metabolomic network analysis incorporating unknowns , 2015, Bioinform..

[54]  S. Keleş,et al.  Sparse partial least squares regression for simultaneous dimension reduction and variable selection , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[55]  Giovanni Scardoni,et al.  Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data , 2012, Bioinform..

[56]  Roded Sharan,et al.  SPINE: a framework for signaling-regulatory pathway inference from cause-effect experiments , 2007, ISMB/ECCB.

[57]  Peter Langfelder,et al.  Genetic analysis of DNA methylation and gene expression levels in whole blood of healthy human subjects , 2012, BMC Genomics.

[58]  Jens Timmer,et al.  Networks: On the relation of bi- and multivariate measures , 2015, Scientific Reports.

[59]  Wei Huang,et al.  Integrative genome analysis reveals an oncomir/oncogene cluster regulating glioblastoma survivorship , 2010, Proceedings of the National Academy of Sciences.

[60]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[61]  N. Rajewsky,et al.  The evolution of gene regulation by transcription factors and microRNAs , 2007, Nature Reviews Genetics.

[62]  E. Fraenkel,et al.  Integrating Proteomic, Transcriptional, and Interactome Data Reveals Hidden Components of Signaling and Regulatory Networks , 2009, Science Signaling.

[63]  H. H. Andersen,et al.  MicroRNA Expression Signatures Determine Prognosis and Survival in Glioblastoma Multiforme—a Systematic Overview , 2014, Molecular Neurobiology.

[64]  E. Lander Initial impact of the sequencing of the human genome , 2011, Nature.

[65]  David Warde-Farley,et al.  Dynamic modularity in protein interaction networks predicts breast cancer outcome , 2009, Nature Biotechnology.

[66]  Fabian J. Theis,et al.  Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data , 2011, BMC Systems Biology.

[67]  Mariano J. Alvarez,et al.  Identification of Causal Genetic Drivers of Human Disease through Systems-Level Analysis of Regulatory Networks , 2014, Cell.

[68]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[69]  C. Farber,et al.  Future of Osteoporosis Genetics: Enhancing Genome‐Wide Association Studies , 2009, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.