Incorporating prior information into differential network analysis using non‐paranormal graphical models

Motivation: Understanding how gene regulatory networks change under different cellular states is important for revealing insights into network dynamics. Gaussian graphical models, which assume that the data follow a joint normal distribution, have been used recently to infer differential networks. However, the distributions of the omics data are non‐normal in general. Furthermore, although much biological knowledge (or prior information) has been accumulated, most existing methods ignore the valuable prior information. Therefore, new statistical methods are needed to relax the normality assumption and make full use of prior information. Results: We propose a new differential network analysis method to address the above challenges. Instead of using Gaussian graphical models, we employ a non‐paranormal graphical model that can relax the normality assumption. We develop a principled model to take into account the following prior information: (i) a differential edge less likely exists between two genes that do not participate together in the same pathway; (ii) changes in the networks are driven by certain regulator genes that are perturbed across different cellular states and (iii) the differential networks estimated from multi‐view gene expression data likely share common structures. Simulation studies demonstrate that our method outperforms other graphical model‐based algorithms. We apply our method to identify the differential networks between platinum‐sensitive and platinum‐resistant ovarian tumors, and the differential networks between the proneural and mesenchymal subtypes of glioblastoma. Hub nodes in the estimated differential networks rediscover known cancer‐related regulator genes and contain interesting predictions. Availability and Implementation: The source code is at https://github.com/Zhangxf‐ccnu/pDNA Contact: szuouyl@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Andrea Califano,et al.  Rewiring makes the difference , 2011, Molecular systems biology.

[2]  Larry A. Wasserman,et al.  The Nonparanormal: Semiparametric Estimation of High Dimensional Undirected Graphs , 2009, J. Mach. Learn. Res..

[3]  Pradeep Ravikumar,et al.  Graphical models via univariate exponential family distributions , 2013, J. Mach. Learn. Res..

[4]  Larry A. Wasserman,et al.  High Dimensional Semiparametric Gaussian Copula Graphical Models. , 2012, ICML 2012.

[5]  Xing-Ming Zhao,et al.  Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information , 2012, Bioinform..

[6]  Sourav Bandyopadhyay,et al.  Rewiring of Genetic Networks in Response to DNA Damage , 2010, Science.

[7]  Nico Pfeifer,et al.  Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery , 2015, Bioinform..

[8]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[9]  S. Dilruba,et al.  Platinum-based drugs: past, present and future , 2016, Cancer Chemotherapy and Pharmacology.

[10]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[11]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[12]  Quanquan Gu,et al.  Identifying gene regulatory network rewiring using latent differential graphical models , 2016, Nucleic acids research.

[13]  Luonan Chen,et al.  Part mutual information for quantifying direct associations in networks , 2016, Proceedings of the National Academy of Sciences.

[14]  Su-In Lee,et al.  Node-based learning of multiple Gaussian graphical models , 2013, J. Mach. Learn. Res..

[15]  Yin Liu,et al.  Incorporating prior knowledge into Gene Network Study , 2013, Bioinform..

[16]  Marco Grzegorczyk,et al.  Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks , 2006, Bioinform..

[17]  Cheng Li,et al.  Lessons from a decade of integrating cancer copy number alterations with gene expression profiles , 2012, Briefings Bioinform..

[18]  D. Brat,et al.  Transforming Fusions of FGFR and TACC Genes in Human Glioblastoma , 2012, Science.

[19]  Rahul Kumar,et al.  CancerDR: Cancer Drug Resistance Database , 2013, Scientific Reports.

[20]  Xingming Zhao,et al.  A survey on computational approaches to identifying disease biomarkers based on molecular networks. , 2014, Journal of theoretical biology.

[21]  Su-In Lee,et al.  Pathway Graphical Lasso , 2015, AAAI.

[22]  A. Barabasi,et al.  Network link prediction by global silencing of indirect correlations , 2013, Nature Biotechnology.

[23]  Xing-Ming Zhao,et al.  Network-based biomarkers for complex diseases. , 2014, Journal of theoretical biology.

[24]  Xing-Ming Zhao,et al.  NARROMI: a noise and redundancy reduction technique improves accuracy of gene regulatory network inference , 2013, Bioinform..

[25]  Maurizio Botta,et al.  An update on dual Src/Abl inhibitors. , 2012, Future Medicinal Chemistry.

[26]  Susmita Datta,et al.  A statistical framework for differential network analysis from microarray data , 2010, BMC Bioinformatics.

[27]  V. Beral,et al.  Rethinking ovarian cancer II: reducing mortality from high-grade serous ovarian cancer , 2015, Nature Reviews Cancer.

[28]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[29]  Mathieu Blanchette,et al.  The relationship between DNA methylation, genetic and expression inter-individual variation in untransformed human fibroblasts , 2014, Genome Biology.

[30]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[31]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[32]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[33]  A. G. de la Fuente From 'differential expression' to 'differential networking' - identification of dysfunctional regulatory networks in diseases. , 2010, Trends in genetics : TIG.

[34]  Su-In Lee,et al.  Learning graphical models with hubs , 2014, J. Mach. Learn. Res..

[35]  Kim-Anh Do,et al.  DINGO: differential network analysis in genomics , 2015, Bioinform..

[36]  Hong Yan,et al.  Differential network analysis from cross-platform gene expression data , 2016, Scientific Reports.

[37]  Su-In Lee,et al.  Identifying Network Perturbation in Cancer , 2016, bioRxiv.

[38]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[39]  Xingming Zhao,et al.  Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks , 2014, Nucleic acids research.

[40]  Muriel Médard,et al.  Network deconvolution as a general method to distinguish direct dependencies in networks , 2013, Nature Biotechnology.

[41]  Larry A. Wasserman,et al.  Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models , 2010, NIPS.

[42]  H. Zou,et al.  Regularized rank-based estimation of high-dimensional nonparanormal graphical models , 2012, 1302.3082.

[43]  T. Ideker,et al.  Differential network biology , 2012, Molecular systems biology.

[44]  Andrew H. Beck,et al.  EMDomics: a robust and powerful method for the identification of genes differentially expressed between heterogeneous classes , 2015, Bioinform..

[45]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[46]  Daniel S. Himmelstein,et al.  Understanding multicellular function and disease with human tissue-specific networks , 2015, Nature Genetics.

[47]  Ulf Leser,et al.  Comparative assessment of differential network analysis methods , 2016, Briefings Bioinform..

[48]  Xiao-Fei Zhang,et al.  Determining minimum set of driver nodes in protein-protein interaction networks , 2015, BMC Bioinformatics.

[49]  T. Cai,et al.  Direct estimation of differential networks. , 2014, Biometrika.

[50]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[51]  Andrew E. Sloan,et al.  Molecular Subtypes of Glioblastoma Are Relevant to Lower Grade Glioma , 2014, PloS one.

[52]  Juan F. Poyatos,et al.  Rewiring of Genetic Networks in Response to Modification of Genetic Background , 2014, bioRxiv.

[53]  A. Palma,et al.  Metabolic/Proteomic Signature Defines Two Glioblastoma Subtypes With Different Clinical Outcome , 2016, Scientific Reports.

[54]  Daniel Marbach,et al.  Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases , 2016, Nature Methods.

[55]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[56]  Albert-László Barabási,et al.  Scale-Free Networks: A Decade and Beyond , 2009, Science.

[57]  Cun-Hui Zhang,et al.  A group bridge approach for variable selection , 2009, Biometrika.

[58]  Ruibin Xi,et al.  Differential Network Analysis via the Lasso Penalized D-Trace Loss , 2015, 1511.09188.

[59]  Lei Wang,et al.  FGFR1/3 Tyrosine Kinase Fusions Define a Unique Molecular Subtype of Non–Small Cell Lung Cancer , 2014, Clinical Cancer Research.

[60]  D. Haussler,et al.  The Somatic Genomic Landscape of Glioblastoma , 2013, Cell.

[61]  J. Xia,et al.  OCGene: a database of experimentally verified ovarian cancer-related genes with precomputed regulation information , 2015, Cell Death and Disease.

[62]  S. Gabriel,et al.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. , 2010, Cancer cell.

[63]  P. Drew,et al.  The analysis of doxorubicin resistance in human breast cancer cells using antibody microarrays , 2006, Molecular Cancer Therapeutics.