Knowledge-fused differential dependency network models for detecting significant rewiring in biological networks

BackgroundModeling biological networks serves as both a major goal and an effective tool of systems biology in studying mechanisms that orchestrate the activities of gene products in cells. Biological networks are context-specific and dynamic in nature. To systematically characterize the selectively activated regulatory components and mechanisms, modeling tools must be able to effectively distinguish significant rewiring from random background fluctuations. While differential networks cannot be constructed by existing knowledge alone, novel incorporation of prior knowledge into data-driven approaches can improve the robustness and biological relevance of network inference. However, the major unresolved roadblocks include: big solution space but a small sample size; highly complex networks; imperfect prior knowledge; missing significance assessment; and heuristic structural parameter learning.ResultsTo address these challenges, we formulated the inference of differential dependency networks that incorporate both conditional data and prior knowledge as a convex optimization problem, and developed an efficient learning algorithm to jointly infer the conserved biological network and the significant rewiring across different conditions. We used a novel sampling scheme to estimate the expected error rate due to “random” knowledge. Based on that scheme, we developed a strategy that fully exploits the benefit of this data-knowledge integrated approach. We demonstrated and validated the principle and performance of our method using synthetic datasets. We then applied our method to yeast cell line and breast cancer microarray data and obtained biologically plausible results. The open-source R software package and the experimental data are freely available at http://www.cbil.ece.vt.edu/software.htm.ConclusionsExperiments on both synthetic and real data demonstrate the effectiveness of the knowledge-fused differential dependency network in revealing the statistically significant rewiring in biological networks. The method efficiently leverages data-driven evidence and existing biological knowledge while remaining robust to the false positive edges in the prior knowledge. The identified network rewiring events are supported by previous studies in the literature and also provide new mechanistic insight into the biological systems. We expect the knowledge-fused differential dependency network analysis, together with the open-source R package, to be an important and useful bioinformatics tool in biological network analyses.

[1]  L. Migliore,et al.  Mutation Research / Fundamental and Molecular Mechanisms of Mutagenesis , 2014 .

[2]  K. Shiozaki,et al.  Yeast signaling pathways in the oxidative stress response. , 2005, Mutation research.

[3]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[4]  Rune Linding,et al.  Navigating cancer network attractors for tumor-specific therapy , 2012, Nature Biotechnology.

[5]  N Jones,et al.  Regulation of yAP‐1 nuclear localization in response to oxidative stress , 1997, The EMBO journal.

[6]  P. Tseng Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization , 2001 .

[7]  K. Murphy,et al.  Bcl-2 inhibits Bax translocation from cytosol to mitochondria during drug-induced apoptosis of human tumor cells , 2000, Cell Death and Differentiation.

[8]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[9]  Zoubin Ghahramani,et al.  Modeling T-cell activation using gene expression profiling and state-space models , 2004, Bioinform..

[10]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[11]  Margaret Werner-Washburne,et al.  A multiple network learning approach to capture system-wide condition-specific responses , 2011, Bioinform..

[12]  Ye Tian,et al.  Knowledge-guided differential dependency network learning for detecting structural changes in biological networks , 2011, BCB '11.

[13]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[14]  Frank Emmert-Streib,et al.  Assessment Method for a Power Analysis to Identify Differentially Expressed Pathways , 2012, PloS one.

[15]  Antonio Reverter,et al.  Beyond differential expression: the quest for causal mutations and effector molecules , 2012, BMC Genomics.

[16]  H. Brooks,et al.  Expression of osmotic stress-related genes in tissues of normal and hyposmotic rats. , 2003, American journal of physiology. Renal physiology.

[17]  Robert Clarke,et al.  Differential dependency network analysis to identify condition-specific topological changes in biological networks , 2009, Bioinform..

[18]  D. Agard,et al.  Estrogen receptor pathways to AP-1 , 2000, The Journal of Steroid Biochemistry and Molecular Biology.

[19]  S. Horvath,et al.  Variations in DNA elucidate molecular networks that cause disease , 2008, Nature.

[20]  Amy C. Kelly,et al.  Saccharomyces cerevisiae , 2013, Prion.

[21]  Horst Bunke,et al.  Inexact graph matching for structural pattern recognition , 1983, Pattern Recognit. Lett..

[22]  References , 1971 .

[23]  Allen Chong,et al.  Discovery of estrogen receptor α target genes and response elements in breast tumor cells , 2004, Genome Biology.

[24]  Yue Joseph Wang,et al.  Learning Structural Changes of Gaussian Graphical Models in Controlled Experiments , 2010, UAI.

[25]  C. Klinge Estrogen receptor interaction with estrogen response elements. , 2001, Nucleic acids research.

[26]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[27]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[28]  Yin Liu,et al.  Incorporating prior knowledge into Gene Network Study , 2013, Bioinform..

[29]  Jayaram Raghuram,et al.  Comparative analysis of methods for detecting interacting loci , 2011, BMC Genomics.

[30]  Alexandre d'Aspremont,et al.  Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .

[31]  Y. Yamasaki,et al.  Estrogen regulation of the insulin-like growth factor I gene transcription involves an AP-1 enhancer. , 1994, The Journal of biological chemistry.

[32]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[33]  A. Quintanilha,et al.  Hydrogen peroxide-induced carbonylation of key metabolic enzymes in Saccharomyces cerevisiae: the involvement of the oxidative stress response regulators Yap1 and Skn7. , 2002, Free radical biology & medicine.

[34]  T. Ideker,et al.  Differential network biology , 2012, Molecular systems biology.

[35]  C. Grant,et al.  Glutathione and catalase provide overlapping defenses for protection against hydrogen peroxide in the yeast Saccharomyces cerevisiae. , 1998, Biochemical and biophysical research communications.

[36]  A. Genz,et al.  Computation of Multivariate Normal and t Probabilities , 2009 .

[37]  Nir Friedman,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004, Science.

[38]  D J Jamieson,et al.  Oxidative stress responses of the yeast Saccharomyces cerevisiae , 1998, Yeast.

[39]  B. Komm,et al.  Genome-Wide Analysis of Estrogen Receptor α DNA Binding and Tethering Mechanisms Identifies Runx1 as a Novel Tethering Factor in Receptor-Mediated Transcriptional Activation , 2010, Molecular and Cellular Biology.

[40]  C. Borner,et al.  Bcl-2 prolongs cell survival after Bax-induced release of cytochrome c , 1998, Nature.

[41]  Brian P. Dalrymple,et al.  Regulatory impact factors: unraveling the transcriptional regulation of complex traits from expression data , 2010, Bioinform..

[42]  Robert Clarke,et al.  Dynamic modelling of oestrogen signalling and cell fate in breast cancer cells , 2011, Nature Reviews Cancer.

[43]  Q. Wei,et al.  The calcineurin B subunit induces TNF-related apoptosis-inducing ligand (TRAIL) expression via CD11b-NF-κB pathway in RAW264.7 macrophages. , 2012, Biochemical and biophysical research communications.

[44]  BMC Systems Biology , 2007 .

[45]  E. Lander,et al.  Remodeling of yeast genome expression in response to environmental changes. , 2001, Molecular biology of the cell.

[46]  T. Shen,et al.  Elevated extracellular glucose and uncontrolled type 1 diabetes enhance NFAT5 signaling and disrupt the transverse tubular network in mouse skeletal muscle , 2012, Experimental biology and medicine.

[47]  Frank Emmert-Streib,et al.  The Chronic Fatigue Syndrome: A Comparative Pathway Analysis , 2007, J. Comput. Biol..

[48]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[49]  C. Paumi,et al.  The Rho1 GTPase Acts Together With a Vacuolar Glutathione S-Conjugate Transporter to Protect Yeast Cells From Oxidative Stress , 2011, Genetics.

[50]  J. Garin,et al.  Yap1 and Skn7 Control Two Specialized Oxidative Stress Response Regulons in Yeast* , 1999, The Journal of Biological Chemistry.

[51]  Antonio Reverter,et al.  A Differential Wiring Analysis of Expression Data Correctly Identifies the Gene Containing the Causal Mutation , 2009, PLoS Comput. Biol..

[52]  Maria Petrou,et al.  Incorporating prior knowledge in ICA , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[53]  B. Biteau,et al.  Oxidative stress responses in yeast , 2003 .

[54]  B. Shneiderman,et al.  Nuclear envelope dystrophies show a transcriptional fingerprint suggesting disruption of Rb-MyoD pathways in muscle regeneration. , 2006, Brain : a journal of neurology.

[55]  Signal flow between CWI/TOR and CWI/RAS in budding yeast under conditions of oxidative stress and glucose starvation , 2010, Communicative & integrative biology.

[56]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[57]  Andrea Califano,et al.  Rewiring makes the difference , 2011, Molecular systems biology.

[58]  Sourav Bandyopadhyay,et al.  Rewiring of Genetic Networks in Response to DNA Damage , 2010, Science.

[59]  Amr Ahmed,et al.  Recovering time-varying networks of dependencies in social and biological studies , 2009, Proceedings of the National Academy of Sciences.

[60]  J. Bergh,et al.  Definition of clinically distinct molecular subtypes in estrogen receptor-positive breast carcinomas through genomic grade. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[61]  Subha Madhavan,et al.  DDN: a caBIG® analytical tool for differential network analysis , 2011, Bioinform..

[62]  Michael F. Ochs,et al.  Knowledge-based data analysis comes of age , 2010, Briefings Bioinform..

[63]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[64]  Susmita Datta,et al.  A statistical framework for differential network analysis from microarray data , 2010, BMC Bioinformatics.

[65]  Edith D. Wong,et al.  Saccharomyces Genome Database: the genomics resource of budding yeast , 2011, Nucleic Acids Res..

[66]  Komudi Singh,et al.  Oxidant-induced cell death mediated by a Rho Gtpase in Saccharomyces cerevisiae. , 2008 .

[67]  Honglin Li,et al.  Apoptosis in the skeletal muscle of untreated children with juvenile dermatomyositis: impact of duration of untreated disease. , 2007, Clinical immunology.