Signaling network prediction by the Ontology Fingerprint enhanced Bayesian network

BackgroundDespite large amounts of available genomic and proteomic data, predicting the structure and response of signaling networks is still a significant challenge. While statistical method such as Bayesian network has been explored to meet this challenge, employing existing biological knowledge for network prediction is difficult. The objective of this study is to develop a novel approach that integrates prior biological knowledge in the form of the Ontology Fingerprint to infer cell-type-specific signaling networks via data-driven Bayesian network learning; and to further use the trained model to predict cellular responses.ResultsWe applied our novel approach to address the Predictive Signaling Network Modeling challenge of the fourth (2009) Dialog for Reverse Engineering Assessment's and Methods (DREAM4) competition. The challenge results showed that our method accurately captured signal transduction of a network of protein kinases and phosphoproteins in that the predicted protein phosphorylation levels under all experimental conditions were highly correlated (R2 = 0.93) with the observed results. Based on the evaluation of the DREAM4 organizer, our team was ranked as one of the top five best performers in predicting network structure and protein phosphorylation activity under test conditions.ConclusionsBayesian network can be used to simulate the propagation of signals in cellular systems. Incorporating the Ontology Fingerprint as prior biological knowledge allows us to efficiently infer concise signaling network structure and to accurately predict cellular responses.

[1]  M. White,et al.  Ras Interaction with Two Distinct Binding Domains in Raf-1 5 Be Required for Ras Transformation (*) , 1996, The Journal of Biological Chemistry.

[2]  A. Catling,et al.  Rac-PAK Signaling Stimulates Extracellular Signal-Regulated Kinase (ERK) Activation by Regulating Formation of MEK1-ERK Complexes , 2002, Molecular and Cellular Biology.

[3]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[4]  Chiara Sabatti,et al.  Network component analysis: Reconstruction of regulatory signals in biological systems , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[6]  P. Dent,et al.  Ras-induced activation of Raf-1 is dependent on tyrosine phosphorylation , 1996, Molecular and cellular biology.

[7]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[8]  D. Pe’er Bayesian Network Analysis of Signaling Networks: A Primer , 2005, Science's STKE.

[9]  D. Heckerman,et al.  A Bayesian Approach to Causal Discovery , 2006 .

[10]  C. Daub,et al.  BMC Systems Biology , 2007 .

[11]  Nancy Cartwright,et al.  From Metaphysics to Method: Comments on Manipulability and the Causal Markov Condition , 2006, The British Journal for the Philosophy of Science.

[12]  J. Jackson,et al.  Structural Requirements for PAK Activation by Rac GTPases* , 1998, The Journal of Biological Chemistry.

[13]  David Maxwell Chickering,et al.  Learning Bayesian Networks is , 1994 .

[14]  C. Marshall,et al.  Requirement of Ras-GTP-Raf complexes for activation of Raf-1 by protein kinase C. , 1998, Science.

[15]  D J Glass,et al.  Differentiation stage-specific inhibition of the Raf-MEK-ERK pathway by Akt. , 1999, Science.

[16]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[17]  L. Loew,et al.  Systems Analysis of Ran Transport , 2002, Science.

[18]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[19]  Ravi Iyengar,et al.  Models of Spatially Restricted Biochemical Reaction Systems* , 2009, Journal of Biological Chemistry.

[20]  R. Iyengar,et al.  Modeling cell signaling networks. , 2004, Biology of the cell.

[21]  D. Lauffenburger,et al.  Networks Inferred from Biochemical Data Reveal Profound Differences in Toll-like Receptor and Inflammatory Signaling between Normal and Transformed Hepatocytes* , 2010, Molecular & Cellular Proteomics.

[22]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[23]  Jason A. Papin,et al.  Reconstruction of cellular signalling networks and analysis of their properties , 2005, Nature Reviews Molecular Cell Biology.

[24]  Ron Bose,et al.  Phosphoproteomic analysis of Her2/neu signaling and inhibition. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Lauffenburger,et al.  Physicochemical modelling of cell signalling pathways , 2006, Nature Cell Biology.

[26]  Steve Horvath,et al.  Glycerol kinase deficiency alters expression of genes involved in lipid metabolism, carbohydrate metabolism, and insulin signaling , 2007, European Journal of Human Genetics.

[27]  Herbert A. Simon,et al.  Causality in Bayesian Belief Networks , 1993, UAI.

[28]  Drew Endy,et al.  Stimulus Design for Model Selection and Validation in Cell Signaling , 2008, PLoS Comput. Biol..

[29]  Michael Boehnke,et al.  Evaluation of genome-wide association study results through development of ontology fingerprints , 2009, Bioinform..

[30]  Nancy Cartwright,et al.  Against Modularity, the Causal Markov Condition, and Any Link Between the Two: Comments on Hausman and Woodward , 2002, The British Journal for the Philosophy of Science.

[31]  D. Lauffenburger,et al.  Discrete logic modelling as a means to link protein signalling networks with functional analysis of mammalian signal transduction , 2009, Molecular systems biology.

[32]  Ron Shamir,et al.  A Probabilistic Methodology for Integrating Knowledge and Experiments on Biological Networks , 2006, J. Comput. Biol..

[33]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[34]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[35]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[36]  Eugene Charniak,et al.  Bayesian Networks without Tears , 1991, AI Mag..

[37]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[38]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[39]  Guenther Witzany,et al.  Biocommunication and natural genome editing. , 2009, World journal of biological chemistry.

[40]  T. Roberts,et al.  Tangled Webs: Evidence of Cross-Talk Between c-Raf-1 and Akt , 1999, Science's STKE.

[41]  G. Church,et al.  Genome-Scale Metabolic Model of Helicobacter pylori 26695 , 2002, Journal of bacteriology.

[42]  K. Lu Pinning down cell signaling, cancer and Alzheimer's disease. , 2004, Trends in biochemical sciences.

[43]  D. Lauffenburger,et al.  A Systems Model of Signaling Identifies a Molecular Basis Set for Cytokine-Induced Apoptosis , 2005, Science.

[44]  K. Moelling,et al.  Phosphorylation and regulation of Raf by Akt (protein kinase B). , 1999, Science.

[45]  Jacob J Hughey,et al.  Computational modeling of mammalian signaling networks , 2010, Wiley interdisciplinary reviews. Systems biology and medicine.

[46]  David Madigan,et al.  A Note on Equivalence Classes of Directed Acyclic Independence Graphs , 1993, Probability in the Engineering and Informational Sciences.

[47]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[48]  Peter J. Woolf,et al.  Bayesian analysis of signaling networks governing embryonic stem cell fate decisions , 2005, Bioinform..

[49]  Douglas A. Lauffenburger,et al.  Common effector processing mediates cell-specific responses to stimuli , 2007, Nature.

[50]  R. Iyengar,et al.  Signaling Networks The Origins of Cellular Multitasking , 2000, Cell.

[51]  Baoju Zhang,et al.  Discovery of lung cancer pathways using Reverse Phase Protein Microarray and prior-knowledge based Bayesian networks , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[52]  Eberhard O Voit,et al.  Theoretical Biology and Medical Modelling , 2022 .

[53]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[54]  B. Palsson,et al.  Reconstructing metabolic flux vectors from extreme pathways: defining the alpha-spectrum. , 2003, Journal of theoretical biology.

[55]  B. Kholodenko Cell-signalling dynamics in time and space , 2006, Nature Reviews Molecular Cell Biology.

[56]  D. Lauffenburger,et al.  The Response of Human Epithelial Cells to TNF Involves an Inducible Autocrine Cascade , 2006, Cell.

[57]  Akhilesh Pandey,et al.  Comparisons of tyrosine phosphorylated proteins in cells expressing lung cancer-specific alleles of EGFR and KRAS , 2008, Proceedings of the National Academy of Sciences.

[58]  B. Palsson,et al.  Reconstructing metabolic flux vectors from extreme pathways: defining the α-spectrum , 2003 .

[59]  P. Ja,et al.  Inference in Bayesian Networks , 1999, AI Mag..

[60]  Eberhard O. Voit,et al.  Biochemical systems theory and metabolic control theory: 1. fundamental similarities and differences , 1987 .

[61]  A. Califano,et al.  Dialogue on Reverse‐Engineering Assessment and Methods , 2007, Annals of the New York Academy of Sciences.

[62]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[63]  T. Jaakkola,et al.  Bayesian Network Approach to Cell Signaling Pathway Modeling , 2002, Science's STKE.

[64]  Julio Saez-Rodriguez,et al.  Flexible informatics for linking experimental data to mathematical models via DataRail , 2008, Bioinform..

[65]  George Casella,et al.  Implementations of the Monte Carlo EM Algorithm , 2001 .

[66]  Korbinian Strimmer,et al.  Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process , 2007, BMC Bioinformatics.