Network inference using steady-state data and Goldbeter-Koshland kinetics

MOTIVATION Network inference approaches are widely used to shed light on regulatory interplay between molecular players such as genes and proteins. Biochemical processes underlying networks of interest (e.g. gene regulatory or protein signalling networks) are generally nonlinear. In many settings, knowledge is available concerning relevant chemical kinetics. However, existing network inference methods for continuous, steady-state data are typically rooted in statistical formulations, which do not exploit chemical kinetics to guide inference. RESULTS Herein, we present an approach to network inference for steady-state data that is rooted in non-linear descriptions of biochemical mechanism. We use equilibrium analysis of chemical kinetics to obtain functional forms that are in turn used to infer networks using steady-state data. The approach we propose is directly applicable to conventional steady-state gene expression or proteomic data and does not require knowledge of either network topology or any kinetic parameters. We illustrate the approach in the context of protein phosphorylation networks, using data simulated from a recent mechanistic model and proteomic data from cancer cell lines. In the former, the true network is known and used for assessment, whereas in the latter, results are compared against known biochemistry. We find that the proposed methodology is more effective at estimating network topology than methods based on linear models. AVAILABILITY mukherjeelab.nki.nl/CODE/GK_Kinetics.zip CONTACT c.j.oates@warwick.ac.uk; s.mukherjee@nki.nl SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Wei-Po Lee,et al.  Computational methods for discovering gene networks from expression data , 2009, Briefings Bioinform..

[2]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[3]  Sung Ho Yoon,et al.  Ensemble learning of genetic networks from time-series expression data , 2007, Bioinform..

[4]  Chris. J. Oates,et al.  On the relationship between ODEs and DBNs , 2012 .

[5]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[6]  J. Pearl Causal inference in statistics: An overview , 2009 .

[7]  D. Lauffenburger,et al.  Input–output behavior of ErbB signaling pathways as revealed by a mass action model trained against dynamic data , 2009, Molecular systems biology.

[8]  Erwin P. Gianchandani,et al.  Correction: Dynamic Analysis of Integrated Signaling, Metabolic, and Regulatory Networks , 2008, PLoS Computational Biology.

[9]  Sach Mukherjee,et al.  Structural inference using nonlinear dynamics , 2012 .

[10]  D. Bernardo,et al.  A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches , 2009, Cell.

[11]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[12]  Korbinian Strimmer,et al.  Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process , 2007, BMC Bioinformatics.

[13]  J. Rosenthal,et al.  Harris recurrence of Metropolis-within-Gibbs and trans-dimensional Markov chains , 2006, math/0702412.

[14]  K. Coombes,et al.  A Technical Assessment of the Utility of Reverse Phase Protein Arrays for the Study of the Functional Proteome in Non-microdissected Human Breast Cancers , 2010, Clinical Proteomics.

[15]  Peter A. J. Hilbers,et al.  Computing the Stochastic Dynamics of Phosphorylation Networks , 2010, J. Comput. Biol..

[16]  Jun S. Liu,et al.  Model selection principles in misspecified models , 2010, 1005.5483.

[17]  Terence Hwa,et al.  Transcriptional regulation by the numbers: models. , 2005, Current opinion in genetics & development.

[18]  Mark Girolami,et al.  Statistical analysis of nonlinear dynamical systems using differential geometric sampling methods , 2011, Interface Focus.

[19]  M. Girolami,et al.  Inferring Signaling Pathway Topologies from Multiple Perturbation Measurements of Specific Biochemical Species , 2010, Science Signaling.

[20]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[21]  Kathryn B. Laskey,et al.  Population Markov Chain Monte Carlo , 2004, Machine Learning.

[22]  I. Chou,et al.  Recent developments in parameter estimation and structure identification of biochemical and genomic systems. , 2009, Mathematical biosciences.

[23]  P. Heagerty,et al.  Misspecified maximum likelihood estimates and generalised linear mixed models , 2001 .

[24]  B. Kholodenko Cell-signalling dynamics in time and space , 2006, Nature Reviews Molecular Cell Biology.

[25]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[26]  Satoru Miyano,et al.  Inferring gene networks from time series microarray data using dynamic Bayesian networks , 2003, Briefings Bioinform..

[27]  Luay Nakhleh,et al.  Kinome siRNA-phosphoproteomic screen identifies networks regulating AKT signaling , 2011, Oncogene.

[28]  James E. Ferrell,et al.  Substrate Competition as a Source of Ultrasensitivity in the Inactivation of Wee1 , 2007, Cell.

[29]  E. Gilles,et al.  Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors , 2002, Nature Biotechnology.

[30]  D. Koshland,et al.  An amplified sensitivity arising from covalent modification in biological systems. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[31]  W. Wong,et al.  Learning Causal Bayesian Network Structures From Experimental Data , 2008 .

[32]  Gheorghe Craciun,et al.  Identifiability of chemical reaction networks , 2008 .

[33]  Bernhard Schölkopf,et al.  Identifiability of Causal Graphs using Functional Models , 2011, UAI.

[34]  Erwin P. Gianchandani,et al.  Dynamic Analysis of Integrated Signaling, Metabolic, and Regulatory Networks , 2008, PLoS Comput. Biol..

[35]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[36]  Holger Fröhlich,et al.  Dynamic deterministic effects propagation networks: learning signalling pathways from longitudinal protein array data , 2010, Bioinform..

[37]  V. Leskovac Comprehensive Enzyme Kinetics , 2003 .

[38]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[39]  Bor-Sen Chen,et al.  Identifying Functional Mechanisms of Gene and Protein Regulatory Networks in Response to a Broader Range of Environmental Stresses , 2010, Comparative and functional genomics.

[40]  Arnaud Doucet,et al.  On the Utility of Graphics Cards to Perform Massively Parallel Simulation of Advanced Monte Carlo Methods , 2009, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[41]  Sach Mukherjee,et al.  Network inference using informative priors , 2008, Proceedings of the National Academy of Sciences.

[42]  C J Oates,et al.  Network Inference and Biological Dynamics. , 2011, The annals of applied statistics.

[43]  Paul T. Spellman,et al.  Integrating biological knowledge into variable selection: an empirical Bayes approach with an application in cancer biology , 2011, BMC Bioinformatics.

[44]  Wen-Lin Kuo,et al.  A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. , 2006, Cancer cell.

[45]  Edward R. Morrissey,et al.  On reverse engineering of gene interaction networks using time course data with repeated measurements , 2010, Bioinform..

[46]  Steven M. Hill Sparse graphical models for cancer signalling , 2012 .

[47]  Kevin P. Murphy,et al.  Exact Bayesian structure learning from uncertain interventions , 2007, AISTATS.

[48]  Irene Cantone A yeast synthetic network for In-vivo Reverse-engineering and Modelling Assessment (IRMA) , 2009 .