Topological augmentation to infer hidden processes in biological systems

Motivation: A common problem in understanding a biochemical system is to infer its correct structure or topology. This topology consists of all relevant state variables—usually molecules and their interactions. Here we present a method called topological augmentation to infer this structure in a statistically rigorous and systematic way from prior knowledge and experimental data. Results: Topological augmentation starts from a simple model that is unable to explain the experimental data and augments its topology by adding new terms that capture the experimental behavior. This process is guided by representing the uncertainty in the model topology through stochastic differential equations whose trajectories contain information about missing model parts. We first apply this semiautomatic procedure to a pharmacokinetic model. This example illustrates that a global sampling of the parameter space is critical for inferring a correct model structure. We also use our method to improve our understanding of glutamine transport in yeast. This analysis shows that transport dynamics is determined by glutamine permeases with two different kinds of kinetics. Topological augmentation can not only be applied to biochemical systems, but also to any system that can be described by ordinary differential equations. Availability and implementation: Matlab code and examples are available at: http://www.csb.ethz.ch/tools/index. Contact: mikael.sunnaker@bsse.ethz.ch; andreas.wagner@ieu.uzh.ch Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Jens Nielsen,et al.  Integrated multilaboratory systems biology reveals differences in protein metabolism between two reference yeast strains. , 2010, Nature communications.

[2]  U. Sauer,et al.  Identification of furfural as a key toxin in lignocellulosic hydrolysates and evolution of a tolerant yeast strain , 2008, Microbial biotechnology.

[3]  Marc Hafner,et al.  Efficient characterization of high-dimensional parameter spaces for systems biology , 2011, BMC Systems Biology.

[4]  Nir Friedman,et al.  Inferring quantitative models of regulatory networks from expression data , 2004, ISMB/ECCB.

[5]  C T Verrips,et al.  Dynamic optimal control of homeostasis: an integrative system approach for modeling of the central nitrogen metabolism in Saccharomyces cerevisiae. , 2000, Metabolic engineering.

[6]  H. Madsen,et al.  Using Stochastic Differential Equations for PK/PD Model Development , 2005, Journal of Pharmacokinetics and Pharmacodynamics.

[7]  M. Girolami,et al.  Inferring signaling pathway topologies from multiple perturbation measurements of specific biochemical species. , 2010, Science signaling.

[8]  Edda Klipp,et al.  ModelMage: a tool for automatic model generation, selection and management. , 2008, Genome informatics. International Conference on Genome Informatics.

[9]  J. Stelling,et al.  Ensemble modeling for analysis of cell signaling dynamics , 2007, Nature Biotechnology.

[10]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[11]  E. Dubois,et al.  Nitrogen Catabolite Repression-Sensitive Transcription as a Readout of Tor Pathway Regulation: The Genetic Background, Reporter Gene and GATA Factor Assayed Determine the Outcomes , 2009, Genetics.

[12]  Jinnie M. Garrett,et al.  The Saccharomyces cerevisiae YCC5(YCL025c) Gene Encodes an Amino Acid Permease, Agp1, Which Transports Asparagine and Glutamine , 1998, Journal of bacteriology.

[13]  Nicola Zamboni,et al.  High-throughput, accurate mass metabolome profiling of cellular extracts by flow injection-time-of-flight mass spectrometry. , 2011, Analytical chemistry.

[14]  C. Kaiser,et al.  Activity-dependent reversible inactivation of the general amino acid permease. , 2006, Molecular biology of the cell.

[15]  Jacob Hofman-Bang,et al.  Nitrogen catabolite repression in Saccharomyces cerevisiae , 1999, Molecular biotechnology.

[16]  Michael D. Schmidt,et al.  Automated refinement and inference of analytical models for metabolic networks , 2011, Physical biology.

[17]  J. Schreve,et al.  GNP1, the high-affinity glutamine permease of S. cerevisiae , 1996, Current Genetics.

[18]  U. Sauer,et al.  Cross-platform comparison of methods for quantitative metabolomics of primary metabolism. , 2009, Analytical chemistry.

[19]  D. Gatz,et al.  The standard error of a weighted mean concentration—I. Bootstrapping vs other methods , 1995 .

[20]  H. Madsen,et al.  Non-Linear Mixed-Effects Models with Stochastic Differential Equations: Implementation of an Estimation Algorithm , 2005, Journal of Pharmacokinetics and Pharmacodynamics.

[21]  B. Øksendal Stochastic differential equations : an introduction with applications , 1987 .

[22]  Marie Davidian,et al.  Non-linear mixed-effects models , 2008 .

[23]  A. Wagner,et al.  Automatic Generation of Predictive Dynamic Models Reveals Nuclear Phosphorylation as the Key Msn2 Control Mechanism , 2013, Science Signaling.

[24]  Uwe Sauer,et al.  TCA cycle activity in Saccharomyces cerevisiae is a function of the environmentally determined specific growth and glucose uptake rates. , 2004, Microbiology.

[25]  Joerg M. Buescher,et al.  Ultrahigh performance liquid chromatography-tandem mass spectrometry method for fast and robust quantification of anionic and aromatic metabolites. , 2010, Analytical chemistry.

[26]  Aurélien Mazurie,et al.  Gene networks inference using dynamic Bayesian networks , 2003, ECCB.

[27]  Fang-Xiang Wu,et al.  Modeling Gene Expression from Microarray Expression Data with State-Space Equations , 2003, Pacific Symposium on Biocomputing.

[28]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[29]  Michael P. H. Stumpf,et al.  Simulation-based model selection for dynamical systems in systems and population biology , 2009, Bioinform..

[30]  Michael C. Jewett,et al.  Linking high-resolution metabolic flux phenotypes and transcriptional regulation in yeast modulated by the global regulator Gcn4p , 2009, Proceedings of the National Academy of Sciences.

[31]  B. Regenberg,et al.  Dip5p mediates high-affinity and high-capacity transport of L-glutamate and L-aspartate in Saccharomyces cerevisiae , 1998, Current Genetics.

[32]  J. Gancedo,et al.  Concentrations of intermediary metabolites in yeast. , 1973, Biochimie.

[33]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[34]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[35]  M. Gutmann,et al.  Approximate Bayesian Computation , 2019, Annual Review of Statistics and Its Application.

[36]  Amy C. Kelly,et al.  Saccharomyces cerevisiae , 2013, Prion.