MASSpy: Building, simulating, and visualizing dynamic biological models in Python using mass action kinetics

Mathematical models of metabolic networks utilize simulation to study system-level mechanisms and functions. Various approaches have been used to model the steady state behavior of metabolic networks using genome-scale reconstructions, but formulating dynamic models from such reconstructions continues to be a key challenge. Here, we present the Mass Action Stoichiometric Simulation Python (MASSpy) package, an open-source computational framework for dynamic modeling of metabolism. MASSpy utilizes mass action kinetics and detailed chemical mechanisms to build dynamic models of complex biological processes. MASSpy adds dynamic modeling tools to the COnstraint-Based Reconstruction and Analysis Python (COBRApy) package to provide an unified framework for constraint-based and kinetic modeling of metabolic networks. MASSpy supports high-performance dynamic simulation through its implementation of libRoadRunner; the Systems Biology Markup Language (SBML) simulation engine. Three case studies demonstrate how to use MASSpy: 1) to simulate dynamics of detailed mechanisms of enzyme regulation; 2) to generate an ensemble of kinetic models using Monte Carlo sampling to approximate missing numerical values of parameters and to quantify uncertainty, and 3) to overcome issues that arise when integrating experimental data with the computation of functional states of detailed biological mechanisms. MASSpy represents a powerful tool to address challenge that arise in dynamic modeling of metabolic networks, both at a small and large scale. Author Summary Genome-scale reconstructions of metabolism appeared shortly after the first genome sequences became available. Constraint-based models are widely used to compute steady state properties of such reconstructions, but the attainment of dynamic models has remained elusive. We thus developed the MASSpy software package, a framework that enables the construction, simulation, and visualization of dynamic metabolic models. MASSpy is based on the mass action kinetics for each elementary step in an enzymatic reaction mechanism. MASSpy seamlessly unites existing software packages within its framework to provide the user with various modeling tools in one package. MASSpy integrates community standards to facilitate the exchange of models, giving modelers the freedom to use the software for different aspects of their own modeling workflows. Furthermore, MASSpy contains methods for generating and simulating ensembles of models, and for explicitly accounting for biological uncertainty. MASSpy has already demonstrated success in a classroom setting. We anticipate that the suite of modeling tools incorporated into MASSpy will enhance the ability of the modeling community to construct and interrogate complex dynamic models of metabolism.

[1]  C. Woodward,et al.  Enabling New Flexibility in the SUNDIALS Suite of Nonlinear and Differential/Algebraic Equation Solvers , 2020, ACM Trans. Math. Softw..

[2]  Johannes L. Schönberger,et al.  SciPy 1.0: fundamental algorithms for scientific computing in Python , 2019, Nature Methods.

[3]  Jason A. Papin,et al.  Community standards to facilitate development and address challenges in metabolic modeling , 2019, bioRxiv.

[4]  Carlos F. Lopez,et al.  Interactive Multiresolution Visualization of Cellular Network Processes , 2019, bioRxiv.

[5]  William S. Hlavacek,et al.  PyBioNetFit and the Biological Property Specification Language , 2019, iScience.

[6]  Sarah M. Keating,et al.  Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0 , 2019, Nature Protocols.

[7]  Arvind Satyanarayan,et al.  Altair: Interactive Statistical Visualizations for Python , 2018, J. Open Source Softw..

[8]  Herbert M. Sauro,et al.  Tellurium: An extensible python-based modeling environment for systems and synthetic biology , 2018, Biosyst..

[9]  Bernhard O Palsson,et al.  Network-level allosteric effects are elucidated by detailing how ligand-binding events modulate utilization of catalytic potentials , 2018, PLoS Comput. Biol..

[10]  B. Olivier,et al.  SBML Level 3 Package: Flux Balance Constraints version 2 , 2018, Journal of Integrative Bioinformatics.

[11]  Zachary A. King,et al.  A Padawan Programmer's Guide to Developing Software Libraries. , 2017, Cell systems.

[12]  Jasper A. Vrugt,et al.  PyDREAM: high-dimensional parameter inference for biological models in python , 2017, Bioinform..

[13]  Zachary A. King,et al.  iML1515, a knowledgebase that computes Escherichia coli traits , 2017, Nature Biotechnology.

[14]  Aarash Bordbar,et al.  Elucidating dynamic metabolic physiology through network integration of quantitative time-course metabolomics , 2017, Scientific Reports.

[15]  Nikolaus Sonnenschein,et al.  Optlang: An algebraic modeling language for mathematical optimization , 2017, J. Open Source Softw..

[16]  Andy R. Terrel,et al.  SymPy: Symbolic computing in Python , 2017, PeerJ Prepr..

[17]  Michael Hucka,et al.  SBML Level 3 package: Groups, Version 1 Release 1 , 2016, J. Integr. Bioinform..

[18]  Robert A. Edwards,et al.  From DNA to FBA: How to Build Your Own Genome-Scale Metabolic Model , 2016, Frontiers in microbiology.

[19]  Zhen Zhang,et al.  Evaluation of rate law approximations in bottom-up kinetic models of metabolism , 2016, BMC Systems Biology.

[20]  Charles E. McAnany,et al.  An Introduction to Programming for Bioscientists: A Python-Based Primer , 2016, PLoS Comput. Biol..

[21]  John Salvatier,et al.  Probabilistic programming in Python using PyMC3 , 2016, PeerJ Comput. Sci..

[22]  Ludovic Cottret,et al.  FlexFlux: combining metabolic flux and regulatory network analyses , 2015, BMC Systems Biology.

[23]  R. Aebersold,et al.  The quantitative and condition-dependent Escherichia coli proteome , 2015, Nature Biotechnology.

[24]  Daniel C. Zielinski,et al.  Personalized Whole-Cell Kinetic Models of Metabolism for Discovery in Genomics and Pharmacodynamics. , 2015, Cell systems.

[25]  U. Sauer,et al.  Pseudo-transition Analysis Identifies the Key Regulators of Dynamic Metabolic Adaptations from Steady-State Data. , 2015, Cell systems.

[26]  Zachary A. King,et al.  Escher: A Web Application for Building, Sharing, and Embedding Data-Rich Visualizations of Biological Pathways , 2015, PLoS Comput. Biol..

[27]  N. N. Ulusu Evolution of Enzyme Kinetic Mechanisms , 2015, Journal of Molecular Evolution.

[28]  Z. Nikoloski,et al.  Integration of transcriptomics and metabolomics data specifies the metabolic response of Chlamydomonas to rapamycin treatment. , 2015, The Plant journal : for cell and molecular biology.

[29]  Ali R. Zomorrodi,et al.  A kinetic model of Escherichia coli core metabolism satisfying multiple sets of mutant flux data. , 2014, Metabolic engineering.

[30]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[31]  M. Huynen,et al.  optGpSampler: An Improved Tool for Uniformly Sampling the Solution-Space of Genome-Scale Metabolic Networks , 2014, PloS one.

[32]  Wolfram Liebermeister,et al.  Pathway Thermodynamics Highlights Kinetic Obstacles in Central Metabolism , 2014, PLoS Comput. Biol..

[33]  P. Mendes,et al.  Systematic Construction of Kinetic Models from Genome-Scale Metabolic Networks , 2013, PloS one.

[34]  Joshua A. Lerman,et al.  COBRApy: COnstraints-Based Reconstruction and Analysis for Python , 2013, BMC Systems Biology.

[35]  Ronan M. T. Fleming,et al.  Consistent Estimation of Gibbs Energy Using Component Contributions , 2013, PLoS Comput. Biol..

[36]  Roger L. Chang,et al.  Structural Systems Biology Evaluation of Metabolic Thermotolerance in Escherichia coli , 2013, Science.

[37]  Adam M. Feist,et al.  Basic and applied uses of genome-scale metabolic network reconstructions of Escherichia coli , 2013, Molecular systems biology.

[38]  Carlos F. Lopez,et al.  Programming biological models in Python using PySB , 2013, Molecular systems biology.

[39]  M. Brynildsen,et al.  Potentiating antibacterial activity by predictably enhancing endogenous microbial ROS production , 2012, Nature Biotechnology.

[40]  Andreas Prlic,et al.  Ten Simple Rules for the Open Development of Scientific Software , 2012, PLoS Comput. Biol..

[41]  Roger L. Chang,et al.  Network Context and Selection in the Evolution to Enzyme Specificity , 2012, Science.

[42]  Marc-Thorsten Hütt,et al.  A network perspective on metabolic inconsistency , 2012, BMC Systems Biology.

[43]  A. Atala Re: Haem oxygenase is synthetically lethal with the tumour suppressor fumarate hydratase. , 2012, Journal of Urology.

[44]  A. Atala Re: Microvesicles released from human renal cancer stem cells stimulate angiogenesis and formation of lung premetastatic niche. , 2012, The Journal of urology.

[45]  J. Liao,et al.  Metabolic ensemble modeling for strain engineers , 2012, Biotechnology journal.

[46]  Rafael S. Costa,et al.  Exploring the gap between dynamic and constraint-based models of metabolism. , 2012, Metabolic engineering.

[47]  B. Palsson,et al.  Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods , 2012, Nature Reviews Microbiology.

[48]  A. Bordbar,et al.  Using the reconstructed genome‐scale human metabolic network to study physiology and pathology , 2012, Journal of internal medicine.

[49]  Ron Milo,et al.  eQuilibrator—the biochemical thermodynamics calculator , 2011, Nucleic Acids Res..

[50]  Matthias Heinemann,et al.  Condition-Dependent Cell Volume and Concentration of Escherichia coli to Facilitate Data Conversion for Systems Biology Modeling , 2011, PloS one.

[51]  A. Burgard,et al.  Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. , 2011, Nature chemical biology.

[52]  Bernhard O. Palsson,et al.  Systems Biology: Simulation of Dynamic Network States , 2011 .

[53]  Tanneguy Redarce,et al.  Automatic Lip-Contour Extraction and Mouth-Structure Segmentation in Images , 2011, Computing in Science & Engineering.

[54]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[55]  Andrzej M. Kierzek,et al.  SurreyFBA: a command line tool and graphics user interface for constraint-based modeling of genome-scale metabolic reaction networks , 2011, Bioinform..

[56]  B. Palsson,et al.  Insight into human alveolar macrophage and M. tuberculosis interactions via metabolic reconstructions , 2010, Molecular systems biology.

[57]  Sarah M. Keating,et al.  The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 1 Core , 2010, J. Integr. Bioinform..

[58]  Erika Cule,et al.  ABC-SysBio—approximate Bayesian computation in Python with GPU support , 2010, Bioinform..

[59]  Neema Jamshidi,et al.  Mass action stoichiometric simulation models: incorporating kinetics and regulation into stoichiometric models. , 2010, Biophysical journal.

[60]  Jan Schellenberger,et al.  Use of Randomized Sampling for Analysis of Metabolic Networks* , 2009, Journal of Biological Chemistry.

[61]  J. Liao,et al.  Ensemble modeling of metabolic networks. , 2008, Biophysical journal.

[62]  Howard A Stone,et al.  Dynamics of shear-induced ATP release from red blood cells , 2008, Proceedings of the National Academy of Sciences.

[63]  Michael Hucka,et al.  LibSBML: an API Library for SBML , 2008, Bioinform..

[64]  B. Palsson,et al.  Formulating genome-scale kinetic models in the post-genome era , 2008, Molecular systems biology.

[65]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[66]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[67]  Ronan M. T. Fleming,et al.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 , 2007, Nature Protocols.

[68]  S. Panke,et al.  Putative regulatory sites unraveled by network-embedded thermodynamic analysis of metabolome data , 2006, Molecular systems biology.

[69]  Jan-Hendrik S. Hofmeyr,et al.  Modelling cellular systems with PySCeS , 2005, Bioinform..

[70]  Hugh D. Spence,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[71]  Konrad Hinsen,et al.  High-Level Scientific Programming with Python , 2002, International Conference on Computational Science.

[72]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[73]  B. Palsson,et al.  Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110 , 1994, Applied and environmental microbiology.

[74]  Ulrich Nowak,et al.  A Family of Newton Codes for Systems of Highly Nonlinear Equations. , 1992 .

[75]  D Garfinkel,et al.  Theory formation in postulating enzyme kinetic mechanisms: reasoning with constraints. , 1988, Computers and biomedical research, an international journal.

[76]  B. Hess,et al.  Metabolic control mechanisms. 5. A solution for the equations representing interaction between glycolysis and respiration in ascites tumor cells. , 1960, The Journal of biological chemistry.

[77]  D. Gleich TRUST REGION METHODS , 2017 .

[78]  et al.,et al.  Jupyter Notebooks - a publishing format for reproducible computational workflows , 2016, ELPUB.

[79]  B. Griffin,et al.  Network Context and Selection in the Evolution to Enzyme Specificity , 2014 .

[80]  A. Burgard,et al.  Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. Nat Chem Biol , 2011 .

[81]  Nicholas I. M. Gould,et al.  Trust Region Methods , 2000, MOS-SIAM Series on Optimization.

[82]  Robert L. Smith,et al.  Direction Choice for Accelerated Convergence in Hit-and-Run Sampling , 1998, Oper. Res..