An algorithm for the reduction of genome-scale metabolic network models to meaningful core models

BackgroundConstraint-based analysis of genome-scale metabolic models has become a key methodology to gain insights into functions, capabilities, and properties of cellular metabolism. Since their inception, the size and complexity of genome-scale metabolic reconstructions has significantly increased, with a concomitant increase in computational effort required for their analysis. Many stoichiometric methods cannot be applied to large networks comprising several thousand reactions. Furthermore, basic principles of an organism’s metabolism can sometimes be easier studied in smaller models focusing on central metabolism. Therefore, an automated and unbiased reduction procedure delivering meaningful core networks from well-curated genome-scale reconstructions is highly desirable.ResultsHere we present NetworkReducer, a new algorithm for an automated reduction of metabolic reconstructions to obtain smaller models capturing the central metabolism or other metabolic modules of interest. The algorithm takes as input a network model and a list of protected elements and functions (phenotypes) and applies a pruning step followed by an optional compression step. Network pruning removes elements of the network that are dispensable for the protected functions and delivers a subnetwork of the full system. Loss-free network compression further reduces the network size but not the complexity (dimension) of the solution space. As a proof of concept, we applied NetworkReducer to the iAF1260 genome-scale model of Escherichia coli (2384 reactions, 1669 internal metabolites) to obtain a reduced model that (i) allows the same maximal growth rates under aerobic and anaerobic conditions as in the full model, and (ii) preserves a protected set of reactions representing the central carbon metabolism. The reduced representation comprises 85 metabolites and 105 reactions which we compare to a manually derived E. coli core model. As one particular strength of our approach, NetworkReducer derives a condensed biomass synthesis reaction that is consistent with the full genome-scale model. In a second case study, we reduced a genome-scale model of the cyanobacterium Synechocystis sp. PCC 6803 to obtain a small metabolic module comprising photosynthetic core reactions and the Calvin-Benson cycle allowing synthesis of both biomass and a biofuel (ethanol).ConclusionAlthough only genome-scale models provide a complete description of an organism’s metabolic capabilities, an unbiased stoichiometric reduction of large-scale metabolic models is highly useful. We are confident that the NetworkReducer algorithm provides a valuable tool for the application of computationally expensive analyses, for educational purposes, as well to identify core models for kinetic modeling and isotopic tracer experiments.

[1]  B. Palsson,et al.  Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods , 2012, Nature Reviews Microbiology.

[2]  Jeffrey D Orth,et al.  What is flux balance analysis? , 2010, Nature Biotechnology.

[3]  F. Srienc,et al.  Elementary mode analysis: a useful metabolic pathway analysis tool for characterizing cellular metabolism , 2009, Applied Microbiology and Biotechnology.

[4]  S. Klamt,et al.  Cyanobacterial biofuels: new insights and strain design strategies revealed by computational modeling , 2014, Microbial Cell Factories.

[5]  Melanie I. Stefan,et al.  Molecules for memory: modelling CaMKII , 2007, BMC Systems Biology.

[6]  Steffen Klamt,et al.  Stoichiometric and Constraint-Based Analysis of Biochemical Reaction Networks , 2014, Large-Scale Networks in Engineering and Life Sciences.

[7]  Nicola Zamboni,et al.  Novel biological insights through metabolomics and 13C-flux analysis. , 2009, Current opinion in microbiology.

[8]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[9]  Udo Reichl,et al.  Large-Scale Networks in Engineering and Life Sciences , 2014, Modeling and Simulation in Science, Engineering and Technology.

[10]  R. Mahadevan,et al.  The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. , 2003, Metabolic engineering.

[11]  Adam M. Feist,et al.  A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011 , 2011, Molecular systems biology.

[12]  Bonnie Berger,et al.  An exact arithmetic toolbox for a consistent and reproducible structural analysis of metabolic network models , 2014, Nature Communications.

[13]  Adam M. Feist,et al.  Basic and applied uses of genome-scale metabolic network reconstructions of Escherichia coli , 2013, Molecular systems biology.

[14]  Steffen Klamt,et al.  Computation of elementary modes: a unifying framework and the new binary approach , 2004, BMC Bioinformatics.

[15]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Lake-Ee Quek,et al.  Reducing Recon 2 for steady-state flux analysis of HEK cell culture. , 2014, Journal of biotechnology.

[17]  Ronan M. T. Fleming,et al.  Reconstruction and Use of Microbial Metabolic Networks: the Core Escherichia coli Metabolic Model as an Educational Guide. , 2010, EcoSal Plus.

[18]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[19]  Steffen Klamt,et al.  Structural and functional analysis of cellular networks with CellNetAnalyzer , 2007, BMC Systems Biology.

[20]  Ralf Steuer,et al.  Flux Balance Analysis of Cyanobacterial Metabolism: The Metabolic Network of Synechocystis sp. PCC 6803 , 2013, PLoS Comput. Biol..

[21]  C. Daub,et al.  BMC Systems Biology , 2007 .