Literature mining supports a next-generation modeling approach to predict cellular byproduct secretion

The metabolic byproducts secreted by growing cells can be easily measured and provide a window into the state of a cell; they have been essential to the development of microbiology1, cancer biology2, and biotechnology3. Progress in computational modeling of cells has made it possible to predict metabolic byproduct secretion with bottom-up reconstructions of metabolic networks. However, owing to a lack of data, it has not been possible to validate these predictions across a wide range of strains and conditions. Through literature mining, we were able to generate a database of Escherichia coli strains and their experimentally measured byproduct secretions. We simulated these strains in six historical genome-scale models of E. coli, and we report that the predictive power of the models has increased as they have expanded in size and scope. Next-generation models of metabolism and gene expression are even more capable than previous models, but parameterization poses new challenges.

[1]  Ali Ebrahim,et al.  Multi-omic data integration enables discovery of hidden biological regularities , 2016, Nature Communications.

[2]  Christoph Herwig,et al.  Quantitative feature extraction from the Chinese hamster ovary bioprocess bibliome using a novel meta-analysis workflow. , 2016, Biotechnology advances.

[3]  Tudor Groza,et al.  Navigating the Phenotype Frontier: The Monarch Initiative , 2016, Genetics.

[4]  Ioannis Ch. Paschalidis,et al.  Mapping the landscape of metabolic goals of a cell , 2016, Genome Biology.

[5]  J. Keasling,et al.  Synthetic and systems biology for microbial production of commodity chemicals , 2016, npj Systems Biology and Applications.

[6]  Yuxuan Wang,et al.  Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming , 2016, PLoS Comput. Biol..

[7]  Philip Miller,et al.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models , 2015, Nucleic Acids Res..

[8]  Adam M. Feist,et al.  Next-generation genome-scale models for metabolic engineering. , 2015, Current opinion in biotechnology.

[9]  T. Hwa,et al.  Overflow metabolism in E. coli results from efficient proteome allocation , 2015, Nature.

[10]  S. Lee,et al.  Systems strategies for developing industrial microbial strains , 2015, Nature Biotechnology.

[11]  Edward J. O'Brien,et al.  Computing the functional proteome: recent progress and future prospects for genome-scale models. , 2015, Current opinion in biotechnology.

[12]  Bernhard O. Palsson,et al.  Escher: A Web Application for Building, Sharing, and Embedding Data-Rich Visualizations of Biological Pathways , 2015, PLoS Comput. Biol..

[13]  James D. Winkler,et al.  The LASER database: Formalizing design rules for metabolic engineering , 2015, Metabolic engineering communications.

[14]  Erika Check Hayden,et al.  Synthetic biologists seek standards for nascent field , 2015, Nature.

[15]  Elizabeth Brunk,et al.  Model-driven discovery of underground metabolic functions in Escherichia coli , 2015, Proceedings of the National Academy of Sciences.

[16]  Erika Check Hayden Synthetic biology called to order. , 2015, Nature.

[17]  Zachary A. King,et al.  Constraint-based models predict metabolic and associated cellular functions , 2014, Nature Reviews Genetics.

[18]  Edward J. O'Brien,et al.  Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction , 2013, Molecular systems biology.

[19]  M. Jiang,et al.  Enhancement of succinate production by metabolically engineered Escherichia coli with co-expression of nicotinic acid phosphoribosyltransferase and pyruvate carboxylase , 2013, Applied Microbiology and Biotechnology.

[20]  Joshua A. Lerman,et al.  COBRApy: COnstraints-Based Reconstruction and Analysis for Python , 2013, BMC Systems Biology.

[21]  Neil Swainston,et al.  Improving metabolic flux predictions using absolute gene expression data , 2012, BMC Systems Biology.

[22]  B. Palsson,et al.  Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods , 2012, Nature Reviews Microbiology.

[23]  Jeffrey D. Orth,et al.  In silico method for modelling metabolism and gene product expression at genome scale , 2012, Nature Communications.

[24]  E. Ferreira,et al.  Stringent response of Escherichia coli: revisiting the bibliome using literature mining , 2011, Microbial Informatics and Experimentation.

[25]  Jay D Keasling,et al.  Metabolic engineering of microbial pathways for advanced biofuels production. , 2011, Current opinion in biotechnology.

[26]  Alexander Skupin,et al.  Carbohydrate-active enzymes exemplify entropic principles in metabolism , 2011, Molecular systems biology.

[27]  Adam M. Feist,et al.  A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011 , 2011, Molecular systems biology.

[28]  James M Clomburg,et al.  Engineered reversal of the β-oxidation cycle for the synthesis of fuels and chemicals , 2011, Nature.

[29]  A. Burgard,et al.  Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. , 2011, Nature chemical biology.

[30]  Cong T. Trinh,et al.  Redesigning Escherichia coli Metabolism for Anaerobic Production of Isobutanol , 2011, Applied and Environmental Microbiology.

[31]  Dan S. Tawfik,et al.  The moderately efficient enzyme: evolutionary and physicochemical trends shaping enzyme parameters. , 2011, Biochemistry.

[32]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[33]  V. Hatzimanikatis,et al.  Manipulating redox and ATP balancing for improved production of succinate in E. coli. , 2011, Metabolic engineering.

[34]  James M Clomburg,et al.  Metabolic engineering of Escherichia coli for the production of succinate from glycerol. , 2010, Metabolic engineering.

[35]  Tom M. Conrad,et al.  Omic data from evolved E. coli are consistent with computed optimal growth from genome-scale models , 2010, Molecular systems biology.

[36]  Adam M. Feist,et al.  Model-driven evaluation of the production potential for growth-coupled products of Escherichia coli. , 2010, Metabolic engineering.

[37]  A. Demain History of Industrial Biotechnology , 2010 .

[38]  Jeffrey D Orth,et al.  What is flux balance analysis? , 2010, Nature Biotechnology.

[39]  Xueli Zhang,et al.  Fermentation of Glycerol to Succinate by Metabolically Engineered Strains of Escherichia coli , 2010, Applied and Environmental Microbiology.

[40]  B. Teusink,et al.  Shifts in growth strategies reflect tradeoffs in cellular economics , 2009, Molecular systems biology.

[41]  H. Mori,et al.  Systematic phenome analysis of Escherichia coli multiple-knockout mutants reveals hidden reactions in central carbon metabolism , 2009, Molecular systems biology.

[42]  James C. Liao,et al.  Engineering the isobutanol biosynthetic pathway in Escherichia coli by comparison of three aldehyde reductase/alcohol dehydrogenase genes , 2009, Applied Microbiology and Biotechnology.

[43]  J. Liao,et al.  Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels , 2008, Nature.

[44]  Xueli Zhang,et al.  Production of l-alanine by metabolically engineered Escherichia coli , 2007, Applied Microbiology and Biotechnology.

[45]  U. Sauer,et al.  Systematic evaluation of objective functions for predicting intracellular fluxes in Escherichia coli , 2007, Molecular systems biology.

[46]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[47]  J. Piškur,et al.  How did Saccharomyces evolve to become a good brewer? , 2006, Trends in genetics : TIG.

[48]  B. Palsson Systems Biology: Properties of Reconstructed Networks , 2006 .

[49]  Ka-Yiu San,et al.  Efficient Succinic Acid Production from Glucose through Overexpression of Pyruvate Carboxylase in an Escherichia coli Alcohol Dehydrogenase and Lactate Dehydrogenase Mutant , 2008, Biotechnology progress.

[50]  S. Lee,et al.  Metabolic Engineering of Escherichia coli for Enhanced Production of Succinic Acid, Based on Genome Comparison and In Silico Gene Knockout Simulation , 2005, Applied and Environmental Microbiology.

[51]  F. Blattner,et al.  In silico design and adaptive evolution of Escherichia coli for production of lactic acid. , 2005, Biotechnology and bioengineering.

[52]  G. Bennett,et al.  Novel pathway engineering design of the anaerobic central metabolic pathway in Escherichia coli to increase succinate yield and productivity. , 2005, Metabolic engineering.

[53]  A. Burgard,et al.  Optknock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization , 2003, Biotechnology and bioengineering.

[54]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[55]  Shengde Zhou,et al.  Functional Replacement of the Escherichia colid-(−)-Lactate Dehydrogenase Gene (ldhA) with the l-(+)-Lactate Dehydrogenase Gene (ldhL) from Pediococcus acidilactici , 2003, Applied and Environmental Microbiology.

[56]  J. A. Barnett,et al.  Beginnings of microbiology and biochemistry: the contribution of yeast research. , 2003, Microbiology.

[57]  B. Palsson,et al.  Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth , 2002, Nature.

[58]  D. Kaplan Biological molecules at large , 2002, Nature.

[59]  M. A. Eiteman,et al.  Effects of Growth Mode and Pyruvate Carboxylase on Succinic Acid Production by Metabolically Engineered Strains of Escherichia coli , 2002, Applied and Environmental Microbiology.

[60]  B. Palsson,et al.  In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data , 2001, Nature Biotechnology.

[61]  G. Cecchini,et al.  Anaerobic Expression of Escherichia coli Succinate Dehydrogenase: Functional Replacement of Fumarate Reductase in the Respiratory Chain during Anaerobic Growth , 1998, Journal of bacteriology.

[62]  C. S. Millard,et al.  A novel fermentation pathway in anEscherichia coli mutant producing succinic acid, acetic acid, and ethanol , 1998, Applied biochemistry and biotechnology.

[63]  L. Stols,et al.  Production of succinic acid through overexpression of NAD(+)-dependent malic enzyme in an Escherichia coli mutant , 1997, Applied and environmental microbiology.

[64]  L. Stols,et al.  Expression ofAscaris suum malic enzyme in a mutantEscherichia coli allows production of succinic acid from glucose , 1997 .

[65]  S. Lee Bacterial polyhydroxyalkanoates , 1996, Biotechnology and bioengineering.

[66]  B. Palsson,et al.  Stoichiometric interpretation of Escherichia coli glucose catabolism under various oxygenation rates , 1993, Applied and environmental microbiology.

[67]  D. Clark,et al.  The fermentation pathways of Escherichia coli. , 1989, FEMS microbiology reviews.

[68]  L. Ingram,et al.  Genetic engineering of ethanol production in Escherichia coli , 1987, Applied and environmental microbiology.