OptFill: A Tool for Infeasible Cycle-Free Gapfilling of Stoichiometric Metabolic Models

Summary Stoichiometric metabolic modeling, particularly genome-scale models (GSMs), is now an indispensable tool for systems biology. The model reconstruction process typically involves collecting information from public databases; however, incomplete systems knowledge leaves gaps in any reconstruction. Current tools for addressing gaps use databases of biochemical functionalities to address gaps on a per-metabolite basis and can provide multiple solutions but cannot avoid thermodynamically infeasible cycles (TICs), invariably requiring lengthy manual curation. To address these limitations, this work introduces an optimization-based multi-step method named OptFill, which performs TIC-avoiding whole-model gapfilling. We applied OptFill to three fictional prokaryotic models of increasing sizes and to a published GSM of Escherichia coli, iJR904. This application resulted in holistic and infeasible cycle-free gapfilling solutions. In addition, OptFill can be adapted to automate inherent TICs identification in any GSM. Overall, OptFill can address critical issues in automated development of high-quality GSMs.

[1]  Peter D. Karp,et al.  MetaCyc: a multiorganism database of metabolic pathways and enzymes , 2005, Nucleic Acids Res..

[2]  Lin Wang,et al.  Accelerating flux balance calculations in genome‐scale metabolic models by localizing the application of loopless constraints , 2017, Bioinform..

[3]  Sang Yup Lee,et al.  Recent advances in reconstruction and applications of genome-scale metabolic models. , 2012, Current opinion in biotechnology.

[4]  Rick L. Stevens,et al.  KBase: The United States Department of Energy Systems Biology Knowledgebase , 2018, Nature Biotechnology.

[5]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[6]  B. Palsson,et al.  A protocol for generating a high-quality genome-scale metabolic reconstruction , 2010 .

[7]  A. Burgard,et al.  Optknock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization , 2003, Biotechnology and bioengineering.

[8]  Steinn Gudmundsson,et al.  Applications of genome-scale metabolic models of microalgae and cyanobacteria in biotechnology , 2017 .

[9]  Peter D. Karp,et al.  How accurate is automated gap filling of metabolic models? , 2018, BMC Systems Biology.

[10]  Margaret N. Simons,et al.  Assessing the Metabolic Impact of Nitrogen Availability Using a Compartmentalized Maize Leaf Genome-Scale Model1[C][W][OPEN] , 2014, Plant Physiology.

[11]  M. Oh,et al.  Production of 2,3-butanediol in Saccharomyces cerevisiae by in silico aided metabolic engineering , 2011, Microbial Cell Factories.

[12]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[14]  Matteo Mori,et al.  Counting and Correcting Thermodynamically Infeasible Flux Cycles in Genome-Scale Metabolic Networks , 2013, Metabolites.

[15]  Ines Thiele,et al.  Computationally efficient flux variability analysis , 2010, BMC Bioinformatics.

[16]  Ratul Chowdhury,et al.  Using Gene Essentiality and Synthetic Lethality Information to Correct Yeast and CHO Cell Genome-Scale Models , 2015, Metabolites.

[17]  C. Maranas,et al.  Zea mays iRS1563: A Comprehensive Genome-Scale Metabolic Reconstruction of Maize Metabolism , 2011, PloS one.

[18]  Jeffrey D Orth,et al.  What is flux balance analysis? , 2010, Nature Biotechnology.

[19]  Forest Rohwer,et al.  Elucidating genomic gaps using phenotypic profiles , 2014 .

[20]  Avlant Nilsson,et al.  Recon3D: A Resource Enabling A Three-Dimensional View of Gene Variation in Human Metabolism , 2018, Nature Biotechnology.

[21]  Lars K. Nielsen,et al.  Fast-SNP: a fast matrix pre-processing algorithm for efficient loopless flux optimization of metabolic models , 2016, Bioinform..

[22]  Nan Xu,et al.  Genome-scale reconstruction and in silico analysis of Aspergillus terreus metabolism. , 2013, Molecular bioSystems.

[23]  Philip Miller,et al.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models , 2015, Nucleic Acids Res..

[24]  Peter D. Karp,et al.  Evaluation of reaction gap-filling accuracy by randomization , 2018, BMC Bioinformatics.

[25]  J. Nielsen,et al.  Metabolic model integration of the bibliome, genome, metabolome and reactome of Aspergillus niger , 2008, Molecular systems biology.

[26]  Costas D. Maranas,et al.  OptForce: An Optimization Procedure for Identifying All Genetic Manipulations Leading to Targeted Overproductions , 2010, PLoS Comput. Biol..

[27]  Ronan M. T. Fleming,et al.  Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota , 2016, Nature Biotechnology.

[28]  L. Quek,et al.  A multi-tissue genome-scale metabolic modeling framework for the analysis of whole plant systems , 2015, Front. Plant Sci..

[29]  Naryttza N. Diaz,et al.  The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes , 2005, Nucleic acids research.

[30]  F. Eisenhaber,et al.  Discovering novel SNPs that are correlated with patient outcome in a Singaporean cancer patient cohort treated with gemcitabine-based chemotherapy , 2018, BMC Cancer.

[31]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[32]  Toshihiro Obata,et al.  Dissecting metabolic flux in C4 plants: experimental and theoretical approaches , 2018, Phytochemistry Reviews.

[33]  Vinay Satish Kumar,et al.  Optimization based automated curation of metabolic reconstructions , 2007, BMC Bioinformatics.

[34]  P. Beyer,et al.  Golden Rice: introducing the beta-carotene biosynthesis pathway into rice endosperm by genetic engineering to defeat vitamin A deficiency. , 2002, The Journal of nutrition.

[35]  C. Maranas,et al.  Diurnal Regulation of Cellular Processes in the Cyanobacterium Synechocystis sp. Strain PCC 6803: Insights from Transcriptomic, Fluxomic, and Physiological Analyses , 2016, mBio.

[36]  Intawat Nookaew,et al.  Understanding the interactions between bacteria in the human gut through metabolic modeling , 2013, Scientific Reports.

[37]  Rajib Saha,et al.  Reconstruction and Comparison of the Metabolic Potential of Cyanobacteria Cyanothece sp. ATCC 51142 and Synechocystis sp. PCC 6803 , 2012, PloS one.

[38]  Juho Rousu,et al.  Comparative Genome-Scale Reconstruction of Gapless Metabolic Networks for Present and Ancestral Species , 2014, PLoS Comput. Biol..

[39]  Rick L. Stevens,et al.  High-throughput generation, optimization and analysis of genome-scale metabolic models , 2010, Nature Biotechnology.

[40]  Martin J. Lercher,et al.  Erroneous energy-generating cycles in published genome scale metabolic networks: Identification and removal , 2017, PLoS Comput. Biol..

[41]  William R Cluett,et al.  Constructing kinetic models of metabolism at genome‐scales: A review , 2015, Biotechnology journal.

[42]  R Nigam,et al.  Algorithm for perturbing thermodynamically infeasible metabolic networks , 2007, Comput. Biol. Medicine.

[43]  B. Palsson,et al.  Elimination of thermodynamically infeasible loops in steady-state metabolic models. , 2011, Biophysical journal.

[44]  Cheryl M. Immethun,et al.  Modeling the Interplay between Photosynthesis, CO2 Fixation, and the Quinone Pool in a Purple Non-Sulfur Bacterium , 2019, Scientific Reports.

[45]  Robert D Hall,et al.  Plant metabolomics and its potential application for human nutrition. , 2007, Physiologia plantarum.