A review of computational tools for design and reconstruction of metabolic pathways

Metabolic pathways reflect an organism's chemical repertoire and hence their elucidation and design have been a primary goal in metabolic engineering. Various computational methods have been developed to design novel metabolic pathways while taking into account several prerequisites such as pathway stoichiometry, thermodynamics, host compatibility, and enzyme availability. The choice of the method is often determined by the nature of the metabolites of interest and preferred host organism, along with computational complexity and availability of software tools. In this paper, we review different computational approaches used to design metabolic pathways based on the reaction network representation of the database (i.e., graph or stoichiometric matrix) and the search algorithm (i.e., graph search, flux balance analysis, or retrosynthetic search). We also put forth a systematic workflow that can be implemented in projects requiring pathway design and highlight current limitations and obstacles in computational pathway design.

[1]  Olivier Martin,et al.  MetaNetX/MNXref – reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks , 2015, Nucleic Acids Res..

[2]  Jacques van Helden,et al.  In response to 'Can sugars be produced from fatty acids? A test case for pathway analysis tools' , 2009, Bioinform..

[3]  Pablo Carbonell,et al.  A retrosynthetic biology approach to therapeutics: from conception to delivery. , 2012, Current opinion in biotechnology.

[4]  B. Griffin,et al.  Network Context and Selection in the Evolution to Enzyme Specificity , 2014 .

[5]  Jan Brezovsky,et al.  Computational tools for designing and engineering biocatalysts. , 2009, Current opinion in chemical biology.

[6]  Anupam Chowdhury,et al.  Designing overall stoichiometric conversions and intervening metabolic reactions , 2015, Scientific Reports.

[7]  Susumu Goto,et al.  Metabolic pathway reconstruction strategies for central metabolism and natural product biosynthesis , 2016, Biophysics and physicobiology.

[8]  Ryo Takeuchi,et al.  Computational redesign of a mononuclear zinc metalloenzyme for organophosphate hydrolysis. , 2012, Nature chemical biology.

[9]  Di Wu,et al.  A Computational Approach To Design and Evaluate Enzymatic Reaction Pathways: Application to 1-Butanol Production from Pyruvate , 2011, J. Chem. Inf. Model..

[10]  C. Smolke,et al.  Complete biosynthesis of opioids in yeast , 2015, Science.

[11]  Johann Gasteiger,et al.  Combining Chemoinformatics with Bioinformatics: In Silico Prediction of Bacterial Flavor-Forming Pathways by a Chemical Systems Biology Approach “Reverse Pathway Engineering” , 2014, PloS one.

[12]  Lydia E. Kavraki,et al.  An Algorithm for Efficient Identification of Branched Metabolic Pathways , 2011, J. Comput. Biol..

[13]  Jens Meiler,et al.  New algorithms and an in silico benchmark for computational enzyme design , 2006, Protein science : a publication of the Protein Society.

[14]  Dong-Sheng Cao,et al.  RxnFinder: biochemical reaction search engines using molecular structures, molecular fragments and reaction similarity , 2011, Bioinform..

[15]  Eric A. Althoff,et al.  Kemp elimination catalysts by computational enzyme design , 2008, Nature.

[16]  J Vanco [The Beilstein CrossFire Information System and its use in pharmaceutical chemistry]. , 2003, Ceska a Slovenska farmacie : casopis Ceske farmaceuticke spolecnosti a Slovenske farmaceuticke spolecnosti.

[17]  C. Austin,et al.  Improving the Human Hazard Characterization of Chemicals: A Tox21 Update , 2013, Environmental health perspectives.

[18]  Frances H. Arnold,et al.  Olefin Cyclopropanation via Carbene Transfer Catalyzed by Engineered Cytochrome P450 Enzymes , 2013, Science.

[19]  Keith E. J. Tyo,et al.  Evaluating enzymatic synthesis of small molecule drugs. , 2016, Metabolic engineering.

[20]  Lydia E. Kavraki,et al.  Finding metabolic pathways using atom tracking , 2010, Bioinform..

[21]  J Craig Venter,et al.  Digital-to-biological converter for on-demand production of biologics , 2017, Nature Biotechnology.

[22]  Chiam Yu Ng,et al.  Metabolic modeling of clostridia: current developments and applications. , 2016, FEMS microbiology letters.

[23]  J. Y. Yen,et al.  Finding the K Shortest Loopless Paths in a Network , 2007 .

[24]  Yiran Huang,et al.  A Method for Finding Metabolic Pathways Using Atomic Group Tracking , 2017, PloS one.

[25]  Francisco J. Planes,et al.  Refining carbon flux paths using atomic trace data , 2014, Bioinform..

[26]  Ronan M. T. Fleming,et al.  Consistent Estimation of Gibbs Energy Using Component Contributions , 2013, PLoS Comput. Biol..

[27]  Hyun Uk Kim,et al.  Production of bulk chemicals via novel metabolic pathways in microorganisms. , 2013, Biotechnology advances.

[28]  Costas D Maranas,et al.  OptStrain: a computational framework for redesign of microbial production systems. , 2004, Genome research.

[29]  David Eppstein,et al.  Finding the k Shortest Paths , 1999, SIAM J. Comput..

[30]  Hsien-Da Huang,et al.  FMM: a web server for metabolic pathway reconstruction and comparative analysis , 2009, Nucleic Acids Res..

[31]  Alexandra M. Newman,et al.  Practical Guidelines for Solving Difficult Mixed Integer Linear , 2013 .

[32]  Oliver Fiehn,et al.  MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics , 2015, Journal of Cheminformatics.

[33]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[34]  Sunwon Park,et al.  Prediction of novel synthetic pathways for the production of desired chemicals , 2010, BMC Systems Biology.

[35]  Günter Klambauer,et al.  DeepTox: Toxicity Prediction using Deep Learning , 2016, Front. Environ. Sci..

[36]  Pablo Carbonell,et al.  Enumerating metabolic pathways for the production of heterologous target chemicals in chassis organisms , 2012, BMC Systems Biology.

[37]  Elmar Heinzle,et al.  Network design and analysis for multi-enzyme biocatalysis , 2017, BMC Bioinformatics.

[38]  S. Chávez,et al.  The Presence of Glutamate Dehydrogenase Is a Selective Advantage for the Cyanobacterium Synechocystis sp. Strain PCC 6803 under Nonexponential Growth Conditions , 1999, Journal of bacteriology.

[39]  C. Maranas,et al.  IPRO: an iterative computational protein library redesign and optimization procedure. , 2006, Biophysical journal.

[40]  C. Maranas,et al.  A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains , 2016, Nature Communications.

[41]  Pablo Carbonell,et al.  A retrosynthetic biology approach to metabolic pathway design for therapeutic production , 2011, BMC Systems Biology.

[42]  Amanda L. Smith,et al.  Computational protein design enables a novel one-carbon assimilation pathway , 2015, Proceedings of the National Academy of Sciences.

[43]  D. Baker,et al.  The coming of age of de novo protein design , 2016, Nature.

[44]  Peter D. Karp,et al.  A systematic comparison of the MetaCyc and KEGG pathway databases , 2013, BMC Bioinformatics.

[45]  Robert Petryszak,et al.  UniChem: a unified chemical structure cross-referencing and identifier tracking system , 2013, Journal of Cheminformatics.

[46]  Xin Gao,et al.  MRE: a web tool to suggest foreign enzymes for the biosynthesis pathway design with competing endogenous reactions in mind , 2016, Nucleic Acids Res..

[47]  Pablo Carbonell,et al.  Molecular signatures-based prediction of enzyme promiscuity , 2010, Bioinform..

[48]  Pablo Carbonell,et al.  XTMS: pathway design in an eXTended metabolic space , 2014, Nucleic Acids Res..

[49]  Jean-Loup Faulon,et al.  Expanding Biosensing Abilities through Computer-Aided Design of Metabolic Pathways. , 2016, ACS synthetic biology.

[50]  Robin Osterhout,et al.  Development of a commercial scale process for production of 1,4-butanediol from sugar. , 2016, Current opinion in biotechnology.

[51]  Oliver Kohlbacher,et al.  MetaRoute: fast search for relevant metabolic routes for interactive network navigation and visualization , 2008, Bioinform..

[52]  Stefan Schuster,et al.  Systems biology Metatool 5.0: fast and flexible elementary modes analysis , 2006 .

[53]  Christoph Steinbeck,et al.  Rhea—a manually curated resource of biochemical reactions , 2011, Nucleic Acids Res..

[54]  H. Wood,et al.  Life with CO or CO2 and H2 as a source of carbon and energy , 1991, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[55]  Linda J. Broadbelt,et al.  Efficient searching and annotation of metabolic networks using chemical similarity , 2015, Bioinform..

[56]  Morteza Saheb Zamani,et al.  FogLight: an efficient matrix-based approach to construct metabolic pathways by search space reduction , 2016, Bioinform..

[57]  J. Reed,et al.  Large-Scale Bi-Level Strain Design Approaches and Mixed-Integer Programming Solution Techniques , 2011, PloS one.

[58]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[59]  Jiabo Li,et al.  New features that improve the pharmacophore tools from Accelrys. , 2011, Current computer-aided drug design.

[60]  James C Liao,et al.  Ensemble Modeling for Robustness Analysis in engineering non-native metabolic pathways. , 2014, Metabolic engineering.

[61]  P. Karp,et al.  Computational prediction of human metabolic pathways from the complete human genome , 2004, Genome Biology.

[62]  Yutaka Saito,et al.  An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds , 2012, BMC Bioinformatics.

[63]  Richard B. Sessions,et al.  CCBuilder: an interactive web-based tool for building, designing and assessing coiled-coil protein assemblies , 2014, Bioinform..

[64]  Marwin H. S. Segler,et al.  Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. , 2017, Chemistry.

[65]  Jotun Hein,et al.  Rahnuma: hypergraph-based tool for metabolic pathway prediction and network comparison , 2009, Bioinform..

[66]  V. Hatzimanikatis,et al.  ATLAS of Biochemistry: A Repository of All Possible Biochemical Reactions for Synthetic Biology and Metabolic Engineering Studies. , 2016, ACS synthetic biology.

[67]  Susumu Goto,et al.  PathPred: an enzyme-catalyzed metabolic pathway prediction server , 2010, Nucleic Acids Res..

[68]  Yang Liu,et al.  Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation , 2009, J. Chem. Inf. Model..

[69]  J. van Helden,et al.  Metabolic pathfinding using RPAIR annotation. , 2009, Journal of molecular biology.

[70]  Tong Li,et al.  The Iterative Protein Redesign and Optimization (IPRO) suite of programs , 2015, J. Comput. Chem..

[71]  Wolfram Liebermeister,et al.  The Protein Cost of Metabolic Fluxes: Prediction from Enzymatic Rate Laws and Cost Minimization , 2016, PLoS Comput. Biol..

[72]  A. Burgard,et al.  Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. , 2011, Nature chemical biology.

[73]  Costas D. Maranas,et al.  MetRxn: a knowledgebase of metabolites and reactions spanning metabolic models and databases , 2012, BMC Bioinformatics.

[74]  Christodoulos A Floudas,et al.  Protein WISDOM: a workbench for in silico de novo design of biomolecules. , 2013, Journal of visualized experiments : JoVE.

[75]  Christoph Kaleta,et al.  Metabolic Pathway Analysis : from small to genome-scale networks , 2011 .

[76]  Peter D. Karp,et al.  The EcoCyc database: reflecting new knowledge about Escherichia coli K-12 , 2016, Nucleic Acids Res..

[77]  Christoph Steinbeck,et al.  Reaction Decoder Tool (RDT): extracting features from chemical reactions , 2016, Bioinform..

[78]  F M Richards,et al.  Construction of new ligand binding sites in proteins of known structure. II. Grafting of a buried transition metal binding site into Escherichia coli thioredoxin. , 1991, Journal of molecular biology.

[79]  Huimin Zhao,et al.  Protein design for pathway engineering. , 2014, Journal of structural biology.

[80]  Hossein Fazelinia,et al.  Extending Iterative Protein Redesign and Optimization (IPRO) in protein library design for ligand specificity. , 2007, Biophysical journal.

[81]  Jennifer L Reed,et al.  Metabolic assessment of E. coli as a Biofactory for commercial products. , 2016, Metabolic engineering.

[82]  F. Richards,et al.  Construction of new ligand binding sites in proteins of known structure. I. Computer-aided modeling of sites with pre-defined geometry. , 1991, Journal of molecular biology.

[83]  Paul N. Devine,et al.  Biocatalytic Asymmetric Synthesis of Chiral Amines from Ketones Applied to Sitagliptin Manufacture , 2010, Science.

[84]  Lei Shi,et al.  SABIO-RK—database for biochemical reaction kinetics , 2011, Nucleic Acids Res..

[85]  Benjamin Müller,et al.  The SCIP Optimization Suite 5.0 , 2017, 2112.08872.

[86]  Rainer Schrader,et al.  Metabolic pathway analysis web service (Pathway Hunter Tool at CUBIC) , 2005, Bioinform..

[87]  Philip Miller,et al.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models , 2015, Nucleic Acids Res..

[88]  J. Pleiss Protein design in metabolic engineering and synthetic biology. , 2011, Current opinion in biotechnology.

[89]  Kent McClymont,et al.  Metabolic tinker: an online tool for guiding the design of synthetic metabolic pathways , 2013, Nucleic acids research.

[90]  Pablo Carbonell,et al.  Compound toxicity screening and structure-activity relationship modeling in Escherichia coli. , 2012, Biotechnology and bioengineering.

[91]  Lin Wang,et al.  Standardizing biomass reactions and ensuring complete mass balance in genome‐scale metabolic models , 2017, Bioinform..

[92]  Wolfram Liebermeister,et al.  Pathway Thermodynamics Highlights Kinetic Obstacles in Central Metabolism , 2014, PLoS Comput. Biol..

[93]  K. Prather,et al.  De novo biosynthetic pathways: rational design of microbial chemical factories. , 2008, Current opinion in biotechnology.

[94]  Peter D. Karp,et al.  Optimal metabolic route search based on atom mappings , 2014, Bioinform..

[95]  Gemma L. Holliday,et al.  EC-BLAST: A Tool to Automatically Search and Compare Enzyme Reactions , 2014, Nature Methods.

[96]  V. Hatzimanikatis,et al.  Design of computational retrobiosynthesis tools for the design of de novo synthetic pathways. , 2015, Current opinion in chemical biology.

[97]  Costas D. Maranas,et al.  CLCA: Maximum Common Molecular Substructure Queries within the MetRxn Database , 2014, J. Chem. Inf. Model..

[98]  De Yonker,et al.  A New Approach to Protein Design: Grafting of a Buried Transition Metal Binding Site into Escherichia coli Thioredoxin , 1992 .

[99]  Edda Klipp,et al.  Modular rate laws for enzymatic reactions: thermodynamics, elasticities and implementation , 2010, Bioinform..

[100]  M Kanehisa,et al.  Organizing and computing metabolic pathway data in terms of binary relations. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[101]  Matthew D. Jankowski,et al.  Group contribution method for thermodynamic analysis of complex metabolic networks. , 2008, Biophysical journal.

[102]  Dietmar Schomburg,et al.  BKM-react, an integrated biochemical reaction database , 2011, BMC Biochemistry.

[103]  T. Sjögren,et al.  Structural basis for ligand promiscuity in cytochrome P450 3A4 , 2006, Proceedings of the National Academy of Sciences.

[104]  S. Rhee,et al.  AraCyc: A Biochemical Pathway Database for Arabidopsis1 , 2003, Plant Physiology.

[105]  Jean-Charles Portais,et al.  FindPath: a Matlab solution for in silico design of synthetic metabolic pathways , 2014, Bioinform..

[106]  B. Waclaw,et al.  Lower glycolysis carries a higher flux than any biochemically possible alternative , 2014, Nature Communications.

[107]  Egon L. Willighagen,et al.  The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo-and Bioinformatics , 2003, J. Chem. Inf. Comput. Sci..

[108]  Alfonso Jaramillo,et al.  DESHARKY: automatic design of metabolic pathways for optimal cell growth , 2008, Bioinform..

[109]  Ali R. Zomorrodi,et al.  A kinetic model of Escherichia coli core metabolism satisfying multiple sets of mutant flux data. , 2014, Metabolic engineering.

[110]  Rainer Breitling,et al.  Computational tools for the synthetic design of biochemical pathways , 2012, Nature Reviews Microbiology.

[111]  Francisco J. Planes,et al.  Path finding methods accounting for stoichiometry in metabolic networks , 2011, Genome Biology.

[112]  Adam M. Feist,et al.  Generation of an atlas for commodity chemical production in Escherichia coli and a novel pathway prediction algorithm, GEM-Path. , 2014, Metabolic engineering.

[113]  Kai Zhao,et al.  MRSD: a web server for Metabolic Route Search and Design , 2011, Bioinform..

[114]  Shoshana J. Wodak,et al.  Metabolic PathFinding: inferring relevant pathways in biochemical networks , 2005, Nucleic Acids Res..

[115]  Hal S Alper,et al.  Rewiring yeast sugar transporter preference through modifying a conserved protein motif , 2013, Proceedings of the National Academy of Sciences.

[116]  Natalia N. Ivanova,et al.  1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life , 2017, Nature Biotechnology.

[117]  Steffen Klamt,et al.  Hypergraphs and Cellular Networks , 2009, PLoS Comput. Biol..

[118]  Pablo Carbonell,et al.  RetroPath2.0: A retrosynthesis workflow for metabolic engineers. , 2018, Metabolic engineering.

[119]  Dong-Qing Wei,et al.  Theoretical Studies of Intracellular Concentration of Micro-organisms’ Metabolites , 2017, Scientific Reports.

[120]  Lynda B. M. Ellis,et al.  The University of Minnesota Biocatalysis/Biodegradation Database: improving public access , 2009, Nucleic Acids Res..

[121]  Pablo Carbonell,et al.  Computer-aided design for metabolic engineering. , 2014, Journal of biotechnology.

[122]  Thomas Bernard,et al.  Reconciliation of metabolites and biochemical reactions for metabolic networks , 2012, Briefings Bioinform..

[123]  Derek N Woolfson,et al.  CCBuilder 2.0: Powerful and accessible coiled‐coil modeling , 2017, Protein science : a publication of the Protein Society.

[124]  Juho Rousu,et al.  BMC Systems Biology BioMed Central Methodology article , 2009 .

[125]  Angel Rubio,et al.  Computing the shortest elementary flux modes in genome-scale metabolic networks , 2009, Bioinform..

[126]  Chunhui Li,et al.  Exploring the diversity of complex metabolic networks , 2005, Bioinform..

[127]  Egon L. Willighagen,et al.  The Chemical Translation Service—a web-based tool to improve standardization of metabolomic reports , 2010, Bioinform..

[128]  Antje Chang,et al.  BRENDA, the enzyme information system in 2011 , 2010, Nucleic Acids Res..

[129]  Limsoon Wong,et al.  CMPF: Class-switching minimized pathfinding in metabolic networks , 2012, BMC Bioinformatics.

[130]  Ron Milo,et al.  eQuilibrator—the biochemical thermodynamics calculator , 2011, Nucleic Acids Res..

[131]  Pierre Dupont,et al.  Systems biology Advance Access publication March 12, 2010 , 2009 .

[132]  Christoph Kaleta,et al.  Response to comment on 'Can sugars be produced from fatty acids? A test case for pathway analysis tools' , 2009, Bioinform..

[133]  Rick L. Stevens,et al.  High-throughput generation, optimization and analysis of genome-scale metabolic models , 2010, Nature Biotechnology.

[134]  Lynda B. M. Ellis,et al.  The University of Minnesota Pathway Prediction System: multi-level prediction and visualization , 2011, Nucleic Acids Res..

[135]  R. Milo,et al.  Glycolytic strategy as a tradeoff between energy yield and protein cost , 2013, Proceedings of the National Academy of Sciences.

[136]  Jennifer L Reed,et al.  MapMaker and PathTracer for tracking carbon in genome-scale metabolic models. , 2016, Biotechnology journal.