Prediction of novel synthetic pathways for the production of desired chemicals

BackgroundThere have been several methods developed for the prediction of synthetic metabolic pathways leading to the production of desired chemicals. In these approaches, novel pathways were predicted based on chemical structure changes, enzymatic information, and/or reaction mechanisms, but the approaches generating a huge number of predicted results are difficult to be applied to real experiments. Also, some of these methods focus on specific pathways, and thus are limited to expansion to the whole metabolism.ResultsIn the present study, we propose a system framework employing a retrosynthesis model with a prioritization scoring algorithm. This new strategy allows deducing the novel promising pathways for the synthesis of a desired chemical together with information on enzymes involved based on structural changes and reaction mechanisms present in the system database. The prioritization scoring algorithm employing Tanimoto coefficient and group contribution method allows examination of structurally qualified pathways to recognize which pathway is more appropriate. In addition, new concepts of binding site covalence, estimation of pathway distance and organism specificity were taken into account to identify the best synthetic pathway. Parameters of these factors can be evolutionarily optimized when a newly proven synthetic pathway is registered. As the proofs of concept, the novel synthetic pathways for the production of isobutanol, 3-hydroxypropionate, and butyryl-CoA were predicted. The prediction shows a high reliability, in which experimentally verified synthetic pathways were listed within the top 0.089% of the identified pathway candidates.ConclusionsIt is expected that the system framework developed in this study would be useful for the in silico design of novel metabolic pathways to be employed for the efficient production of chemicals, fuels and materials.

[1]  Robert E. Bixby,et al.  Commentary - Progress in Linear Programming , 1994, INFORMS J. Comput..

[2]  N. Pace A molecular view of microbial diversity and the biosphere. , 1997, Science.

[3]  G. Church,et al.  Analysis of optimality in natural and perturbed metabolic networks , 2002 .

[4]  J. Liao,et al.  Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels , 2008, Nature.

[5]  J. Thornton,et al.  Homology, pathway distance and chromosomal localization of the small molecule metabolism enzymes in Escherichia coli. , 2002, Journal of molecular biology.

[6]  Peter D. Karp,et al.  The Pathway Tools software , 2002, ISMB.

[7]  P N Judson,et al.  Knowledge-based expert systems for toxicity and metabolism prediction: DEREK, StAR and METEOR. , 1999, SAR and QSAR in environmental research.

[8]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[9]  Johnathan E. Holladay,et al.  Top Value Added Chemicals From Biomass. Volume 1 - Results of Screening for Potential Candidates From Sugars and Synthesis Gas , 2004 .

[10]  Enrique Querol,et al.  Analysis of phenetic trees based on metabolic capabilites across the three domains of life. , 2004, Journal of molecular biology.

[11]  M. Mavrovouniotis Estimation of standard Gibbs energy changes of biotransformations. , 1991, The Journal of biological chemistry.

[12]  Lynda B. M. Ellis,et al.  Microbial Pathway Prediction: A Functional Group Approach , 2003, J. Chem. Inf. Comput. Sci..

[13]  Masanori Arita,et al.  Metabolic reconstruction using shortest paths , 2000, Simul. Pract. Theory.

[14]  Gilles Klopman,et al.  META. 1. A Program for the Evaluation of Metabolic Transformation of Chemicals , 1994, J. Chem. Inf. Comput. Sci..

[15]  F. Darvas,et al.  Predicting metabolic pathways by logic programming , 1988 .

[16]  M. Mavrovouniotis Group contributions for estimating standard gibbs energies of formation of biochemical compounds in aqueous solution , 1990, Biotechnology and bioengineering.

[17]  Robert E. Bixby,et al.  Progress in Linear Programming , 1993 .

[18]  Johann Gasteiger,et al.  Handbook of Chemoinformatics , 2003 .

[19]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[20]  Michel Dumontier,et al.  CO: A chemical ontology for identification of functional groups and semantic comparison of small molecules , 2005, FEBS letters.

[21]  S. Wodak,et al.  Inferring meaningful pathways in weighted metabolic networks. , 2006, Journal of molecular biology.

[22]  David Kendrick,et al.  GAMS, a user's guide , 1988, SGNM.

[23]  Chunhui Li,et al.  Exploring the diversity of complex metabolic networks , 2005, Bioinform..

[24]  Pritish Kumar Varadwaj,et al.  Functional group based Ligand binding affinity scoring function at atomic environmental level , 2009, Bioinformation.

[25]  James M. Hogle,et al.  Functional group placement in protein binding sites: a comparison of GRID and MCSS , 2001, J. Comput. Aided Mol. Des..

[26]  D. Aguilar,et al.  Analysis of Phenetic Trees Based on Metabolic Capabilites Across the Three Domains of Life , 2004 .

[27]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[28]  Johann Gasteiger,et al.  Computer‐Assisted Planning of Organic Syntheses: The Second Generation of Programs , 1996 .

[29]  James E. Bray,et al.  Enzymology and Molecular Biology of Carbonyl Metabolism , 2005 .

[30]  Ferenc Csizmadia JChem: Java Applets and Modules Supporting Chemical Database Handling from Web Browsers , 2000, J. Chem. Inf. Comput. Sci..

[31]  David Sankoff,et al.  Edit Distances for Genome Comparisons Based on Non-Local Operations , 1992, CPM.

[32]  Linda J. Broadbelt,et al.  Computational discovery of biochemical routes to specialty chemicals , 2004 .

[33]  Imran Shah,et al.  Heurstic search for metabolic engineering: de novo synthesis of vanillin , 2005, Comput. Chem. Eng..

[34]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[35]  スティーブン ジェイ. ゴート,et al.  3-hydroxypropionic acid, and other organic compounds , 2001 .

[36]  M. Xian,et al.  Biosynthetic pathways for 3-hydroxypropionic acid production , 2009, Applied Microbiology and Biotechnology.

[37]  S. Rao,et al.  PathMiner: predicting metabolic pathways by heuristic search , 2003, Bioinform..

[38]  Alfonso Jaramillo,et al.  DESHARKY: automatic design of metabolic pathways for optimal cell growth , 2008, Bioinform..

[39]  Xin Yao,et al.  Evolutionary Optimization , 2002 .

[40]  P Willett,et al.  Grouping of coefficients for the calculation of inter-molecular similarity and dissimilarity using 2D fragment bit-strings. , 2002, Combinatorial chemistry & high throughput screening.

[41]  K. Prather,et al.  De novo biosynthetic pathways: rational design of microbial chemical factories. , 2008, Current opinion in biotechnology.

[42]  Elena Deza,et al.  Dictionary of distances , 2006 .

[43]  Roger S. Holmes,et al.  Enzymology and Molecular Biology of Carbonyl Metabolism 5 , 1995, Advances in Experimental Medicine and Biology.

[44]  Costas D Maranas,et al.  OptStrain: a computational framework for redesign of microbial production systems. , 2004, Genome research.