A retrosynthetic biology approach to metabolic pathway design for therapeutic production

BackgroundSynthetic biology is used to develop cell factories for production of chemicals by constructively importing heterologous pathways into industrial microorganisms. In this work we present a retrosynthetic approach to the production of therapeutics with the goal of developing an in situ drug delivery device in host cells. Retrosynthesis, a concept originally proposed for synthetic chemistry, iteratively applies reversed chemical transformations (reversed enzyme-catalyzed reactions in the metabolic space) starting from a target product to reach precursors that are endogenous to the chassis. So far, a wider adoption of retrosynthesis into the manufacturing pipeline has been hindered by the complexity of enumerating all feasible biosynthetic pathways for a given compound.ResultsIn our method, we efficiently address the complexity problem by coding substrates, products and reactions into molecular signatures. Metabolic maps are represented using hypergraphs and the complexity is controlled by varying the specificity of the molecular signature. Furthermore, our method enables candidate pathways to be ranked to determine which ones are best to engineer. The proposed ranking function can integrate data from different sources such as host compatibility for inserted genes, the estimation of steady-state fluxes from the genome-wide reconstruction of the organism's metabolism, or the estimation of metabolite toxicity from experimental assays. We use several machine-learning tools in order to estimate enzyme activity and reaction efficiency at each step of the identified pathways. Examples of production in bacteria and yeast for two antibiotics and for one antitumor agent, as well as for several essential metabolites are outlined.ConclusionsWe present here a unified framework that integrates diverse techniques involved in the design of heterologous biosynthetic pathways through a retrosynthetic approach in the reaction signature space. Our engineering methodology enables the flexible design of industrial microorganisms for the efficient on-demand production of chemical compounds with therapeutic applications.

[1]  Robert D. Carr,et al.  The Signature Molecular Descriptor. 4. Canonizing Molecules Using Extended Valence Sequences , 2004, J. Chem. Inf. Model..

[2]  Alan Villalobos,et al.  Designing genes for successful protein expression. , 2011, Methods in enzymology.

[3]  F. Sato,et al.  Microbial production of plant benzylisoquinoline alkaloids , 2008, Proceedings of the National Academy of Sciences.

[4]  Christopher D Reeves,et al.  Combinatorial biosynthesis for drug development. , 2007, Current opinion in microbiology.

[5]  Ronan M. T. Fleming,et al.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 , 2007, Nature Protocols.

[6]  Keith F. Tipton,et al.  History of the enzyme nomenclature system , 2000, Bioinform..

[7]  Masanori Arita,et al.  Metabolic reconstruction using shortest paths , 2000, Simul. Pract. Theory.

[8]  Rainer Schrader,et al.  Metabolic pathway analysis web service (Pathway Hunter Tool at CUBIC) , 2005, Bioinform..

[9]  Peter D. Karp,et al.  Machine learning methods for metabolic pathway prediction , 2010 .

[10]  Dan S. Tawfik,et al.  Enzyme promiscuity: a mechanistic and evolutionary perspective. , 2010, Annual review of biochemistry.

[11]  Jean-Loup Faulon,et al.  Predicting protein-protein interactions using signature products , 2005, Bioinform..

[12]  M. Mavrovouniotis Estimation of standard Gibbs energy changes of biotransformations. , 1991, The Journal of biological chemistry.

[13]  Soha Hassoun,et al.  Probabilistic pathway construction. , 2011, Metabolic engineering.

[14]  Imran Shah,et al.  Heurstic search for metabolic engineering: de novo synthesis of vanillin , 2005, Comput. Chem. Eng..

[15]  J. Keasling Manufacturing Molecules Through Metabolic Engineering , 2010, Science.

[16]  W. Patrick,et al.  Artificial gene amplification reveals an abundance of promiscuous resistance determinants in Escherichia coli , 2010, Proceedings of the National Academy of Sciences.

[17]  John R Carney,et al.  Combinatorial polyketide biosynthesis by de novo design and rearrangement of modular polyketide synthase genes , 2005, Nature Biotechnology.

[18]  Pablo Carbonell,et al.  Molecular signatures-based prediction of enzyme promiscuity , 2010, Bioinform..

[19]  Lydia E. Kavraki,et al.  Finding metabolic pathways using atom tracking , 2010, Bioinform..

[20]  Jean-Loup Faulon,et al.  Genome scale enzyme–metabolite and drug–target interaction predictions using the signature molecular descriptor , 2008 .

[21]  P. Wexler The U.S. National Library of Medicine's Toxicology and Environmental Health Information Program. , 2004, Toxicology.

[22]  Andreas Bender,et al.  Reaction Network Generation , 2010 .

[23]  Zachary L. Fowler,et al.  Increased Malonyl Coenzyme A Biosynthesis by Tuning the Escherichia coli Metabolic Network and Its Application to Flavanone Production , 2009, Applied and Environmental Microbiology.

[24]  Robert M. Williams,et al.  Taxol biosynthesis: Taxane 13α-hydroxylase is a cytochrome P450-dependent monooxygenase , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[26]  S. Horinouchi,et al.  Production of Plant-Specific Flavanones by Escherichia coli Containing an Artificial Gene Cluster , 2003, Applied and Environmental Microbiology.

[27]  R. Croteau,et al.  Molecular cloning of a cytochrome P450 taxane 10 beta-hydroxylase cDNA from Taxus and functional expression in yeast. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[28]  R. Croteau,et al.  Cloning and functional expression of a cDNA encoding geranylgeranyl diphosphate synthase from Taxus canadensis and assessment of the role of this prenyltransferase in cells induced for taxol production. , 1998, Archives of biochemistry and biophysics.

[29]  Kenji Watanabe,et al.  Engineered biosynthesis of an ansamycin polyketide precursor in Escherichia coli , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Jason Weston,et al.  Mismatch string kernels for discriminative protein classification , 2004, Bioinform..

[31]  Jean-Loup Faulon,et al.  The signature molecular descriptor. 3. Inverse-quantitative structure-activity relationship of ICAM-1 inhibitory peptides. , 2003, Journal of molecular graphics & modelling.

[32]  Ronan M. T. Fleming,et al.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 , 2007, Nature Protocols.

[33]  J. Gasteiger,et al.  Chemoinformatics: A Textbook , 2003 .

[34]  V. Lacroix,et al.  An Introduction to Metabolic Networks and Their Structural Analysis , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[35]  Jotun Hein,et al.  Rahnuma: hypergraph-based tool for metabolic pathway prediction and network comparison , 2009, Bioinform..

[36]  J. Martín,et al.  A Novel Epimerization System in Fungal Secondary Metabolism Involved in the Conversion of Isopenicillin N into Penicillin N inAcremonium chrysogenum * , 2002, The Journal of Biological Chemistry.

[37]  Hugo Gramajo,et al.  Production of the Potent Antibacterial Polyketide Erythromycin C in Escherichia coli , 2005, Applied and Environmental Microbiology.

[38]  Applicability and limitation of OSARs for the toxicity of electrophilic chemicals. , 2003, Environmental science & technology.

[39]  Markus J. Herrgård,et al.  Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. , 2004, Genome research.

[40]  Oliver Kohlbacher,et al.  MetaRoute: fast search for relevant metabolic routes for interactive network navigation and visualization , 2008, Bioinform..

[41]  Alfonso Jaramillo,et al.  DESHARKY: automatic design of metabolic pathways for optimal cell growth , 2008, Bioinform..

[42]  Jean-Loup Faulon,et al.  Using product kernels to predict protein interactions. , 2008, Advances in biochemical engineering/biotechnology.

[43]  Andreas Bender,et al.  Handbook of Chemoinformatics Algorithms , 2010 .

[44]  Pablo Carbonell,et al.  Compound toxicity screening and structure-activity relationship modeling in Escherichia coli. , 2012, Biotechnology and bioengineering.

[45]  Leen Stougie,et al.  Enumerating Precursor Sets of Target Metabolites in a Metabolic Network , 2008, WABI.

[46]  Matthew D. Jankowski,et al.  Group contribution method for thermodynamic analysis of complex metabolic networks. , 2008, Biophysical journal.

[47]  D. Frense Taxanes: perspectives for biotechnological production , 2007, Applied Microbiology and Biotechnology.

[48]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[49]  R. Croteau,et al.  Taxol biosynthesis: taxane 13 alpha-hydroxylase is a cytochrome P450-dependent monooxygenase. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Kenji Watanabe,et al.  Diversification of echinomycin molecular structure by way of chemoenzymatic synthesis and heterologous expression of the engineered echinomycin biosynthetic pathway. , 2009, Current opinion in chemical biology.

[51]  Markus Leber,et al.  Automatic assignment of reaction operators to enzymatic reactions , 2009, Bioinform..

[52]  A. Zeng,et al.  Structural synthetic biotechnology: from molecular structure to predictable design for industrial strain development. , 2010, Trends in biotechnology.

[53]  Ahmad S. Khalil,et al.  Synthetic biology: applications come of age , 2010, Nature Reviews Genetics.

[54]  Antje Chang,et al.  BRENDA, AMENDA and FRENDA the enzyme information system: new content and tools in 2009 , 2008, Nucleic Acids Res..

[55]  Ronan M. T. Fleming,et al.  Quantitative assignment of reaction directionality in constraint-based models of metabolism: application to Escherichia coli. , 2009, Biophysical chemistry.

[56]  Keith E. J. Tyo,et al.  Isoprenoid Pathway Optimization for Taxol Precursor Overproduction in Escherichia coli , 2010, Science.

[57]  J. Sohng,et al.  Heterologous production of ribostamycin derivatives in engineered Escherichia coli. , 2010, Research in microbiology.

[58]  Jason A. Papin,et al.  Applications of genome-scale metabolic reconstructions , 2009, Molecular systems biology.

[59]  M. Elowitz,et al.  Build life to understand it , 2010, Nature.

[60]  Peter D. Karp,et al.  Evaluation of computational metabolic-pathway predictions for Helicobacter pylori , 2002, Bioinform..

[61]  R. Croteau,et al.  A cDNA Clone for Taxadiene Synthase, the Diterpene Cyclase That Catalyzes the Committed Step of Taxol Biosynthesis , 1996, The Journal of Biological Chemistry.

[62]  Clay C C Wang,et al.  Total biosynthesis of antitumor nonribosomal peptides in Escherichia coli , 2006, Nature chemical biology.

[63]  Steffen Klamt,et al.  Hypergraphs and Cellular Networks , 2009, PLoS Comput. Biol..

[64]  Dylan Alexander,et al.  Combinatorial biosynthesis of novel antibiotics related to daptomycin , 2006, Proceedings of the National Academy of Sciences.

[65]  Robert M. Long,et al.  Genetic engineering of taxol biosynthetic genes in Saccharomyces cerevisiae. , 2006, Biotechnology and bioengineering.

[66]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[67]  C. Daub,et al.  BMC Systems Biology , 2007 .

[68]  Chunhui Li,et al.  Exploring the diversity of complex metabolic networks , 2005, Bioinform..

[69]  R. Croteau,et al.  Molecular cloning of a 10-deacetylbaccatin III-10-O-acetyl transferase cDNA from Taxus and functional expression in Escherichia coli. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[70]  G. Church,et al.  Analysis of optimality in natural and perturbed metabolic networks , 2002 .

[71]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[72]  A. Burgard,et al.  Optknock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization , 2003, Biotechnology and bioengineering.

[73]  R. Croteau,et al.  Overproduction, in Escherichia coli, of soluble taxadiene synthase, a key enzyme in the Taxol biosynthetic pathway. , 1998, Protein expression and purification.

[74]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[75]  G. Stephanopoulos,et al.  Tuning genetic control through promoter engineering. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[76]  S. Lee,et al.  Metabolic flux analysis and metabolic engineering of microorganisms. , 2008, Molecular bioSystems.

[77]  H. Kipen,et al.  The National Library of Medicine's Toxicology and Environmental Health Information Program , 1997 .

[78]  R. Croteau,et al.  Cytochrome P450-catalyzed hydroxylation of taxa-4(5),11(12)-diene to taxa-4(20),11(12)-dien-5alpha-ol: the first oxygenation step in taxol biosynthesis. , 1996, Chemistry & biology.

[79]  Blaine A. Pfeifer,et al.  Current status, strategies, and potential for the metabolic engineering of heterologous polyketides in Escherichia coli , 2008, Biotechnology Letters.

[80]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[81]  Mudita Singhal,et al.  COPASI - a COmplex PAthway SImulator , 2006, Bioinform..

[82]  Sunwon Park,et al.  Prediction of novel synthetic pathways for the production of desired chemicals , 2010, BMC Systems Biology.

[83]  Johann Gasteiger,et al.  New Applications of Computers in Chemistry , 1979 .

[84]  Timothy S. Ham,et al.  Production of the antimalarial drug precursor artemisinic acid in engineered yeast , 2006, Nature.

[85]  Marten Veenhuis,et al.  An Engineered Yeast Efficiently Secreting Penicillin , 2009, PloS one.

[86]  M. Koffas,et al.  Metabolic engineering for plant natural product biosynthesis in microbes. , 2008, Current opinion in biotechnology.

[87]  Yang Liu,et al.  Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation , 2009, J. Chem. Inf. Model..