Pathway design using de novo steps through uncharted biochemical spaces

Existing retrosynthesis tools generally traverse production routes from a source to a sink metabolite using known enzymes or de novo steps. Generally, important considerations such as blending known transformations with putative steps, complexity of pathway topology, mass conservation, cofactor balance, thermodynamic feasibility, microbial chassis selection, and cost are largely dealt with in a posteriori fashion. The computational procedure we present here designs bioconversion routes while simultaneously considering any combination of the aforementioned design criteria. First, we track and codify as rules all reaction centers using a prime factorization-based encoding technique (rePrime). Reaction rules and known biotransformations are then simultaneously used by the pathway design algorithm (novoStoic) to trace both metabolites and molecular moieties through balanced bio-conversion strategies. We demonstrate the use of novoStoic in bypassing steps in existing pathways through putative transformations, assembling complex pathways blending both known and putative steps toward pharmaceuticals, and postulating ways to biodegrade xenobiotics.Existing pathway design tools make use of existing reactions from databases or successively apply retrosynthetic rules. novoStoic provides an integrated optimization-based framework combining known reactions with novel steps in pathway design allowing for constraints on thermodynamic feasibility, product yield, pathway length and number of novel steps.

[1]  Wenjun Zhang,et al.  Identification of phenylalanine 3-hydroxylase for meta-tyrosine biosynthesis. , 2011, Biochemistry.

[2]  Rainer Schrader,et al.  Metabolic pathway analysis web service (Pathway Hunter Tool at CUBIC) , 2005, Bioinform..

[3]  S. E. Lantz,et al.  Phylogenetic and physiological comparisons of PAH-degrading bacteria from geographically diverse soils , 1997, Antonie van Leeuwenhoek.

[4]  James Dugundji,et al.  An algebraic model of constitutional chemistry as a basis for chemical computer programs , 1973 .

[5]  Jack Liang,et al.  Practical chiral alcohol manufacture using ketoreductases. , 2010, Current opinion in chemical biology.

[6]  M. Shiaris,et al.  Metabolism of naphthalene, fluorene, and phenanthrene: preliminary characterization of a cloned gene cluster from Pseudomonas putida NCIB 9816 , 1994, Journal of bacteriology.

[7]  Adam M. Feist,et al.  Generation of an atlas for commodity chemical production in Escherichia coli and a novel pathway prediction algorithm, GEM-Path. , 2014, Metabolic engineering.

[8]  Oliver Kohlbacher,et al.  MetaRoute: fast search for relevant metabolic routes for interactive network navigation and visualization , 2008, Bioinform..

[9]  Angel Rubio,et al.  Computing the shortest elementary flux modes in genome-scale metabolic networks , 2009, Bioinform..

[10]  Hal S Alper,et al.  Rewiring yeast sugar transporter preference through modifying a conserved protein motif , 2013, Proceedings of the National Academy of Sciences.

[11]  P B RUSSELL,et al.  New route to phenylephrine. , 1961, Journal of pharmaceutical sciences.

[12]  K. Iwashita,et al.  A new method for isolation of S-adenosylmethionine (SAM)-accumulating yeast , 2006, Applied Microbiology and Biotechnology.

[13]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[14]  Olivier Martin,et al.  MetaNetX/MNXref – reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks , 2015, Nucleic Acids Res..

[15]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[16]  Amanda L. Smith,et al.  Computational protein design enables a novel one-carbon assimilation pathway , 2015, Proceedings of the National Academy of Sciences.

[17]  Dietmar Schomburg,et al.  BKM-react, an integrated biochemical reaction database , 2011, BMC Biochemistry.

[18]  P. Facchini,et al.  Biosynthesis of amphetamine analogs in plants. , 2012, Trends in plant science.

[19]  Dan S. Tawfik,et al.  Enzyme promiscuity: evolutionary and mechanistic aspects. , 2006, Current opinion in chemical biology.

[20]  Steffen Klamt,et al.  Genome-scale strain designs based on regulatory minimal cut sets , 2015, Bioinform..

[21]  Stefan Kramer,et al.  Data-driven extraction of relative reasoning rules to limit combinatorial explosion in biodegradation pathway prediction , 2008, Bioinform..

[22]  Yi Liu,et al.  RosettaDesign server for protein design , 2006, Nucleic Acids Res..

[23]  Antje Chang,et al.  BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA , 2012, Nucleic Acids Res..

[24]  V. Hatzimanikatis,et al.  Thermodynamics-based metabolic flux analysis. , 2007, Biophysical journal.

[25]  Mukund S. Chorghade,et al.  A Practical Synthesis of (R)-(−)-Phenylephrine Hydrochloride , 1998 .

[26]  Pablo Carbonell,et al.  Semisupervised Gaussian Process for Automated Enzyme Search. , 2016, ACS synthetic biology.

[27]  Pablo Carbonell,et al.  XTMS: pathway design in an eXTended metabolic space , 2014, Nucleic Acids Res..

[28]  A. K. Haritash,et al.  Biodegradation aspects of polycyclic aromatic hydrocarbons (PAHs): a review. , 2009, Journal of hazardous materials.

[29]  Susumu Goto,et al.  PathPred: an enzyme-catalyzed metabolic pathway prediction server , 2010, Nucleic Acids Res..

[30]  Paul N. Devine,et al.  Biocatalytic Asymmetric Synthesis of Chiral Amines from Ketones Applied to Sitagliptin Manufacture , 2010, Science.

[31]  Andrew Currin,et al.  Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently , 2014, Chemical Society reviews.

[32]  John A. Bumpus,et al.  Fungi in Bioremediation , 2001 .

[33]  Radhakrishnan Mahadevan,et al.  Redesigning metabolism based on orthogonality principles , 2017, Nature Communications.

[34]  Ju Chu,et al.  Intracellular expression of Vitreoscilla hemoglobin improves S-adenosylmethionine production in a recombinant Pichia pastoris , 2007, Applied Microbiology and Biotechnology.

[35]  Zhiyong Lu,et al.  On expert curation and scalability: UniProtKB/Swiss-Prot as a case study , 2017, Bioinform..

[36]  Hongtao Yu,et al.  ENVIRONMENTAL CARCINOGENIC POLYCYCLIC AROMATIC HYDROCARBONS: PHOTOCHEMISTRY AND PHOTOTOXICITY , 2002, Journal of environmental science and health. Part C, Environmental carcinogenesis & ecotoxicology reviews.

[37]  Vassily Hatzimanikatis,et al.  Computational framework for predictive biodegradation , 2009, Biotechnology and bioengineering.

[38]  Yang Liu,et al.  Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation , 2009, J. Chem. Inf. Model..

[39]  Lorna J. Hepworth,et al.  Constructing Biocatalytic Cascades: In Vitro and in Vivo Approaches to de Novo Multi-Enzyme Pathways , 2017 .

[40]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[41]  A. Burgard,et al.  Optknock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization , 2003, Biotechnology and bioengineering.

[42]  Kent McClymont,et al.  Metabolic tinker: an online tool for guiding the design of synthetic metabolic pathways , 2013, Nucleic acids research.

[43]  S. Atsumi,et al.  Expanding ester biosynthesis in Escherichia coli. , 2014, Nature chemical biology.

[44]  Pradeep Kumar,et al.  Enantioselective synthesis of (R)-phenylephrine hydrochloride , 2003 .

[45]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[46]  Frances H Arnold,et al.  Expanding the enzyme universe: accessing non-natural reactions by mechanism-guided directed evolution. , 2015, Angewandte Chemie.

[47]  Antony J. Williams,et al.  ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology. , 2016, Chemical research in toxicology.

[48]  Anupam Chowdhury,et al.  Designing overall stoichiometric conversions and intervening metabolic reactions , 2015, Scientific Reports.

[49]  David Baker,et al.  Bridging the gaps in design methodologies by evolutionary optimization of the stability and proficiency of designed Kemp eliminase KE59 , 2012, Proceedings of the National Academy of Sciences.

[50]  Wipanoot Baison,et al.  An alternative synthesis of ()-phenylephrine hydrochloride , 2014 .

[51]  スティーブン ジェイ. ゴート,et al.  Alanine 2,3 aminomutase , 2003 .

[52]  Ronan M. T. Fleming,et al.  Consistent Estimation of Gibbs Energy Using Component Contributions , 2013, PLoS Comput. Biol..

[53]  Derek Y. Chiang,et al.  MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery , 2010, Nucleic acids research.

[54]  Patrik R. Jones,et al.  Carboxylic acid reductase is a versatile enzyme for the conversion of fatty acids into fuels and chemical commodities , 2012, Proceedings of the National Academy of Sciences.

[55]  Frances H. Arnold,et al.  Olefin Cyclopropanation via Carbene Transfer Catalyzed by Engineered Cytochrome P450 Enzymes , 2013, Science.

[56]  Costas D. Maranas,et al.  OptForce: An Optimization Procedure for Identifying All Genetic Manipulations Leading to Targeted Overproductions , 2010, PLoS Comput. Biol..

[57]  Costas D. Maranas,et al.  MetRxn: a knowledgebase of metabolites and reactions spanning metabolic models and databases , 2012, BMC Bioinformatics.

[58]  Costas D. Maranas,et al.  CLCA: Maximum Common Molecular Substructure Queries within the MetRxn Database , 2014, J. Chem. Inf. Model..

[59]  Yoshihiro Yamanishi,et al.  E-zyme: predicting potential EC numbers from the chemical transformation pattern of substrate-product pairs , 2009, Bioinform..

[60]  Kyongbum Lee,et al.  Utilizing elementary mode analysis, pathway thermodynamics, and a genetic algorithm for metabolic flux determination and optimal metabolic network design , 2010, BMC Systems Biology.

[61]  Mukund S. Chorghade,et al.  A Practical Synthesis of (R)-(-)-Phenylephrine Hydrochloride. , 1999 .

[62]  Ron Milo,et al.  eQuilibrator—the biochemical thermodynamics calculator , 2011, Nucleic Acids Res..

[63]  Anne Morgat,et al.  Updates in Rhea – an expert curated resource of biochemical reactions , 2017, Nucleic Acids Res..

[64]  Hiroshi Habe,et al.  Genetics of Polycyclic Aromatic Hydrocarbon Metabolism in Diverse Aerobic Bacteria , 2003, Bioscience, biotechnology, and biochemistry.

[65]  A. Burgard,et al.  Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. , 2011, Nature chemical biology.

[66]  J. Liao,et al.  Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels , 2008, Nature.

[67]  B. Griffin,et al.  Network Context and Selection in the Evolution to Enzyme Specificity , 2014 .

[68]  C. Maranas,et al.  IPRO: an iterative computational protein library redesign and optimization procedure. , 2006, Biophysical journal.

[69]  James C. Liao,et al.  Expanding metabolism for biosynthesis of nonnatural alcohols , 2008, Proceedings of the National Academy of Sciences.

[70]  Francisco J. Planes,et al.  Path finding methods accounting for stoichiometry in metabolic networks , 2011, Genome Biology.

[71]  A H van Gennip,et al.  Specific genetic deficiencies of the A and B isoenzymes of monoamine oxidase are characterized by distinct neurochemical and clinical phenotypes. , 1996, The Journal of clinical investigation.