Metabolic disassembler for understanding and predicting the biosynthetic units of natural products

BackgroundNatural products are the source of various functional materials such as medicines, and understanding their biosynthetic pathways can provide information that is helpful for their effective production through the synthetic biology approach. A number of studies have aimed to predict biosynthetic pathways from their chemical structures in a retrosynthesis manner; however, sometimes the calculation finishes without reaching the starting material from the target molecule. In order to address this problem, the method to find suitable starting materials is required.ResultsIn this study, we developed a predictive workflow named the Metabolic Disassembler that automatically disassembles the target molecule structure into relevant biosynthetic units (BUs), which are the substructures that correspond to the starting materials in the biosynthesis pathway. This workflow uses a biosynthetic unit library (BUL), which contains starting materials, key intermediates, and their derivatives. We obtained the starting materials from the KEGG PATHWAY database, and 765 BUs were registered in the BUL. We then examined the proposed workflow to optimize the combination of the BUs. To evaluate the performance of the proposed Metabolic Disassembler workflow, we used 943 molecules that are included in the secondary metabolism maps of KEGG PATHWAY. About 95.8% of them (903 molecules) were correctly disassembled by our proposed workflow. For comparison, we also implemented a genetic algorithm-based workflow, and found that the accuracy was only about 52.0%. In addition, for 90.7% of molecules, our workflow finished the calculation within one minute.ConclusionsThe Metabolic Disassembler enabled the effective disassembly of natural products in terms of both correctness and computational time. It also outputs automatically highlighted color-coded substructures corresponding to the BUs to help users understand the calculation results. The users do not have to specify starting molecules in advance, and can input any target molecule, even if it is not in databases. Our workflow will be very useful for understanding and predicting the biosynthesis of natural products.

[1]  Yasuo Tabei,et al.  Metabolome-scale de novo pathway reconstruction using regioisomer-sensitive graph alignments , 2015, Bioinform..

[2]  M. Wink Medicinal Natural Products. A Biosynthetic Approach , 2002 .

[3]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[4]  Akhil Kumar,et al.  Pathway design using de novo steps through uncharted biochemical spaces , 2018, Nature Communications.

[5]  Stephen R. Heller,et al.  InChI - the worldwide chemical structure identifier standard , 2013, Journal of Cheminformatics.

[6]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[7]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[8]  Susumu Goto,et al.  PathPred: an enzyme-catalyzed metabolic pathway prediction server , 2010, Nucleic Acids Res..

[9]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[10]  S. Ōmura,et al.  Avermectins, New Family of Potent Anthelmintic Agents: Producing Organism and Fermentation , 1979, Antimicrobial Agents and Chemotherapy.

[11]  A. Fleming,et al.  Classics in infectious diseases: on the antibacterial action of cultures of a penicillium, with special reference to their use in the isolation of B. influenzae by Alexander Fleming, Reprinted from the British Journal of Experimental Pathology 10:226-236, 1929. , 1980, Reviews of infectious diseases.

[12]  Marc Parizeau,et al.  DEAP: evolutionary algorithms made easy , 2012, J. Mach. Learn. Res..

[13]  A. Fleming,et al.  On the antibacterial action of cultures of a penicillium, with special reference to their use in the isolation of B. influenzæ , 1929 .

[14]  Pablo Carbonell,et al.  Molecular structures enumeration and virtual screening in the chemical space with RetroPath2.0 , 2017, bioRxiv.

[15]  Suxia Han,et al.  fied, along with their associated protein‐protein interaction networks and Kyoto Encyclopedia of Genes and Genomes , 2019 .

[16]  Yu Tian,et al.  PrecursorFinder: a customized biosynthetic precursor explorer , 2019, Bioinform..

[17]  市瀬 浩志 Medicinal Natural Products A Biosynthetic Approach (3rd Edition), Paul M.Dewick著, John Wiley&Sons, B5変型, 550頁, $170.00 , 2009 .

[18]  Pablo Carbonell,et al.  XTMS: pathway design in an eXTended metabolic space , 2014, Nucleic Acids Res..

[19]  Christoph Steinbeck,et al.  Rhea—a manually curated resource of biochemical reactions , 2011, Nucleic Acids Res..

[20]  Tae Yong Kim,et al.  ReactPRED: a tool to predict and analyze biochemical reactions , 2016, Bioinform..

[21]  Soha Hassoun,et al.  Probabilistic pathway construction. , 2011, Metabolic engineering.

[22]  Pablo Carbonell,et al.  RetroPath2.0: A retrosynthesis workflow for metabolic engineers. , 2018, Metabolic engineering.