Taxonomy-focused Natural Product Databases for Carbon-13 NMR-based Dereplication

The recent revival of the study of organic natural products as renewable sources of medicinal drugs, cosmetics, dyes, and materials motivated the creation of general purpose structural databases. Dereplication, the efficient identification of already reported compounds, relies on the grouping of structural, taxonomic and spectroscopic databases that focus on a particular taxon (species, genus, family, order, etc.). A set of freely available python scripts, CNMR_Predict, is proposed for the quick supplementation of taxon oriented search results from the naturaL prOducTs occUrrences database (LOTUS, lotus.naturalproducts.net) with predicted carbon-13 nuclear magnetic resonance data from the ACD/Labs CNMR predictor and DB software (acdlabs.com) to provide easily searchable databases. The database construction process is illustrated using Brassica rapa as a taxon example.

[1]  Jean-Marc Nuzillard,et al.  Identification of natural metabolites in mixture: a pattern recognition strategy based on (13)C NMR. , 2014, Analytical chemistry.

[2]  Jean-Marc Nuzillard,et al.  Dereplication strategies in natural product research: How many tools and methodologies behind the same concept? , 2015, Phytochemistry Reviews.

[3]  G. Bifulco,et al.  Elucidating the Relative and Absolute Configuration of Organic Compounds by Quantum Mechanical Approaches , 2020 .

[4]  S. Rogers,et al.  Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches , 2021, Natural product reports.

[5]  C. Steinbeck,et al.  Open Natural Products Research: Curation and Dissemination of Biological Occurrences of Chemical Structures through Wikidata , 2021 .

[6]  S. Kuhn,et al.  The Three Pillars of Natural Product Dereplication. Alkaloids from the Bulbs of Urceolina peruviana (C. Presl) J.F. Macbr. as a Preliminary Test Case , 2021, Molecules.

[7]  G. Zengin,et al.  Conventional versus green extraction techniques — a comparative perspective , 2021 .

[8]  F. Saubion,et al.  MixONat, a software for the dereplication of mixtures based on 13C NMR spectroscopy. , 2020, Analytical chemistry.

[9]  Christoph Steinbeck,et al.  NMRShiftDB -- compound identification and structure elucidation support through a free community-built web database. , 2004, Phytochemistry.

[10]  Stephen R. Heller,et al.  InChI, the IUPAC International Chemical Identifier , 2015, Journal of Cheminformatics.

[11]  D. Schaufelberger,et al.  Dereplication of phorbol bioactives: Lyngbya majuscula and Croton cuneatus. , 1990, Journal of natural products.

[12]  P. Joseph-Nathan,et al.  Vibrational Circular Dichroism Absolute Configuration of Natural Products From 2015 to 2019 , 2021 .

[13]  P. Richomme,et al.  13C-NMR dereplication of Garcinia extracts: Predicted chemical shifts as reliable databases. , 2018, Fitoterapia.

[14]  Maria Sorokina,et al.  Review on natural products databases: where to find data in 2020 , 2020, Journal of Cheminformatics.

[15]  Kohulan Rajan,et al.  COCONUT online: Collection of Open Natural Products database , 2021, Journal of Cheminformatics.

[16]  T. Wenzel Strategies for using NMR spectroscopy to determine absolute configuration , 2017 .

[17]  Justin J. J. van der Hooft,et al.  Reproducible molecular networking of untargeted mass spectrometry data using GNPS , 2020, Nature Protocols.

[18]  The LOTUS Initiative for Open Natural Products Research: Knowledge Management through Wikidata , 2021 .

[19]  Jean-Marc Nuzillard,et al.  Computer-Aided 13C NMR Chemical Profiling of Crude Natural Extracts without Fractionation. , 2017, Journal of natural products.

[20]  J. Nuzillard,et al.  Tailoring the nuclear Overhauser effect for the study of small and medium-sized molecules by solvent viscosity manipulation. , 2021, Progress in nuclear magnetic resonance spectroscopy.

[21]  W. Robien The Advantage of Automatic Peer-Reviewing of 13C-NMR Reference Data Using the CSEARCH-Protocol † , 2021, Molecules.