Computational methods for NMR and MS for structure elucidation III: More advanced approaches

Abstract The structural assignment of natural products, even with the very sophisticated one-dimensional and two-dimensional (1D and 2D) spectroscopic methods available today, is still a tedious and time-consuming task. Mass spectrometry (MS) is generally used for molecular mass determination, molecular formula generation and MS/MSn fragmentation patterns of molecules. In the meantime, nuclear magnetic resonance (NMR) spectroscopy provides spectra (e. g. 1 H, 13C and correlation spectra) whose interpretation allows the structure determination of known or unknown compounds. With the advance of high throughput studies, like metabolomics, the fast and automated identification or annotation of natural products became highly demanded. Some growing tools to meet this demand apply computational methods for structure elucidation. These methods act on characteristic parameters in the structural determination of small molecules. We have numbered and herein present existing and reputed computational methods for peak picking analysis, resonance assignment, nuclear Overhauser effect (NOE) assignment, combinatorial fragmentation and structure calculation and prediction. Fully automated programs in structure determination are also mentioned, together with their integrated algorithms used to elucidate the structure of a metabolite. The use of these automated tools has helped to significantly reduce errors introduced by manual processing and, hence, accelerated the structure identification or annotation of compounds.

[1]  W. Bremser Hose — a novel substructure code , 1978 .

[2]  Joshua Lederberg,et al.  Applications of Artificial Intelligence for Organic Chemistry: The DENDRAL Project , 1980 .

[3]  Craig A. Shelley,et al.  Case, a computer model of the structure elucidation process , 1981 .

[4]  Dennis H. Smith,et al.  Applications of artificial intelligence for chemical inference. 37. GENOA: a computer program for structure elucidation utilizing overlapping and alternative substructures , 1981 .

[5]  J. Dubois,et al.  Computer-aided elucidation of structures by carbon-13 nuclear magnetic resonance The DARC-EPIOS Method: Characterization of Ordered Substructures by Correlating the Chemical Shifts of Their Bonded Carbon Atoms , 1984 .

[6]  Kimito Funatsu,et al.  Further development of structure generation in the automated structure elucidation system CHEMICS , 1988, J. Chem. Inf. Comput. Sci..

[7]  S. Sasaki,et al.  Introduction of Two-Dimensional NMR Spectral Information to an Automated Structure Elucidation System, CHEMICS. Utilization of 2D-INADEQUATE Information. , 1989 .

[8]  Kimito Funatsu,et al.  Introduction of two-dimensional NMR spectral information to an automated structure elucidation system, CHEMICS. Utilization of 2D-INADEQUATE information , 1989, J. Chem. Inf. Comput. Sci..

[9]  W. Bremser,et al.  SpecInfo—A multidimensional spectroscopic interpretation system , 1991 .

[10]  J. Nuzillard,et al.  Logic for structure determination , 1991 .

[11]  M. Munk,et al.  The role of two-dimensional nuclear magnetic resonance spectroscopy in computer-enhanced structure elucidation , 1991 .

[12]  D. L. Pavia,et al.  Introduction to spectroscopy : a guide for students of organic chemistry , 1996 .

[13]  J. Nuzillard,et al.  Clerodane diterpenoids and an ursane triterpenoid from Salvia haenkei. Computer-assisted structural elucidation , 1997 .

[14]  M. Munk Computer-Based Structure Determination: Then and Now , 1998, Journal of chemical information and computer sciences.

[15]  M. Billeter,et al.  Automated peak picking and peak integration in macromolecular NMR spectra using AUTOPSY. , 1998, Journal of magnetic resonance.

[16]  Morton E. Munk Computer-Based Structure Determination: Then and Now , 1998, J. Chem. Inf. Comput. Sci..

[17]  Christoph Steinbeck,et al.  SENECA: A Platform-Independent, Distributed, and Parallel System for Computer-Assisted Structure Elucidation in Organic Chemistry , 2001, J. Chem. Inf. Comput. Sci..

[18]  Martin Billeter,et al.  MUNIN: Application of three-way decomposition to the analysis of heteronuclear NMR relaxation data** , 2001, Journal of biomolecular NMR.

[19]  Antony J. Williams,et al.  Computer‐assisted structure elucidation of natural products with limited 2D NMR data: application of the StrucEluc system , 2003 .

[20]  M. Elyashberg,et al.  Quindolinocryptotackieine: the elucidation of a novel indoloquinoline alkaloid structure through the use of computer‐assisted structure elucidation and 2D NMR , 2003 .

[21]  Yves Gibon,et al.  GMD@CSB.DB: the Golm Metabolome Database , 2005, Bioinform..

[22]  J. Nuzillard,et al.  Automatic Structure Elucidation through Data Base Search and 2D NMR Spectral Analysis , 2006 .

[23]  Xiang Wan,et al.  CISA: Combined NMR Resonance Connectivity Information Determination and Sequential Assignment , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[24]  Guohui Lin,et al.  CISA: Combined NMR Resonance Connectivity Information Determination and Sequential Assignment , 2007, TCBB.

[25]  Antony J. Williams,et al.  Computer-Assisted Structure Verification and Elucidation Tools in NMR-based Structure Elucidation , 2008 .

[26]  Ari Rantanen,et al.  FiD: a software for ab initio structural identification of product ions from tandem mass spectrometric data. , 2008, Rapid communications in mass spectrometry : RCM.

[27]  Matthias Müller-Hannemann,et al.  In silico fragmentation for computer assisted identification of metabolite mass spectra , 2010, BMC Bioinformatics.

[28]  Xin Gao,et al.  PICKY: a novel SVD-based NMR spectra peak picking method , 2009, Bioinform..

[29]  G. Kummerlöwe,et al.  Residual dipolar couplings as a powerful tool for constitutional analysis: the unexpected formation of tricyclic compounds. , 2011, Angewandte Chemie.

[30]  Egon L. Willighagen,et al.  Elemental composition determination based on MSn , 2011, Bioinform..

[31]  Zhi Liu,et al.  WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering , 2012, Bioinform..

[32]  Morton E. Munk,et al.  INFERCNMR: A 13C NMR Interpretive Library Search System , 2012, J. Chem. Inf. Model..

[33]  M. Kurz,et al.  Isolation and structural elucidation of armeniaspirols A-C: potent antibiotics against gram-positive pathogens. , 2012, Chemistry.

[34]  Antony J. Williams,et al.  Elucidating ‘undecipherable’ chemical structures using computer‐assisted structure elucidation approaches , 2012, Magnetic resonance in chemistry : MRC.

[35]  Emma L. Schymanski,et al.  Small Molecule Identification with MOLGEN and Mass Spectrometry , 2013, Metabolites.

[36]  Bertrand Plainchont,et al.  Recent advances in the structure elucidation of small organic molecules by the LSD software , 2013, Magnetic resonance in chemistry : MRC.

[37]  Sebastian Böcker,et al.  Computational mass spectrometry for small molecules , 2013, Journal of Cheminformatics.

[38]  Takeaki Uno,et al.  Chemical Structure Elucidation from 13C NMR Chemical Shifts: Efficient Data Processing Using Bipartite Matching and Maximal Clique Algorithms , 2014, J. Chem. Inf. Model..

[39]  S. Böcker,et al.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID , 2015, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Antony J. Williams,et al.  Computer–Based Structure Elucidation from Spectral Data: The Art of Solving Problems , 2015 .

[41]  Mikhail E. Elyashberg,et al.  Identification and structure elucidation by NMR spectroscopy , 2015 .

[42]  O. Fiehn,et al.  Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics. , 2015, Trends in analytical chemistry : TRAC.

[43]  Emma L. Schymanski,et al.  MetFrag relaunched: incorporating strategies beyond in silico fragmentation , 2016, Journal of Cheminformatics.

[44]  Kristian Fog Nielsen,et al.  Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking , 2016, Nature Biotechnology.

[45]  Sebastian Böcker,et al.  Searching molecular structure databases using tandem MS data: are we there yet? , 2017, Current opinion in chemical biology.

[46]  Jian Ji,et al.  Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics , 2018, Metabolites.

[47]  Noureddin Sadawi,et al.  ChemDistiller: an engine for metabolite annotation in mass spectrometry , 2018, Bioinform..

[48]  Hiroshi Mamitsuka,et al.  Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches , 2018, Briefings Bioinform..