Identification of H-NMR Spectra of Xyloglucan Oligosaccharides: A Comparative Study of Artificial Neural Networks and Bayesian Classification Using Nonparametric Density Estimation

Proton nuclear magnetic resonance (1H-NMR) is a widely used tool for chemical structural analysis. However, 1H-NMR spectra suffer from natural aberrations that render computer-assisted automated identification of these spectra difficult, and at times impossible. Previous efforts have successfully implemented instrument dependent or conditional identification of these spectra. In this paper, we report the first instrument independent computer-assisted automated identification system for a group of complex carbohydrates known as the xyloglucan oligosaccharides. The developed system is also implemented on the world wide web (this http URL) as part of an identification package called the CCRC-Net and is intended to recognize any submitted 1H-NMR spectrum of these structures with reasonable signal-to-noise ratio, recorded on any 500 MHz NMR instrument. The system uses Artificial Neural Networks (ANNs) technology and is insensitive to the instrument and environment-dependent variations in 1H-NMR spectroscopy. In this paper, comparative results of the ANN engine versus a multidimensional Bayes' classifier is also presented.

[1]  J H Prestegard,et al.  Nuclear magnetic resonance in the era of structural genomics. , 2001, Biochemistry.

[2]  Peter de B. Harrington,et al.  Optimal Associative Memory for Background Correction of Spectra , 1994 .

[3]  Keinosuke Fukunaga,et al.  Chapter 6 – NONPARAMETRIC DENSITY ESTIMATION , 1990 .

[4]  Sabine Van Huffel,et al.  Enhanced resolution based on minimum variance estimation and exponential data modeling , 1993, Signal Process..

[5]  Faramarz Valafar,et al.  Artificial neural networks in chemotype analysis of Cryptococcus neoformans , 1998, Proceedings of the 1998 2nd International Conference Biomedical Engineering Days.

[6]  J. T. Magee,et al.  On mass spectrometer instrument standardization and interlaboratory calibration transfer using neural networks , 1997 .

[7]  Faramarz Valafar,et al.  Prediction of a patient's response to a specific drug treatment using artificial neural networks , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[8]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[9]  Marcus Greferath,et al.  NMR IMAGING OF OBJECTS CONTAINING SIMILAR SUBSTRUCTURES , 1993 .

[10]  John Hardin,et al.  Predicting the effectiveness of hydroxyurea in individual sickle cell anemia patients , 2000, Artif. Intell. Medicine.

[11]  Faramarz Valafar,et al.  Comparative studies of two neural network architectures for modeling of human speech production , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[12]  Pantelis Angelidis Spectrum estimation and the Fourier transform in imaging and spectroscopy , 1996 .

[13]  Peter de B. Harrington,et al.  Quantitative comparison of bidirectional and optimal associative memories for background prediction of spectra , 1995 .

[14]  Kohji Fukunaga,et al.  Introduction to Statistical Pattern Recognition-Second Edition , 1990 .

[15]  Faramarz Valafar,et al.  CCRC-Net: an Internet-based spectral database for complex carbohydrates using artificial neural networks search engines , 1999 .

[16]  J H Prestegard,et al.  Variation of molecular alignment as a means of resolving orientational ambiguities in protein structures from dipolar couplings. , 2000, Journal of magnetic resonance.

[17]  Faramarz Valafar,et al.  Parallel, self organizing, consensus neural networks , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[18]  F. Howe,et al.  In vivo 31p mrs: absolute concentrations, signal‐to‐noise and prior knowledge , 1995, NMR in biomedicine.

[19]  Faramarz Valafar,et al.  A Comparative Study of Artificial Neural Networks Using Reinforcement Learning and Multidimensional Bayesian Classification Using Parzen Density Estimation for Identification of GC-EIMS Spectra of Partially Methylated Alditol Acetates on the World Wide Web , 1999, IC-AI.

[20]  J H Prestegard,et al.  A dipolar coupling based strategy for simultaneous resonance assignment and structure determination of protein backbones. , 2001, Journal of the American Chemical Society.

[21]  Faramarz Valafar,et al.  Distributed global optimization (DGO) , 1996, Proceedings of International Conference on Neural Networks (ICNN'96).