Empirical pattern recognition/expert system for molecular weight estimation of low resolution mass spectra

Abstract A fast, personal-computer based method of estimating molecular weights of organic compounds from low resolution mass spectra has been redesigned and implemented with a rule-based expert system. It has a sequential design with a pattern recognition classifier followed by filter and molecular weight estimator modules for each of six classes. The classes are nonhalobenzenes, chlorobenzenes, bromo- and bromochloroalkanes/alkenes, mono- and di-chloroalkanes/alkenes, tri-, tetra- and pentachloroalkanes/alkenes and unknowns. The classifier was derived from 106 NIST/EPA/MSDC reference spectra. The filters employ computed series of allowed molecular weights and selected base peaks for each class, except unknown, to reduce misclassification. Empirical linear corrections from the training spectra are applied to two mass spectral features, MAXMASS and HIMAX1, to yield estimates and lower limits to the molecular weights. Extensive testing of the system was conducted with 32 test, 99 randomly chosen and 37 field gas chromatographic-mass spectrometric (GC-MS) spectra and results were compared to those from STIRS. The median absolute deviations from the true molecular weights of the test, random and field GC-MS spectra with the expert system were all 1 dalton (average 5.6, 7.3, 5.9 daltons, respectively). This approach also was evaluated with 400 spectra of volatile and nonvolatile compounds of pharmaceutical interest. The median and average absolute deviations from the true molecular weights of the 400 spectra were 2 and 10 daltons. Classification of the evaluation spectra, including many incomplete spectra, was very good with accuracies of 97 (test, random and pharmaceutical) and 95% (field GC-MS).

[1]  Silvia Heuerding,et al.  Simple tools for the computer-aided interpretation of mass spectra , 1993 .

[2]  Stephen E. Stein,et al.  Large scale evaluation of a pattern recognition/expert system for mass spectral molecular weight estimation , 1993 .

[3]  Donald R. Scott,et al.  Rapid and accurate method for estimating molecular weights of organic compounds from low resolution mass spectra , 1992 .

[4]  Donald R. Scott Classification of binary mass spectra of toxic compounds with an inductive expert system and comparison with SIMCA class modeling , 1988 .

[5]  Fred W. McLafferty,et al.  Retrieval and interpretative computer programs for mass spectrometry , 1985, J. Chem. Inf. Comput. Sci..

[6]  Donald R. Scott,et al.  EXPERT SYSTEM FOR ESTIMATES OF MOLECULAR WEIGHTS OF VOLATILE ORGANIC COMPOUNDS FROM LOW-RESOLUTION MASS SPECTRA , 1991 .

[7]  F. McLafferty,et al.  Computer prediction of molecular weights from mass spectra , 1981 .

[8]  Donald R. Scott,et al.  Pattern recognition/expert system for mass spectra of volatile toxic and other organic compounds , 1992 .

[9]  Donald R. Scott 1ST-CLASS (version 3.52) and FUSION (version 1.17) expert shell systems , 1990 .

[10]  Donald R. Scott,et al.  Determination of chemical classes from mass spectra of toxic organic compounds by SIMCA pattern recognition and information theory , 1986 .

[11]  Donald R. Scott Classification and identification of mass spectra of toxic compounds with an inductive rule-building expert system and information theory , 1989 .

[12]  Donald R. Scott Improved method for estimating molecular weights of volatile organic compounds from low resolution mass spectra , 1991 .