Pattern recognition/expert system for mass spectra of volatile toxic and other organic compounds

Abstract A pattern recognition/expert system for classifying, estimating the molecular weights and identifying the low resolution mass spectra of toxic and other volatile organic compounds was developed with a rule-building expert shell program. The system was designed for low concentration spectra by using high intensity and high Shannon information content masses. A sequential filter design was used with an initial classification module followed by molecular weight and base peak filters and identification modules. Ternary encoded reference spectra of 74 target and 31 other compounds were used to establish the five target and the one unknown classes. The target classes were nonhalobenzenes; chlorobenzenes; bromo- and bromochloroalkanes/alkenes; mono- and dichloroalkanes/alkenes; and tri-, tetra- and pentachloroalkanes/alkenes. A molecular weight estimator provided molecular weight estimates even for unclassified spectra. The resulting system was extensively tested with reference spectra of 31 potential air pollutants, 100 randomly selected compounds, 100 GC-MS field spectra and with 54 000 NIST spectra. Classification and identification accuracies for training, test, random and field spectra were 100 and 100; 94 and 100; 76 and 100; and 90 and 96%, respectively. The median absolute deviations of the estimated molecular weights from the true values for the same sets were 0, 7, 7 and 1.

[1]  F. McLafferty,et al.  Computer prediction of molecular weights from mass spectra , 1981 .

[2]  Probabilities of peak occurrence and information content of the NBS/EPA/MSDC mass-spectral data base , 1990 .

[3]  W. Dunn,et al.  Comparison of SIMCA pattern recognition and library search identification of hazardous compounds from mass spectra , 1992 .

[4]  Donald R. Scott 1ST-CLASS (version 3.52) and FUSION (version 1.17) expert shell systems , 1990 .

[5]  Donald R. Scott,et al.  Determination of chemical classes from mass spectra of toxic organic compounds by SIMCA pattern recognition and information theory , 1986 .

[6]  Fred W. McLafferty,et al.  Retrieval and interpretative computer programs for mass spectrometry , 1985, J. Chem. Inf. Comput. Sci..

[7]  W. Dunn,et al.  Classification and identification of hazardous organic compounds in ambient air by pattern recognition of mass spectral data , 1987 .

[8]  Donald R. Scott,et al.  EXPERT SYSTEM FOR ESTIMATES OF MOLECULAR WEIGHTS OF VOLATILE ORGANIC COMPOUNDS FROM LOW-RESOLUTION MASS SPECTRA , 1991 .

[9]  Donald R. Scott Classification of binary mass spectra of toxic compounds with an inductive expert system and comparison with SIMCA class modeling , 1988 .

[10]  M. Koehler,et al.  Application of pattern recognition to mass spectral data of toxic organic compounds in ambient air , 1987 .

[11]  Donald R. Scott Classification and identification of mass spectra of toxic compounds with an inductive rule-building expert system and information theory , 1989 .

[12]  Donald R. Scott Improved method for estimating molecular weights of volatile organic compounds from low resolution mass spectra , 1991 .