A quantitative structure- property relationship of gas chromatographic/mass spectrometric retention data of 85 volatile organic compounds as air pollutant materials by multivariate methods

A quantitative structure-property relationship (QSPR) study is suggested for the prediction of retention times of volatile organic compounds. Various kinds of molecular descriptors were calculated to represent the molecular structure of compounds. Modeling of retention times of these compounds as a function of the theoretically derived descriptors was established by multiple linear regression (MLR) and artificial neural network (ANN). The stepwise regression was used for the selection of the variables which gives the best-fitted models. After variable selection ANN, MLR methods were used with leave-one-out cross validation for building the regression models. The prediction results are in very good agreement with the experimental values. MLR as the linear regression method shows good ability in the prediction of the retention times of the prediction set. This provided a new and effective method for predicting the chromatography retention index for the volatile organic compounds.

[1]  Zhide Hu,et al.  QSPR prediction of GC retention indices for nitrogen-containing polycyclic aromatic compounds from heuristically computed molecular descriptors. , 2005, Talanta.

[2]  Julio Caballero,et al.  Linear and nonlinear QSAR study of N-hydroxy-2-[(phenylsulfonyl)amino]acetamide derivatives as matrix metalloproteinase inhibitors. , 2006, Bioorganic & medicinal chemistry.

[3]  Gérard Dreyfus,et al.  Neural networks - methodology and applications , 2005 .

[4]  Bahram Hemmateenejad,et al.  QSAR study of the calcium channel antagonist activity of some recently synthesized dihydropyridine derivatives. An application of genetic algorithm for variable selection in MLR and PLS methods , 2002 .

[5]  Gilles Marcou,et al.  Predicting the Predictability: A Unified Approach to the Applicability Domain Problem of QSAR Models , 2009, J. Chem. Inf. Model..

[6]  M. Cronin,et al.  Application of the modelling power approach to variable subset selection for GA-PLS QSAR models. , 2008, Analytica chimica acta.

[7]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[8]  J. Doucet,et al.  QSAR models for 2-amino-6-arylsulfonylbenzonitriles and congeners HIV-1 reverse transcriptase inhibitors based on linear and nonlinear regression methods. , 2009, European journal of medicinal chemistry.

[9]  R. Jayanty,et al.  Evaluation of aluminum canisters for the collection and storage of air toxics , 1990 .

[10]  M. Koupparis,et al.  Gas chromatographic quantitative structure-retention relationships of trimethylsilylated anabolic androgenic steroids by multiple linear regression and partial least squares. , 2009, Journal of chromatography. A.

[11]  Malik Beshir Malik,et al.  Applied Linear Regression , 2005, Technometrics.

[12]  Wen‐Jun Zhang,et al.  Comparison of different methods for variable selection , 2001 .

[13]  Joachim D. Pleil,et al.  Sample integrity of trace level volatile organic compounds in ambient air stored in SUMMA® polished canisters , 1986 .

[14]  R. Leardi,et al.  Genetic algorithms applied to feature selection in PLS regression: how and when to use them , 1998 .

[15]  Yizeng Liang,et al.  QSPR study of GC retention indices for saturated esters on seven stationary phases based on novel topological indices. , 2007, Talanta.

[16]  Ziding Zhang,et al.  Studies on log P, retention time and QSAR of 2-substituted phenylnitronyl nitroxides as free radical scavengers , 2007 .

[17]  A. Katritzky,et al.  QSPR correlation and predictions of GC retention indexes for methyl-branched hydrocarbons produced by insects. , 2000, Analytical chemistry.

[18]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[19]  S. Weisberg Applied Linear Regression, 2nd Edition. , 1987 .

[20]  Eslam Pourbasheer,et al.  Application of genetic algorithm-support vector machine (GA-SVM) for prediction of BK-channels activity. , 2009, European journal of medicinal chemistry.

[21]  Maykel Pérez González,et al.  QSAR studies about cytotoxicity of benzophenazines with dual inhibition toward both topoisomerases I and II: 3D-MoRSE descriptors and statistical considerations about variable selection. , 2006, Bioorganic & medicinal chemistry.

[22]  T. Hancock,et al.  A performance comparison of modern statistical techniques for molecular descriptor selection and retention prediction in chromatographic QSRR studies , 2005 .

[23]  D. A. Bender,et al.  Determination of a Wide Range of Volatile Organic Compounds in Ambient Air Using Multisorbent Adsorption/Thermal Desorption and Gas Chromatography/Mass Spectrometry , 1998 .

[24]  Mehdi Jalali-Heravi,et al.  Use of Computer-Assisted Methods for the Modeling of the Retention Time of a Variety of Volatile Organic Compounds: A PCA-MLR-ANN Approach , 2004, J. Chem. Inf. Model..

[25]  J. Torrecilla,et al.  Estimation of toxicity of ionic liquids in Leukemia Rat Cell Line and Acetylcholinesterase enzyme by principal component analysis, neural networks and multiple lineal regressions. , 2009, Journal of hazardous materials.

[26]  D. Sin,et al.  Development of an analytical technique and stability evaluation of 143 C3-C12 volatile organic compounds in Summa canisters by gas chromatography-mass spectrometry. , 2001, The Analyst.

[27]  Károly Héberger,et al.  Prediction of Anti-HIV-1 Activity of a Series of Tetrapyrrole Molecules , 2003, J. Chem. Inf. Comput. Sci..

[28]  Yves Chauvin,et al.  Backpropagation: theory, architectures, and applications , 1995 .

[29]  Yong Pan,et al.  Predicting the auto-ignition temperatures of organic compounds from molecular structure using support vector machine. , 2009, Journal of hazardous materials.

[30]  R. Leardi Application of a genetic algorithm to feature selection under full validation conditions and to outlier detection , 1994 .

[31]  Johann Gasteiger,et al.  Neural networks in chemistry and drug design , 1999 .

[32]  K. K. Hii Chemistry Central Journal , 2007 .

[33]  Bahram Hemmateenejad,et al.  QSPR models for half-wave reduction potential of steroids: a comparative study between feature selection and feature extraction from subsets of or entire set of descriptors. , 2009, Analytica chimica acta.

[34]  G. Miller,et al.  Analytical Method for Determination of Trace Organics in Gas Samples Collected by Canister , 1991 .

[35]  Feng Luan,et al.  Prediction of retention time of a variety of volatile organic compounds based on the heuristic method and support vector machine , 2005 .

[36]  Jahan B. Ghasemi,et al.  Prediction of gas chromatography/electron capture detector retention times of chlorinated pesticides, herbicides, and organohalides by multivariate chemometrics methods. , 2007, Analytica chimica acta.

[37]  K. Funatsu,et al.  GA strategy for variable selection in QSAR studies: GAPLS and D-optimal designs for predictive QSAR model , 1998 .

[38]  David M. Haaland,et al.  Partial least-squares methods for spectral analyses. 2. Application to simulated and glass spectral data , 1988 .