Exploring QSTR analysis of the toxicity of phenols and thiophenols using machine learning methods.

There is an increasing need for the rapid safety assessment of chemicals by both industries and regulatory agencies throughout the world. In silico techniques are practical alternatives in the environmental hazard assessment. In this background, quantitative structure-toxicity relationship (QSTR) analysis has been performed on toxicity of phenols and thiophenols to Photobacterium phosphoreum. The techniques of classification and regression trees (CART) and least squares support vector regressions (LS-SVR) were applied successfully as variable selection and mapping tools, respectively. Four descriptors selected by the CART technique have been used as inputs of the LS-SVR for prediction of toxicities. The best model explains 91.8% leave-one-out predicted variance and 93.0% external predicted variance. The predictive performance of the CART-LS-SVR model was significantly better than the previous reported models based on CoMFA/CoMSIA and stepwise MLR techniques, suggesting that the present methodology may be useful to predict of toxicity, safety and risk assessment of chemicals.

[1]  Quantitative Structure–Activity Relationship for Prediction of the Toxicity of Phenols on Photobacterium phosphoreum , 2012, Bulletin of Environmental Contamination and Toxicology.

[2]  Mohammad Asadollahi Baboli Effect of Weight Updates Functions in QSAR/QSPR Modeling using Artificial Neural Network , 2011 .

[3]  Yalei Zhang,et al.  Hydrophobicity-dependent QSARs to predict the toxicity of perfluorinated carboxylic acids and their mixtures. , 2011, Environmental toxicology and pharmacology.

[4]  Jiaqi Shi,et al.  Acute toxicity and n-octanol/water partition coefficients of substituted thiophenols: determination and QSAR analysis. , 2012, Ecotoxicology and environmental safety.

[5]  D. W. Osten,et al.  Selection of optimal regression models via cross‐validation , 1988 .

[6]  Shane Weaver,et al.  The importance of the domain of applicability in QSAR modeling. , 2008, Journal of molecular graphics & modelling.

[7]  Andrew P Worth,et al.  Quantitative structure-activity-activity and quantitative structure-activity investigations of human and rodent toxicity. , 2006, Chemosphere.

[8]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[9]  M. Asadollahi-Baboli Quantitative structure–activity relationship analysis of human neutrophil elastase inhibitors using shuffling classification and regression trees and adaptive neuro-fuzzy inference systems , 2012, SAR and QSAR in environmental research.

[10]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[11]  M. Pavan,et al.  The role of the European Chemicals Bureau in promoting the regulatory use of (Q)SAR methods , 2007, SAR and QSAR in environmental research.

[12]  Manuela Pavan,et al.  Review of (Quantitative) Structure–Activity Relationships for Acute Aquatic Toxicity , 2008 .