A new quantitative structure-property relationship model to predict bioconcentration factors of polychlorinated biphenyls (PCBs) in fishes using E-state index and topological descriptors.

Abstract A quantitative structure–property relationship (QSPR) study for predicting the logarithm of bioconcentration factors (Log  BCF ) of polychlorinated biphenyls (PCBs) is presented in this work. For this, the descriptors were obtained using only the Simplified Molecular Input Line Entry System (SMILES) strings in the free web server Parameter Client. The model was built using the Partial Least Squares (PLS) regression method. The best model presented five descriptors (one E-state index and four topological descriptors) and a high quality for fit, internal, and external predictions. The leave- N -out (LNO) cross validation and the y-randomization test showed the model is robust and has no shown chance correlation. With a second test set, the model was compared to other models and presented a root mean square error (RMSE) very close to the best model. The mechanistic interpretation was corroborated by other works in the literature and by the descriptors' theory. Thus, the results meet the five Organization for Economic Co-operation and Development (OECD) principles for validation of QSA(P)R models, and it is expected the model can effectively predict the BCF values in fishes of the PCB congeners without highly reliable experimental BCF.

[1]  Robert S. Boethling,et al.  Improved method for estimating bioconcentration/bioaccumulation factor from octanol/water partition coefficient , 1999 .

[2]  Randall D. Tobias,et al.  Chemometrics: A Practical Guide , 1998, Technometrics.

[3]  P. White,et al.  Polychlorinated biphenyls (PCBs) contamination and aryl hydrocarbon receptor (AhR) agonist activity of Omega-3 polyunsaturated fatty acid supplements: implications for daily intake of dioxins and PCBs. , 2010, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[4]  Pablo R Duchowicz,et al.  QSAR analysis for quinoxaline-2-carboxylate 1,4-di-N-oxides as anti-mycobacterial agents. , 2009, Journal of molecular graphics & modelling.

[5]  Safe,et al.  Toxic equivalency factors (TEFs) for PCBs, PCDDs, PCDFs for humans and wildlife. , 1998, Environmental health perspectives.

[6]  Eliana Zandonade,et al.  Proposição, validação e análise dos modelos que correlacionam estrutura química e atividade biológica , 2001 .

[7]  Manuela Pavan,et al.  Review of Literature‐Based Quantitative Structure–Activity Relationship Models for Bioconcentration , 2008 .

[8]  Eduardo Borges de Melo,et al.  Multivariate QSAR study on the antimutagenic activity of flavonoids against 3-NFA on Salmonella typhimurium TA98. , 2010, European journal of medicinal chemistry.

[9]  A. Aguiar,et al.  Lead and PCBs as Risk Factors for Attention Deficit/Hyperactivity Disorder , 2010, Environmental health perspectives.

[10]  Y. Li,et al.  Estimation of bioconcentration factors using molecular electro-topological state and flexibility , 2008, SAR and QSAR in environmental research.

[11]  S. Wold,et al.  Statistical Validation of QSAR Results , 1995 .

[12]  Igor V Tetko,et al.  Computing chemistry on the web. , 2005, Drug discovery today.

[13]  Haralambos Sarimveis,et al.  Optimization of biaryl piperidine and 4-amino-2-biarylurea MCH1 receptor antagonists using QSAR modeling, classification techniques and virtual screening , 2007, J. Comput. Aided Mol. Des..

[14]  Frank A. P. C. Gobas,et al.  A review of bioconcentration factor (BCF) and bioaccumulation factor (BAF) assessments for organic chemicals in aquatic organisms , 2006 .

[15]  P. Gramatica,et al.  Modelling and prediction of soil sorption coefficients of non-ionic organic pesticides by molecular descriptors. , 2000, Chemosphere.

[16]  Paola Gramatica,et al.  CHEMOMETRIC METHODS AND THEORETICAL MOLECULAR DESCRIPTORS IN PREDICTIVE QSAR MODELING OF THE ENVIRONMENTAL BEHAVIOR OF ORGANIC POLLUTANTS , 2010 .

[17]  Shu-Shen Liu,et al.  A new predictive model for the bioconcentration factors of polychlorinated biphenyls (PCBs) based on the molecular electronegativity distance vector (MEDV). , 2008, Chemosphere.

[18]  S. Ceccatelli,et al.  The effects of PCBs and dioxins on child health , 2006, Acta paediatrica (Oslo, Norway : 1992). Supplement.

[19]  Prediction of the Bioconcentration Factor of PCBs in Fish Using the Molecular Connectivity Index and Fragment Constant Models , 2005, Water environment research : a research publication of the Water Environment Federation.

[20]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[21]  H. Santos,et al.  Toxicokinetics of Waterborne Trivalent Arsenic in the Freshwater Bivalve Corbicula fluminea , 2009, Archives of environmental contamination and toxicology.

[22]  M. Erickson,et al.  Applications of polychlorinated biphenyls , 2011, Environmental science and pollution research international.

[23]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[24]  Márcia M. C. Ferreira,et al.  Basic validation procedures for regression models in QSAR and QSPR studies: theory and application , 2009 .

[25]  J. Díaz-Ferrero,et al.  Persistent organic pollutants (PCDD/Fs, dioxin-like PCBs, marker PCBs, and PBDEs) in health supplements on the Spanish market. , 2010, Chemosphere.

[26]  E. B. Melo Multivariate SAR/QSAR of 3-aryl-4-hydroxyquinolin-2(1H)-one derivatives as type I fatty acid synthase (FAS) inhibitors. , 2010 .

[27]  Paola Gramatica,et al.  The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models , 2003 .

[28]  J. C. Penteado,et al.  O LEGADO DAS BIFENILAS POLICLORADAS (PCBs) , 2001 .

[29]  Mark T. D. Cronin,et al.  The better predictive model: High q2 for the training set or low root mean square error of prediction for the test set? , 2005 .

[30]  Roberto Todeschini,et al.  Molecular descriptors for chemoinformatics , 2009 .

[31]  G. Ross The public health implications of polychlorinated biphenyls (PCBs) in the environment. , 2004, Ecotoxicology and environmental safety.

[32]  Igor V. Tetko,et al.  Virtual Computational Chemistry Laboratory – Design and Description , 2005, J. Comput. Aided Mol. Des..

[33]  Paola Gramatica,et al.  Principles of QSAR models validation: internal and external , 2007 .

[34]  Euzébio G. Barbosa,et al.  LQTA-QSAR: A New 4D-QSAR Methodology , 2009, J. Chem. Inf. Model..

[35]  Lili Shi,et al.  Using electrotopological state indices to model the depuration rates of polychlorinated biphenyls in mussels of Elliptio complanata. , 2010, Journal of environmental sciences.

[36]  T. Yoshimura Yusho in Japan. , 2003, Industrial health.

[37]  Shu-Shen Liu,et al.  QSPR model for bioconcentration factors of nonpolar organic compounds using molecular electronegativity distance vector descriptors , 2010, Molecular Diversity.

[38]  Roland Weber,et al.  Review Article: Persistent organic pollutants and landfills - a review of past experiences and future challenges , 2011, Waste management & research : the journal of the International Solid Wastes and Public Cleansing Association, ISWA.

[39]  Davide Ballabio,et al.  Evaluation of model predictive ability by external validation techniques , 2010 .

[40]  L. Knudsen,et al.  Studying placental transfer of highly purified non-dioxin-like PCBs in two models of the placental barrier. , 2011, Placenta.

[41]  K. Roy,et al.  Chemometric QSAR Modeling and In Silico Design of Antioxidant NO Donor Phenols , 2010, Scientia pharmaceutica.

[42]  Emilio Benfenati,et al.  A new hybrid system of QSAR models for predicting bioconcentration factors (BCF). , 2008, Chemosphere.

[43]  E. Benfenati,et al.  QSPR modeling bioconcentration factor (BCF) by balance of correlations. , 2009, European journal of medicinal chemistry.

[44]  R. Teófilo,et al.  Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression , 2009 .

[45]  Svetoslav H. Slavov,et al.  Quantitative structure–activity relationship modeling of bioconcentration factors of polychlorinated biphenyls , 2010 .

[46]  C. J. Everett,et al.  Relationship of polychlorinated biphenyls with type 2 diabetes and hypertension. , 2011, Journal of environmental monitoring : JEM.

[47]  M. M. Ferreira,et al.  Multivariate QSAR study of 4,5-dihydroxypyrimidine carboxamides as HIV-1 integrase inhibitors. , 2009, European journal of medicinal chemistry.

[48]  Douglas J. Klein,et al.  Modeling the bioconcentration factors and bioaccumulation factors of polychlorinated biphenyls with posetic quantitative super-structure/activity relationships (QSSAR) , 2006, Molecular Diversity.