QSAR model as a random event: A case of rat toxicity.

Quantitative structure-property/activity relationships (QSPRs/QSARs) can be used to predict physicochemical and/or biochemical behavior of substances which were not studied experimentally. Typically predicted values for chemicals in the training set are accurate since they were used to build the model. QSPR/QSAR models must be validated before they are used in practice. Unfortunately, the majority of the suggested approaches of the validation of QSPR/QSAR models are based on consideration of geometrical features of clusters of data points in the plot of experimental versus calculated values of an endpoint. We believe these geometrical criteria can be more useful if they are analyzed for several splits into the training and test sets. In this way, one can estimate the reproducibility of the model with various splits and better evaluate model reliability. The probability of the correct prediction of an endpoint for external validation set (in the series of the above-mentioned splits) can provide an useful way to evaluate the domain of applicability of the model.

[1]  G. Gini,et al.  QSAR modeling of anxiolytic activity taking into account the presence of keto- and enol-tautomers by balance of correlations with ideal slopes , 2011 .

[2]  E. Benfenati,et al.  QSAR modelling of carcinogenicity by balance of correlations , 2009, Molecular Diversity.

[3]  George Kollias,et al.  Molecular Modeling on Pyrimidine-Urea Inhibitors of TNF-α Production: An Integrated Approach Using a Combination of Molecular Docking, Classification Techniques, and 3D-QSAR CoMSIA , 2012, J. Chem. Inf. Model..

[4]  A. Toropova,et al.  QSAR models for HEPT derivates as NNRTI inhibitors based on Monte Carlo method. , 2014, European journal of medicinal chemistry.

[5]  Emilio Benfenati,et al.  QSPR modeling of octanol/water partition coefficient of antineoplastic agents by balance of correlations. , 2010, European journal of medicinal chemistry.

[6]  Eduardo A. Castro,et al.  QSAR Study and Molecular Design of Open-Chain Enaminones as Anticonvulsant Agents , 2011, International journal of molecular sciences.

[7]  G. Gini,et al.  The average numbers of outliers over groups of various splits into training and test sets: A criterion of the reliability of a QSPR? A case of water solubility , 2012 .

[8]  On reducing correlations between topological indices , 2005 .

[9]  I. Gutman,et al.  Comparison of QSPR Models Based on Hydrogen-Filled Graphs and on Graphs of Atomic Orbitals , 2005 .

[10]  R. Pohjanvirta,et al.  Toxic equivalency factors do not predict the acute toxicities of dioxins in rats. , 1995, European journal of pharmacology.

[11]  Giuseppina C. Gini,et al.  CORAL: Quantitative structure–activity relationship models for estimating toxicity of organic compounds in rats , 2011, J. Comput. Chem..

[12]  Giuseppina C. Gini,et al.  Coral: QSPR modeling of rate constants of reactions between organic aromatic pollutants and hydroxyl radical , 2012, J. Comput. Chem..

[13]  E Benfenati,et al.  SMILES-based optimal descriptors: QSAR modeling of carcinogenicity by balance of correlations with ideal slopes. , 2010, European journal of medicinal chemistry.

[14]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[15]  Jerzy Leszczynski,et al.  SMILES‐based optimal descriptors: QSAR analysis of fullerene‐based HIV‐1 PR inhibitors by means of balance of correlations , 2009, J. Comput. Chem..

[16]  George Kollias,et al.  Ligand-based virtual screening procedure for the prediction and the identification of novel β-amyloid aggregation inhibitors using Kohonen maps and Counterpropagation Artificial Neural Networks. , 2011, European journal of medicinal chemistry.

[17]  K Roy,et al.  On the use of the metric rm² as an effective tool for validation of QSAR models in computational drug design and predictive toxicology. , 2012, Mini reviews in medicinal chemistry.

[18]  David Weininger,et al.  SMILES, 3. DEPICT. Graphical depiction of chemical structures , 1990, J. Chem. Inf. Comput. Sci..

[19]  Andrey A Toropov,et al.  CORAL software: prediction of carcinogenicity of drugs by means of the Monte Carlo method. , 2014, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[20]  Jerzy Leszczynski,et al.  QSAR Modeling of Acute Toxicity for Nitrobenzene Derivatives Towards Rats: Comparative Analysis by MLRA and Optimal Descriptors , 2007 .

[21]  Eduardo A. Castro,et al.  QSAR treatment on a new class of triphenylmethyl-containing compounds as potent anticancer agents , 2011 .

[22]  E. Benfenati,et al.  QSPR modeling bioconcentration factor (BCF) by balance of correlations. , 2009, European journal of medicinal chemistry.

[23]  George Kollias,et al.  Predictive QSAR workflow for the in silico identification and screening of novel HDAC inhibitors , 2009, Molecular Diversity.

[24]  Emilio Benfenati,et al.  QSAR modelling toxicity toward rats of inorganic substances by means of CORAL , 2010 .

[25]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[26]  Humayun Kabir,et al.  Comparative Studies on Some Metrics for External Validation of QSPR Models , 2012, J. Chem. Inf. Model..

[27]  Pablo R Duchowicz,et al.  A comparative QSAR on 1,2,5-thiadiazolidin-3-one 1,1-dioxide compounds as selective inhibitors of human serine proteinases. , 2011, Journal of molecular graphics & modelling.

[28]  Vladimir Poroikov,et al.  QSAR Modelling of Rat Acute Toxicity on the Basis of PASS Prediction , 2011, Molecular informatics.

[29]  Jerzy Leszczynski,et al.  QSAR modeling of acute toxicity by balance of correlations. , 2008, Bioorganic & medicinal chemistry.

[30]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[31]  I. Gutman,et al.  Relation between second and third geometric–arithmetic indices of trees , 2011 .

[32]  Jerzy Leszczynski,et al.  Optimal descriptor as a translator of eclectic data into prediction of cytotoxicity for metal oxide nanoparticles under different conditions. , 2015, Ecotoxicology and environmental safety.

[33]  Kunal Roy,et al.  Introduction of rm2(rank) metric incorporating rank-order predictions as an additional tool for validation of QSAR/QSPR models , 2012 .

[34]  G Melagraki,et al.  Ligand and structure based virtual screening strategies for hit-finding and optimization of hepatitis C virus (HCV) inhibitors. , 2011, Current medicinal chemistry.

[35]  I. Rusyn,et al.  Use of in Vitro HTS-Derived Concentration–Response Data as Biological Descriptors Improves the Accuracy of QSAR Models of in Vivo Toxicity , 2010, Environmental health perspectives.

[36]  Eduardo A. Castro,et al.  QSAR on aryl-piperazine derivatives with activity on malaria , 2012 .