QSAR-modeling of toxicity of organometallic compounds by means of the balance of correlations for InChI-based optimal descriptors

Quantitative structure–activity relationships (QSAR) for toxicity toward rats (pLD50) have been built by means of optimal descriptors. Comparison of the optimal descriptors calculated using the International Chemical Identifier (InChI) with the optimal descriptors calculated using the simplified molecular input line entry system (SMILES) has shown that the InChI-based models give more accurate prediction for the abovementioned toxicity of organometallic compounds. These models were obtained by means of the balance of correlation: one subset of the training set (subtraining set) plays role of the training; the second subset (calibration set) plays role of the preliminary check of the models. It has been shown that the balance of correlations is a more robust predictor for the toxicity than the classic scheme (training set—test set: without the calibration set). Three splits into the subtraining set, calibration set, and test set were examined.

[1]  Emilio Benfenati,et al.  SMILES in QSPR/QSAR Modeling: results and perspectives. , 2007, Current drug discovery technologies.

[2]  Alessio Micheli,et al.  Prediction of the Glass Transition Temperature of (Meth)Acrylic Polymers Containing Phenyl Groups by Recursive Neural Network , 2007 .

[3]  Andrey A. Toropov,et al.  Modeling of acyclic carbonyl compounds normal boiling points by correlation weighting of nearest neighboring codes , 2002 .

[4]  K Krishnan,et al.  Quantitative structure-pharmacokinetic relationship modelling. , 2001, The Science of the total environment.

[5]  E. Benfenati,et al.  QSAR modelling of aldehyde toxicity against a protozoan, Tetrahymena pyriformis by optimization of correlation weights of nearest neighboring codes , 2004 .

[6]  E Benfenati,et al.  Additive SMILES-based optimal descriptors in QSAR modelling bee toxicity: Using rare SMILES attributes to define the applicability domain. , 2008, Bioorganic & medicinal chemistry.

[7]  I. Gutman,et al.  Comparison of QSPR Models Based on Hydrogen-Filled Graphs and on Graphs of Atomic Orbitals , 2005 .

[8]  Kunal Roy,et al.  QSAR modeling for lipid peroxidation inhibition potential of flavonoids using topological and structural parameters , 2008 .

[9]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[10]  I. Gutman,et al.  Simplified molecular input line entry system (SMILES) as an alternative for constructing quantitative structure-property relationships (QSPR) , 2005 .

[11]  A. Toropova,et al.  Prediction of heteroaromatic amine mutagenicity by means of correlation weighting of atomic orbital graphs of local invariants , 2001 .

[12]  David Weininger,et al.  SMILES, 3. DEPICT. Graphical depiction of chemical structures , 1990, J. Chem. Inf. Comput. Sci..

[13]  Henry S Rzepa,et al.  Enhancement of the chemical semantic web through the use of InChI identifiers. , 2005, Organic & biomolecular chemistry.

[14]  Tomasz Puzyn,et al.  QSPR-based estimation of the atmospheric persistence for chloronaphthalene congeners , 2008 .

[15]  Andrey A. Toropov,et al.  The graph of atomic orbitals and its basic properties. 2. Zagreb indices , 2005 .

[16]  Andrey A. Toropov,et al.  The graph of atomic orbitals and its basic properties. 1. Wiener index , 2005 .

[17]  David Vidal,et al.  LINGO, an Efficient Holographic Text Based Method To Calculate Biophysical Properties and Intermolecular Similarities , 2005, J. Chem. Inf. Model..

[18]  Haralambos Sarimveis,et al.  A Novel RBF Neural Network Training Methodology to Predict Toxicity to Vibrio Fischeri , 2006, Molecular Diversity.

[19]  E. Castro,et al.  3D-chiral (2.5) atom-based TOMOCOMD-CARDD descriptors: theory and QSAR applications to central chirality codification , 2008 .

[20]  A. Toropova,et al.  Nearest neighboring code and hydrogen bond index in labeled hydrogen-filled graph and graph of atomic orbitals: application to model of normal boiling points of haloalkanes , 2004 .

[21]  Quantitative structure-activity relationship studies: β-adrenergic blocking activity of 1-(2,4-disubstituted phenoxy)-3-aminopropan-2-ols , 1990 .

[22]  Jerzy Leszczynski,et al.  QSAR modeling of acute toxicity by balance of correlations. , 2008, Bioorganic & medicinal chemistry.

[23]  Tomasz Puzyn,et al.  Calculation of Quantum-Mechanical Descriptors for QSPR at the DFT Level: Is It Necessary? , 2008, J. Chem. Inf. Model..

[24]  Kunal Roy,et al.  QSTR with Extended Topochemical Atom Indices. 10. Modeling of Toxicity of Organic Chemicals to Humans Using Different Chemometric Tools , 2008, Chemical biology & drug design.

[25]  Andrey A. Toropov,et al.  QSAR modeling of toxicity on optimization of correlation weights of Morgan extended connectivity , 2002 .

[26]  E. Benfenati,et al.  Optimisation of correlation weights of SMILES invariants for modelling oral quail toxicity. , 2007, European journal of medicinal chemistry.

[27]  Jerzy Leszczynski,et al.  The effects of characteristics of substituents on toxicity of the nitroaromatics: HiT QSAR study , 2008, J. Comput. Aided Mol. Des..

[28]  Haralambos Sarimveis,et al.  A Novel QSAR Model for Evaluating and Predicting the Inhibition Activity of Dipeptidyl Aspartyl Fluoromethylketones , 2006 .

[29]  Emilio Benfenati,et al.  QSAR modelling of aldehyde toxicity by means of optimisation of correlation weights of nearest neighbouring codes , 2004 .

[30]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[31]  Kunal Roy,et al.  Comparative QSAR Studies of CYP1A2 Inhibitor Flavonoids Using 2D and 3D Descriptors , 2008, Chemical biology & drug design.

[32]  Jerzy Leszczynski,et al.  Structure-toxicity relationships of nitroaromatic compounds , 2006, Molecular Diversity.

[33]  Alexander Wlodawer,et al.  Application of InChI to curate, index, and query 3‐D structures , 2005, Proteins.

[34]  Jerzy Leszczynski,et al.  QSAR Modeling of Acute Toxicity for Nitrobenzene Derivatives Towards Rats: Comparative Analysis by MLRA and Optimal Descriptors , 2007 .

[35]  David Vidal,et al.  A Novel Search Engine for Virtual Screening of Very Large Databases , 2006, J. Chem. Inf. Model..

[36]  Haralambos Sarimveis,et al.  Prediction of intrinsic viscosity in polymer-solvent combinations using a QSPR model , 2006 .

[37]  J. Leszczynski,et al.  The effect of nitroaromatics' composition on their toxicity in vivo: novel, efficient non-additive 1D QSAR analysis. , 2008, Chemosphere.

[38]  Henrik Boström,et al.  Discrimination between modes of toxic action of phenols using rule based methods , 2006, Molecular Diversity.

[39]  Alan Talevi,et al.  New QSPR study for the prediction of aqueous solubility of drug-like compounds. , 2008, Bioorganic & medicinal chemistry.

[40]  Kirill Degtyarenko,et al.  "Good Annotation Practice" for Chemical Data in Biology , 2007, Silico Biol..

[41]  Eduardo A. Castro,et al.  Partial Order Ranking for the aqueous toxicity of aromatic mixtures , 2008 .