Application of quantum topological molecular similarity descriptors in QSPR study of the O‐methylation of substituted phenols

The usefulness of a novel type of electronic descriptors called quantum topological molecular similarity (QTMS) indices for describing the quantitative effects of molecular electronic environments on the O‐methylation kinetic of substituted phenols has been investigated. QTMS theory produces for each molecule a matrix of descriptors, containing bond (or structure) information in one dimension and electronic effects in another dimension, instead of other methods producing a vector of descriptors for each molecule. A collection of chemometrics tools including principal component analysis (PCA), partial least squares (PLS), and genetic algorithms (GA) were used to model the structure‐kinetic data. PCA separated the bond and descriptor effects, and PLS modeled the effects of these parameters on the rate constant data, and GA selected the most relevant subset of variables. The model performances were validated by both cross‐validation and external validation. The results indicated that the proposed models could explain about 95% of variances in the rate constant data. The significant effects of variables on the reaction kinetic were identified by calculating variable important in projection (VIP). It was found that the rate constant of esterification of phenols is highly influenced by the electronic properties of the C2C1OH fragment of the parent molecule. Indeed, the C2X and C4X bonds (corresponding to ortho and para substituents) were found as highly influential parameters. All of the eight calculated QTMS indices were found significant however, λ1, λ2, λ3, ϵ, and K(r) were detected as highly influential parameters. © 2007 Wiley Periodicals, Inc. J Comput Chem, 2008

[1]  Mahdi Nooshyar,et al.  Application of artificial neural networks for predicting the aqueous acidity of various phenols using QSAR , 2006, Journal of molecular modeling.

[2]  Mehdi Jalali-Heravi,et al.  Use of Computer-Assisted Methods for the Modeling of the Retention Time of a Variety of Volatile Organic Compounds: A PCA-MLR-ANN Approach , 2004, J. Chem. Inf. Model..

[3]  M. Karelson,et al.  Quantum-Chemical Descriptors in QSAR/QSPR Studies. , 1996, Chemical reviews.

[4]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[5]  Tudor I. Oprea,et al.  An automated PLS search for biologically relevant QSAR descriptors , 2004, J. Comput. Aided Mol. Des..

[6]  Wu Lei,et al.  QSAR study on N-containing corrosion inhibitors: Quantum chemical approach assisted by topological index , 2005 .

[7]  C. Hansch,et al.  Radical toxicity of phenols: A reference point for obtaining perspective in the formulation of QSAR , 2001, Medicinal research reviews.

[8]  Paul L. A. Popelier,et al.  Quantum topological molecular similarity. Part 4. A QSAR study of cell growth inhibitory properties of substituted (E)-1-phenylbut-1-en-3-ones , 2002 .

[9]  J. Platts Theoretical prediction of hydrogen bond basicity , 2000 .

[10]  R. M. Muir,et al.  Correlation of Biological Activity of Phenoxyacetic Acids with Hammett Substituent Constants and Partition Coefficients , 1962, Nature.

[11]  Paul L. A. Popelier,et al.  Quantum molecular similarity. 1. BCP space , 1999 .

[12]  B. Rode,et al.  Quantitative Electronic Structure-Activity Relations: The Influence of Basis Set Selection on Prediction Quality , 2003 .

[13]  Bahram Hemmateenejad,et al.  Optimal QSAR analysis of the carcinogenic activity of drugs by correlation ranking and genetic algorithm‐based PCR , 2004 .

[14]  C. Hansch,et al.  Comparative QSAR and the radical toxicity of various functional groups. , 2002, Chemical reviews.

[15]  R. Bader Atoms in molecules : a quantum theory , 1990 .

[16]  Shijin Ren,et al.  Comparative Assessment of Multiresponse Regression Methods for Predicting the Mechanisms of Toxic Action of Phenols , 2003, J. Chem. Inf. Comput. Sci..

[17]  David W. Salt,et al.  Variable Selection—Spoilt for Choice? , 2005 .

[18]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[19]  Correlation of Reaction Rates with a Substituent Constant Scale Derived from Calculated Electron Densities for a Computer Control Algorithm , 1993 .

[20]  B. Rasulev,et al.  A QSAR toxicity study of a series of alkaloids with the lycoctonine skeleton. , 2004, Molecules.

[21]  Bahram Hemmateenejad,et al.  Ant colony optimisation: a powerful tool for wavelength selection , 2006 .

[22]  Zhen Xi,et al.  Development of a general quantum‐chemical descriptor for steric effects: Density functional theory based QSAR study of herbicidal sulfonylurea analogues , 2006, J. Comput. Chem..

[23]  Kazutoshi Tanabe,et al.  Is large‐scale ab initio Hartree‐Fock calculation chemically accurate? Toward improved calculation of biological molecule properties , 1999 .

[24]  Julio Caballero,et al.  Bayesian-regularized genetic neural networks applied to the modeling of non-peptide antagonists for the human luteinizing hormone-releasing hormone receptor. , 2006, Journal of molecular graphics & modelling.

[25]  P. Popelier,et al.  Estimation of pKa using quantum topological molecular similarity descriptors: application to carboxylic acids, anilines and phenols. , 2004, The Journal of organic chemistry.

[26]  L. Eriksson Multi- and megavariate data analysis , 2006 .

[27]  Ying Liu,et al.  A Comparative Study on Feature Selection Methods for Drug Discovery , 2004, J. Chem. Inf. Model..

[28]  Bahram Hemmateenejad,et al.  Accurate prediction of the blood–brain partitioning of a large set of solutes using ab initio calculations and genetic neural network modeling , 2006, J. Comput. Chem..

[29]  D. B. Hibbert Genetic algorithms in chemistry , 1993 .

[30]  R. Leardi Genetic algorithms in chemometrics and chemistry: a review , 2001 .

[31]  R. Parthasarathi,et al.  Chemical Reactivity Profiles of Two Selected Polychlorinated Biphenyls , 2003 .

[32]  Paul L. A. Popelier,et al.  Ester hydrolysis rate constant prediction from quantum topological molecular similarity descriptors , 2003 .

[33]  Paul L. A. Popelier,et al.  Quantum Molecular Similarity. 3. QTMS Descriptors , 2001, J. Chem. Inf. Comput. Sci..

[34]  C. Jun,et al.  Performance of some variable selection methods when multicollinearity is present , 2005 .

[35]  P. Popelier,et al.  The cytotoxicity of ortho alkyl substituted 4-X-phenols: a QSAR based on theoretical bond lengths and electron densities. , 2006, Bioorganic & medicinal chemistry letters.

[36]  Edmund R. Malinowski,et al.  Factor Analysis in Chemistry , 1980 .

[37]  P. Popelier,et al.  Quantum chemical topology (QCT) descriptors as substitutes for appropriate Hammett constants. , 2005, Organic & biomolecular chemistry.

[38]  Bahram Hemmateenejad,et al.  Genetic Algorithm Applied to the Selection of Factors in Principal Component-Artificial Neural Networks: Application to QSAR Study of Calcium Channel Antagonist Activity of 1, 4-Dihydropyridines (Nifedipine Analogous) , 2003, J. Chem. Inf. Comput. Sci..

[39]  Bahram Hemmateenejad,et al.  Application of ab initio theory to QSAR study of 1,4‐dihydropyridine‐based calcium channel blockers using GA‐MLR and PC‐GA‐ANN procedures , 2004, J. Comput. Chem..

[40]  Paul L. A. Popelier,et al.  Quantum topological molecular similarity. Part 5. Further development with an application to the toxicity of polychlorinated dibenzo-p-dioxinsThe IUPAC name for dibenzo-p-dioxin is dibenzo[b,e][1,4]dioxin.(PCDDs) , 2002 .

[41]  P. Popelier,et al.  QSAR models based on quantum topological molecular similarity. , 2006, European journal of medicinal chemistry.

[42]  D. Hoekman Exploring QSAR Fundamentals and Applications in Chemistry and Biology, Volume 1. Hydrophobic, Electronic and Steric Constants, Volume 2 J. Am. Chem. Soc. 1995, 117, 9782 , 1996 .