QSPR Prediction of pKa for Benzoic Acids in Different Solvents

A Computational Neural Network (CNN) derived model is proposed for the pKa prediction of benzoic acids in different solvents. The system studied contains 519 pKa values corresponding to 136 benzoic acids determined in water and in 8 organic solvents. The benzoic acids were described by the usual molecular descriptors and the solvents by a number of physical properties and by several parameters of the most widely used polarity solvent scales. The model is composed of seven descriptors – five of them corresponding to the solute and the other two to the solvents – and was validated by an external prediction set. The three sets of values needed in the analysis, training, prediction, and cross-validation, have the same squared correlation coefficient (0.998) and Root-Mean Square Error (RMSE) (0.21) values. The robustness of the model is also given for the statistical results for small subsets, such as ortho- and non-ortho-substituted acids, those obtained in protic or aprotic solvents, those obtained in each solvent, and even those for the set of pKa values of one specific acid in several solvents. The descriptors encode information about the chemical nature of the solutes and the solvents that is clearly related to the interactions present in the dissociation process in solution. The derived model also has the ability to predict pKa values of a larger set of aromatic neutral acids, containing both phenols and benzoic acids.

[1]  A. Chalk,et al.  A Temperature-Dependent Quantum Mechanical/Neural Net Model for Vapor Pressure , 2001, J. Chem. Inf. Comput. Sci..

[2]  E. Kosower The Effect of Solvent on Spectra. I. A New Empirical Measure of Solvent Polarity: Z-Values , 1958 .

[3]  D. Manallack,et al.  Statistics using neural networks: chance effects. , 1993, Journal of medicinal chemistry.

[4]  D. Chipman Computation of p K a from Dielectric Continuum Theory , 2002 .

[5]  Johann Gasteiger,et al.  Prediction of pKa Values for Aliphatic Carboxylic Acids and Alcohols with Empirical Atomic Charge Descriptors , 2006, J. Chem. Inf. Model..

[6]  O. Pytela,et al.  Chemometric Analysis of Substituent Effects. XII. Application of Relationship Between 2- and 4-Substitution of Benzene Ring to Study ortho Effect in Selected Compounds with Different Reaction Centres , 1999 .

[7]  O. Pytela,et al.  Ortho effect in dissociation of benzoic acids with electron-accceptor substituents using the AISE theory; relation to para substitution and solvent , 2002 .

[8]  Javier Catalán,et al.  Progress towards a generalized solvent polarity scale: The solvatochromism of 2‐(dimethylamino)‐7‐nitrofluorene and its homomorph 2‐fluoro‐7‐nitrofluorene , 1995 .

[9]  Q. Guo,et al.  First-principle predictions of absolute pKa's of organic acids in dimethyl sulfoxide solution. , 2004, Journal of the American Chemical Society.

[10]  O. Pytela,et al.  Reparametrization and/or Determination of Hammett, Inductive, Mesomeric and AISE Substituent Constants for Five Substituents: N+(CH3)3, CH2N+(CH3)3, CH2Py+, CH2SO2CH3 and PO(OCH3)2 , 2004 .

[11]  J. Kulhanek,et al.  Solvation and steric hindrance in methyl-substituted benzoic acids , 1998 .

[12]  Igor I. Baskin,et al.  Quantitative structure–conditions–property relationship studies. Neural network modelling of the acid hydrolysis of esters , 2002 .

[13]  Matthew D. Wessel,et al.  Prediction of Reduced Ion Mobility Constants from Structural Information Using Multiple Linear Regression Analysis and Computational Neural Networks , 1994 .

[14]  I. Kolthoff,et al.  Substituent effects on dissociation of benzoic acids and heteroconjugation of benzoates with p-bromophenol in acetonitrile, N,N-dimethylformamide, and dimethyl sulfoxide. Intramolecular hydrogen bonding in o-hydroxybenzoic acids and their anions , 1971 .

[15]  Brian T. Luke,et al.  Evolutionary Programming Applied to the Development of Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships , 1994, J. Chem. Inf. Comput. Sci..

[16]  P. Jurs,et al.  Studies of Chemical Structure-Biological Activity Relations Using Pattern Recognition , 1979 .

[17]  Citra Estimating the pKa of phenols, carboxylic acids and alcohols from semi-empirical quantum chemical methods , 1999, Chemosphere.

[18]  J. Abboud,et al.  Critical compilation of scales of solvent parameters. Part I. Pure, non-hydrogen bond donor solvents , 1999 .

[19]  Peter C. Jurs,et al.  Automated Descriptor Selection for Quantitative Structure-Activity Relationships Using Generalized Simulated Annealing , 1995, J. Chem. Inf. Comput. Sci..

[20]  J. Sales,et al.  Neural Network Based QSPR Study for Predicting pKa of Phenols in Different Solvents , 2007 .

[21]  Ruisheng Zhang,et al.  QSAR Study of Ethyl 2-[(3-Methyl-2, 5-dioxo(3-pyrrolinyl))amino]-4-(trifluoromethyl) pyrimidine-5-carboxylate: An Inhibitor of AP-1 and NF-B Mediated Gene Expression Based on Support Vector Machines , 2003, J. Chem. Inf. Comput. Sci..

[22]  O. Pytela A New Iterative Method of Construction of the Hammett Acidity Function , 1997 .

[23]  Jon W. Ball,et al.  Quantitative structure‐activity relationships for toxicity of phenols using regression analysis and computational neural networks , 1994 .

[24]  V. A. Palyulin,et al.  Construction of Neural-Network Structure–Condition–Property Relationships: Modeling of Physicochemical Properties of Hydrocarbons , 2002 .

[25]  Johan Ulander,et al.  High-throughput pKa screening and prediction amenable for ADME profiling , 2006, Expert opinion on drug metabolism & toxicology.

[26]  P. Jurs,et al.  Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies , 1990 .

[27]  K. Dimroth,et al.  Über Pyridinium‐N‐phenol‐betaine und ihre Verwendung zur Charakterisierung der Polarität von Lösungsmitteln , 1963 .

[28]  G. I. Almerindo,et al.  Ionization of Organic Acids in Dimethyl Sulfoxide Solution: A Theoretical Ab Initio Calculation of the pKa Using a New Parametrization of the Polarizable Continuum Model , 2004 .

[29]  I. Kolthoff,et al.  Resolution of acid strength in tert-butyl alcohol and isopropyl alcohol of substituted benzoic acids, phenols, and aliphatic carboxylic acids , 1979 .

[30]  Emanuela Gancia,et al.  Estimation of pKa Using Semiempirical Molecular Orbital Methods. Part 2: Application to Amines, Anilines and Various Nitrogen Containing Heterocyclic Compounds. , 2002 .

[31]  V. Gutmann,et al.  Coordination reactions in non aqueous solutions - The role of the donor strength , 1966 .

[32]  R. Taft,et al.  The solvatochromic comparison method. 2. The .alpha.-scale of solvent hydrogen-bond donor (HBD) acidities , 1976 .

[33]  E. Bosch,et al.  Hammett–Taft and Drago models in the prediction of acidity constant values of neutral and cationic acids in methanol† , 1999 .

[34]  E. Bosch,et al.  Dissociation constants of neutral and charged acids in methyl alcohol. The acid strength resolution , 1998 .

[35]  Ralf-Uwe Ebert,et al.  Application of Neural Networks to Modeling and Estimating Temperature-Dependent Liquid Viscosity of Organic Compounds , 2001, J. Chem. Inf. Comput. Sci..

[36]  E. Bosch,et al.  Acidity in methanol–water , 2001 .

[37]  O. Pytela,et al.  Chemometrical Analysis of Substituent Effects. III. Additivity of Substituent Effects in Dissociation of 3,4-Disubstituted Benzoic Acids in Organic Solvents , 1994 .

[38]  Eamonn F. Healy,et al.  Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model , 1985 .