QSAR Models for the Prediction of Binding Affinities to Human Serum Albumin Using the Heuristic Method and a Support Vector Machine

The binding affinities to human serum albumin for 94 diverse drugs and drug-like compounds were modeled with the descriptors calculated from the molecular structure alone using a quantitative structure-activity relationship (QSAR) technique. The heuristic method (HM) and support vector machine (SVM) were utilized to construct the linear and nonlinear prediction models, leading to a good correlation coefficient (R2) of 0.86 and 0.94 and root-mean-square errors (rms) of 0.212 and 0.134 albumin drug binding affinity units, respectively. Furthermore, the models were evaluated by a 10 compound external test set, yielding R2 of 0.71 and 0.89 and rms error of 0.430 and 0.222. The specific information described by the heuristic linear model could give some insights into the factors that are likely to govern the binding affinity of the compounds and be used as an aid to the drug design process; however, the prediction results of the nonlinear SVM model seem to be better than that of the HM.

[1]  S. Ekins,et al.  Three-dimensional quantitative structure activity relationship computational approaches for prediction of human in vitro intrinsic clearance. , 2000, The Journal of pharmacology and experimental therapeutics.

[2]  J. Topliss,et al.  QSAR model for drug human oral bioavailability. , 2000, Journal of medicinal chemistry.

[3]  J. Lavandera,et al.  Cheminformatic models to predict binding affinities to human serum albumin. , 2001, Journal of medicinal chemistry.

[4]  D. B. Boyd Quantum Chemistry Program Exchange. , 1999, Journal of molecular graphics & modelling.

[5]  Robert A. Lordo,et al.  Learning from Data: Concepts, Theory, and Methods , 2001, Technometrics.

[6]  Ritu Jain,et al.  QSPR Analysis of Flash Points , 2001, J. Chem. Inf. Comput. Sci..

[7]  G. Sudlow,et al.  Further characterization of specific drug binding sites on human serum albumin. , 1976, Molecular pharmacology.

[8]  M. Karelson,et al.  QSPR: the correlation and quantitative prediction of chemical and physical properties from structure , 1995 .

[9]  V. Vapnik Estimation of Dependences Based on Empirical Data , 2006 .

[10]  Alan R. Katritzky,et al.  Theoretical Descriptors for the Correlation of Aquatic Toxicity of Environmental Pollutants by Quantitative Structure-Toxicity Relationships , 2001, J. Chem. Inf. Comput. Sci..

[11]  Ruisheng Zhang,et al.  Prediction of the Isoelectric Point of an Amino Acid Based on GA-PLS and SVMs , 2004, J. Chem. Inf. Model..

[12]  D. Butina,et al.  Predicting ADME properties in silico: methods and models. , 2002, Drug discovery today.

[13]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[14]  Feng Luan,et al.  Diagnosing Breast Cancer Based on Support Vector Machines , 2003, J. Chem. Inf. Comput. Sci..

[15]  D. Manallack,et al.  Neural networks in drug discovery: Have they lived up to their promise? , 1999 .

[16]  D C Carter,et al.  Structure of human serum albumin. , 1990, Science.

[17]  S. Walker,et al.  Pharmaceutical innovation by the seven UK-owned pharmaceutical companies (1964-1985). , 1988, British journal of clinical pharmacology.

[18]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[19]  Ruisheng Zhang,et al.  QSAR Study of Ethyl 2-[(3-Methyl-2, 5-dioxo(3-pyrrolinyl))amino]-4-(trifluoromethyl) pyrimidine-5-carboxylate: An Inhibitor of AP-1 and NF-B Mediated Gene Expression Based on Support Vector Machines , 2003, J. Chem. Inf. Comput. Sci..

[20]  Milan Randic,et al.  Quantitative Structure-Activity Relationship of Flavonoid Analogues. 3. Inhibition of p56lck Protein Tyrosine Kinase , 2000, J. Chem. Inf. Comput. Sci..

[21]  Ruisheng Zhang,et al.  An Accurate QSPR Study of O-H Bond Dissociation Energy in Substituted Phenols Based on Support Vector Machines , 2004, J. Chem. Inf. Model..

[22]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[23]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[24]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[25]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[26]  Zhide Hu,et al.  Interaction of isofraxidin with human serum albumin. , 2004, Bioorganic & medicinal chemistry.

[27]  Davide Anguita,et al.  Hyperparameter design criteria for support vector classifiers , 2003, Neurocomputing.

[28]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[29]  M. Cronin,et al.  Computational methods for the prediction of drug toxicity. , 2000, Current opinion in drug discovery & development.

[30]  G. Sudlow,et al.  The characterization of two specific drug binding sites on human serum albumin. , 1975, Molecular pharmacology.

[31]  Lemont B. Kier,et al.  Modeling Drug Albumin Binding Affinity with E-State Topological Structure Representation , 2003, J. Chem. Inf. Comput. Sci..

[32]  Ruisheng Zhang,et al.  Support Vector Machines-Based Quantitative Structure-Property Relationship for the Prediction of Heat Capacity , 2004, J. Chem. Inf. Model..

[33]  Lemont B. Kier,et al.  QSAR modeling of β-lactam binding to human serum proteins , 2003, J. Comput. Aided Mol. Des..

[34]  Ruisheng Zhang,et al.  Study of the Quantitative Structure-Mobility Relationship of Carboxylic Acids in Capillary Electrophoresis Based on Support Vector Machines , 2004, J. Chem. Inf. Model..

[35]  D C Carter,et al.  Three-dimensional structure of human serum albumin. , 1989, Science.

[36]  Eamonn F. Healy,et al.  Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model , 1985 .

[37]  John G. Topliss,et al.  QSAR Model for Drug Human Oral Bioavailability1 , 2000 .

[38]  D. C. Carter,et al.  Atomic structure and chemistry of human serum albumin , 1993, Nature.

[39]  Ying Li,et al.  Binding of the bioactive component daphnetin to human serum albumin demonstrated using tryptophan fluorescence quenching. , 2004, Macromolecular bioscience.

[40]  J. Blake,et al.  Chemoinformatics - predicting the physicochemical properties of 'drug-like' molecules. , 2000, Current opinion in biotechnology.