Prediction of Human Cytochrome P450 Inhibition Using Support Vector Machines

Cytochrome P450s (CYPs) play a major role in the metabolism of drugs utilized in human health care. Inhibition of these enzymes by a drug may result in unwanted drug-drug interactions when two or more drugs are coadministered. Therefore, CYP inhibition should be investigated as early as possible in lead discovery and lead optimization. In silico approaches are highly desirable to assess large data sets or virtual compounds. Here we present the application of support vector machines (SVMs) to predict the potency of structurally diverse drug-like molecules to inhibit the human cytochromes P450 3A4 (CYP3A4) and 2D6 (CYP2D6). Different descriptor sets were used to cover various aspects of molecular properties, including physico-chemical properties derived from the 2D structure, the interactions of the molecule with its environment, and properties derived from quantum-mechanical calculations. Support vector classifiers were trained to distinguish between strong, medium, and weak inhibitors. For both isoenzymes, independent test set compounds were correctly re-classified with an accuracy of approximately 70%. The data sets were also used to generate support vector regression models. The best models were able to predict the log IC 5 0 values of the test set compounds with a squared correlation coefficient of R 2 = 0.67 (CYP3A4, corresponding RMSE of 0.36 log units) and R 2 =0.66 (CYP2D6, corresponding RMSE of 0.44 log units). Our results show that SVMs are a very powerful tool to predict CYP inhibition liability from calculated physico-chemical properties without invoking any information about the active site of the enzyme. The models can, for instance, be utilized to flag problematic compounds in an early step or to guide further synthesis efforts in a later stage of a research project.

[1]  Eric F. Johnson,et al.  The Structure of Human Microsomal Cytochrome P450 3A4 Determined by X-ray Crystallography to 2.05-Å Resolution* , 2004, Journal of Biological Chemistry.

[2]  J. Murray,et al.  Statistical analysis of the molecular surface electrostatic potential: an approach to describing noncovalent interactions in condensed phases , 1998 .

[3]  D. Winkler,et al.  Rapid prediction of chemical metabolism by human UDP-glucuronosyltransferase isoforms using quantum chemical descriptors derived with the electronegativity equalization method. , 2004, Journal of medicinal chemistry.

[4]  Steven L. Dixon,et al.  Use of Robust Classification Techniques for the Prediction of Human Cytochrome P450 2D6 Inhibition , 2003, J. Chem. Inf. Comput. Sci..

[5]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[6]  D. Lewis,et al.  Homology modelling of CYP3A4 from the CYP2C5 crystallographic template: analysis of typical CYP3A4 substrate interactions , 2004, Xenobiotica; the fate of foreign compounds in biological systems.

[7]  Lemont B. Kier,et al.  An Electrotopological-State Index for Atoms in Molecules , 1990, Pharmaceutical Research.

[8]  Anders Karlén,et al.  Conformer- and alignment-independent model for predicting structurally diverse competitive CYP2C9 inhibitors. , 2004, Journal of medicinal chemistry.

[9]  Yoonkyung Lee,et al.  Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data , 2003, Bioinform..

[10]  Marti A. Hearst Intelligent Connections: Battling with GA-Joe. , 1998 .

[11]  Peter Lind,et al.  Support Vector Machines for the Estimation of Aqueous Solubility , 2003, J. Chem. Inf. Comput. Sci..

[12]  Sean Ekins,et al.  Generation and validation of rapid computational filters for cyp2d6 and cyp3a4. , 2003, Drug metabolism and disposition: the biological fate of chemicals.

[13]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[14]  Gabriele Cruciani,et al.  Predicting drug metabolism: a site of metabolism prediction tool applied to the cytochrome P450 2C9. , 2003, Journal of medicinal chemistry.

[15]  T. Andersson,et al.  Analysis of selective regions in the active sites of human cytochromes P450, 2C8, 2C9, 2C18, and 2C19 homology models using GRID/CPCA. , 2001, Journal of medicinal chemistry.

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  Igor V. Pletnev,et al.  Drug Discovery Using Support Vector Machines. The Case Studies of Drug-likeness, Agrochemical-likeness, and Enzyme Inhibition Predictions , 2003, J. Chem. Inf. Comput. Sci..

[18]  Iñaki Tuñón,et al.  GEPOL: An improved description of molecular surfaces. III. A new algorithm for the computation of a solvent‐excluding surface , 1994, J. Comput. Chem..

[19]  S. Ekins,et al.  Three and four dimensional-quantitative structure activity relationship (3D/4D-QSAR) analyses of CYP2D6 inhibitors. , 1999, Pharmacogenetics.

[20]  Nico P E Vermeulen,et al.  Homology modeling of rat and human cytochrome P450 2D (CYP2D) isoforms and computational rationalization of experimental ligand-binding specificities. , 2003, Journal of medicinal chemistry.

[21]  Paul A. Smith,et al.  Comparison of Linear and Nonlinear Classification Algorithms for the Prediction of Drug and Chemical Metabolism by Human UDP-Glucuronosyltransferase Isoforms , 2003, J. Chem. Inf. Comput. Sci..

[22]  S. Surapaneni,et al.  Role of drug metabolism in drug discovery and development , 2001, Medicinal research reviews.

[23]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[24]  A. Alex,et al.  Novel approach to predicting P450-mediated drug metabolism: development of a combined protein and pharmacophore model for CYP2D6. , 1999, Journal of medicinal chemistry.

[25]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[26]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[27]  Ismael Zamora,et al.  Model based on GRID-derived descriptors for estimating CYP3A4 enzyme stability of potential drug candidates , 2004, J. Comput. Aided Mol. Des..

[28]  Sean B. Holden,et al.  Support Vector Machines for ADME Property Classification , 2003 .

[29]  Gisbert Schneider,et al.  A fast virtual screening filter for cytochrome P450 3A4 inhibition liability of compound libraries , 2002 .

[30]  R J Riley,et al.  Fully automated analysis of activities catalysed by the major human liver cytochrome P450 (CYP) enzymes: assessment of human CYP inhibition potential. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[31]  Slobodan Petar Rendic,et al.  Human cytochrome P450 enzymes: a status report summarizing their reactions, substrates, inducers, and inhibitors. , 1997, Drug metabolism reviews.

[32]  György M Keseru,et al.  A neural network based virtual screening of cytochrome P450 3A4 inhibitors. , 2002, Bioorganic & medicinal chemistry letters.

[33]  Sean Ekins,et al.  In vitro and pharmacophore insights into CYP3A enzymes. , 2003, Trends in pharmacological sciences.

[34]  D. Lewis,et al.  Human cytochromes P450 associated with the phase 1 metabolism of drugs and other xenobiotics: a compilation of substrates and inhibitors of the CYP1, CYP2 and CYP3 families. , 2003, Current medicinal chemistry.

[35]  Kiyomi Ito,et al.  Database analyses for the prediction of in vivo drug-drug interactions from in vitro data. , 2004, British journal of clinical pharmacology.

[36]  Gordon C K Roberts,et al.  Residues Glutamate 216 and Aspartate 301 Are Key Determinants of Substrate Specificity and Product Regioselectivity in Cytochrome P450 2D6* , 2003, The Journal of Biological Chemistry.

[37]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[38]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[39]  Eamonn F. Healy,et al.  Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model , 1985 .

[40]  L. Wienkers,et al.  Covalent alteration of the CYP3A4 active site: evidence for multiple substrate binding domains. , 2001, Archives of biochemistry and biophysics.

[41]  Daniel Rinaldi,et al.  Calcul théorique des polarisabilités électroniques moléculaires. Comparaison des différentes méthodes , 1974 .

[42]  Jeffrey P. Jones,et al.  A refined 3-dimensional QSAR of cytochrome P450 2C9: computational predictions of drug interactions. , 2000, Journal of medicinal chemistry.

[43]  Barry C Jones,et al.  Development of a combined protein and pharmacophore model for cytochrome P450 2C9. , 2002, Journal of medicinal chemistry.

[44]  G. R. Famini,et al.  Using theoretical descriptors in quantitative structure–property relationships: some distribution equilibria , 1998 .

[45]  W L Jorgensen,et al.  Prediction of drug solubility from Monte Carlo simulations. , 2000, Bioorganic & medicinal chemistry letters.

[46]  P. Carrupt,et al.  Molecular fields in quantitative structure–permeation relationships: the VolSurf approach , 2000 .

[47]  Gunnar Rätsch,et al.  Active Learning with Support Vector Machines in the Drug Discovery Process , 2003, J. Chem. Inf. Comput. Sci..

[48]  Bernd Beck,et al.  VESPA: A new, fast approach to electrostatic potential‐derived atomic charges from semiempirical methods , 1997 .

[49]  G. Schürer,et al.  Accurate parametrized variational calculations of the molecular electronic polarizability by NDDO‐based methods , 1999 .

[50]  Lan Mu,et al.  A model based QSPR analysis of the unified non-specific solvent polarity scale , 1998 .

[51]  Grazyna D. Szklarz,et al.  Molecular modeling of cytochrome P450 3A4 , 1997, J. Comput. Aided Mol. Des..

[52]  Jose Cosme,et al.  Crystal Structures of Human Cytochrome P450 3A4 Bound to Metyrapone and Progesterone , 2004, Science.

[53]  Ismael Zamora,et al.  Discriminant and quantitative PLS analysis of competitive CYP2C9 inhibitors versus non-inhibitors using alignment independent GRIND descriptors , 2002, J. Comput. Aided Mol. Des..

[54]  Gordon C K Roberts,et al.  Validation of model of cytochrome P450 2D6: an in silico tool for predicting metabolism and inhibition. , 2004, Journal of medicinal chemistry.

[55]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[56]  Danielson Pb,et al.  The cytochrome P450 superfamily: biochemistry, evolution and drug metabolism in humans. , 2002 .

[57]  Miguel Figueroa,et al.  Competitive learning with floating-gate circuits , 2002, IEEE Trans. Neural Networks.

[58]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[59]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.