Computational Prediction of the Isoform Specificity of Cytochrome P450 Substrates by an Improved Bayesian Method

Cytochrome P450 (CYP) is the most important drug-metabolizing enzyme in human beings. Each CYP isoform is able to metabolize a large number of compounds, and if patients take more than one drugs during the treatment, it is possible that some drugs would be metabolized by the same CYP isoform, leading to potential drug-drug interactions and side effects. Therefore, it is necessary to investigate the isoform specificity of CYP substrates. In this study, we constructed a data set consisting of 10 major CYP isoforms associated with 776 substrates, and used machine learning methods to construct the predictive models based on the features of structural and physicochemical properties of substrates. We also proposed a new method called Improved Bayesian method, which is suitable for small data sets and is able to construct more stable and accurate predictive models compared with other traditional machine learning models. Based on this method, the predictive performance of our method got the accuracy of 86% for the independent test, which was significantly better to the existing models. We believe that our proposed method will facilitate the understanding of drug metabolisms and help the large-scale analysis of drug-drug interactions.

[1]  Q. Zou,et al.  Gene2vec: gene subsequence embedding for prediction of mammalian N6-methyladenosine sites from mRNA , 2018, RNA.

[2]  Jihong Guan,et al.  Group-sparse Modeling Drug-kinase Networks for Predicting Combinatorial Drug Sensitivity in Cancer Cells , 2018, Current Bioinformatics.

[3]  Lei Chen,et al.  Identification of Drug-Drug Interactions Using Chemical Interactions , 2017 .

[4]  Feng Liu,et al.  Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data , 2017, BMC Bioinformatics.

[5]  Ying Ju,et al.  Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy , 2016, BMC Systems Biology.

[6]  Ren Long,et al.  iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework , 2016, Bioinform..

[7]  Liujuan Cao,et al.  A novel features ranking metric with application to scalable visual and bioinformatics data classification , 2016, Neurocomputing.

[8]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[9]  Phung Anh Nguyen,et al.  Interactions between traditional Chinese medicine and western drugs in Taiwan: A population-based study , 2015, Comput. Methods Programs Biomed..

[10]  Daisuke Kihara,et al.  Three-Dimensional Compound Comparison Methods and Their Application in Drug Discovery , 2015, Molecules.

[11]  Daisuke Kihara,et al.  Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0 , 2015, Bioinform..

[12]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[13]  Tao Zhang,et al.  Classification Models for Predicting Cytochrome P450 Enzyme‐Substrate Selectivity , 2012, Molecular informatics.

[14]  N. Mishra Computational modeling of P450s for toxicity prediction , 2011, Expert opinion on drug metabolism & toxicology.

[15]  Igor V Tetko,et al.  A comparison of different QSAR approaches to modeling CYP450 1A2 inhibition , 2011, J. Chem. Inf. Model..

[16]  Tao Zhang,et al.  Long-Range Effects of a Peripheral Mutation on the Enzymatic Activity of Cytochrome P450 1A2 , 2011, J. Chem. Inf. Model..

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  Anthony E. Klon,et al.  Machine learning algorithms for the prediction of hERG and CYP450 binding in drug development , 2010, Expert opinion on drug metabolism & toxicology.

[19]  K. Chou,et al.  Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features , 2010, PloS one.

[20]  Michael Schroeder,et al.  SuperCYP: a comprehensive database on Cytochrome P450 enzymes including a tool for analysis of CYP-drug interactions , 2009, Nucleic Acids Res..

[21]  Jürgen Pleiss,et al.  The Cytochrome P450 Engineering Database: integration of biochemical properties , 2009, BMC Biochemistry.

[22]  Johann Gasteiger,et al.  Comparison of Multilabel and Single-Label Classification Applied to the Prediction of the Isoform Specificity of Cytochrome P450 Substrates , 2009, J. Chem. Inf. Model..

[23]  Ruili Huang,et al.  Comprehensive Characterization of Cytochrome P450 Isozyme Selectivity across Chemical Libraries , 2009, Nature Biotechnology.

[24]  Paavo Honkakoski,et al.  Inhibition and induction of human cytochrome P450 enzymes: current status , 2008, Archives of Toxicology.

[25]  J. Pachecka,et al.  Cytochrome P450 polymorphism--molecular, metabolic, and pharmacogenetic aspects. II. Participation of CYP isoenzymes in the metabolism of endogenous substances and drugs. , 2008, Acta poloniae pharmaceutica.

[26]  Takayuki Ito,et al.  Novel Hierarchical Classification and Visualization Method for Multiobjective Optimization of Drug Properties: Application to Structure-Activity Relationship Analysis of Cytochrome P450 Metabolism , 2008, J. Chem. Inf. Model..

[27]  Douglas R. Henry,et al.  Evaluation of descriptors and classification schemes to predict cytochrome substrates in terms of chemical information , 2008, J. Comput. Aided Mol. Des..

[28]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[29]  F. Guengerich,et al.  Cytochrome P450s and other enzymes in drug metabolism and toxicity , 2006, The AAPS Journal.

[30]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[31]  Jürgen Pleiss,et al.  The Cytochrome P450 Engineering Database: a navigation and prediction tool for the cytochrome P450 protein family , 2007, Bioinform..

[32]  Johann Gasteiger,et al.  Ligand-Based Models for the Isoform Specificity of Cytochrome P450 3A4, 2D6, and 2C9 Substrates , 2007, J. Chem. Inf. Model..

[33]  L. Hall,et al.  The Molecular Connectivity Chi Indexes and Kappa Shape Indexes in Structure‐Property Modeling , 2007 .

[34]  Thomas Fox,et al.  Machine learning techniques for in silico modeling of drug metabolism. , 2006, Current topics in medicinal chemistry.

[35]  I. Poggesi,et al.  Computational approaches for predicting CYP-related metabolism properties in the screening of new drugs. , 2006, European journal of medicinal chemistry.

[36]  Jürgen Pleiss,et al.  Multiple molecular dynamics simulations of human p450 monooxygenase CYP2C9: The molecular basis of substrate binding and regioselectivity toward warfarin , 2006, Proteins.

[37]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[38]  Hwangseo Park,et al.  Structural and dynamical basis of broad substrate specificity, catalytic mechanism, and inhibition of cytochrome P450 3A4. , 2005, Journal of the American Chemical Society.

[39]  Adrian J Mulholland,et al.  Electronic structure of compound I in human isoforms of cytochrome P450 from QM/MM modeling. , 2005, Journal of the American Chemical Society.

[40]  Yu Zong Chen,et al.  Prediction of Cytochrome P450 3A4, 2D6, and 2C9 Inhibitors and Substrates by Using Support Vector Machines , 2005, J. Chem. Inf. Model..

[41]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  R. Friesner,et al.  Ab initio quantum chemical and mixed quantum mechanics/molecular mechanics (QM/MM) methods for studying enzymatic catalysis. , 2005, Annual review of physical chemistry.

[43]  D. Lewis,et al.  57 varieties: the human cytochromes P450. , 2004, Pharmacogenomics.

[44]  F Peter Guengerich,et al.  Cytochromes P450, drugs, and diseases. , 2003, Molecular interventions.

[45]  G. Szklarz,et al.  Molecular Modeling of Cytochrome P450 1A1: Enzyme-Substrate Interactions and Substrate Binding Affinities , 2002, Journal of biomolecular structure & dynamics.

[46]  Sean Ekins,et al.  Pharmacophore modeling of cytochromes P450. , 2002, Advanced drug delivery reviews.

[47]  S. Ekins,et al.  Pharmacophore and three-dimensional quantitative structure activity relationship methods for modeling cytochrome p450 active sites. , 2001, Drug metabolism and disposition: the biological fate of chemicals.

[48]  L. Lesko,et al.  In vivo drug‐drug interaction studies‐ A survey of all new molecular entities approved from 1987 to 1997 , 2000, Clinical pharmacology and therapeutics.

[49]  M. Relling,et al.  Pharmacogenomics: translating functional genomics into rational therapeutics. , 1999, Science.

[50]  Patrick Marroum,et al.  In vitro metabolic interaction studies: Experience of the Food and Drug Administration , 1999, Clinical pharmacology and therapeutics.

[51]  M H Tarbit,et al.  Structural determinants of cytochrome P450 substrate specificity, binding affinity and catalytic rate. , 1998, Chemico-biological interactions.

[52]  Y. Sugiyama,et al.  Prediction of pharmacokinetic alterations caused by drug-drug interactions: metabolic interaction in the liver. , 1998, Pharmacological reviews.

[53]  Dennis A. Smith,et al.  Properties of cytochrome P450 isoenzymes and their substrates Part 2: properties of cytochrome P450 substrates , 1997 .

[54]  T. Halgren Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94 , 1996, J. Comput. Chem..

[55]  W. M. Haynes CRC Handbook of Chemistry and Physics , 1990 .

[56]  C. Tomes CHEMISTRY AND PHYSICS , 1903 .