Ensemble Rule‐Based Classification of Substrates of the Human ABC‐Transporter ABCB1 Using Simple Physicochemical Descriptors

Within the last decades, the detailed knowledge on the impact of membrane bound drug efflux transporters of the ATP binding cassette (ABC) protein family on the pharmacological profile of drugs has enormously increased. Especially, ABCB1 (P‐glycoprotein, P‐gp, MDR1) has attracted particular interest in medicinal chemistry, since it determines the clinical efficacy, side effects and toxicity risks of drug candidates. Based on this, the development of in silico models that provide rapid and cost‐effective screening tools for the classification of substrates and nonsubstrates of ABCB1 is an urgent need in contemporary ADMET profiling. A characteristic hallmark feature of this transporter is its polyspecific ligand recognition pattern. In this study we describe a method for classifying ABCB1 ligands in terms of simple, conjunctive rules (RuleFit) based on interpretable ADMET features. The retrieved results showed that models based on large, very diverse data sets gave better classification performance than models based on smaller, more homogenous training sets. The best model achieved gave a correct classification rate of 0.90 for an external validation set. Furthermore, from the interpretation of the best performing model it could be concluded that in comparison to nonsubstrates ABCB1 substrates generally show a higher number of hydrogen‐bond acceptors, are more flexible and exhibit higher logP values.

[1]  M. Gottesman,et al.  Multidrug resistance in cancer: role of ATP–dependent transporters , 2002, Nature Reviews Cancer.

[2]  Gerhard Klebe,et al.  Use of 3D QSAR Models for Database Screening: A Feasibility Study , 2008, J. Chem. Inf. Model..

[3]  Thomas J. Raub,et al.  P-glycoprotein recognition of substrates and circumvention through rational drug design. , 2006, Molecular pharmaceutics.

[4]  Gerhard F Ecker,et al.  Inhibitors of p-glycoprotein--lead identification and optimisation. , 2005, Mini reviews in medicinal chemistry.

[5]  K. Linton,et al.  The ATP switch model for ABC transporters , 2004, Nature Structural &Molecular Biology.

[6]  Balázs Sarkadi,et al.  The role of ABC transporters in drug absorption, distribution, metabolism, excretion and toxicity (ADME-Tox). , 2008, Drug discovery today.

[7]  Remigijus Didziapetris,et al.  Classification Analysis of P-Glycoprotein Substrate Specificity , 2003, Journal of drug targeting.

[8]  K. Baumann Distance Profiles (DiP): A translationally and rotationally invariant 3D structure descriptor capturing steric properties of molecules , 2002 .

[9]  Miguel A. Cabrera,et al.  A topological substructural approach for the prediction of P-glycoprotein substrates. , 2006, Journal of pharmaceutical sciences.

[10]  Yiyu Cheng,et al.  Identifying P-Glycoprotein Substrates Using a Support Vector Machine Optimized by a Particle Swarm , 2007, J. Chem. Inf. Model..

[11]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[12]  David E. Leahy,et al.  Automated QSPR through Competitive Workflow , 2005, J. Comput. Aided Mol. Des..

[13]  H. Kubinyi,et al.  A scoring scheme for discriminating between drugs and nondrugs. , 1998, Journal of medicinal chemistry.

[14]  Robert Pearlstein,et al.  Understanding the structure-activity relationship of the human ether-a-go-go-related gene cardiac K+ channel. A model for bad behavior. , 2003, Journal of medicinal chemistry.

[15]  Bogdan E. Popescu,et al.  Gradient Directed Regularization for Linear Regression and Classi…cation , 2004 .

[16]  E. Lien,et al.  Structure–activity relationship: analyses of p‐glycoprotein substrates and inhibitors , 2003, Journal of clinical pharmacy and therapeutics.

[17]  Roy J. Vaz,et al.  Antitargets : prediction and prevention of drug side effects , 2008 .

[18]  A. Seelig A general pattern for substrate recognition by P-glycoprotein. , 1998, European journal of biochemistry.

[19]  Vijay K Gombar,et al.  Predicting P-glycoprotein substrates by a quantitative structure-activity relationship model. , 2004, Journal of pharmaceutical sciences.

[20]  J. F. Wang,et al.  Prediction of P-Glycoprotein Substrates by a Support Vector Machine Approach , 2004, J. Chem. Inf. Model..

[21]  A. Seelig,et al.  Structure-activity relationship of P-glycoprotein substrates and modifiers. , 2000, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[22]  Xin Chen,et al.  Effect of Molecular Descriptor Feature Selection in Support Vector Machine Classification of Pharmacokinetic and Toxicological Properties of Chemical Agents , 2004, J. Chem. Inf. Model..

[23]  Alexander Golbraikh,et al.  Combinatorial QSAR Modeling of P-Glycoprotein Substrates , 2006, J. Chem. Inf. Model..

[24]  Bogdan E. Popescu,et al.  PREDICTIVE LEARNING VIA RULE ENSEMBLES , 2008, 0811.1679.

[25]  G Ecker,et al.  Structural requirements for activity of propafenone-type modulators in P-glycoprotein-mediated multidrug resistance. , 1996, Molecular pharmacology.

[26]  Gerhard F. Ecker,et al.  In silico prediction of substrate properties for ABC-multidrug transporters , 2008 .

[27]  Ajay N. Jain,et al.  Accurate and Interpretable Computational Modeling of Chemical Mutagenicity , 2008, J. Chem. Inf. Model..