Selective Fusion of Heterogeneous Classifiers for Predicting Substrates of Membrane Transporters

Membrane transporters play a crucial role in determining fate of administered drugs in a biological system. Early identification of plausible transporters for a drug molecule can provide insights into its therapeutic, pharmacokinetic, and toxicological profiles. In the present study, predictive models for classifying small molecules into substrates and nonsubstrates of various pharmaceutically important membrane transporters were developed using quantitative structure-activity relationship (QSAR) and proteochemometric (PCM) approaches. For this purpose, 4575 substrate interactions for these transporters were collected from the Metabolism and Transport Database (Metrabase) and the literature. The transporters selected for this study include (i) six efflux transporters, viz., breast cancer resistance protein (BCRP/ABCG2), P-glycoprotein (P-gp/MDR1), and multidrug resistance proteins (MRP1, MRP2, MRP3, and MRP4), and (ii) seven influx transporters, viz., organic cation transporter (OCT1/SO22A1), peptide transporter (PEPT1/SO15A1), apical sodium-bile acid transporter (ASBT/NTCP2), and organic anion transporting peptides (OATP1A2/SO1A2, OATP1B/SO1B1, OATP1B3/SO1B3, and OATP2B1/SO2B1). Various types of descriptors and machine learning methods (classifiers) were evaluated for the development of robust predictive models. Additionally, ensemble models were developed by bagging of homogeneous classifiers and selective fusion of heterogeneous classifiers. It was observed that the latter approach improves the accuracy of substrate/nonsubstrate prediction for transporters (average correct classification rate of more than 0.80 for external validation). Moreover, structural fragments important in determining the substrate specificity across the various transporters were identified. To demonstrate these fragments on the query molecule, contour maps were generated. The prediction efficacy of the developed models was illustrated by a good correlation between the reported logBB value of a molecule and its predicted substrate propensity for blood-brain barrier transporters. Conclusively, this comprehensive modeling analysis can be efficiently employed for the prediction of membrane transporters of a drug, thereby providing insights into its pharmacological profile.

[1]  P. Prusis,et al.  Visually Interpretable Models of Kinase Selectivity Related Features Derived from Field-Based Proteochemometrics , 2013, J. Chem. Inf. Model..

[2]  Prabha Garg,et al.  Classification of Breast Cancer Resistant Protein (BCRP) Inhibitors and Non-Inhibitors Using Machine Learning Approaches. , 2015, Combinatorial chemistry & high throughput screening.

[3]  James G. Nourse,et al.  Reoptimization of MDL Keys for Use in Drug Discovery , 2002, J. Chem. Inf. Comput. Sci..

[4]  Dong-Sheng Cao,et al.  PyDPI: Freely Available Python Package for Chemoinformatics, Bioinformatics, and Chemogenomics Studies , 2013, J. Chem. Inf. Model..

[5]  L Zhang,et al.  Emerging Transporters of Clinical Importance: An Update From the International Transporter Consortium , 2013, Clinical pharmacology and therapeutics.

[6]  J. Platts,et al.  Correlation and prediction of a large blood-brain distribution data set--an LFER study. , 2001, European journal of medicinal chemistry.

[7]  Dan Li,et al.  ADMET evaluation in drug discovery. 13. Development of in silico prediction models for P-glycoprotein substrates. , 2014, Molecular pharmaceutics.

[8]  Zsolt Bikádi,et al.  Predicting substrates of the human breast cancer resistance protein using a support vector machine method , 2013, BMC Bioinformatics.

[9]  L. Hall,et al.  Modeling Blood-Brain Barrier Partitioning Using the Electrotopological State. , 2002 .

[10]  Ola Spjuth,et al.  Chemoinformatics Taking Biology into Account: Proteochemometrics , 2011 .

[11]  Marlene T. Kim,et al.  Developing Enhanced Blood–Brain Barrier Permeability Models: Integrating External Bio-Assay Data in QSAR Modeling , 2015, Pharmaceutical Research.

[12]  S. Cole,et al.  Multidrug resistance proteins: role of P-glycoprotein, MRP1, MRP2, and BCRP (ABCG2) in tissue defense. , 2005, Toxicology and applied pharmacology.

[13]  Hui Zhang,et al.  A prediction model of substrates and non-substrates of breast cancer resistance protein (BCRP) developed by GA-CG-SVM method , 2011, Comput. Biol. Medicine.

[14]  C. Luo,et al.  Computational models for predicting interactions with membrane transporters. , 2013, Current Medicinal Chemistry.

[15]  P. Garg,et al.  An improved approach for predicting drug-target interaction: proteochemometrics to molecular docking. , 2016, Molecular bioSystems.

[16]  Fabian Müller,et al.  Transporters and Drug-Drug Interactions: Important Determinants of Drug Disposition and Effects , 2013, Pharmacological Reviews.

[17]  Wolfgang Löscher,et al.  Role of drug efflux transporters in the brain for drug disposition and treatment of brain diseases , 2005, Progress in Neurobiology.

[18]  Peteris Prusis,et al.  Organic Compounds with Melanocortin Receptor Subtypes , 2004 .

[19]  M. Niemi,et al.  Membrane transporters in drug development , 2010, Nature Reviews Drug Discovery.

[20]  Tong Zhang,et al.  Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.

[21]  Jos H. Beijnen,et al.  Increased sensitivity to anticancer drugs and decreased inflammatory response in mice lacking the multidrug resistance-associated protein , 1997, Nature Medicine.

[22]  Tomislav Šmuc,et al.  Accurate models for P-gp drug recognition induced from a cancer cell line cytotoxicity screen. , 2013, Journal of medicinal chemistry.

[23]  P. Artursson,et al.  Computational Prospecting for Drug–Transporter Interactions , 2013, Clinical pharmacology and therapeutics.

[24]  P. Garg,et al.  The role of multidrug resistance protein (MRP-1) as an active efflux transporter on blood–brain barrier (BBB) permeability , 2017, Molecular Diversity.

[25]  Sereina Riniker,et al.  Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods , 2013, Journal of Cheminformatics.

[26]  A. Tropsha,et al.  Human Intestinal Transporter Database: QSAR Modeling and Virtual Profiling of Drug Uptake, Efflux and Interactions , 2013, Pharmaceutical Research.

[27]  Shufeng Zhou,et al.  Role of multidrug resistance associated proteins in drug development. , 2008, Drug discoveries & therapeutics.

[28]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[29]  Andreas Bender,et al.  Metrabase: a cheminformatics and bioinformatics database for small molecule transporter data analysis and (Q)SAR modeling , 2015, Journal of Cheminformatics.

[30]  Christel A. S. Bergström,et al.  Computational modeling to predict the functions and impact of drug transporters , 2015, In Silico Pharmacology.

[31]  S. Cole,et al.  Substrate recognition and transport by multidrug resistance protein 1 (ABCC1) , 2006, FEBS letters.

[32]  Leslie Z Benet,et al.  Intestinal drug transporters: an overview. , 2013, Advanced drug delivery reviews.

[33]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery, 7. Prediction of Oral Absorption by Correlation and Classification , 2007, J. Chem. Inf. Model..

[34]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[35]  Prabha Garg,et al.  Role of breast cancer resistance protein (BCRP) as active efflux transporter on blood-brain barrier (BBB) permeability , 2014, Molecular Diversity.

[36]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[37]  F. Sharom ABC multidrug transporters: structure, function and role in chemoresistance. , 2008, Pharmacogenomics.

[38]  Vasanthanathan Poongavanam,et al.  Fingerprint-based in silico models for the prediction of P-glycoprotein substrates and inhibitors , 2012, Bioorganic & medicinal chemistry.

[39]  D. Sprous,et al.  QSAR in the pharmaceutical research setting: QSAR models for broad, large problems. , 2010, Current topics in medicinal chemistry.

[40]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[41]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[42]  N. Shaik,et al.  Drug efflux transporters in the CNS. , 2003, Advanced drug delivery reviews.

[43]  H. Samonigg,et al.  Clinical role of multidrug resistance protein 1 expression in chemotherapy resistance in early-stage breast cancer: the Austrian Breast and Colorectal Cancer Study Group. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.