Applying machine learning techniques for ADME-Tox prediction: a review

Introduction: Pharmacokinetics involves the study of absorption, distribution, metabolism, excretion and toxicity of xenobiotics (ADME-Tox). In this sense, the ADME-Tox profile of a bioactive compound can impact its efficacy and safety. Moreover, efficacy and safety were considered some of the major causes of clinical failures in the development of new chemical entities. In this context, machine learning (ML) techniques have been often used in ADME-Tox studies due to the existence of compounds with known pharmacokinetic properties available for generating predictive models. Areas covered: This review examines the growth in the use of some ML techniques in ADME-Tox studies, in particular supervised and unsupervised techniques. Also, some critical points (e.g., size of the data set and type of output variable) must be considered during the generation of models that relate ADME-Tox properties and biological activity. Expert opinion: ML techniques have been successfully employed in pharmacokinetic studies, helping the complex process of designing new drug candidates from the use of reliable ML models. An application of this procedure would be the prediction of ADME-Tox properties from studies of quantitative structure–activity relationships or the discovery of new compounds from a virtual screening using filters based on results obtained from ML techniques.

[1]  Sunil S. Jambhekar Physicochemical and Biopharmaceutical Properties of Drug Substances and Pharmacokinetics , 2009 .

[2]  Bruno Sarmento,et al.  Models to predict intestinal absorption of therapeutic peptides and proteins. , 2012, Current drug metabolism.

[3]  Jie Shen,et al.  admetSAR: A Comprehensive Source and Free Tool for Assessment of Chemical ADMET Properties , 2012, J. Chem. Inf. Model..

[4]  Adriano D Andricopulo,et al.  Consensus hologram QSAR modeling for the prediction of human intestinal absorption. , 2012, Bioorganic & medicinal chemistry letters.

[5]  Xiaoling Li,et al.  In silico model of drug permeability across sublingual mucosa. , 2013, Archives of oral biology.

[6]  Chang-Ying Ma,et al.  In silico prediction of mitochondrial toxicity by using GA-CG-SVM approach. , 2009, Toxicology in vitro : an international journal published in association with BIBRA.

[7]  Philippe Vayer,et al.  Toward in silico structure-based ADMET prediction in drug discovery. , 2012, Drug discovery today.

[8]  Alexander Tropsha,et al.  k Nearest Neighbors QSAR Modeling as a Variational Problem: Theory and Applications , 2005, J. Chem. Inf. Model..

[9]  R. Cramer Partial Least Squares (PLS): Its strengths and limitations , 1993 .

[10]  Ingo Muegge,et al.  DemQSAR: predicting human volume of distribution and clearance of drugs , 2011, J. Comput. Aided Mol. Des..

[11]  Taravat Ghafourian,et al.  QSAR models for the prediction of plasma protein binding. , 2013, BioImpacts : BI.

[12]  Andreas Bender,et al.  Similarity Searching of Chemical Databases Using Atom Environment Descriptors (MOLPRINT 2D): Evaluation of Performance , 2004, J. Chem. Inf. Model..

[13]  Veerabahu Shanmugasundaram,et al.  Estimation of Aqueous Solubility of Organic Compounds with QSPR Approach , 2002, Pharmaceutical Research.

[14]  R. Shinde,et al.  Insights into the permeability of drugs and drug-likemolecules from MI-QSAR and HQSAR studies , 2012, Journal of Molecular Modeling.

[15]  Thomas Sander,et al.  OSIRIS, an Entirely in-House Developed Drug Discovery Informatics System , 2009, J. Chem. Inf. Model..

[16]  Igor V. Tetko,et al.  Estimation of Aqueous Solubility of Chemical Compounds Using E-State Indices , 2001, J. Chem. Inf. Comput. Sci..

[17]  Cheng Chen,et al.  MI-QSAR models for prediction of corneal permeability of organic compounds , 2006, Acta Pharmacologica Sinica.

[18]  Hui Zhang,et al.  An integrated scheme for feature selection and parameter setting in the support vector machine modeling and its application to the prediction of pharmacokinetic properties of drugs , 2009, Artif. Intell. Medicine.

[19]  Jarmo Huuskonen,et al.  Estimation of Aqueous Solubility for a Diverse Set of Organic Compounds Based on Molecular Topology , 2000, J. Chem. Inf. Comput. Sci..

[20]  R. Enriz,et al.  The legacy of the past, the reality of the present and the hopes of the future , 2005 .

[21]  C. Selassie,et al.  History of Quantitative Structure–Activity Relationships , 2010 .

[22]  Johann Gasteiger,et al.  Prediction of Aqueous Solubility of Organic Compounds Based on a 3D Structure Representation , 2003, J. Chem. Inf. Comput. Sci..

[23]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[24]  Stephan Schmidt,et al.  Significance of protein binding in pharmacokinetics and pharmacodynamics. , 2010, Journal of pharmaceutical sciences.

[25]  Adriano D. Andricopulo,et al.  A fragment-based approach for the in silico prediction of blood-brain barrier permeation , 2012 .

[26]  Lijuan Chen,et al.  Results of molecular docking as descriptors to predict human serum albumin binding affinity. , 2012, Journal of molecular graphics & modelling.

[27]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[28]  A. Tropsha,et al.  kappa Nearest neighbors QSAR modeling as a variational problem: theory and applications. , 2005, Journal of chemical information and modeling.

[29]  John C Lindon,et al.  Development of quantitative structure-metabolism (QSMR) relationships for substituted anilines based on computational chemistry , 2013, Xenobiotica; the fate of foreign compounds in biological systems.

[30]  Björn Krüger,et al.  The holistic integration of virtual screening in drug discovery. , 2013, Drug discovery today.

[31]  Emilio Xavier Esposito,et al.  Membrane‐Interaction Quantitative Structure—Activity Relationship (MI‐QSAR) Analyses of Skin Penetration Enhancers. , 2008 .

[32]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[33]  Wei Li,et al.  Three-class classification models of logS and logP derived by using GA–CG–SVM approach , 2009, Molecular Diversity.

[34]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 4. Prediction of Aqueous Solubility Based on Atom Contribution Approach , 2004, J. Chem. Inf. Model..

[35]  D. Sprous,et al.  QSAR in the pharmaceutical research setting: QSAR models for broad, large problems. , 2010, Current topics in medicinal chemistry.

[36]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[37]  Yuquan Wei,et al.  Prediction models of human plasma protein binding rate and oral bioavailability derived by using GA-CG-SVM method. , 2008, Journal of pharmaceutical and biomedical analysis.

[38]  Scott Boyer,et al.  Reaction Site Mapping of Xenobiotic Biotransformations , 2007, J. Chem. Inf. Model..

[39]  Tomasz Arodz,et al.  Computational methods in developing quantitative structure-activity relationships (QSAR): a review. , 2006, Combinatorial chemistry & high throughput screening.

[40]  R. Schoenwald,et al.  Pharmacokinetics in Drug Discovery and Development , 2002 .

[41]  I. Khanna,et al.  Drug discovery in pharmaceutical industry: productivity challenges and trends. , 2012, Drug discovery today.

[42]  J C Gertrudes,et al.  Machine learning techniques and drug design. , 2012, Current medicinal chemistry.

[43]  R D Benz,et al.  (Q)SAR Modeling and Safety Assessment in Regulatory Review , 2012, Clinical pharmacology and therapeutics.

[44]  O. Engkvist,et al.  Beyond size, ionization state, and lipophilicity: influence of molecular topology on absorption, distribution, metabolism, excretion, and toxicity for druglike compounds. , 2012, Journal of medicinal chemistry.

[45]  Tatiana Nikolskaya,et al.  Prediction of Organ Toxicity Endpoints by QSAR Modeling Based on Precise Chemical‐Histopathology Annotations , 2012, Chemical biology & drug design.

[46]  John Ferguson,et al.  The Use of Chemical Potentials as Indices of Toxicity , 1939 .

[47]  Andreas Bender,et al.  The challenges involved in modeling toxicity data in silico: a review. , 2012, Current pharmaceutical design.

[48]  Darko Butina,et al.  Modeling Aqueous Solubility , 2003, J. Chem. Inf. Comput. Sci..

[49]  A. Beresford,et al.  The emerging importance of predictive ADME simulation in drug discovery. , 2002, Drug discovery today.

[50]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[51]  I. Cockburn,et al.  The changing structure of the pharmaceutical industry. , 2004, Health affairs.

[52]  T. Baillie,et al.  Metabolism and toxicity of drugs. Two decades of progress in industrial drug metabolism. , 2008, Chemical research in toxicology.

[53]  Sheng Tian,et al.  ADME evaluation in drug discovery. 9. Prediction of oral bioavailability in humans based on molecular properties and structural fingerprints. , 2011, Molecular pharmaceutics.

[54]  Rebecca Denton,et al.  A rapid computational filter for predicting the rate of human renal clearance. , 2010, Journal of molecular graphics & modelling.

[55]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[56]  Hui Zhang,et al.  Support vector machine and pharmacophore-based prediction models of multidrug-resistance protein 2 (MRP2) inhibitors. , 2009, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[57]  K. Luthman,et al.  Caco-2 monolayers in experimental and theoretical predictions of drug transport. , 2001, Advanced drug delivery reviews.

[58]  Dong-Sheng Cao,et al.  Automatic feature subset selection for decision tree-based ensemble methods in the prediction of bioactivity , 2010 .

[59]  Adriano D Andricopulo,et al.  Pharmacokinetic properties and in silico ADME modeling in drug discovery. , 2013, Medicinal chemistry (Shariqah (United Arab Emirates)).

[60]  Jianzhong Liu,et al.  Prediction and mechanistic interpretation of human oral drug absorption using MI-QSAR analysis. , 2007, Molecular pharmaceutics.

[61]  Hua Yu,et al.  A Novel Chemometric Method for the Prediction of Human Oral Bioavailability , 2012, International journal of molecular sciences.

[62]  Jerzy Leszczynski,et al.  CORAL: QSPR model of water solubility based on local and global SMILES attributes. , 2013, Chemosphere.

[63]  H. Abdi,et al.  Principal component analysis , 2010 .

[64]  Marc Decramer,et al.  Who can pay for innovative medicines? , 2013, European Respiratory Journal.

[65]  Hao Sun,et al.  Structure‐based Drug Metabolism Predictions for Drug Design , 2010, Chemical biology & drug design.

[66]  A. J. Hopfinger,et al.  Predicting Blood–Brain Barrier Partitioning of Organic Molecules Using Membrane–Interaction QSAR Analysis , 2002, Pharmaceutical Research.

[67]  Sean B. Holden,et al.  Support Vector Machines for ADME Property Classification , 2003 .

[68]  Adriano D. Andricopulo,et al.  PK/DB: database for pharmacokinetic properties and predictive in silico ADME models , 2008, Bioinform..

[69]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[70]  Adriano D Andricopulo,et al.  Hologram QSAR model for the prediction of human oral bioavailability. , 2007, Bioorganic & medicinal chemistry.

[71]  Berith F. Jensen,et al.  In silico prediction of cytochrome P450 2D6 and 3A4 inhibition using Gaussian kernel weighted k-nearest neighbor and extended connectivity fingerprints, including structural fragment analysis of inhibitors versus noninhibitors. , 2007, Journal of medicinal chemistry.

[72]  Andreas Zell,et al.  Kernel Functions for Attributed Molecular Graphs – A New Similarity‐Based Approach to ADME Prediction in Classification and Regression , 2006 .

[73]  T Scior,et al.  How to recognize and workaround pitfalls in QSAR studies: a critical review. , 2009, Current medicinal chemistry.

[74]  M Paul Gleeson,et al.  Plasma protein binding affinity and its relationship to molecular structure: an in-silico analysis. , 2007, Journal of medicinal chemistry.

[75]  K. Luthman,et al.  Correlation of drug absorption with molecular surface properties. , 1996, Journal of pharmaceutical sciences.

[76]  David G. Stork,et al.  Pattern Classification , 1973 .

[77]  Jie Yang,et al.  Predicting MDCK cell permeation coefficients of organic molecules using membrane-interaction QSAR analysis , 2005, Acta Pharmacologica Sinica.

[78]  Katherine Tsaioun,et al.  ADMET for Medicinal Chemists: A Practical Guide , 2011 .

[79]  Aki T Heikkinen,et al.  Modelling of drug disposition kinetics in in vitro intestinal absorption cell models. , 2010, Basic & clinical pharmacology & toxicology.

[80]  Olivier Sperandio,et al.  In Silico ADME/Tox Predictions , 2010 .

[81]  Antanas Verikas,et al.  Mining data with random forests: A survey and results of new tests , 2011, Pattern Recognit..

[82]  John S. Delaney,et al.  ESOL: Estimating Aqueous Solubility Directly from Molecular Structure , 2004, J. Chem. Inf. Model..

[83]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[84]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[85]  Ulf Norinder,et al.  In silico categorization of in vivo intrinsic clearance using machine learning. , 2013, Molecular pharmaceutics.

[86]  Carlos A. Montanari,et al.  In Silico Prediction of Human Plasma Protein Binding Using Hologram QSAR , 2007 .

[87]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.