Prediction of hERG Liability – Using SVM Classification, Bootstrapping and Jackknifing

Drug‐induced QT prolongation leads to life‐threatening cardiotoxicity, mostly through blockage of the human ether‐à‐go‐go‐related gene (hERG) encoded potassium ion (K+) channels. The hERG channel is one of the most important antitargets to be addressed in the early stage of drug discovery process, in order to avoid more costly failures in the development phase. Using a thallium flux assay, 4,323 molecules were screened for hERG channel inhibition in a quantitative high throughput screening (qHTS) format. Here, we present support vector classification (SVC) models of hERG channel inhibition with the averaged area under the receiver operator characteristics curve (AUC‐ROC) of 0.93 for the tested compounds. Both Jackknifing and bootstrapping have been employed to rebalance the heavily biased training datasets, and the impact of these two under‐sampling rebalance methods on the performance of the predictive models is discussed. Our results indicated that the rebalancing techniques did not enhance the predictive power of the resulting models; instead, adoption of optimal cutoffs could restore the desirable balance of sensitivity and specificity of the binary classifiers. In an external validation set of 66 drug molecules, the SVC model exhibited an AUC‐ROC of 0.86, further demonstrating the utility of this modeling approach to predict hERG liabilities.

[1]  B Sakmann,et al.  Patch clamp techniques for studying ionic channels in excitable membranes. , 1984, Annual review of physiology.

[2]  W. Crumb,et al.  Loratadine blockade of K(+) channels in human heart: comparison with terfenadine under physiological conditions. , 2000, The Journal of pharmacology and experimental therapeutics.

[3]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[4]  L. Fauchier,et al.  Effect of verapamil on QT interval dynamicity. , 1999, The American journal of cardiology.

[5]  Bruno O Villoutreix,et al.  Computational investigations of hERG channel blockers: New insights and current predictive models. , 2015, Advanced drug delivery reviews.

[6]  Ruifeng Liu,et al.  Data-driven identification of structural alerts for mitigating the risk of drug-induced human liver injuries , 2015, Journal of Cheminformatics.

[7]  Charles W. Schmidt,et al.  TOX 21: New Dimensions of Toxicity Testing , 2009, Environmental health perspectives.

[8]  Cédric Merlot,et al.  Computational toxicology--a tool for early safety evaluation. , 2010, Drug discovery today.

[9]  Ruili Huang,et al.  Predictive Models for Cytochrome P450 Isozymes Based on Quantitative High Throughput Screening Data , 2011, J. Chem. Inf. Model..

[10]  Lupei Du,et al.  A novel structure-based virtual screening model for the hERG channel blockers. , 2007, Biochemical and biophysical research communications.

[11]  Alexander Tropsha,et al.  Tuning HERG out: antitarget QSAR models for drug development. , 2014, Current topics in medicinal chemistry.

[12]  William Stafford Noble,et al.  Support vector machine , 2013 .

[13]  G. Keserü Prediction of hERG potassium channel affinity by traditional and hologram qSAR methods. , 2003, Bioorganic & medicinal chemistry letters.

[14]  Adam Yasgar,et al.  Quantitative high-throughput screening: a titration-based approach that efficiently identifies biological activities in large chemical libraries. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Nigel Greene,et al.  Computer systems for the prediction of toxicity: an update. , 2002, Advanced drug delivery reviews.

[16]  Ruili Huang,et al.  The future of toxicity testing: a focus on in vitro methods using a quantitative high-throughput screening platform. , 2010, Drug discovery today.

[17]  Alexander Tropsha,et al.  Pred‐hERG: A Novel web‐Accessible Computational Tool for Predicting Cardiac Toxicity , 2015, Molecular informatics.

[18]  Alex M Aronov,et al.  Tuning out of hERG. , 2008, Current opinion in drug discovery & development.

[19]  Ruili Huang,et al.  A Grid Algorithm for High Throughput Fitting of Dose-Response Curve Data , 2010, Current chemical genomics.

[20]  Hongmao Sun,et al.  Support vector machines classification of hERG liabilities based on atom types. , 2008, Bioorganic & medicinal chemistry.

[21]  J. Warmke,et al.  A family of potassium channel genes related to eag in Drosophila and mammals. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Ruili Huang,et al.  Prediction of Cytochrome P450 Profiles of Environmental Chemicals with QSAR Models Built from Drug‐Like Molecules , 2012, Molecular informatics.

[23]  Ruili Huang,et al.  A new homogeneous high-throughput screening assay for profiling compound activity on the human ether-a-go-go-related gene channel. , 2009, Analytical biochemistry.

[24]  Ruili Huang,et al.  Paradigm Shift in Toxicity Testing and Modeling , 2012, The AAPS Journal.

[25]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[26]  Hongmao Sun,et al.  A Universal Molecular Descriptor System for Prediction of LogP, LogS, LogBB, and Absorption , 2004, J. Chem. Inf. Model..

[27]  Ruili Huang,et al.  Modelling the Tox21 10 K chemical profiles for in vivo toxicity prediction and mechanism characterization , 2016, Nature Communications.

[28]  D. Roden,et al.  Drug Block of I Kr : Model Systems and Relevance to Human Arrhythmias , 2001, Journal of cardiovascular pharmacology.

[29]  Stefan A. Mann,et al.  hERG K(+) channels: structure, function, and clinical significance. , 2012, Physiological reviews.

[30]  Istvan J Enyedy,et al.  In silico prediction of hERG inhibition. , 2015, Future medicinal chemistry.

[31]  Robert J Kavlock,et al.  Toxicity Testing in the 21st Century: Implications for Human Health Risk Assessment , 2009, Risk analysis : an official publication of the Society for Risk Analysis.

[32]  S. Dworetzky,et al.  A Thallium-Sensitive, Fluorescence-Based Assay for Detecting and Characterizing Potassium Channel Modulators in Mammalian Cells , 2004, Journal of biomolecular screening.

[33]  D. Doyle,et al.  Potassium channel structures: do they conform? , 2004, Current opinion in structural biology.

[34]  A. Stepan,et al.  Structural alert/reactive metabolite concept as applied in medicinal chemistry to mitigate the risk of idiosyncratic drug toxicity: a perspective based on the critical examination of trends in the top 200 drugs marketed in the United States. , 2011, Chemical research in toxicology.

[35]  Hongmao Sun A naive bayes classifier for prediction of multidrug resistance reversal activity on the basis of atom typing. , 2005, Journal of medicinal chemistry.

[36]  T. Sjögren,et al.  Structural basis for ligand promiscuity in cytochrome P450 3A4 , 2006, Proceedings of the National Academy of Sciences.

[37]  Paul Czodrowski,et al.  hERG Me Out , 2013, J. Chem. Inf. Model..

[38]  Maurizio Recanatini,et al.  Safety of Non-Antiarrhythmic Drugs that Prolong the QT Interval or Induce Torsade de Pointes , 2002, Drug safety.

[39]  C David Stout,et al.  Adaptations for the Oxidation of Polycyclic Aromatic Hydrocarbons Exhibited by the Structure of Human P450 1A2*♦ , 2007, Journal of Biological Chemistry.

[40]  T. Collier,et al.  The Synergistic Toxicity of Pesticide Mixtures: Implications for Risk Assessment and the Conservation of Endangered Pacific Salmon , 2008, Environmental health perspectives.

[41]  Jose Cosme,et al.  Crystal Structures of Human Cytochrome P450 3A4 Bound to Metyrapone and Progesterone , 2004, Science.

[42]  Serdar Durdagi,et al.  Modeling of Open, Closed, and Open-Inactivated States of the hERG1 Channel: Structural Mechanisms of the State-Dependent Drug Binding , 2012, J. Chem. Inf. Model..

[43]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[44]  J. Sánchez-Chapula,et al.  Impact of the whole-cell patch-clamp configuration on the pharmacological assessment of the hERG channel: trazodone as a case example. , 2014, Journal of pharmacological and toxicological methods.

[45]  L. Buydens,et al.  Comparing support vector machines to PLS for spectral regression applications , 2004 .

[46]  Sichao Wang,et al.  Recent developments in computational prediction of HERG blockage. , 2013, Current topics in medicinal chemistry.

[47]  Hongmao Sun,et al.  An Accurate and Interpretable Bayesian Classification Model for Prediction of hERG Liability , 2006, ChemMedChem.

[48]  Ruili Huang,et al.  The NCGC Pharmaceutical Collection: A Comprehensive Resource of Clinically Approved Drugs Enabling Repurposing and Chemical Genomics , 2011, Science Translational Medicine.

[49]  Alex M Aronov,et al.  Predictive in silico modeling for hERG channel blockers. , 2005, Drug discovery today.

[50]  Sun Hongmao A Practical Guide to Rational Drug Design , 2015 .