An Accurate and Interpretable Bayesian Classification Model for Prediction of hERG Liability

Drug‐induced QT interval prolongation has been identified as a critical side‐effect of non‐cardiovascular therapeutic agents and has resulted in the withdrawal of many drugs from the market. As almost all cases of drug‐induced QT prolongation can be traced to the blockade of a voltage‐dependent potassium ion channel encoded by the hERG (the human ether‐à‐go‐go‐related gene), early identification of potential hERG channel blockers will decrease the risk of cardiotoxicity‐induced attritions in the later and more expensive development stage. Presented herein is a naive Bayes classifier to categorize hERG blockers into active and inactive classes, by using a universal, generic molecular descriptor system. 1 The naive Bayes classifier was built from a training set containing 1979 corporate compounds, and exhibited an ROC accuracy of 0.87. The model was validated on an external test set of 66 drugs, of which 58 were correctly classified. The cumulative probabilities reflected the confidence of prediction and were proven useful for the identification of hERG blockers. Relative performance was compared for two classifiers constructed from either an atom‐type‐based molecular descriptor or the long range functional class fingerprint descriptor FCFP_6. The combination of an atom‐typing descriptor and the naive Bayes classification technique enables the interpretation of the resulting model, which offers extra information for the design of compounds free of undesirable hERG activity.

[1]  Cavero,et al.  QT interval prolongation by non-cardiovascular drugs: issues and solutions for novel drug development. , 1999, Pharmaceutical science & technology today.

[2]  Jun Chen,et al.  A structural basis for drug-induced long QT syndrome. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[3]  F. Ashcroft,et al.  Crystal Structure of the Potassium Channel KirBac1.1 in the Closed State , 2003, Science.

[4]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[5]  Roy J. Vaz,et al.  Characterization of HERG potassium channel inhibition using CoMSiA 3D QSAR and homology modeling approaches. , 2003, Bioorganic & medicinal chemistry letters.

[6]  Hongmao Sun A naive bayes classifier for prediction of multidrug resistance reversal activity on the basis of atom typing. , 2005, Journal of medicinal chemistry.

[7]  J. Li,et al.  A two-state homology model of the hERG K+ channel: application to ligand binding. , 2005, Bioorganic & medicinal chemistry letters.

[8]  Bernd Wollnik,et al.  The antihistamine fexofenadine does not affect IKr currents in a case report of drug‐induced cardiac arrhythmia , 2002, British journal of pharmacology.

[9]  M. Cadene,et al.  X-ray structure of a voltage-dependent K+ channel , 2003, Nature.

[10]  Gisbert Schneider,et al.  A Virtual Screening Method for Prediction of the hERG Potassium Channel Liability of Compound Libraries , 2002, Chembiochem : a European journal of chemical biology.

[11]  R. MacKinnon,et al.  Structure of the cytoplasmic beta subunit-T1 assembly of voltage-dependent K+ channels. , 2000, Science.

[12]  G. Keserü Prediction of hERG potassium channel affinity by traditional and hologram qSAR methods. , 2003, Bioorganic & medicinal chemistry letters.

[13]  W. Crumb,et al.  Three-dimensional quantitative structure-activity relationship for inhibition of human ether-a-go-go-related gene potassium channel. , 2002, The Journal of pharmacology and experimental therapeutics.

[14]  Anthony E. Klon,et al.  Finding more needles in the haystack: A simple and efficient method for improving high-throughput docking results. , 2004, Journal of medicinal chemistry.

[15]  Michael C Sanguinetti,et al.  Voltage-dependent profile of human ether-a-go-go-related gene channel block is influenced by a single residue in the S6 transmembrane domain. , 2003, Molecular pharmacology.

[16]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[17]  Andreas Bender,et al.  Molecular Similarity Searching Using Atom Environments, Information-Based Feature Selection, and a Naïve Bayesian Classifier , 2004, J. Chem. Inf. Model..

[18]  B. Fermini,et al.  The impact of drug-induced QT interval prolongation on drug discovery and development , 2003, Nature Reviews Drug Discovery.

[19]  A. Cavalli,et al.  QT prolongation through hERG K+ channel blockade: Current knowledge and strategies for the early prediction during drug development , 2005, Medicinal research reviews.

[20]  Xiaoyang Xia,et al.  Classification of kinase inhibitors using a Bayesian model. , 2004, Journal of medicinal chemistry.

[21]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[22]  Youxing Jiang,et al.  Structure of the RCK Domain from the E. coli K+ Channel and Demonstration of Its Presence in the Human BK Channel , 2001, Neuron.

[23]  Hongmao Sun,et al.  A Universal Molecular Descriptor System for Prediction of LogP, LogS, LogBB, and Absorption , 2004, J. Chem. Inf. Model..

[24]  Maurizio Recanatini,et al.  Safety of Non-Antiarrhythmic Drugs that Prolong the QT Interval or Induce Torsade de Pointes , 2002, Drug safety.

[25]  Hongmao Sun Predicting ADMET Properties by Projecting onto Chemical Space?Benefits and Pitfalls , 2005 .

[26]  Gary Yellen,et al.  The inward rectification mechanism of the HERG cardiac potassium channel , 1996, Nature.

[27]  Robert Pearlstein,et al.  Understanding the structure-activity relationship of the human ether-a-go-go-related gene cardiac K+ channel. A model for bad behavior. , 2003, Journal of medicinal chemistry.

[28]  A. Camm,et al.  Relationships between preclinical cardiac electrophysiology, clinical QT interval prolongation and torsade de pointes for a broad range of drugs: evidence for a provisional safety margin in drug development. , 2003, Cardiovascular research.

[29]  A. Cavalli,et al.  Toward a pharmacophore for drugs inducing the long QT syndrome: insights from a CoMFA study of HERG K(+) channel blockers. , 2002, Journal of medicinal chemistry.

[30]  Jürgen Bajorath,et al.  Combinatorial Preferences Affect Molecular Similarity/Diversity Calculations Using Binary Fingerprints and Tanimoto Coefficients , 2000, J. Chem. Inf. Comput. Sci..