Avoiding hERG-liability in drug design via synergetic combinations of different (Q)SAR methodologies and data sources: a case study in an industrial setting

In this paper, we explore the impact of combining different in silico prediction approaches and data sources on the predictive performance of the resulting system. We use inhibition of the hERG ion channel target as the endpoint for this study as it constitutes a key safety concern in drug development and a potential cause of attrition. We will show that combining data sources can improve the relevance of the training set in regard of the target chemical space, leading to improved performance. Similarly we will demonstrate that combining multiple statistical models together, and with expert systems, can lead to positive synergistic effects when taking into account the confidence in the predictions of the merged systems. The best combinations analyzed display a good hERG predictivity. Finally, this work demonstrates the suitability of the SOHN methodology for building models in the context of receptor based endpoints like hERG inhibition when using the appropriate pharmacophoric descriptors.

[1]  Liudmila Polonchuk,et al.  Toward a New Gold Standard for Early Safety: Automated Temperature-Controlled hERG Test on the PatchLiner® , 2012, Front. Pharmacol..

[2]  Simone Brogi,et al.  Computational Tool for Fast in silico Evaluation of hERG K+ Channel Affinity , 2017, Front. Chem..

[3]  D Tzivoni,et al.  Torsade de pointes. , 1989, American heart journal.

[4]  A. Ghose,et al.  Atomic Physicochemical Parameters for Three‐Dimensional Structure‐Directed Quantitative Structure‐Activity Relationships I. Partition Coefficients as a Measure of Hydrophobicity , 1986 .

[5]  Bo-Han Su,et al.  In Silico Binary Classification QSAR Models Based on 4D-Fingerprints and MOE Descriptors for Prediction of hERG Blockage , 2010, J. Chem. Inf. Model..

[6]  A. Fowkes,et al.  A Semi‐automated Approach to Create Purposeful Mechanistic Datasets from Heterogeneous Data: Data Mining Towards the in silico Predictions for Oestrogen Receptor Modulation and Teratogenicity , 2017, Molecular informatics.

[7]  Fabian P. Steinmetz,et al.  Screening Chemicals for Receptor‐Mediated Toxicological and Pharmacological Endpoints: Using Public Data to Build Screening Tools within a KNIME Workflow , 2015, Molecular informatics.

[8]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[9]  A. Cavalli,et al.  Toward a pharmacophore for drugs inducing the long QT syndrome: insights from a CoMFA study of HERG K(+) channel blockers. , 2002, Journal of medicinal chemistry.

[10]  Robert P. Sheridan,et al.  Time-Split Cross-Validation as a Method for Estimating the Goodness of Prospective Prediction , 2013, J. Chem. Inf. Model..

[11]  Ruifeng Liu,et al.  General Purpose 2D and 3D Similarity Approach to Identify hERG Blockers , 2016, J. Chem. Inf. Model..

[12]  Stéphane Werner,et al.  JPlogP: an improved logP predictor trained using predicted data , 2018, Journal of Cheminformatics.

[13]  A. Brown,et al.  A mechanism for the proarrhythmic effects of cisapride (Propulsid): high affinity blockade of the human cardiac potassium channel HERG , 1997, FEBS letters.

[14]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[15]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[16]  M. Sanguinetti,et al.  hERG potassium channels and cardiac arrhythmia , 2006, Nature.

[17]  B. Merget,et al.  Profiling Prediction of Kinase Inhibitors: Toward the Virtual Assay. , 2017, Journal of medicinal chemistry.

[18]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[19]  D. Altman,et al.  Statistics Notes: Diagnostic tests 2: predictive values , 1994, BMJ.

[20]  Michael Houghton,et al.  A human ether-á-go-go-related (hERG) ion channel atomistic model generated by long supercomputer molecular dynamics simulations and its use in predicting drug cardiotoxicity. , 2014, Toxicology letters.

[21]  Scott Boyer,et al.  Development, interpretation and temporal evaluation of a global QSAR of hERG electrophysiology screening data , 2007, J. Comput. Aided Mol. Des..

[22]  Oliver J. Britton,et al.  Human In Silico Drug Trials Demonstrate Higher Accuracy than Animal Models in Predicting Clinical Pro-Arrhythmic Cardiotoxicity , 2017, Front. Physiol..

[23]  A. Brown,et al.  HERG, a primary human ventricular target of the nonsedating antihistamine terfenadine. , 1996, Circulation.

[24]  Dan Li,et al.  ADMET Evaluation in Drug Discovery. 16. Predicting hERG Blockers by Combining Multiple Pharmacophores and Machine Learning Approaches. , 2016, Molecular pharmaceutics.

[25]  J. Warmke,et al.  A family of potassium channel genes related to eag in Drosophila and mammals. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[26]  B. Priest,et al.  Role of hERG potassium channel assays in drug development , 2008, Channels.

[27]  Samuel J. Webb,et al.  Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge , 2014, Journal of Cheminformatics.

[28]  Keiji Ogura,et al.  Construction of an integrated database for hERG blocking small molecules , 2018, PloS one.

[29]  C Barber,et al.  Applicability domain: towards a more formal definition$ , 2016, SAR and QSAR in environmental research.

[30]  M. Cases,et al.  Value of shared preclinical safety studies – The eTOX database , 2014, Toxicology reports.

[31]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[32]  U. Tillmann,et al.  A systematic approach for evaluating the quality of experimental toxicological and ecotoxicological data. , 1997, Regulatory toxicology and pharmacology : RTP.

[33]  Yuan Zhang,et al.  Modeling of the hERG K+ Channel Blockage Using Online Chemical Database and Modeling Environment (OCHEM) , 2017, Molecular informatics.

[34]  Joachim M. Buhmann,et al.  The Balanced Accuracy and Its Posterior Distribution , 2010, 2010 20th International Conference on Pattern Recognition.

[35]  Anton J. Hopfinger,et al.  4D-Fingerprints, Universal QSAR and QSPR Descriptors , 2004, J. Chem. Inf. Model..

[36]  W. D. Kaplan,et al.  The behavior of four neurological mutants of Drosophila. , 1969, Genetics.

[37]  Paul Czodrowski,et al.  hERG Me Out , 2013, J. Chem. Inf. Model..

[38]  M. Todd,et al.  Experimentally Validated Pharmacoinformatics Approach to Predict hERG Inhibition Potential of New Chemical Entities , 2018, Front. Pharmacol..

[39]  Alex M Aronov,et al.  Predictive in silico modeling for hERG channel blockers. , 2005, Drug discovery today.