Identification of Compounds That Interfere with High‐Throughput Screening Assay Technologies

A significant challenge in high‐throughput screening (HTS) campaigns is the identification of assay technology interference compounds. A Compound Interfering with an Assay Technology (CIAT) gives false readouts in many assays. CIATs are often considered viable hits and investigated in follow‐up studies, thus impeding research and wasting resources. In this study, we developed a machine‐learning (ML) model to predict CIATs for three assay technologies. The model was trained on known CIATs and non‐CIATs (NCIATs) identified in artefact assays and described by their 2D structural descriptors. Usual methods identifying CIATs are based on statistical analysis of historical primary screening data and do not consider experimental assays identifying CIATs. Our results show successful prediction of CIATs for existing and novel compounds and provide a complementary and wider set of predicted CIATs compared to BSF, a published structure‐independent model, and to the PAINS substructural filters. Our analysis is an example of how well‐curated datasets can provide powerful predictive models despite their relatively small size.

[1]  Jayme L. Dahlin,et al.  The essential roles of chemistry in high-throughput screening triage. , 2014, Future medicinal chemistry.

[2]  Tudor I. Oprea,et al.  Badapple: promiscuity patterns from noisy evidence , 2016, Journal of Cheminformatics.

[3]  Jürgen Bajorath,et al.  Determining the Degree of Promiscuity of Extensively Assayed Compounds , 2016, PloS one.

[4]  D. Bojanic,et al.  Impact of high-throughput screening in biomedical research , 2011, Nature Reviews Drug Discovery.

[5]  J Willem M Nissink,et al.  Quantification of frequent-hitter behavior based on historical high-throughput screening data. , 2014, Future medicinal chemistry.

[6]  P. Kenny,et al.  Comment on The Ecstasy and Agony of Assay Interference Compounds , 2017, J. Chem. Inf. Model..

[7]  Heiner Koch,et al.  The target landscape of clinical kinase drugs , 2017, Science.

[8]  Yuhong Du,et al.  A time-resolved fluorescence resonance energy transfer assay for high-throughput screening of 14-3-3 protein-protein interaction inhibitors. , 2013, Assay and drug development technologies.

[9]  T. Keating,et al.  Correction for Interference by Test Samples in High-Throughput Assays , 2009, Journal of biomolecular screening.

[10]  Jürgen Bajorath,et al.  Activity profiles of analog series containing pan assay interference compounds , 2017 .

[11]  Robert Nadon,et al.  Statistical practice in high-throughput screening data analysis , 2006, Nature Biotechnology.

[12]  Jayme L. Dahlin,et al.  PAINS in the Assay: Chemical Mechanisms of Assay Interference and Promiscuous Enzymatic Inhibition Observed during a Sulfhydryl-Scavenging HTS , 2015, Journal of medicinal chemistry.

[13]  J. Baell,et al.  New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays. , 2010, Journal of medicinal chemistry.

[14]  Anton Simeonov,et al.  AlphaScreen-Based Assays: Ultra-High-Throughput Screening for Small-Molecule Inhibitors of Challenging Enzymes and Protein-Protein Interactions. , 2016, Methods in molecular biology.

[15]  Nils-Ole Friedrich,et al.  Hit Dexter: A Machine‐Learning Model for the Prediction of Frequent Hitters , 2018, ChemMedChem.

[16]  Andrew Pannifer,et al.  The importance of triaging in determining the quality of output from high-throughput screening. , 2015, Future medicinal chemistry.

[17]  Alexander Tropsha,et al.  Phantom PAINS: Problems with the Utility of Alerts for Pan-Assay INterference CompoundS , 2017, J. Chem. Inf. Model..

[18]  S. Muresan,et al.  Chemical predictive modelling to improve compound quality , 2013, Nature Reviews Drug Discovery.

[19]  Petra Schneider,et al.  Privileged Structures Revisited , 2017, Angewandte Chemie.

[20]  Johannes Kirchmair,et al.  Hit Dexter 2.0: Machine-Learning Models for the Prediction of Frequent Hitters , 2019, J. Chem. Inf. Model..

[21]  Ola Engkvist,et al.  Utility of Resazurin, Horseradish Peroxidase, and NMR Assays to Identify Redox-Related False-Positive Behavior in High-Throughput Screens. , 2018, Assay and drug development technologies.

[22]  Andrew C. Good,et al.  An Empirical Process for the Design of High-Throughput Screening Deck Filters , 2006, J. Chem. Inf. Model..

[23]  Martin Romacker,et al.  Evolving BioAssay Ontology (BAO): modularization, integration and applications , 2014, Journal of Biomedical Semantics.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  R. Hertzberg,et al.  High-throughput screening: new technology for the 21st century. , 2000, Current opinion in chemical biology.

[26]  Evan Bolton,et al.  PubChem 2019 update: improved access to chemical data , 2018, Nucleic Acids Res..

[27]  Yang Song,et al.  Development of FRET Assay into Quantitative and High-throughput Screening Technology Platforms for Protein–Protein Interactions , 2010, Annals of Biomedical Engineering.

[28]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[29]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[30]  J. Bajorath,et al.  How Promiscuous Are Pharmaceutically Relevant Compounds? A Data-Driven Assessment , 2012, The AAPS Journal.

[31]  Petra Schneider,et al.  Privilegierte Strukturen neu betrachtet , 2017 .

[32]  Michael Sattler,et al.  Luciferase Advisor: High-Accuracy Model To Flag False Positive Hits in Luciferase HTS Assays , 2018, J. Chem. Inf. Model..

[33]  Lewis R. Vidler,et al.  Investigating the Behavior of Published PAINS Alerts Using a Pharmaceutical Company Data Set , 2018, ACS medicinal chemistry letters.

[34]  J Willem M Nissink,et al.  Seven Year Itch: Pan-Assay Interference Compounds (PAINS) in 2017—Utility and Limitations , 2017, ACS chemical biology.

[35]  Alexander Böcker,et al.  HTS Promiscuity Analyses for Accelerating Decision Making , 2011, Journal of biomolecular screening.