On the Relationship between Molecular Hit Rates in High-Throughput Screening and Molecular Descriptors

High-throughput screening (HTS) is widely used in the pharmaceutical industry to identify novel chemical starting points for drug discovery projects. The current study focuses on the relationship between molecular hit rate in recent in-house HTS and four common molecular descriptors: lipophilicity (ClogP), size (heavy atom count, HEV), fraction of sp3-hybridized carbons (Fsp3), and fraction of molecular framework (fMF). The molecular hit rate is defined as the fraction of times the molecule has been assigned as active in the HTS campaigns where it has been screened. Beta-binomial statistical models were built to model the molecular hit rate as a function of these descriptors. The advantage of the beta-binomial statistical models is that the correlation between the descriptors is taken into account. Higher degree polynomial terms of the descriptors were also added into the beta-binomial statistic model to improve the model quality. The relative influence of different molecular descriptors on molecular hit rate has been estimated, taking into account that the descriptors are correlated to each other through applying beta-binomial statistical modeling. The results show that ClogP has the largest influence on the molecular hit rate, followed by Fsp3 and HEV. fMF has only a minor influence besides its correlation with the other molecular descriptors.

[1]  Stuart Barber,et al.  All of Statistics: a Concise Course in Statistical Inference , 2005 .

[2]  A. Fliri,et al.  Biological spectra analysis: Linking biological activity profiles to molecular structure. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  P. Leeson,et al.  The influence of drug-like concepts on decision-making in medicinal chemistry , 2007, Nature Reviews Drug Discovery.

[4]  J. Peters,et al.  Pharmacological Promiscuity: Dependence on Compound Properties and Target Specificity in a Set of Recent Roche Compounds , 2009, ChemMedChem.

[5]  A. Bender,et al.  Modeling Promiscuity Based on in vitro Safety Pharmacology Profiling Data , 2007, ChemMedChem.

[6]  Stephen D Pickett,et al.  The impact of aromatic ring count on compound developability: further insights by examining carbo- and hetero-aromatic and -aliphatic ring types. , 2011, Drug discovery today.

[7]  John P. Overington,et al.  Can we rationally design promiscuous drugs? , 2006, Current opinion in structural biology.

[8]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[9]  S. Muresan,et al.  Investigation of the relationship between topology and selectivity for druglike molecules. , 2010, Journal of medicinal chemistry.

[10]  Paul D. Leeson,et al.  Impact of ion class and time on oral drug molecular properties , 2011 .

[11]  T. Ritchie,et al.  The impact of aromatic ring count on compound developability--are too many aromatic rings a liability in drug design? , 2009, Drug discovery today.

[12]  C. Humblet,et al.  Escape from flatland: increasing saturation as an approach to improving clinical success. , 2009, Journal of medicinal chemistry.

[13]  Wolfgang Guba,et al.  Development of a virtual screening method for identification of "frequent hitters" in compound libraries. , 2002, Journal of medicinal chemistry.

[14]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[15]  Larry Wasserman,et al.  All of Statistics , 2004 .

[16]  Alan Stobie,et al.  Pyridyl-phenyl ether monoamine reuptake inhibitors: Impact of lipophilicity on dual SNRI pharmacology and off-target promiscuity. , 2008, Bioorganic & medicinal chemistry letters.

[17]  F. Lovering,et al.  Escape from Flatland 2: complexity and promiscuity , 2013 .

[18]  D. Bojanic,et al.  Impact of high-throughput screening in biomedical research , 2011, Nature Reviews Drug Discovery.

[19]  David J Diller,et al.  Deriving knowledge through data mining high-throughput screening data. , 2004, Journal of medicinal chemistry.

[20]  D. Horvath,et al.  G-protein-coupled receptor affinity prediction based on the use of a profiling dataset: QSAR design, synthesis, and experimental validation. , 2005, Journal of medicinal chemistry.