Design of chemical libraries with potentially bioactive molecules applying a maximum common substructure concept

Success in small molecule screening relies heavily on the preselection of compounds. Here, we present a strategy for the enrichment of chemical libraries with potentially bioactive compounds integrating the collected knowledge of medicinal chemistry. Employing a genetic algorithm, substructures typically occurring in bioactive compounds were identified using the World Drug Index. Availability of compounds containing the selected substructures was analysed in vendor libraries, and the substructure-specific sublibraries were assembled. Compounds containing reactive, undesired functional groups were omitted. Using a diversity filter for both physico-chemical properties and the substructure composition, the compounds of all the sublibraries were ranked. Accordingly, a screening collection of 16,671 compounds was selected. Diversity and chemical space coverage of the collection indicate that it is highly diverse and well-placed in the chemical space spanned by bioactive compounds. Furthermore, secondary assay-validated hits presented in this study show the practical relevance of our library design strategy.

[1]  Y. Martin,et al.  3D database searching in drug design. , 1992, Journal of medicinal chemistry.

[2]  B. E. Evans,et al.  Methods for drug discovery: development of potent, selective, orally effective cholecystokinin antagonists. , 1988, Journal of Medicinal Chemistry.

[3]  A. Schuffenhauer,et al.  Charting biologically relevant chemical space: a structural classification of natural products (SCONP). , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Daniel James,et al.  Lessons Learnt from Assembling Screening Libraries for Drug Discovery for Neglected Diseases , 2007, ChemMedChem.

[5]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[6]  Mark Johnson,et al.  Using Molecular Equivalence Numbers To Visually Explore Structural Features that Distinguish Chemical Libraries , 2002, J. Chem. Inf. Comput. Sci..

[7]  Yvonne C. Martin Computer design of potentially bioactive molecules by geometric searching with ALADDIN , 1990 .

[8]  P. Labute A widely applicable set of descriptors. , 2000, Journal of molecular graphics & modelling.

[9]  John Davies,et al.  Design of small molecule libraries for NMR screening and other applications in drug discovery. , 2002, Current topics in medicinal chemistry.

[10]  Jürgen Bajorath,et al.  Molecular Descriptors for Effective Classification of Biologically Active Compounds Based on Principal Component Analysis Identified by a Genetic Algorithm , 2000, J. Chem. Inf. Comput. Sci..

[11]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[12]  Hugo O. Villar,et al.  Comments on the design of chemical libraries for screening , 2004, Molecular Diversity.

[13]  John J Irwin,et al.  How good is your screening library? , 2006, Current opinion in chemical biology.

[14]  Christopher W Murray,et al.  Fragment-based lead discovery using X-ray crystallography. , 2005, Journal of medicinal chemistry.

[15]  Johann Gasteiger,et al.  The Determination of Maximum Common Substructures by a Genetic Algorithm: Application in Synthesis Design and for the Structural Analysis of Biological Activity , 1994 .

[16]  Edward R Zartler,et al.  Fragonomics: fragment-based drug discovery. , 2005, Current opinion in chemical biology.

[17]  Jörg Rademann,et al.  Sensitized Detection of Inhibitory Fragments and Iterative Development of Non‐Peptidic Protease Inhibitors by Dynamic Ligation Screening† , 2008, Angewandte Chemie.

[18]  Arthur A. Patchett,et al.  Chapter 26. Privileged structures — An update , 2000 .

[19]  U. Abel,et al.  Modern methods to produce natural-product libraries. , 2002, Current opinion in chemical biology.

[20]  Dora M Schnur,et al.  Are target-family-privileged substructures truly privileged? , 2006, Journal of medicinal chemistry.

[21]  Jennifer L Miller,et al.  Recent developments in focused library design: targeting gene-families. , 2006, Current topics in medicinal chemistry.