Small molecules of different origins have distinct distributions of structural complexity that correlate with protein-binding profiles

Using a diverse collection of small molecules generated from a variety of sources, we measured protein-binding activities of each individual compound against each of 100 diverse (sequence-unrelated) proteins using small-molecule microarrays. We also analyzed structural features, including complexity, of the small molecules. We found that compounds from different sources (commercial, academic, natural) have different protein-binding behaviors and that these behaviors correlate with general trends in stereochemical and shape descriptors for these compound collections. Increasing the content of sp3-hybridized and stereogenic atoms relative to compounds from commercial sources, which comprise the majority of current screening collections, improved binding selectivity and frequency. The results suggest structural features that synthetic chemists can target when synthesizing screening collections for biological discovery. Because binding proteins selectively can be a key feature of high-value probes and drugs, synthesizing compounds having features identified in this study may result in improved performance of screening collections.

[1]  P. Clemons,et al.  Distinct Biological Network Properties between the Targets of Natural Products and Disease Genes , 2010, Journal of the American Chemical Society.

[2]  P. Andersson,et al.  Highly flexible synthesis of chiral azacycles via iridium-catalyzed hydrogenation. , 2010, Journal of the American Chemical Society.

[3]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[4]  D. Dixon,et al.  Enantioselective Broensted Acid Catalyzed N-Acyliminium Cyclization Cascades. , 2010 .

[5]  C. Humblet,et al.  Escape from flatland: increasing saturation as an approach to improving clinical success. , 2009, Journal of medicinal chemistry.

[6]  D. Dixon,et al.  Enantioselective Brønsted acid-catalyzed N-acyliminium cyclization cascades. , 2009, Journal of the American Chemical Society.

[7]  Rajarshi Guha,et al.  Chemoinformatic Analysis of Combinatorial Libraries, Drugs, Natural Products, and Molecular Libraries Small Molecule Repository , 2009, J. Chem. Inf. Model..

[8]  A. Ganesan The impact of natural products upon modern drug discovery. , 2008, Current opinion in chemical biology.

[9]  Yvonne C. Martin,et al.  Application of Belief Theory to Similarity Data Fusion for Use in Analog Searching and Lead Hopping , 2008, J. Chem. Inf. Model..

[10]  Martin Serrano,et al.  Nucleic Acids Research Advance Access published October 18, 2007 ChemBank: a small-molecule screening and , 2007 .

[11]  Peter Ertl,et al.  Cheminformatics analysis of natural products: lessons from nature inspiring the design of new drugs. , 2008, Progress in drug research. Fortschritte der Arzneimittelforschung. Progres des recherches pharmaceutiques.

[12]  Paul A Clemons,et al.  A pipeline for ligand discovery using small-molecule microarrays. , 2007, Current opinion in chemical biology.

[13]  Shun Su,et al.  Discovery of chemical reactions through multidimensional screening. , 2007, Journal of the American Chemical Society.

[14]  R. Grubbs Olefin-metathesis catalysts for the preparation of molecules and materials (Nobel Lecture). , 2006, Angewandte Chemie.

[15]  Stuart L Schreiber,et al.  A robust small-molecule microarray platform for screening cell lysates. , 2006, Chemistry & biology.

[16]  Peter Ertl,et al.  Relationships between Molecular Complexity, Biological Activity, and Structural Diversity , 2006, J. Chem. Inf. Model..

[17]  D. Rogers,et al.  Using Extended-Connectivity Fingerprints with Laplacian-Modified Bayesian Analysis in High-Throughput Screening Follow-Up , 2005, Journal of biomolecular screening.

[18]  A. Hopkins,et al.  Navigating chemical space for biology and medicine , 2004, Nature.

[19]  S. Schreiber,et al.  A library of spirooxindoles based on a stereoselective three-component coupling reaction. , 2004, Journal of the American Chemical Society.

[20]  Stuart L Schreiber,et al.  Discovery of an inhibitor of a transcription factor using small molecule microarrays and diversity-oriented synthesis. , 2003, Journal of the American Chemical Society.

[21]  Miklos Feher,et al.  Property Distributions: Differences between Drugs, Natural Products, and Molecules from Combinatorial Chemistry , 2003, J. Chem. Inf. Comput. Sci..

[22]  K. Sharpless,et al.  Searching for new reactivity (Nobel lecture). , 2002, Angewandte Chemie.

[23]  Maliha S. Nash,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 2001, Technometrics.

[24]  Andrew R. Leach,et al.  Molecular Complexity and Its Impact on the Probability of Finding Leads for Drug Discovery , 2001, J. Chem. Inf. Comput. Sci..

[25]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[26]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[27]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[28]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[29]  E. Corey,et al.  The Logic of Chemical Synthesis , 1989 .

[30]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[31]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[32]  C. Hansch,et al.  Analysis of the structure-activity relationship of the sulfonamide drugs using substituent constants. , 1967, Journal of medicinal chemistry.

[33]  C. Hansch,et al.  SUBSTITUENT CONSTANTS FOR ALIPHATIC FUNCTIONS OBTAINED FROM PARTITION COEFFICIENTS. , 1965, Journal of medicinal chemistry.

[34]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .