A Quantitative Approach to the Estimation of Chemical Space from a Given Geometry by the Combination of Atomic Species

In order to estimate the size of chemical spaces, a quantitative approach was employed consisting of summing all possible combinations of atomic species generated computationally for a given three-dimensional geometry. The atomic species were comprised of heavy atoms (N, C, O, S and Cl) and various numbers of H atoms (e.g., CH3 and CH2 groups). The assignment of atomic species to a given geometry maintained consistent bond orders between adjacent atoms. Between 108 and 1019 compounds were generated from individual geometries. A slightly smaller number of compounds were obtained by applying a nonleadlikeness filter that considered the properties of the substituents. Principal component analysis using descriptors representing the molecular structures and properties revealed that the compounds generated from only one geometry largely covered the chemical space of known compounds. Moreover, the compounds with the best scores, defined as the interaction energies between the compounds and the protein, contained original ones within the entire collection of compounds generated from the same geometry. This method can be used to improve the scaffold of compound structures during drug development.

[1]  N. Cohen,et al.  The NEWLEAD program: a new method for the design of candidate structures from pharmacophoric hypotheses. , 1993, Journal of medicinal chemistry.

[2]  Akiko Itai,et al.  Automatic creation of drug candidate structures based on receptor structure. Starting point for artificial lead generation , 1991 .

[3]  Tudor I. Oprea,et al.  Chemical space navigation in lead discovery. , 2002, Current opinion in chemical biology.

[4]  C. Lipinski Drug-like properties and the causes of poor solubility and poor permeability. , 2000, Journal of pharmacological and toxicological methods.

[5]  G. Rishton Nonleadlikeness and leadlikeness in biochemical screening. , 2003, Drug discovery today.

[6]  Jürgen Bajorath,et al.  Molecular Descriptors for Effective Classification of Biologically Active Compounds Based on Principal Component Analysis Identified by a Genetic Algorithm , 2000, J. Chem. Inf. Comput. Sci..

[7]  S. Paula,et al.  Molecular determinants of thapsigargin binding by SERCA Ca2+‐ATPase: A computational docking study , 2004, Proteins.

[8]  Jürgen Bajorath,et al.  Integration of virtual and high-throughput screening , 2002, Nature Reviews Drug Discovery.

[9]  Lars Hemmingsen,et al.  Docking and scoring of metallo-β-lactamases inhibitors , 2004, J. Comput. Aided Mol. Des..

[10]  Tudor I. Oprea,et al.  Pursuing the leadlikeness concept in pharmaceutical research. , 2004, Current opinion in chemical biology.

[11]  Gang Chen,et al.  A New Rapid and Effective Chemistry Space Filter in Recognizing a Druglike Database , 2005, J. Chem. Inf. Model..

[12]  Regine Bohacek,et al.  Multiple Highly Diverse Structures Complementary to Enzyme Binding Sites: Results of Extensive Application of a de Novo Design Method Incorporating Combinatorial Growth , 1994 .

[13]  P. Kollman,et al.  An all atom force field for simulations of proteins and nucleic acids , 1986, Journal of computational chemistry.

[14]  Tudor I. Oprea Current trends in lead discovery: Are we looking for the appropriate properties? , 2002, J. Comput. Aided Mol. Des..

[15]  P Willett,et al.  Docking small-molecule ligands into active sites. , 1995, Current opinion in biotechnology.

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  Daniel A. Gschwend,et al.  Orientational sampling and rigid‐body minimization in molecular docking , 1993, Proteins.

[18]  A Itai,et al.  Confirmation of usefulness of a structure construction program based on three-dimensional receptor structure for rational lead generation. , 1993, Journal of medicinal chemistry.

[19]  Gordon M. Crippen,et al.  Prediction of Physicochemical Parameters by Atomic Contributions , 1999, J. Chem. Inf. Comput. Sci..

[20]  Hans-Joachim Böhm,et al.  LUDI: rule-based automatic design of new substituents for enzyme inhibitor leads , 1992, J. Comput. Aided Mol. Des..

[21]  Eugene I. Shakhnovich,et al.  SMOG : DE NOVO DESIGN METHOD BASED ON SIMPLE, FAST, AND ACCURATE FREE ENERGY ESTIMATES. 2. CASE STUDIES IN MOLECULAR DESIGN , 1997 .

[22]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[23]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[24]  Tudor I. Oprea,et al.  Chemography: the Art of Navigating in Chemical Space , 2000 .

[25]  A. Hawkins,et al.  Structure-based design, synthesis, and biological evaluation of inhibitors of Mycobacterium tuberculosis type II dehydroquinase. , 2005, Journal of medicinal chemistry.

[26]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[27]  D K Gehlhaar,et al.  De novo design of enzyme inhibitors by Monte Carlo ligand generation. , 1995, Journal of medicinal chemistry.