Optimization and visualization of molecular diversity of combinatorial libraries

SummaryOne of the major goals of rational design of combinatorial libraries is to design libraries with maximum diversity to enhance the potential of finding active compounds in the initial rounds of high-throughput screening programs. We present strategies to visualize and optimize the structural diversity of sets of molecules, which can be either potential substituents to be attached at specific positions of the library scaffold, or entire molecules corresponding to enumerated libraries. The selection of highly diverse subsets of molecules from the library is based on the stochastic optimization of ‘Diversity’ functions using a single-point-mutation Monte Carlo technique. The Diversity functions are defined in terms of the distances among molecules in multidimensional property space resulting from the calculation of 2D and 3D molecular descriptors. Several Diversity functions, including an implementation of D-Optimal design, are applied to select diverse subsets and the results are compared. The diversity of the selected subsets of molecules is visualized by embedding the intermolecular distances, defined by the molecules in multidimensional property space, into a three-dimensional space.

[1]  A. Ghose,et al.  Atomic Physicochemical Parameters for Three‐Dimensional Structure‐Directed Quantitative Structure‐Activity Relationships I. Partition Coefficients as a Measure of Hydrophobicity , 1986 .

[2]  Peter Willett,et al.  Pharmacophoric pattern matching in files of 3-D chemical structures: election of interatomic distance screens , 1986 .

[3]  Peter C. Jurs,et al.  Descriptions of molecular shape applied in studies of structure/activity and structure/property relationships , 1987 .

[4]  Bradley D. Christie,et al.  Database structure and searching in MACCS-3D , 1990 .

[5]  Steven L. Teig,et al.  Chemical Function Queries for 3D Database Search , 1994, J. Chem. Inf. Comput. Sci..

[6]  I. W Nowell,et al.  Molecular Connectivity in Structure-Activity Analysis , 1986 .

[7]  M. Waldman,et al.  Peptide conformation in 3D RMS-space: Application to two peptide antagonists of endothelin , 1995 .

[8]  Louis Hodes,et al.  An Efficient Design for Chemical Structure Searching. I. The Screens , 1975, J. Chem. Inf. Comput. Sci..

[9]  L. Hall,et al.  Molecular connectivity in chemistry and drug research , 1976 .

[10]  S. Unger Molecular Connectivity in Structure–activity Analysis , 1987 .

[11]  Danail Bonchev,et al.  Information theoretic indices for characterization of chemical structures , 1983 .

[12]  Peter Willett,et al.  Similarity-searching and clustering algorithms for processing databases of two-dimensional and three-dimensional chemical structures , 1995 .

[13]  Nick A. Farmer,et al.  The CAS ONLINE search system. 1. General system design and selection, generation, and use of search screens , 1983, J. Chem. Inf. Comput. Sci..

[14]  David T. Stanton,et al.  Computer-assisted prediction of normal boiling points of furans, tetrahydrofurans, and thiophenes , 1991, J. Chem. Inf. Comput. Sci..

[15]  Louis Hodes,et al.  An Efficient Design for Chemical Structure Searching. II. The File Oraganization , 1978, J. Chem. Inf. Comput. Sci..

[16]  W. J. Studden,et al.  Theory Of Optimal Experiments , 1972 .

[17]  D C Spellmeyer,et al.  Measuring diversity: experimental design of combinatorial libraries for drug discovery. , 1995, Journal of medicinal chemistry.

[18]  S. P. Fodor,et al.  Applications of combinatorial technologies to drug discovery. 1. Background and peptide combinatorial libraries. , 1994, Journal of medicinal chemistry.

[19]  Amedeo Caflisch,et al.  Computational combinatorial chemistry for de novo ligand design: Review and assessment , 1995 .

[20]  M Levitt,et al.  Molecular dynamics of native protein. II. Analysis and nature of motion. , 1983, Journal of molecular biology.

[21]  P. Jurs,et al.  Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies , 1990 .

[22]  B. Everitt,et al.  Applied Multivariate Data Analysis. , 1993 .

[23]  David Bawden,et al.  Pharmacophoric pattern matching in files of 3d chemical structures: evaluation of search performance , 1987 .

[24]  David T. Stanton,et al.  Computer‐Assisted Prediction of Normal Boiling Points of Furans, Tetrahydrofurans, and Thiophenes. , 1991 .

[25]  S. P. Fodor,et al.  Applications of combinatorial technologies to drug discovery. 2. Combinatorial organic synthesis, library screening strategies, and future directions. , 1994, Journal of medicinal chemistry.

[26]  Arup K. Ghose,et al.  Atomic physicochemical parameters for three dimensional structure directed quantitative structure-activity relationships. 4. Additional parameters for hydrophobic and dispersive interactions and their application for an automated superposition of certain naturally occurring nucleoside antibiotics , 1989, J. Chem. Inf. Comput. Sci..

[27]  Ekaterina Gordeeva,et al.  Traditional topological indexes vs electronic, geometrical, and combined molecular descriptors in QSAR/QSPR research , 1993, J. Chem. Inf. Comput. Sci..