Combinatorial informatics in the post-genomics era

The multitude of potential drug targets emerging from genome sequencing demands new approaches to drug discovery. A chemogenomics strategy, which involves the generation of small-molecule compounds that can be used both as tools to probe biological mechanisms and as leads for drug-property optimization, provides a highly parallel, industrialized solution. Key to the success of this strategy is an integrated suite of chemi-informatics applications that can allow the rapid and directed optimization of chemical compounds with drug-like properties using 'just-in-time' combinatorial chemical synthesis. An effective embodiment of this process requires new computational and data-mining tools that cover all aspects of library generation, compound selection and experimental design, and work effectively on a massive scale.

[1]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[2]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[3]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[4]  W. Cooley,et al.  Multivariate Data Analysis , 1972 .

[5]  R. Venkataraghavan,et al.  Atom pairs as molecular features in structure-activity studies: definition and applications , 1985, J. Chem. Inf. Comput. Sci..

[6]  Ramaswamy Nilakantan,et al.  Topological torsion: a new molecular descriptor for SAR applications. Comparison with other descriptors , 1987, J. Chem. Inf. Comput. Sci..

[7]  Robert P. Sheridan,et al.  3DSEARCH: a system for three-dimensional substructure searching , 1989, J. Chem. Inf. Comput. Sci..

[8]  F. Burden Molecular Identification Number for Substructure Searches. , 1989 .

[9]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[10]  N. W. Murrall,et al.  Conformational freedom in 3-D databases. 1. Techniques , 1990, J. Chem. Inf. Comput. Sci..

[11]  R. Sheridan,et al.  3DSEARCH: a system for three-dimensional substructure searching , 1989, J. Chem. Inf. Comput. Sci..

[12]  E. Keith Davies,et al.  Conformational Freedom in 3-D Databases , 1993 .

[13]  Marina Lasagni,et al.  New molecular descriptors for 2D and 3D structures. Theory , 1994 .

[14]  Peter Willett,et al.  Similarity Searching and Clustering of Chemical-Structure Databases Using Molecular Property Data , 1994, J. Chem. Inf. Comput. Sci..

[15]  Robert P. Sheridan,et al.  Using a Genetic Algorithm To Suggest Combinatorial Libraries , 1995, J. Chem. Inf. Comput. Sci..

[16]  D C Spellmeyer,et al.  Measuring diversity: experimental design of combinatorial libraries for drug discovery. , 1995, Journal of medicinal chemistry.

[17]  J. Gasteiger,et al.  Autocorrelation of Molecular Surface Properties for Modeling Corticosteroid Binding Globulin and Cytosolic Ah Receptor Activity by Neural Networks , 1995 .

[18]  G L Amidon,et al.  Transport approaches to the biopharmaceutical design of oral drug delivery systems: prediction of intestinal absorption. , 1996, Advanced drug delivery reviews.

[19]  Kim D. Janda,et al.  Molecular diversity and combinatorial chemistry : libraries and drug discovery , 1996 .

[20]  Yvonne C. Martin,et al.  Use of Structure-Activity Data To Compare Structure-Based Clustering Methods and Descriptors for Use in Compound Selection , 1996, J. Chem. Inf. Comput. Sci..

[21]  Robert D Clark,et al.  Neighborhood behavior: a useful concept for validation of "molecular diversity" descriptors. , 1996, Journal of medicinal chemistry.

[22]  Edward P. Jaeger,et al.  Application of Genetic Algorithms to Combinatorial Synthesis: A Computational Approach to Lead Identification and Lead Optimization†,∇ , 1996 .

[23]  Robert P. Sheridan,et al.  Chemical Similarity Using Geometric Atom Pair Descriptors , 1996, J. Chem. Inf. Comput. Sci..

[24]  Andreas Zell,et al.  Locating Biologically Active Compounds in Medium-Sized Heterogeneous Datasets by Topological Autocorrelation Vectors: Dopamine and Benzodiazepine Agonists , 1996, J. Chem. Inf. Comput. Sci..

[25]  Robert P. Sheridan,et al.  Chemical Similarity Using Physiochemical Property Descriptors , 1996, J. Chem. Inf. Comput. Sci..

[26]  Robert D Clark,et al.  Bioisosterism as a molecular diversity descriptor: steric fields of single "topomeric" conformers. , 1996, Journal of medicinal chemistry.

[27]  Klaus Gubernator,et al.  Optimization of the Biological Activity of Combinatorial Compound Libraries by a Genetic Algorithm. , 1996 .

[28]  David J. Cummins,et al.  Molecular Diversity in Chemical Databases: Comparison of Medicinal Chemistry Knowledge Bases and Databases of Commercially Available Compounds , 1996, J. Chem. Inf. Comput. Sci..

[29]  Dimitris K. Agrafiotis,et al.  Stochastic Algorithms for Maximizing Molecular Diversity , 1997, J. Chem. Inf. Comput. Sci..

[30]  P Willett,et al.  Comparison of algorithms for dissimilarity-based compound selection. , 1997, Journal of molecular graphics & modelling.

[31]  John Bradshaw,et al.  The Effectiveness of Reactant Pools for Generating Structurally-Diverse Combinatorial Libraries , 1997, J. Chem. Inf. Comput. Sci..

[32]  Yvonne C. Martin,et al.  The Information Content of 2D and 3D Structural Descriptors Relevant to Ligand-Receptor Binding , 1997, J. Chem. Inf. Comput. Sci..

[33]  Ian A. Watson,et al.  Experimental Designs for Selecting Molecules from Large Chemical Databases , 1997, J. Chem. Inf. Comput. Sci..

[35]  A. Good,et al.  New methodology for profiling combinatorial libraries and screening sets: cleaning up the design process with HARPick. , 1997, Journal of medicinal chemistry.

[36]  J. Spurlino,et al.  Serendipity meets precision: the integration of structure-based drug design and combinatorial chemistry for efficient drug discovery. , 1997, Structure.

[37]  H. Matter,et al.  Selecting optimally diverse compounds from structure databases: a validation study of two-dimensional and three-dimensional molecular descriptors. , 1997, Journal of medicinal chemistry.

[38]  James G. Nourse,et al.  Managing the Combinatorial Explosion , 1997, J. Chem. Inf. Comput. Sci..

[39]  Y. Martin,et al.  Designing combinatorial library mixtures using a genetic algorithm. , 1997, Journal of medicinal chemistry.

[40]  Ajay,et al.  Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? , 1998, Journal of medicinal chemistry.

[41]  Y. Martin,et al.  Computational methods in molecular diversity and combinatorial chemistry. , 1998, Current opinion in chemical biology.

[42]  P. Schleyer Encyclopedia of computational chemistry , 1998 .

[43]  Robert D. Clark,et al.  Virtual Compound Libraries: A New Approach to Decision Making in Molecular Discovery Research , 1998, J. Chem. Inf. Comput. Sci..

[44]  H. Kubinyi,et al.  A scoring scheme for discriminating between drugs and nondrugs. , 1998, Journal of medicinal chemistry.

[45]  Mark A. Murcko,et al.  Virtual screening : an overview , 1998 .

[46]  Sung Jin Cho,et al.  Rational Combinatorial Library Design. 1. Focus-2D: A New Approach to the Design of Targeted Combinatorial Chemical Libraries , 1998, J. Chem. Inf. Comput. Sci..

[47]  I D Kuntz,et al.  CombiDOCK: Structure-based combinatorial docking and library design , 1998, Journal of computer-aided molecular design.

[48]  Mark G. Bures,et al.  Validated Descriptors for Diversity Measurements and Optimization , 1998 .

[49]  GENERALIZED CLUSTER SIGNIFICANCE ANALYSIS AND STEPWISE CLUSTER SIGNIFICANCE ANALYSIS WITH CONDITIONAL PROBABILITIES , 1998 .

[50]  Robert S. Pearlman,et al.  Metric Validation and the Receptor-Relevant Subspace Concept , 1999, J. Chem. Inf. Comput. Sci..

[51]  S. L. Dixon,et al.  LASSOO: a generalized directed diversity approach to the design and enrichment of chemical libraries. , 1999, Journal of medicinal chemistry.

[52]  J. Mason,et al.  New 4-point pharmacophore method for molecular similarity and diversity applications: overview of the method and applications, including a novel approach to the design of combinatorial libraries containing privileged substructures. , 1999, Journal of medicinal chemistry.

[53]  Gareth Jones,et al.  Further Development of a Genetic Algorithm for Ligand Docking and Its Application to Screening Combinatorial Libraries , 1999 .

[54]  Tudor I. Oprea,et al.  The Design of Leadlike Combinatorial Libraries. , 1999, Angewandte Chemie.

[55]  A. N. Jain,et al.  IcePick: a flexible surface-based system for molecular diversity. , 1999, Journal of medicinal chemistry.

[56]  Schmid,et al.  "Scaffold-Hopping" by Topological Pharmacophore Search: A Contribution to Virtual Screening. , 1999, Angewandte Chemie.

[57]  Chris L. Waller,et al.  Development and Validation of a Novel Variable Selection Technique with Application to Multidimensional Quantitative Structure-Activity Relationship Studies , 1999, J. Chem. Inf. Comput. Sci..

[58]  Denis M. Bayada,et al.  Molecular Diversity and Representativity in Chemical Databases. , 1999 .

[59]  Dimitris K. Agrafiotis,et al.  An Efficient Implementation of Distance-Based Diversity Measures Based on k-d Trees , 1999, J. Chem. Inf. Comput. Sci..

[60]  Darren V. S. Green,et al.  Implementation of a System for Reagent Selection and Library Enumeration, Profiling, and Design , 1999, J. Chem. Inf. Comput. Sci..

[61]  Darren V. S. Green,et al.  Selecting Combinatorial Libraries to Optimize Diversity and Physical Properties. , 1999 .

[62]  S. Stanley Young,et al.  Approaches to the design of combinatorial libraries , 1999 .

[63]  A. N. Jain,et al.  Molecular hashkeys: a novel method for molecular characterization and its application for predicting important pharmaceutical properties of molecules. , 1999, Journal of medicinal chemistry.

[64]  Hans Matter,et al.  Comparing 3D Pharmacophore Triplets and 2D Fingerprints for Selecting Diverse Compound Subsets , 1999, J. Chem. Inf. Comput. Sci..

[65]  J. Wang,et al.  Toward designing drug-like libraries: a novel computational approach for prediction of drug feasibility of compounds. , 1999, Journal of combinatorial chemistry.

[66]  Osman F. Güner,et al.  Pharmacophore perception, development, and use in drug design , 2000 .

[67]  Darren V. S. Green,et al.  Where Are the GaPs? A Rational Approach to Monomer Acquisition and Selection , 2000, J. Chem. Inf. Comput. Sci..

[68]  Marvin Waldman,et al.  Evaluation of Reagent-Based and Product-Based Strategies in the Design of Combinatorial Library Subsets , 2000, J. Chem. Inf. Comput. Sci..

[69]  R P Sheridan,et al.  Designing targeted libraries with genetic algorithms. , 2000, Journal of molecular graphics & modelling.

[70]  Dimitris K. Agrafiotis,et al.  Stochastic Similarity Selections from Large Combinatorial Libraries. , 2000 .

[71]  P. Labute A widely applicable set of descriptors. , 2000, Journal of molecular graphics & modelling.

[72]  M Waldman,et al.  Novel algorithms for the optimization of molecular diversity of combinatorial libraries. , 2000, Journal of molecular graphics & modelling.

[73]  E J Martin,et al.  Oriented substituent pharmacophore PRopErtY space (OSPPREYS): a substituent-based calculation that describes combinatorial library products better than the corresponding product-based calculation. , 2000, Journal of molecular graphics & modelling.

[74]  Robert D. Brown,et al.  Combinatorial library design for diversity, cost efficiency, and drug-like character. , 2000, Journal of molecular graphics & modelling.

[75]  D K Agrafiotis,et al.  Kolmogorov-Smirnov statistic and its application in library design. , 2000, Journal of molecular graphics & modelling.

[76]  Dimitris K. Agrafiotis,et al.  Ultrafast Algorithm for Designing Focused Combinational Arrays , 2000, J. Chem. Inf. Comput. Sci..

[77]  Dimitris K. Agrafiotis,et al.  The Measurement of Molecular Diversity , 2000 .

[78]  G. Paderes,et al.  Efficient combinatorial filtering for desired molecular properties of reaction products. , 2000, Journal of molecular graphics & modelling.

[79]  Leach,et al.  The in silico world of virtual libraries. , 2000, Drug discovery today.

[80]  Eric J. Martin,et al.  Sensitivity Analysis and Other Improvements to Tailored Combinatorial Library Design , 2000, J. Chem. Inf. Comput. Sci..

[81]  G. Schneider,et al.  Virtual Screening for Bioactive Molecules , 2000 .

[82]  Robert P. Sheridan,et al.  Designing targeted libraries with genetic algorithms. , 2000, Journal of molecular graphics & modelling.

[83]  Tamar Schlick,et al.  An Efficient Projection Protocol for Chemical Databases: Singular Value Decomposition Combined with Truncated-Newton Minimization , 2000, J. Chem. Inf. Comput. Sci..

[84]  M. Wagener,et al.  Potential Drugs and Nondrugs: Prediction and Identification of Important Structural Features. , 2000 .

[85]  Jennifer L. Miller,et al.  Combinatorial Library Design: Maximizing Model-Fitting Compounds within Matrix Synthesis Constraints , 2000, J. Chem. Inf. Comput. Sci..

[86]  Tudor I. Oprea,et al.  Property distribution of drug-related chemical databases* , 2000, J. Comput. Aided Mol. Des..

[87]  Dimitris K. Agrafiotis,et al.  Nonlinear Mapping Networks , 2000, J. Chem. Inf. Comput. Sci..

[88]  David J. Livingstone,et al.  The Characterization of Chemical Structures Using Molecular Properties. A Survey , 2000, J. Chem. Inf. Comput. Sci..

[89]  Hong Li,et al.  Novel algorithms for the optimization of molecular diversity of combinatorial libraries11Color Plates for this article are on pages 533–536. , 2000 .

[90]  I. Muegge,et al.  Simple selection criteria for drug-like chemical matter. , 2001, Journal of medicinal chemistry.

[91]  Tim D. J. Perkins,et al.  Large-scale virtual screening for discovering leads in the postgenomic era , 2001, IBM Syst. J..

[92]  E. Fluder,et al.  Latent semantic structure indexing (LaSSI) for defining chemical similarity. , 2001, Journal of medicinal chemistry.

[93]  Victor S. Lobanov,et al.  High-Density Miniaturized Thermal Shift Assays as a General Strategy for Drug Discovery , 2001 .

[94]  Dimitris K. Agrafiotis,et al.  A Constant Time Algorithm for Estimating the Diversity of Large Chemical Libraries , 2001, J. Chem. Inf. Comput. Sci..

[95]  D K Agrafiotis,et al.  Combinatorial networks. , 2001, Journal of molecular graphics & modelling.

[96]  Jürgen Bajorath,et al.  Differential Shannon Entropy as a Sensitive Measure of Differences in Database Variability of Molecular Descriptors , 2001, J. Chem. Inf. Comput. Sci..

[97]  Dimitris K. Agrafiotis,et al.  Multidimensional scaling and visualization of large molecular similarity tables , 2001, J. Comput. Chem..

[98]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[99]  Dimitris K. Agrafiotis,et al.  Multidimensional scaling of combinatorial libraries without explicit enumeration , 2001, J. Comput. Chem..

[100]  D. Agrafiotis,et al.  Nonlinear mapping of massive data sets by fuzzy clustering and neural networks , 2001 .

[101]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[102]  Joseph D. Kwasnoski,et al.  High-density miniaturized thermal shift assays as a general strategy for drug discovery. , 2001, Journal of biomolecular screening.

[103]  Dimitris K. Agrafiotis,et al.  Multiobjective optimization of combinatorial libraries , 2002, J. Comput. Aided Mol. Des..

[104]  Dimitris K. Agrafiotis,et al.  A Fractal Approach for Selecting an Appropriate Bin Size for Cell-Based Diversity Estimation , 2002, J. Chem. Inf. Comput. Sci..

[105]  Robert P. Sheridan,et al.  The Most Common Chemical Replacements in Drug-Like Compounds , 2002, J. Chem. Inf. Comput. Sci..

[106]  A Novel Frequency Distribution Selection Method for Efficient Plate Layout of a Diverse Combinatorial Library. , 2002 .

[107]  Douglas J. Klein,et al.  Computing Wiener-Type Indices for Virtual Combinatorial Libraries Generated from Heteroatom-Containing Building Blocks. , 2002 .

[108]  D. Agrafiotis,et al.  Scalable methods for the construction and analysis of virtual combinatorial libraries. , 2002, Combinatorial chemistry & high throughput screening.

[109]  Dimitris K. Agrafiotis,et al.  Advances in diversity profiling and combinatorial series design , 2004, Molecular Diversity.