Prediction of 'drug-likeness'.

Recent developments in combinatorial chemistry and high-throughput screening have dramatically increased the scale on which drug discovery programs are carried out. Along with these advances has come a need for automated methods of determining which compounds from a library should be synthesized and screened. These methods range from simple counting schemes to sophisticated machine learning techniques such as neural networks. While many of these methods have performed well in validation studies, the field is still in its formative stage. This paper reviews a number of computational techniques for identifying drug-like molecules and examines challenges facing the field.

[1]  Robert Bywater,et al.  Improving the Odds in Discriminating "Drug-like" from "Non Drug-like" Compounds , 2000, J. Chem. Inf. Comput. Sci..

[2]  X. Lewell,et al.  Drug-motif-based diverse monomer selection: method and application in combinatorial chemistry. , 1997, Journal of molecular graphics & modelling.

[3]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[4]  Ajay,et al.  Designing libraries with CNS activity. , 1999, Journal of medicinal chemistry.

[5]  Dimitris K. Agrafiotis,et al.  Advances in diversity profiling and combinatorial series design , 2004, Molecular Diversity.

[6]  A. Ghose,et al.  A knowledge-based approach in designing combinatorial or medicinal chemistry libraries for drug discovery. 1. A qualitative and quantitative characterization of known drug databases. , 1999, Journal of combinatorial chemistry.

[7]  Markus Wagener,et al.  Potential Drugs and Nondrugs: Prediction and Identification of Important Structural Features , 2000, J. Chem. Inf. Comput. Sci..

[8]  S. Ekins,et al.  Pharmacophore and three-dimensional quantitative structure activity relationship methods for modeling cytochrome p450 active sites. , 2001, Drug metabolism and disposition: the biological fate of chemicals.

[9]  Robert D. Brown,et al.  Combinatorial library design for diversity, cost efficiency, and drug-like character. , 2000, Journal of molecular graphics & modelling.

[10]  Andrew R. Leach,et al.  Molecular Complexity and Its Impact on the Probability of Finding Leads for Drug Discovery , 2001, J. Chem. Inf. Comput. Sci..

[11]  I. Muegge,et al.  Simple selection criteria for drug-like chemical matter. , 2001, Journal of medicinal chemistry.

[12]  Darren V. S. Green,et al.  Implementation of a System for Reagent Selection and Library Enumeration, Profiling, and Design , 1999, J. Chem. Inf. Comput. Sci..

[13]  F. Lombardo,et al.  Computation of brain-blood partitioning of organic solutes via free energy calculations. , 1996, Journal of medicinal chemistry.

[14]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[15]  P. Andrews,et al.  Functional group contributions to drug-receptor interactions. , 1984, Journal of medicinal chemistry.

[16]  Gordon M. Crippen,et al.  Prediction of Physicochemical Parameters by Atomic Contributions , 1999, J. Chem. Inf. Comput. Sci..

[17]  C. Lipinski Drug-like properties and the causes of poor solubility and poor permeability. , 2000, Journal of pharmacological and toxicological methods.

[18]  Robert P. Sheridan,et al.  PATTY: A Programmable Atom Typer and Language for Automatic Classification of Atoms in Molecular Databases. , 1994 .

[19]  G. Seibel,et al.  PICCOLO: a tool for combinatorial library design via multicriterion optimization. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[20]  Michael M. Hann,et al.  RECAP-Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry , 1998, J. Chem. Inf. Comput. Sci..

[21]  David J. Cummins,et al.  Molecular Diversity in Chemical Databases: Comparison of Medicinal Chemistry Knowledge Bases and Databases of Commercially Available Compounds , 1996, J. Chem. Inf. Comput. Sci..

[22]  Tudor I. Oprea,et al.  Is There a Difference between Leads and Drugs? A Historical Perspective , 2001, J. Chem. Inf. Comput. Sci..

[23]  J M Barnard,et al.  Use of Markush structure analysis techniques for descriptor generation and clustering of large combinatorial libraries. , 2000, Journal of molecular graphics & modelling.

[24]  G. Paderes,et al.  Efficient combinatorial filtering for desired molecular properties of reaction products. , 2000, Journal of molecular graphics & modelling.

[25]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[26]  H. Kubinyi,et al.  A scoring scheme for discriminating between drugs and nondrugs. , 1998, Journal of medicinal chemistry.

[27]  Jerry March,et al.  Advanced Organic Chemistry: Reactions, Mechanisms, and Structure , 1977 .

[28]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[29]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[30]  S. Hirono,et al.  Comparison of Reliability of log P Values for Drugs Calculated by Several Methods , 1994 .

[31]  P J Sinko Drug selection in early drug development: screening for acceptable pharmacokinetic properties using combined in vitro and computational approaches. , 1999, Current opinion in drug discovery & development.

[32]  G. Rishton Reactive compounds and in vitro false positives in HTS , 1997 .

[33]  Pickett,et al.  Computational methods for the prediction of 'drug-likeness' , 2000, Drug discovery today.

[34]  Ajay,et al.  Recognizing molecules with drug-like properties. , 1999, Current opinion in chemical biology.

[35]  John Bradshaw,et al.  Identification of Biological Activity Profiles Using Substructural Analysis and Genetic Algorithms , 1998, J. Chem. Inf. Comput. Sci..

[36]  William H. Press,et al.  Numerical recipes in C , 2002 .

[37]  G. Bemis,et al.  Properties of known drugs. 2. Side chains. , 1999, Journal of medicinal chemistry.

[38]  David E. Clark,et al.  Enhancing the Hit-to-Lead Properties of Lead Optimization Libraries , 2000, J. Chem. Inf. Comput. Sci..

[39]  Bruce L. Bush,et al.  PATTY: A programmable atom type and language for automatic classification of atoms in molecular databases , 1993, J. Chem. Inf. Comput. Sci..

[40]  Luhua Lai,et al.  Structural Features of Toxic Chemicals for Specific Toxicity , 1999, J. Chem. Inf. Comput. Sci..

[41]  S. Hirono,et al.  Simple Method of Calculating Octanol/Water Partition Coefficient. , 1992 .

[42]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[43]  I. Kuntz,et al.  The maximal affinity of ligands. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Mark A. Murcko,et al.  Virtual screening : an overview , 1998 .

[45]  Darren V. S. Green,et al.  Selecting Combinatorial Libraries to Optimize Diversity and Physical Properties , 1999, J. Chem. Inf. Comput. Sci..

[46]  Jun Xu,et al.  Drug-like Index: A New Approach To Measure Drug-like Compounds and Their Diversity , 2000, J. Chem. Inf. Comput. Sci..

[47]  A. Ghose,et al.  Prediction of Hydrophobic (Lipophilic) Properties of Small Organic Molecules Using Fragmental Methods: An Analysis of ALOGP and CLOGP Methods , 1998 .

[48]  Tudor I. Oprea,et al.  Property distribution of drug-related chemical databases* , 2000, J. Comput. Aided Mol. Des..

[49]  Brian Hudson,et al.  Strategic Pooling of Compounds for High-Throughput Screening , 1999, J. Chem. Inf. Comput. Sci..

[50]  Ajay,et al.  Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? , 1998, Journal of medicinal chemistry.

[51]  Sholom M. Weiss,et al.  Computer Systems That Learn , 1990 .

[52]  J. Blake,et al.  Chemoinformatics - predicting the physicochemical properties of 'drug-like' molecules. , 2000, Current opinion in biotechnology.