Molecular similarity concepts and search calculations.

The introduction of molecular similarity analysis in the early 1990s has catalyzed the development of many small-molecule-based similarity methods to mine large compound databases for novel active molecules. These efforts have profoundly influenced the field of computer-aided drug discovery and substantially widened the spectrum of available ligand-based virtual screening approaches. However, the principles underlying the computational assessment of molecular similarity are much more multifaceted and complex than it might appear at first glance. Accordingly, intrinsic features of molecular similarity analysis and its relationship to other methods are often not well understood. This chapter discusses critical aspects of molecular similarity, an understanding of which is essential for the evaluation of method development in this field. Then it describes studies designed to enhance the performance of molecular fingerprint searching, which is one of the most intuitive and widely used similarity-based methods.

[1]  Robert J. Jilek,et al.  "Lead hopping". Validation of topomer similarity as a superior predictor of similar biological activities. , 2004, Journal of medicinal chemistry.

[2]  Hugo Kubinyi,et al.  Similarity and Dissimilarity: A Medicinal Chemist’s View , 2002 .

[3]  Andreas Bender,et al.  A Discussion of Measures of Enrichment in Virtual Screening: Comparing the Information Content of Descriptors with Increasing Levels of Sophistication , 2005, J. Chem. Inf. Model..

[4]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[5]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[6]  D. Agrafiotis,et al.  Combinatorial informatics in the post-genomics era , 2002, Nature Reviews Drug Discovery.

[7]  John Bradshaw,et al.  Similarity Searching Using Reduced Graphs , 2003, J. Chem. Inf. Comput. Sci..

[8]  Jürgen Bajorath,et al.  POT-DMC: A virtual screening method for the identification of potent hits. , 2004, Journal of medicinal chemistry.

[9]  Peter Willett,et al.  Dissimilarity-Based Algorithms for Selecting Structurally Diverse Sets of Compounds , 1999, J. Comput. Biol..

[10]  L Xue,et al.  Searching for molecules with similar biological activity: analysis by fingerprint profiling. , 2000, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[11]  J. Bajorath,et al.  Mini-fingerprints for virtual screening: Design principles and generation of novel prototypes based on information theory , 2003, SAR and QSAR in environmental research.

[12]  Y. Martin Diverse viewpoints on computational aspects of molecular diversity. , 2001, Journal of combinatorial chemistry.

[13]  Jürgen Bajorath,et al.  Design and Evaluation of a Molecular Fingerprint Involving the Transformation of Property Descriptor Values into a Binary Classification Scheme , 2003, J. Chem. Inf. Comput. Sci..

[14]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[15]  Andreas Bender,et al.  Similarity Searching of Chemical Databases Using Atom Environment Descriptors (MOLPRINT 2D): Evaluation of Performance , 2004, J. Chem. Inf. Model..

[16]  John M. Barnard,et al.  Chemical Fragment Generation and Clustering Software , 1997, J. Chem. Inf. Comput. Sci..

[17]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[18]  Jürgen Bajorath,et al.  Partitioning methods for the identification of active molecules. , 2003, Current medicinal chemistry.

[19]  Jürgen Bajorath,et al.  Integration of virtual and high-throughput screening , 2002, Nature Reviews Drug Discovery.

[20]  J. Bajorath,et al.  Identification of structurally diverse growth hormone secretagogue agonists by virtual screening and structure-activity relationship analysis of 2-formylaminoacetamide derivatives. , 2004, Journal of medicinal chemistry.

[21]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[22]  Jürgen Bajorath,et al.  Chemical Descriptors with Distinct Levels of Information Content and Varying Sensitivity to Differences between Selected Compound Databases Identified by SE-DSE Analysis , 2002, J. Chem. Inf. Comput. Sci..

[23]  Jürgen Bajorath,et al.  Profile Scaling Increases the Similarity Search Performance of Molecular Fingerprints Containing Numerical Descriptors and Structural Keys , 2003, J. Chem. Inf. Comput. Sci..

[24]  Pierre Acklin,et al.  Similarity Metrics for Ligands Reflecting the Similarity of the Target Proteins , 2003, J. Chem. Inf. Comput. Sci..

[25]  P. Willett,et al.  Enhancing the effectiveness of similarity-based virtual screening using nearest-neighbor information. , 2005, Journal of medicinal chemistry.

[26]  P. Beroza,et al.  A rapid computational method for lead evolution: description and application to alpha(1)-adrenergic antagonists. , 2000, Journal of medicinal chemistry.

[27]  Yvonne C. Martin,et al.  Use of Structure-Activity Data To Compare Structure-Based Clustering Methods and Descriptors for Use in Compound Selection , 1996, J. Chem. Inf. Comput. Sci..

[28]  John M. Barnard,et al.  Substructure searching methods: Old and new , 1993, J. Chem. Inf. Comput. Sci..

[29]  A. Hopfinger,et al.  Methods for applying the quantitative structure-activity relationship paradigm. , 2004, Methods in molecular biology.

[30]  C E Berkoff,et al.  Substructural analysis. A novel approach to the problem of drug design. , 1974, Journal of medicinal chemistry.

[31]  Jürgen Bajorath,et al.  Accurate Partitioning of Compounds Belonging to Diverse Activity Classes , 2002, J. Chem. Inf. Comput. Sci..

[32]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[33]  Jürgen Bajorath,et al.  Selected Concepts and Investigations in Compound Classification, Molecular Descriptor Analysis, and Virtual Screening , 2001, J. Chem. Inf. Comput. Sci..