Molecular Similarity Concepts for Informatics Applications.

The assessment of small molecule similarity is a central task in chemoinformatics and medicinal chemistry. A variety of molecular representations and metrics are applied to computationally evaluate and quantify molecular similarity. A critically important aspect of molecular similarity analysis in chemoinformatics and pharmaceutical research is that one is typically not interested in quantifying the degree of structural or chemical similarity between compounds per se, but rather in extrapolating from molecular similarity to property similarity. In other words, one assumes that there is a correlation between calculated similarity and specific properties of small molecules including, first and foremost, biological activities. Although similarity is a priori a subjective concept, and difficult to quantify, it must computationally be assessed in a formally consistent manner. Otherwise, there is little utility of similarity calculations. Consistent treatment requires approximations to be made and the consideration of alternative computational similarity concepts, as discussed herein.

[1]  Jürgen Bajorath,et al.  Molecular similarity concepts and search calculations. , 2008, Methods in molecular biology.

[2]  Pierre Acklin,et al.  Similarity Metrics for Ligands Reflecting the Similarity of the Target Proteins , 2003, J. Chem. Inf. Comput. Sci..

[3]  Peter Willett,et al.  Dissimilarity-Based Algorithms for Selecting Structurally Diverse Sets of Compounds , 1999, J. Comput. Biol..

[4]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[5]  Veerabahu Shanmugasundaram,et al.  Molecular similarity measures. , 2011, Methods in molecular biology.

[6]  G. Maggiora,et al.  Molecular similarity in medicinal chemistry. , 2014, Journal of medicinal chemistry.

[7]  P. Beroza,et al.  A rapid computational method for lead evolution: description and application to alpha(1)-adrenergic antagonists. , 2000, Journal of medicinal chemistry.

[8]  R. Glen,et al.  Molecular similarity: a key technique in molecular informatics. , 2004, Organic & biomolecular chemistry.

[9]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[10]  A. Tversky Features of Similarity , 1977 .

[11]  Jürgen Bajorath,et al.  Partitioning methods for the identification of active molecules. , 2003, Current medicinal chemistry.

[12]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[13]  Michael S Lajiness,et al.  Assessment of the consistency of medicinal chemists in reviewing sets of compounds. , 2004, Journal of medicinal chemistry.

[14]  Joseph S. Verducci,et al.  A Modification of the Jaccard–Tanimoto Similarity Index for Diverse Selection of Chemical Compounds Using Binary Strings , 2002, Technometrics.

[15]  Kathrin Heikamp,et al.  Large-Scale Similarity Search Profiling of ChEMBL Compound Data Sets , 2011, J. Chem. Inf. Model..

[16]  Jürgen Bajorath,et al.  Rendering Conventional Molecular Fingerprints for Virtual Screening Independent of Molecular Complexity and Size Effects , 2010, ChemMedChem.

[17]  Jürgen Bajorath,et al.  Selected Concepts and Investigations in Compound Classification, Molecular Descriptor Analysis, and Virtual Screening , 2001, J. Chem. Inf. Comput. Sci..

[18]  P. Gund Three-Dimensional Pharmacophoric Pattern Searching , 1977 .

[19]  J. A. Grant,et al.  A shape-based 3-D scaffold hopping method and its application to a bacterial protein-protein interaction. , 2005, Journal of medicinal chemistry.

[20]  Y. Martin Diverse viewpoints on computational aspects of molecular diversity. , 2001, Journal of combinatorial chemistry.

[21]  John M. Barnard,et al.  Substructure searching methods: Old and new , 1993, J. Chem. Inf. Comput. Sci..

[22]  J. Bajorath,et al.  Scaffold hopping using two-dimensional fingerprints: true potential, black magic, or a hopeless endeavor? Guidelines for virtual screening. , 2010, Journal of medicinal chemistry.