Molecular similarity in medicinal chemistry.

Similarity is a subjective and multifaceted concept, regardless of whether compounds or any other objects are considered. Despite its intrinsically subjective nature, attempts to quantify the similarity of compounds have a long history in chemical informatics and drug discovery. Many computational methods employ similarity measures to identify new compounds for pharmaceutical research. However, chemoinformaticians and medicinal chemists typically perceive similarity in different ways. Similarity methods and numerical readouts of similarity calculations are probably among the most misunderstood computational approaches in medicinal chemistry. Herein, we evaluate different similarity concepts, highlight key aspects of molecular similarity analysis, and address some potential misunderstandings. In addition, a number of practical aspects concerning similarity calculations are discussed.

[1]  A. Tversky Features of Similarity , 1977 .

[2]  K. Sen,et al.  Molecular Similarity I , 1995 .

[3]  Robert D Clark,et al.  Neighborhood behavior: a useful concept for validation of "molecular diversity" descriptors. , 1996, Journal of medicinal chemistry.

[4]  M. Lajiness Dissimilarity-based compound selection techniques , 1996 .

[5]  Yvonne C. Martin,et al.  The Information Content of 2D and 3D Structural Descriptors Relevant to Ligand-Receptor Binding , 1997, J. Chem. Inf. Comput. Sci..

[6]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[7]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[8]  Darren R. Flower,et al.  On the Properties of Bit String-Based Measures of Chemical Similarity , 1998, J. Chem. Inf. Comput. Sci..

[9]  James T. Becker,et al.  Working Memory(s) , 1999, Brain and Cognition.

[10]  S. Young,et al.  Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning. , 2000 .

[11]  A. Good,et al.  3-D pharmacophores in drug discovery. , 2001, Current pharmaceutical design.

[12]  Andrew C. Good,et al.  Explicit Calculation of 3D Molecular Similarity , 2002 .

[13]  Peter Willett,et al.  Maximum common subgraph isomorphism algorithms for the matching of chemical structures , 2002, J. Comput. Aided Mol. Des..

[14]  Y. Martin,et al.  Do structurally similar molecules have similar biological activity? , 2002, Journal of medicinal chemistry.

[15]  Hugo Kubinyi,et al.  Similarity and Dissimilarity: A Medicinal Chemist’s View , 2002 .

[16]  Joseph S. Verducci,et al.  A Modification of the Jaccard–Tanimoto Similarity Index for Diverse Selection of Chemical Compounds Using Binary Strings , 2002, Technometrics.

[17]  J. Gasteiger,et al.  Neural networks as data mining tools in drug design , 2003 .

[18]  Susumu Yamanobe,et al.  Development of a Method for Evaluating Drug‐Likeness and Ease of Synthesis Using a Data Set in which Compounds Are Assigned Scores Based on Chemists′ Intuition. , 2003 .

[19]  Michael S Lajiness,et al.  Assessment of the consistency of medicinal chemists in reviewing sets of compounds. , 2004, Journal of medicinal chemistry.

[20]  G. Maggiora,et al.  Molecular similarity measures. , 2004, Methods in molecular biology.

[21]  R. Glen,et al.  Molecular similarity: a key technique in molecular informatics. , 2004, Organic & biomolecular chemistry.

[22]  J. A. Grant,et al.  A shape-based 3-D scaffold hopping method and its application to a bacterial protein-protein interaction. , 2005, Journal of medicinal chemistry.

[23]  A. Fliri,et al.  Biological spectra analysis: Linking biological activity profiles to molecular structure. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[24]  G. Schneider,et al.  Scaffold‐Hopping Potential of Ligand‐Based Similarity Concepts , 2006, ChemMedChem.

[25]  Gerald M. Maggiora,et al.  On Outliers and Activity Cliffs-Why QSAR Often Disappoints , 2006, J. Chem. Inf. Model..

[26]  Peter Willett,et al.  Similarity-based virtual screening using 2D fingerprints. , 2006, Drug discovery today.

[27]  W. Kolanus Guanine nucleotide exchange factors of the cytohesin family and their roles in signal transduction , 2007, Immunological reviews.

[28]  Robert P Sheridan,et al.  Chemical similarity searches: when is complexity justified? , 2007, Expert opinion on drug discovery.

[29]  Jürgen Bajorath,et al.  Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. , 2007, Drug discovery today.

[30]  Jürgen Bajorath,et al.  Apparent Asymmetry in Fingerprint Similarity Searching is a Direct Consequence of Differences in Bit Densities and Molecular Size , 2007, ChemMedChem.

[31]  Jürgen Bajorath,et al.  Emerging Chemical Patterns: A New Methodology for Molecular Classification and Compound Selection. , 2007 .

[32]  Robert P. Sheridan,et al.  Comparison of Topological, Shape, and Docking Methods in Virtual Screening. , 2007 .

[33]  Valerie J Gillet,et al.  A Comparison of Field-Based Similarity Searching Methods: CatShape, FBSS, and ROCS , 2008, J. Chem. Inf. Model..

[34]  N. Cowan What are the differences between long-term, short-term, and working memory? , 2008, Progress in brain research.

[35]  Herbert Köppen Virtual screening - what does it give us? , 2009, Current opinion in drug discovery & development.

[36]  Gary Tresadern,et al.  Modeling approaches for ligand-based 3D similarity. , 2010, Future medicinal chemistry.

[37]  Andreas Bender,et al.  How similar are those molecules after all? Use two descriptors and you will have three different answers , 2010, Expert opinion on drug discovery.

[38]  J. Bajorath,et al.  Scaffold hopping using two-dimensional fingerprints: true potential, black magic, or a hopeless endeavor? Guidelines for virtual screening. , 2010, Journal of medicinal chemistry.

[39]  J. Bajorath,et al.  Global assessment of scaffold hopping potential for current pharmaceutical targets , 2010 .

[40]  Jürgen Bajorath,et al.  Advanced fingerprint methods for similarity searching: balancing molecular complexity effects. , 2010, Combinatorial chemistry & high throughput screening.

[41]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[42]  Hanna Geppert,et al.  Targeting multifunctional proteins by virtual screening: structurally diverse cytohesin inhibitors with differentiated biological functions. , 2010, ACS chemical biology.

[43]  J. Bajorath,et al.  Activity landscape representations for structure-activity relationship analysis. , 2010, Journal of medicinal chemistry.

[44]  Jürgen Bajorath,et al.  Rendering Conventional Molecular Fingerprints for Virtual Screening Independent of Molecular Complexity and Size Effects , 2010, ChemMedChem.

[45]  Jürgen Bajorath,et al.  Similarity searching , 2011 .

[46]  Valerie J. Gillet,et al.  Diversity selection algorithms , 2011 .

[47]  Peter S. Kutchukian,et al.  Rethinking molecular similarity: comparing compounds on the basis of biological activity. , 2012, ACS chemical biology.

[48]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[49]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.

[50]  Ulrike Hahn,et al.  Similarity-based asymmetries in perceptual matching. , 2012, Acta psychologica.

[51]  Meir Glick,et al.  Inside the Mind of a Medicinal Chemist: The Role of Human Bias in Compound Prioritization during Drug Discovery , 2012, PloS one.

[52]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[53]  José L. Medina-Franco,et al.  MOLECULAR SIMILARITY ANALYSIS , 2013 .

[54]  Jürgen Bajorath,et al.  Quantifying the Fingerprint Descriptor Dependence of Structure-Activity Relationship Information on a Large Scale , 2013, J. Chem. Inf. Model..

[55]  J. Bajorath,et al.  Compound promiscuity: what can we learn from current data? , 2013, Drug discovery today.

[56]  Peter Willett,et al.  Combination of Similarity Rankings Using Data Fusion , 2013, J. Chem. Inf. Model..

[57]  Jürgen Bajorath,et al.  Recent progress in understanding activity cliffs and their utility in medicinal chemistry. , 2014, Journal of medicinal chemistry.