SimilarityLab: Molecular Similarity for SAR Exploration and Target Prediction on the Web

Exploration of chemical space around hit, experimental, and known active compounds is an important step in the early stages of drug discovery. In academia, where access to chemical synthesis efforts is restricted in comparison to the pharma-industry, hits from primary screens are typically followed up through purchase and testing of similar compounds, before further funding is sought to begin medicinal chemistry efforts. Rapid exploration of druglike similars and structure–activity relationship profiles can be achieved through our new webservice SimilarityLab. In addition to searching for commercially available molecules similar to a query compound, SimilarityLab also enables the search of compounds with recorded activities, generating consensus counts of activities, which enables target and off-target prediction. In contrast to other online offerings utilizing the USRCAT similarity measure, SimilarityLab’s set of commercially available small molecules is consistently updated, currently containing over 12.7 million unique small molecules, and not relying on published databases which may be many years out of date. This ensures researchers have access to up-to-date chemistries and synthetic processes enabling greater diversity and access to a wider area of commercial chemical space. All source code is available in the SimilarityLab source repository.

[1]  R. M. Owen,et al.  An analysis of the attrition of drug candidates from four major pharmaceutical companies , 2015, Nature Reviews Drug Discovery.

[2]  Kun-Yi Hsin,et al.  EDULISS: a small-molecule database with data-mining and pharmacophore searching capabilities , 2010, Nucleic Acids Res..

[3]  Jürgen Bajorath,et al.  Evolving Concept of Activity Cliffs , 2019, ACS omega.

[4]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[5]  Charlotte M. Deane,et al.  Freely Available Conformer Generation Methods: How Good Are They? , 2012, J. Chem. Inf. Model..

[6]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.

[7]  Tom L. Blundell,et al.  USRCAT: real-time ultrafast shape recognition with pharmacophoric constraints , 2012, Journal of Cheminformatics.

[8]  Gisbert Schneider,et al.  Scaffold‐Hopping: How Far Can You Jump? , 2006 .

[9]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[10]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[11]  Jürgen Bajorath,et al.  Representation and identification of activity cliffs , 2017, Expert opinion on drug discovery.

[12]  A. Aronov,et al.  Drug discovery effectiveness from the standpoint of therapeutic mechanisms and indications , 2017, Nature Reviews Drug Discovery.

[13]  Xi Jin,et al.  Kekule.js: An Open Source JavaScript Chemoinformatics Toolkit , 2016, J. Chem. Inf. Model..

[14]  J. Bajorath,et al.  Activity landscape representations for structure-activity relationship analysis. , 2010, Journal of medicinal chemistry.

[15]  M. Stahl,et al.  Scaffold hopping. , 2004, Drug discovery today. Technologies.

[16]  E. Perl Causalgia, pathological pain, and adrenergic receptors. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Rajarshi Guha,et al.  Structure-Activity Landscape Index: Identifying and Quantifying Activity Cliffs , 2008, J. Chem. Inf. Model..

[18]  W. Tong,et al.  Quantitative structure‐activity relationship methods: Perspectives on drug discovery and toxicology , 2003, Environmental toxicology and chemistry.

[19]  Kwong-Sak Leung,et al.  USR-VS: a web server for large-scale prospective virtual screening using ultrafast shape recognition techniques , 2016, Nucleic Acids Res..

[20]  Douglas R. Houston,et al.  UFSRAT: Ultra-Fast Shape Recognition with Atom Types –The Discovery of Novel Bioactive Small Molecular Scaffolds for FKBP12 and 11βHSD1 , 2015, PloS one.

[21]  Jean-Louis Reymond,et al.  SmilesDrawer: Parsing and Drawing SMILES-Encoded Molecular Structures Using Client-Side JavaScript , 2018, J. Chem. Inf. Model..

[22]  A. Hopfinger,et al.  Methods for applying the quantitative structure-activity relationship paradigm. , 2004, Methods in molecular biology.