Erratum to: Design of chemical space networks using a Tanimoto similarity variant based upon maximum common substructures

Chemical space networks (CSNs) have recently been introduced as an alternative to other coordinate-free and coordinate-based chemical space representations. In CSNs, nodes represent compounds and edges pairwise similarity relationships. In addition, nodes are annotated with compound property information such as biological activity. CSNs have been applied to view biologically relevant chemical space in comparison to random chemical space samples and found to display well-resolved topologies at low edge density levels. The way in which molecular similarity relationships are assessed is an important determinant of CSN topology. Previous CSN versions were based on numerical similarity functions or the assessment of substructure-based similarity. Herein, we report a new CSN design that is based upon combined numerical and substructure similarity evaluation. This has been facilitated by calculating numerical similarity values on the basis of maximum common substructures (MCSs) of compounds, leading to the introduction of MCS-based CSNs (MCS-CSNs). This CSN design combines advantages of continuous numerical similarity functions with a robust and chemically intuitive substructure-based assessment. Compared to earlier version of CSNs, MCS-CSNs are characterized by a further improved organization of local compound communities as exemplified by the delineation of drug-like subspaces in regions of biologically relevant chemical space.

[1]  G. Maggiora,et al.  Molecular similarity in medicinal chemistry. , 2014, Journal of medicinal chemistry.

[2]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[3]  Jürgen Bajorath,et al.  MMP-Cliffs: Systematic Identification of Activity Cliffs on the Basis of Matched Molecular Pairs , 2012, J. Chem. Inf. Model..

[4]  Naoki Tanaka,et al.  Small-World Phenomena in Chemical Library Networks: Application to Fragment-Based Drug Discovery , 2009, J. Chem. Inf. Model..

[5]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[6]  Jürgen Bajorath,et al.  Chemical space networks: a powerful new paradigm for the description of chemical space , 2014, Journal of Computer-Aided Molecular Design.

[7]  Andreas Noack,et al.  Modularity clustering is force-directed layout , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  W. Guida,et al.  The art and practice of structure‐based drug design: A molecular modeling perspective , 1996, Medicinal research reviews.

[9]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[10]  Veerabahu Shanmugasundaram,et al.  Molecular similarity measures. , 2011, Methods in molecular biology.

[11]  Tudor I. Oprea,et al.  Chemography: the Art of Navigating in Chemical Space , 2000 .

[12]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[13]  Jürgen Bajorath,et al.  Design and characterization of chemical space networks for different compound data sets , 2015, Journal of Computer-Aided Molecular Design.

[14]  Peter Willett,et al.  Promoting Access to White Rose Research Papers Effectiveness of Graph-based and Fingerprint-based Similarity Measures for Virtual Screening of 2d Chemical Structure Databases , 2022 .

[15]  María Martín,et al.  The Universal Protein Resource (UniProt) in 2010 , 2010 .

[16]  Jameed Hussain,et al.  Computationally Efficient Algorithm to Identify Matched Molecular Pairs (MMPs) in Large Data Sets , 2010, J. Chem. Inf. Model..

[17]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[18]  J. Bajorath,et al.  Structure-activity relationship anatomy by network-like similarity graphs and local structure-activity relationship indices. , 2008, Journal of medicinal chemistry.

[19]  K. M. Smith,et al.  Novel software tools for chemical diversity , 1998 .

[20]  C. Dobson Chemical space and biology , 2004, Nature.

[21]  Jens Sadowski,et al.  Structure Modification in Chemical Databases , 2005 .

[22]  Michael P. Krein,et al.  Exploration of the topology of chemical spaces with network measures. , 2011, The journal of physical chemistry. A.

[23]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  C. Harris,et al.  The Design and Application of Target-Focused Compound Libraries , 2011, Combinatorial chemistry & high throughput screening.

[25]  Alexander Tropsha,et al.  Using Graph Indices for the Analysis and Comparison of Chemical Datasets , 2013, Molecular informatics.

[26]  Derek Lowe Chemical space is big. Really big , 2015 .

[27]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[28]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[29]  Nikolai S. Zefirov,et al.  Progress in visual representations of chemical space , 2015, Expert opinion on drug discovery.

[30]  Jürgen Bajorath,et al.  Recent progress in understanding activity cliffs and their utility in medicinal chemistry. , 2014, Journal of medicinal chemistry.

[31]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[32]  Jürgen Bajorath,et al.  Comparison of bioactive chemical space networks generated using substructure- and fingerprint-based measures of molecular similarity , 2015, Journal of Computer-Aided Molecular Design.