inSARa: Intuitive and Interactive SAR Interpretation by Reduced Graphs and Hierarchical MCS-Based Network Navigation

The analysis of Structure-Activity-Relationships (SAR) of small molecules is a fundamental task in drug discovery. Although a large number of methods are already published, there is still a strong need for novel intuitive approaches. The inSARa (intuitive networks for Structure-Activity Relationships analysis) method introduced herein takes advantage of the synergistic combination of reduced graphs (RG) and the intuitive maximum common substructure (MCS) concept. The main feature of the inSARa concept is a hierarchical network structure of clearly defined substructure relationships based on common pharmacophoric features. Thus, straightforward SAR interpretation is possible by interactive network navigation. When focusing on a set of active molecules at one single target, the resulting inSARa networks are shown to be valuable for various essential tasks in SAR analysis, such as the identification of activity cliffs or "activity switches", bioisosteric exchanges, common pharmacophoric features, or "SAR hotspots".

[1]  Anne Mai Wassermann,et al.  SARANEA: A Freely Available Program To Mine Structure-Activity and Structure-Selectivity Relationship Information in Compound Data Sets , 2010, J. Chem. Inf. Model..

[2]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[3]  Jürgen Bajorath,et al.  Exploration of structure-activity relationship determinants in analogue series. , 2009, Journal of medicinal chemistry.

[4]  Dimitris K Agrafiotis,et al.  SAR maps: a new SAR visualization technique for medicinal chemists. , 2007, Journal of medicinal chemistry.

[5]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[6]  John Bradshaw,et al.  Similarity Searching Using Reduced Graphs , 2003, J. Chem. Inf. Comput. Sci..

[7]  J. Bajorath,et al.  Data structures and computational tools for the extraction of SAR information from large compound sets. , 2010, Drug discovery today.

[8]  Gavin Harper,et al.  Evolving Interpretable Structure-Activity Relationship Models. 2. Using Multiobjective Optimization To Derive Multiple Models , 2008, J. Chem. Inf. Model..

[9]  Steven L. Teig,et al.  Chemical Function Queries for 3D Database Search , 1994, J. Chem. Inf. Comput. Sci..

[10]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[11]  Peter Willett,et al.  Use of Reduced Graphs To Encode Bioisosterism for Similarity-Based Virtual Screening , 2009, J. Chem. Inf. Model..

[12]  Xi Chen,et al.  The Binding Database: data management and interface design , 2002, Bioinform..

[13]  F. Sanz,et al.  Anchor-GRIND: filling the gap between standard 3D QSAR and the GRid-INdependent descriptors. , 2005 .

[14]  Jürgen Bajorath,et al.  Large-scale SAR analysis. , 2013, Drug discovery today. Technologies.

[15]  Daniel J. Warner,et al.  Matched molecular pairs as a medicinal chemistry tool. , 2011, Journal of medicinal chemistry.

[16]  Jürgen Bajorath,et al.  Introducing the LASSO graph for compound data set representation and structure-activity relationship analysis. , 2012, Journal of medicinal chemistry.

[17]  Sung Jin Cho,et al.  Visual exploration of structure–activity relationship using maximum common framework , 2008, J. Comput. Aided Mol. Des..

[18]  Peter Willett,et al.  Maximum common subgraph isomorphism algorithms for the matching of chemical structures , 2002, J. Comput. Aided Mol. Des..

[19]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[20]  Valerie J Gillet,et al.  Reduced graphs and their applications in chemoinformatics. , 2011, Methods in molecular biology.

[21]  Dimitris K. Agrafiotis,et al.  Enhanced SAR Maps: Expanding the Data Rendering Capabilities of a Popular Medicinal Chemistry Tool , 2009, J. Chem. Inf. Model..

[22]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents. 13. Reduced graph generation , 1991, J. Chem. Inf. Comput. Sci..

[23]  Mathias Wawer,et al.  Systematic Extraction of Structure–Activity Relationship Information from Biological Screening Data , 2009, ChemMedChem.

[24]  Samiul Hasan,et al.  Network analysis has diverse roles in drug discovery. , 2012, Drug discovery today.

[25]  J. Sutherland,et al.  A comparison of methods for modeling quantitative structure-activity relationships. , 2004, Journal of medicinal chemistry.

[26]  J. Woolfrey,et al.  Design and synthesis of glycolic and mandelic acid derivatives as factor Xa inhibitors. , 2001, Bioorganic & medicinal chemistry letters.

[27]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[28]  Rajarshi Guha,et al.  Structure-Activity Landscape Index: Identifying and Quantifying Activity Cliffs , 2008, J. Chem. Inf. Model..

[29]  Andrew I Su,et al.  HierS: hierarchical scaffold clustering using topological chemical graphs. , 2005, Journal of medicinal chemistry.

[30]  Mathias Wawer,et al.  Elucidation of structure-activity relationship pathways in biological screening data. , 2009, Journal of medicinal chemistry.

[31]  Gerald M. Maggiora,et al.  On Outliers and Activity Cliffs-Why QSAR Often Disappoints , 2006, J. Chem. Inf. Model..

[32]  J. Bajorath,et al.  Identification of target family directed bioisosteric replacements , 2011 .

[33]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[34]  Jürgen Bajorath,et al.  Assessing the Confidence Level of Public Domain Compound Activity Data and the Impact of Alternative Potency Measurements on SAR Analysis , 2011, J. Chem. Inf. Model..

[35]  Michael L. Creech,et al.  Integration of biological networks and gene expression data using Cytoscape , 2007, Nature Protocols.

[36]  Mathias Wawer,et al.  Similarity-Potency Trees: A Method to Search for SAR Information in Compound Data Sets and Derive SAR Rules , 2010, J. Chem. Inf. Model..

[37]  Peter Ertl,et al.  Bioisosteric Replacement and Scaffold Hopping in Lead Generation and Optimization , 2010, Molecular informatics.

[38]  J. Bajorath,et al.  Local structural changes, global data views: graphical substructure-activity relationship trailing. , 2011, Journal of medicinal chemistry.

[39]  Matthias Rarey,et al.  Feature trees: A new molecular similarity measure based on tree matching , 1998, J. Comput. Aided Mol. Des..

[40]  Peter Willett,et al.  Representing Clusters Using a Maximum Common Edge Substructure Algorithm Applied to Reduced Graphs and Molecular Graphs , 2007, J. Chem. Inf. Model..

[41]  J. Woolfrey,et al.  Design, synthesis, and SAR of substituted acrylamides as factor Xa inhibitors. , 2002, Bioorganic & Medicinal Chemistry Letters.

[42]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[43]  Fabio Zuccotto,et al.  Pharmacophore Features Distributions in Different Classes of Compounds , 2003, J. Chem. Inf. Comput. Sci..

[44]  John Figueras,et al.  Ring Perception Using Breadth-First Search , 1996, J. Chem. Inf. Comput. Sci..

[45]  Jeff Morris,et al.  Further Development of Reduced Graphs for Identifying Bioactive Compounds , 2003, J. Chem. Inf. Comput. Sci..

[46]  G. Harper,et al.  The reduced graph descriptor in virtual screening and data-driven clustering of high-throughput screening data. , 2004, Journal of chemical information and computer sciences.

[47]  Mathias Wawer,et al.  Navigating structure-activity landscapes. , 2009, Drug discovery today.

[48]  Stefan Wetzel,et al.  The Scaffold Tree - Visualization of the Scaffold Universe by Hierarchical Scaffold Classification , 2007, J. Chem. Inf. Model..

[49]  Peter Willett,et al.  Scaffold Hopping Using Clique Detection Applied to Reduced Graphs , 2006, J. Chem. Inf. Model..

[50]  J. Bajorath,et al.  SAR index: quantifying the nature of structure-activity relationships. , 2007, Journal of medicinal chemistry.

[51]  Lorenz M Mayr,et al.  Novel trends in high-throughput screening. , 2009, Current opinion in pharmacology.

[52]  Gert Thijs,et al.  Pharao: pharmacophore alignment and optimization. , 2008, Journal of molecular graphics & modelling.

[53]  Jürgen Bajorath,et al.  Methods for SAR visualization , 2012 .

[54]  J. Bajorath,et al.  Structure-activity relationship anatomy by network-like similarity graphs and local structure-activity relationship indices. , 2008, Journal of medicinal chemistry.

[55]  J. Bajorath,et al.  Activity landscape representations for structure-activity relationship analysis. , 2010, Journal of medicinal chemistry.

[56]  Gavin Harper,et al.  Evolving Interpretable Structure-Activity Relationships. 1. Reduced Graph Queries , 2008, J. Chem. Inf. Model..

[57]  Ian A. Watson,et al.  ErG: 2D Pharmacophore Descriptions for Scaffold Hopping , 2006, J. Chem. Inf. Model..

[58]  Gary D Bader,et al.  Displaying chemical information on a biological network using Cytoscape. , 2011, Methods in molecular biology.

[59]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[60]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[61]  P. Lam,et al.  Amino(methyl) pyrrolidines as novel scaffolds for factor Xa inhibitors. , 2007, Bioorganic & medicinal chemistry letters.

[62]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.

[63]  A. Hillisch,et al.  Entering the era of non-basic p1 site groups: discovery of Xarelto (Rivaroxaban). , 2010, Current topics in medicinal chemistry.

[64]  Ramesh Hariharan,et al.  MultiMCS: A Fast Algorithm for the Maximum Common Substructure Problem on Multiple Molecules , 2011, J. Chem. Inf. Model..

[65]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[66]  Xin Chen,et al.  Performance of Similarity Measures in 2D Fragment-Based Similarity Searching: Comparison of Structural Descriptors and Similarity Coefficients , 2002, J. Chem. Inf. Comput. Sci..

[67]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..