Privileged Structural Motif Detection and Analysis Using Generative Topographic Maps

Identification of "privileged structural motifs" associated with specific target families is of particular importance for designing novel bioactive compounds. Here, we demonstrate that they can be extracted from a data distribution represented on a two-dimensional map obtained by Generative Topographic Mapping (GTM). In GTM, structurally related molecules are grouped together on the map. Zones of the map preferentially populated by target-specific compounds were delineated, which helped to capture common substructures on the basis of which these compounds were grouped together by GTM. Such privileged structural motifs were identified across three major target superfamilies including proteases, kinases, and G protein coupled receptors. Traditionally, the search for privileged structural motifs focused on scaffolds, whereas motifs were detected here without prior knowledge of compound classification in GTMs. This alternative way of navigating medicinal chemistry space further extends the classical, scaffold-centric approach. Importantly, detected motifs might also comprise fuzzy sets of similar scaffolds, pharmacophore-like patterns, or, by contrast, well-defined scaffolds with specific substituent patterns.

[1]  B. E. Evans,et al.  Methods for drug discovery: development of potent, selective, orally effective cholecystokinin antagonists. , 1988, Journal of medicinal chemistry.

[2]  D. Hawkins,et al.  Analysis of a 2(9) full factorial chemical library. , 1995, Journal of medicinal chemistry.

[3]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[4]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[5]  S. Young,et al.  Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning. , 2000 .

[6]  M. Zhang,et al.  The crystal structures of human α‐thrombin complexed with active site‐directed diamino benzo[b] thiophene derivatives: A binding mode for a structurally novel class of inhibitors , 2008, Protein science : a publication of the Protein Society.

[7]  S. Loening,et al.  The new synthetic matrix metalloproteinase inhibitor (Roche 28-2653) reduces tumor growth and prolongs survival in a prostate cancer standard rat model , 2002, Oncogene.

[8]  J. Adams Activation loop phosphorylation and catalysis in protein kinases: is there functional evidence for the autoinhibitor model? , 2003, Biochemistry.

[9]  Alison R. Gregro,et al.  Structure-based design of potent and selective cell-permeable inhibitors of human beta-secretase (BACE-1). , 2004, Journal of medicinal chemistry.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  T. I. Richardson,et al.  Synthesis and structure-activity relationships of novel arylpiperazines as potent and selective agonists of the melanocortin subtype-4 receptor. , 2004, Journal of medicinal chemistry.

[12]  K. O'Byrne,et al.  Phase III study of matrix metalloproteinase inhibitor prinomastat in non-small-cell lung cancer. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[13]  J. R. Somoza,et al.  Design and synthesis of tri-ring P3 benzamide-containing aminonitriles as potent, selective, orally effective inhibitors of cathepsin K. , 2005, Journal of medicinal chemistry.

[14]  J. Irwin,et al.  ZINC ? A Free Database of Commercially Available Compounds for Virtual Screening. , 2005 .

[15]  W. Greenlee,et al.  Discovery of orally efficacious melanin-concentrating hormone receptor-1 antagonists as antiobesity agents. Synthesis, SAR, and biological evaluation of bicyclo[3.1.0]hexyl ureas. , 2006, Journal of medicinal chemistry.

[16]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt): an expanding universe of protein information , 2005, Nucleic Acids Res..

[17]  C. Varaprasad,et al.  Discovery of 3-hydroxy-4-carboxyalkylamidino-5-arylamino-isothiazoles as potent MEK1 inhibitors. , 2006, Bioorganic & medicinal chemistry letters.

[18]  B. Cravatt,et al.  Metalloprotease Inhibitors GM6001 and TAPI-0 Inhibit the Obligate Intracellular Human Pathogen Chlamydia trachomatis by Targeting Peptide Deformylase of the Bacterium* , 2006, Journal of Biological Chemistry.

[19]  D. Fourches,et al.  Successful “In Silico” Design of New Efficient Uranyl Binders , 2007 .

[20]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[21]  I. Tetko,et al.  ISIDA - Platform for Virtual Screening Based on Fragment and Pharmacophoric Descriptors , 2008 .

[22]  J. Wikberg,et al.  Targeting melanocortin receptors: an approach to treat weight disorders and sexual dysfunction , 2008, Nature Reviews Drug Discovery.

[23]  W. Metzler,et al.  Involvement of DPP‐IV catalytic residues in enzyme–saxagliptin complex formation , 2008, Protein science : a publication of the Protein Society.

[24]  P. Clark,et al.  MF498 [N-{[4-(5,9-Diethoxy-6-oxo-6,8-dihydro-7H-pyrrolo[3,4-g]quinolin-7-yl)-3-methylbenzyl]sulfonyl}-2-(2-methoxyphenyl)acetamide], a Selective E Prostanoid Receptor 4 Antagonist, Relieves Joint Inflammation and Pain in Rodent Models of Rheumatoid and Osteoarthritis , 2008, Journal of Pharmacology and Experimental Therapeutics.

[25]  K. Shokat,et al.  Targeted polypharmacology: Discovery of dual inhibitors of tyrosine and phosphoinositide kinases , 2008, Nature chemical biology.

[26]  Heather B. Miller,et al.  Potent memapsin 2 (beta-secretase) inhibitors: design, synthesis, protein-ligand X-ray structure, and in vivo evaluation. , 2008, Bioorganic & medicinal chemistry letters.

[27]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[28]  F. Fleming,et al.  Nitrile-containing pharmaceuticals: efficacious roles of the nitrile pharmacophore. , 2010, Journal of medicinal chemistry.

[29]  D. Horvath,et al.  ISIDA Property‐Labelled Fragment Descriptors , 2010, Molecular informatics.

[30]  R. Stevens,et al.  Structure of an Agonist-Bound Human A2A Adenosine Receptor , 2011, Science.

[31]  Gilles Marcou,et al.  Local neighborhood behavior in a combinatorial library context , 2011, J. Comput. Aided Mol. Des..

[32]  J. Christensen,et al.  Structure based drug design of crizotinib (PF-02341066), a potent and selective dual inhibitor of mesenchymal-epithelial transition factor (c-MET) kinase and anaplastic lymphoma kinase (ALK). , 2011, Journal of medicinal chemistry.

[33]  Héléna A. Gaspar,et al.  Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure‐Activity Modeling and Dataset Comparison , 2012, Molecular informatics.

[34]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[35]  Alban Arrault,et al.  Generative Topographic Mapping-Based Classification Models and Their Applicability Domain: Application to the Biopharmaceutics Drug Disposition Classification System (BDDCS) , 2013, J. Chem. Inf. Model..

[36]  M. Radi,et al.  Design, synthesis, and biological evaluation of pyrazolo[3,4-d]pyrimidines active in vivo on the Bcr-Abl T315I mutant. , 2013, Journal of medicinal chemistry.

[37]  Olivier Barré,et al.  Cleavage Specificity Analysis of Six Type II Transmembrane Serine Proteases (TTSPs) Using PICS with Proteome-Derived Peptide Libraries , 2014, PloS one.

[38]  L. Juillerat-Jeanneret Dipeptidyl peptidase IV and its inhibitors: therapeutics for type 2 diabetes and what else? , 2014, Journal of medicinal chemistry.

[39]  R. Vandenbroucke,et al.  Is there new hope for therapeutic matrix metalloproteinase inhibition? , 2014, Nature Reviews Drug Discovery.

[40]  Gilles Marcou,et al.  An Evolutionary Optimizer of libsvm Models , 2014 .

[41]  Yanli Wang,et al.  PubChem BioAssay: 2014 update , 2013, Nucleic Acids Res..

[42]  Igor I. Baskin,et al.  Stargate GTM: Bridging Descriptor and Activity Spaces , 2015, J. Chem. Inf. Model..

[43]  Dragos Horvath,et al.  Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds , 2015, Journal of Computer-Aided Molecular Design.

[44]  Héléna A. Gaspar,et al.  GTM‐Based QSAR Models and Their Applicability Domains , 2015, Molecular informatics.

[45]  Andreas Holzinger,et al.  Data Mining with Decision Trees: Theory and Applications , 2015, Online Inf. Rev..

[46]  Gilles Marcou,et al.  Chemical Space Mapping and Structure-Activity Analysis of the ChEMBL Antiviral Compound Set , 2016, J. Chem. Inf. Model..