Detection of Binding Site Molecular Interaction Field Similarities

Protein binding-site similarity detection methods can be used to predict protein function and understand molecular recognition, as a tool in drug design for drug repurposing and polypharmacology, and for the prediction of the molecular determinants of drug toxicity. Here, we present IsoMIF, a method able to identify binding site molecular interaction field similarities across protein families. IsoMIF utilizes six chemical probes and the detection of subgraph isomorphisms to identify geometrically and chemically equivalent sections of protein cavity pairs. The method is validated using six distinct data sets, four of those previously used in the validation of other methods. The mean area under the receiver operator curve (AUC) obtained across data sets for IsoMIF is higher than those of other methods. Furthermore, while IsoMIF obtains consistently high AUC values across data sets, other methods perform more erratically across data sets. IsoMIF can be used to predict function from structure, to detect potential cross-reactivity or polypharmacology targets, and to help suggest bioisosteric replacements to known binding molecules. Given that IsoMIF detects spatial patterns of molecular interaction field similarities, its predictions are directly related to pharmacophores and may be readily translated into modeling decisions in structure-based drug design. IsoMIF may in principle detect similar binding sites with distinct amino acid arrangements that lead to equivalent interactions within the cavity. The source code to calculate and visualize MIFs and MIF similarities are freely available.

[1]  Timo Krotzky,et al.  Extraction of Protein Binding Pockets in Close Neighborhood of Bound Ligands Makes Comparisons Simple Due to Inherent Shape Similarity , 2014, J. Chem. Inf. Model..

[2]  R. Laskowski SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. , 1995, Journal of molecular graphics.

[3]  Rafael Najmanovich,et al.  Side-chain rotamer changes upon ligand binding: common, crucial, correlate with entropy and rearrange hydrogen bonding , 2012, Bioinform..

[4]  S. Strittmatter,et al.  Overcoming Drug Development Bottlenecks With Repurposing: Old drugs learn new tricks , 2014, Nature Medicine.

[5]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[6]  Lei Xie,et al.  Detecting evolutionary relationships across existing fold space, using sequence order-independent profile–profile alignments , 2008, Proceedings of the National Academy of Sciences.

[7]  Valérie Campagna-Slater,et al.  Structural Chemistry of the Histone Methyltransferases Cofactor Binding Site , 2011, J. Chem. Inf. Model..

[8]  Gabriele Cruciani,et al.  A Common Reference Framework for Analyzing/Comparing Proteins and Ligands. Fingerprints for Ligands And Proteins (FLAP): Theory and Application , 2007, J. Chem. Inf. Model..

[9]  Jean-Philippe Vert,et al.  A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction , 2010, BMC Bioinformatics.

[10]  Janet M. Thornton,et al.  Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites , 2008, ECCB.

[11]  Jie Li,et al.  PDB-wide collection of binding data: current status of the PDBbind database , 2015, Bioinform..

[12]  William L. Jorgensen,et al.  Journal of Chemical Information and Modeling , 2005, J. Chem. Inf. Model..

[13]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[14]  Janet M. Thornton,et al.  Analysis of binding site similarity, small-molecule similarity and experimental binding profiles in the human cytosolic sulfotransferase family , 2007, Bioinform..

[15]  Paul Labute,et al.  Pocket Similarity: Are α Carbons Enough? , 2010, J. Chem. Inf. Model..

[16]  G. Klebe,et al.  Unexpected nanomolar inhibition of carbonic anhydrase by COX-2-selective celecoxib: new pharmacological opportunities due to related binding site recognition. , 2004, Journal of medicinal chemistry.

[17]  James E. J. Mills,et al.  High-Throughput Virtual Screening of Proteins Using GRID Molecular Interaction Fields , 2010, J. Chem. Inf. Model..

[18]  Alexander D. MacKerell,et al.  CHARMM general force field: A force field for drug‐like molecules compatible with the CHARMM all‐atom additive biological force fields , 2009, J. Comput. Chem..

[19]  Louis-Philippe Morency,et al.  NRGsuite: a PyMOL plugin to perform docking simulations in real time using FlexAID , 2015, Bioinform..

[20]  Janet M. Thornton,et al.  ProFunc: a server for predicting protein function from 3D structure , 2005, Nucleic Acids Res..

[21]  Didier Rognan,et al.  sc-PDB: a database for identifying variations and multiplicity of 'druggable' binding sites in proteins , 2011, Bioinform..

[22]  K. S. Arun,et al.  Least-Squares Fitting of Two 3-D Point Sets , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[24]  G. Klebe,et al.  DrugScore meets CoMFA: adaptation of fields for molecular comparison (AFMoC) or how to tailor knowledge-based pair-potentials to a particular protein. , 2002, Journal of medicinal chemistry.

[25]  Markus Wagener,et al.  The Quest for Bioisosteric Replacements , 2006, J. Chem. Inf. Model..

[26]  Gabriele Cruciani,et al.  BioGPS: Navigating biological space to predict polypharmacology, off‐targeting, and selectivity , 2015, Proteins.

[27]  David A. Lee,et al.  New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures , 2012, Nucleic Acids Res..

[28]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[29]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[30]  Tina Ritschel,et al.  Pharmacophore Fingerprint-Based Approach to Binding Site Subpocket Similarity and Its Application to Bioisostere Replacement , 2012, J. Chem. Inf. Model..

[31]  J. Thornton,et al.  Shape variation in protein binding pockets and their ligands. , 2007, Journal of molecular biology.

[32]  S. Copley An evolutionary biochemist's perspective on promiscuity. , 2015, Trends in biochemical sciences.

[33]  Didier Rognan,et al.  Comparison and Druggability Prediction of Protein-Ligand Binding Sites from Pharmacophore-Annotated Cavity Shapes , 2012, J. Chem. Inf. Model..

[34]  M. Jambon,et al.  A new bioinformatic approach to detect common 3D sites in protein structures , 2003, Proteins.

[35]  R. Cramer,et al.  Validation of the general purpose tripos 5.2 force field , 1989 .

[36]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[37]  Philip E. Bourne,et al.  Achievements and challenges in structural bioinformatics and computational biophysics , 2014, Bioinform..

[38]  Russ B. Altman,et al.  Using Multiple Microenvironments to Find Similar Ligand-Binding Sites: Application to Kinase Inhibitor Binding , 2011, PLoS Comput. Biol..

[39]  J. Thornton,et al.  Conformational diversity of ligands bound to proteins. , 2006, Journal of molecular biology.

[40]  Rafael Najmanovich,et al.  Side‐chain flexibility in proteins upon ligand binding , 2000, Proteins.

[41]  Michal Brylinski,et al.  eMatchSite: Sequence Order-Independent Structure Alignments of Ligand Binding Pockets in Protein Models , 2014, PLoS Comput. Biol..

[42]  J. Richardson,et al.  Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. , 1999, Journal of molecular biology.

[43]  Eran Eyal,et al.  MutaProt: a web interface for structural analysis of point mutations , 2001, Bioinform..

[44]  Rafael Najmanovich,et al.  Protein side‐chain rearrangement in regions of point mutations , 2002, Proteins.

[45]  Jonathan A. Barker,et al.  Kinome Render: a stand-alone and web-accessible tool to annotate the human protein kinome tree , 2013, PeerJ.

[46]  E. Kellenberger,et al.  A simple and fuzzy method to align and compare druggable ligand‐binding sites , 2008, Proteins.

[47]  G. Klebe,et al.  A new method to detect related function among proteins independent of sequence and fold homology. , 2002, Journal of molecular biology.

[48]  Nathanael Weill,et al.  Alignment-Free Ultra-High-Throughput Comparison of Druggable Protein-Ligand Binding Sites , 2010, J. Chem. Inf. Model..

[49]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors , 2009, PLoS Comput. Biol..

[50]  H. Wolfson,et al.  Recognition of Functional Sites in Protein Structures☆ , 2004, Journal of Molecular Biology.

[51]  Russ B. Altman,et al.  Knowledge-based Fragment Binding Prediction , 2014, PLoS Comput. Biol..

[52]  S. Kliewer,et al.  Identification of the nuclear receptor DAF-12 as a therapeutic target in parasitic nematodes , 2009, Proceedings of the National Academy of Sciences.

[53]  Mitchell D. Miller,et al.  Structural Biology and Crystallization Communications Structures of the First Representatives of Pfam Family Pf06938 (duf1285) Reveal a New Fold with Repeated Structural Motifs and Possible Involvement in Signal Transduction , 2022 .

[54]  Slawomir K. Grzechnik,et al.  The structure of the first representative of Pfam family PF06475 reveals a new fold with possible involvement in glycolipid metabolism , 2009, Acta crystallographica. Section F, Structural biology and crystallization communications.

[55]  Alexey G. Murzin,et al.  SCOP2 prototype: a new approach to protein structure mining , 2014, Nucleic Acids Res..

[56]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[57]  K. Kinoshita,et al.  Identification of protein biochemical functions by similarity search using the molecular surface database eF‐site , 2003, Protein science : a publication of the Protein Society.

[58]  Janet M. Thornton,et al.  Correction: Structural and Chemical Profiling of the Human Cytosolic Sulfotransferases , 2007, PLoS Biology.

[59]  P. Goodford A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. , 1985, Journal of medicinal chemistry.

[60]  Timo Krotzky,et al.  Large-Scale Mining for Similar Protein Binding Pockets: With RAPMAD Retrieval on the Fly Becomes Real , 2015, J. Chem. Inf. Model..

[61]  Rafael Najmanovich,et al.  Vibrational entropy differences between mesophile and thermophile proteins and their use in protein engineering , 2015, Protein science : a publication of the Protein Society.

[62]  Rafael Najmanovich,et al.  Detection of 3 D atomic similarities and their use in the discrimination of small molecule protein-binding sites , 2008 .

[63]  Rafael Najmanovich,et al.  A Coarse-Grained Elastic Network Atom Contact Model and Its Use in the Simulation of Protein Dynamics and the Prediction of the Effect of Mutations , 2013, bioRxiv.

[64]  Chris Sander,et al.  Dali/FSSP classification of three-dimensional protein folds , 1997, Nucleic Acids Res..

[65]  Rafael Najmanovich,et al.  IsoCleft Finder – a web-based tool for the detection and analysis of protein binding-site geometric and chemical similarities , 2013, F1000Research.

[66]  O. Dym,et al.  Sequence‐structure analysis of FAD‐containing proteins , 2001, Protein science : a publication of the Protein Society.

[67]  Dariya S. Glazer,et al.  The FEATURE framework for protein function annotation: modeling new functions, improving performance, and extending to novel applications , 2008, BMC Genomics.

[68]  J M Thornton,et al.  X-SITE: use of empirically derived atomic packing preferences to identify favourable interaction regions in the binding sites of proteins. , 1996, Journal of molecular biology.

[69]  Rafael Najmanovich,et al.  ENCoM server: exploring protein conformational space and the effect of mutations on protein function and stability , 2015, Nucleic Acids Res..

[70]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.