Hidden partners: Using cross-docking calculations to predict binding sites for proteins with multiple interactions

Protein‐protein interactions control a large range of biological processes and their identification is essential to understand the underlying biological mechanisms. To complement experimental approaches, in silico methods are available to investigate protein‐protein interactions. Cross‐docking methods, in particular, can be used to predict protein binding sites. However, proteins can interact with numerous partners and can present multiple binding sites on their surface, which may alter the binding site prediction quality. We evaluate the binding site predictions obtained using complete cross‐docking simulations of 358 proteins with 2 different scoring schemes accounting for multiple binding sites. Despite overall good binding site prediction performances, 68 cases were still associated with very low prediction quality, presenting individual area under the specificity‐sensitivity ROC curve (AUC) values below the random AUC threshold of 0.5, since cross‐docking calculations can lead to the identification of alternate protein binding sites (that are different from the reference experimental sites). For the large majority of these proteins, we show that the predicted alternate binding sites correspond to interaction sites with hidden partners, that is, partners not included in the original cross‐docking dataset. Among those new partners, we find proteins, but also nucleic acid molecules. Finally, for proteins with multiple binding sites on their surface, we investigated the structural determinants associated with the binding sites the most targeted by the docking partners.

[1]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[2]  D. Altman,et al.  Multiple significance tests: the Bonferroni method , 1995, BMJ.

[3]  Pascal Braun,et al.  History of protein–protein interactions: From egg‐white to complex networks , 2012, Proteomics.

[4]  R. Abagyan,et al.  Identification of protein-protein interaction sites from docking energy landscapes. , 2004, Journal of molecular biology.

[5]  Martin Zacharias,et al.  ATTRACT and PTools: open source programs for protein-protein docking. , 2012, Methods in molecular biology.

[6]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[7]  Z. Weng,et al.  Binding interface prediction by combining protein–protein docking results , 2014, Proteins.

[8]  Benjamin A. Shoemaker,et al.  Deciphering Protein–Protein Interactions. Part II. Computational Methods to Predict Protein and Domain Interaction Partners , 2007, PLoS Comput. Biol..

[9]  Jeffrey J. Gray,et al.  High-resolution protein-protein docking. , 2006, Current opinion in structural biology.

[10]  Desmond J. Higham,et al.  Geometric De-noising of Protein-Protein Interaction Networks , 2009, PLoS Comput. Biol..

[11]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[12]  A. Sali,et al.  The molecular sociology of the cell , 2007, Nature.

[13]  Frédéric Desprez,et al.  From Dedicated Grid to Volunteer Grid: Large Scale Execution of a Bioinformatics Application , 2009, Journal of Grid Computing.

[14]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[15]  S. Jones,et al.  Analysis of protein-protein interaction sites using surface patches. , 1997, Journal of molecular biology.

[16]  Julia M. Shifman,et al.  Multispecific recognition: mechanism, evolution, and design. , 2011, Biochemistry.

[17]  Martin Zacharias,et al.  ATTRACT: Protein–protein docking in CAPRI using a reduced protein model , 2005, Proteins.

[18]  Emmanuel D Levy,et al.  PiQSi: protein quaternary structure investigation. , 2007, Structure.

[19]  Mark N. Wass,et al.  Challenges for the prediction of macromolecular interactions. , 2011, Current opinion in structural biology.

[20]  Alexandre M. J. J. Bonvin,et al.  CPORT: A Consensus Interface Predictor and Its Performance in Prediction-Driven Docking with HADDOCK , 2011, PloS one.

[21]  R. Nussinov,et al.  Modeling Protein Assemblies in the Proteome* , 2014, Molecular & Cellular Proteomics.

[22]  Philippe Roche,et al.  2P2Idb v2: update of a structural database dedicated to orthosteric modulation of protein–protein interactions , 2016, Database J. Biol. Databases Curation.

[23]  Alessandra Carbone,et al.  Protein-Protein Interactions in a Crowded Environment: An Analysis via Cross-Docking Simulations and Evolutionary Information , 2013, PLoS Comput. Biol..

[24]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[25]  Burkhard Rost,et al.  Alternative Protein-Protein Interfaces Are Frequent Exceptions , 2012, PLoS Comput. Biol..

[26]  Hilla Peretz,et al.  Ju n 20 03 Schrödinger ’ s Cat : The rules of engagement , 2003 .

[27]  S. Sacquin-Mora Motions and mechanics: investigating conformational transitions in multi-domain proteins with coarse-grain simulations , 2014 .

[28]  Martin Zacharias,et al.  Protein–protein docking with a reduced protein model accounting for side‐chain flexibility , 2003, Protein science : a publication of the Protein Society.

[29]  Alessandra Carbone,et al.  Identification of protein interaction partners and protein-protein interaction sites. , 2008, Journal of molecular biology.

[30]  Zhiping Weng,et al.  Performance of ZDOCK and ZRANK in CAPRI rounds 13–19 , 2010, Proteins.

[31]  K Nadassy,et al.  Structural features of protein-nucleic acid recognition sites. , 1999, Biochemistry.

[32]  Alessandra Carbone,et al.  Local Geometry and Evolutionary Conservation of Protein Surfaces Reveal the Multiple Recognition Patches in Protein-Protein Interactions , 2015, PLoS Comput. Biol..

[33]  Michael Schroeder,et al.  The Many Faces of Protein–Protein Interactions: A Compendium of Interface Geometry , 2006, PLoS Comput. Biol..

[34]  Martin Zacharias,et al.  Protein–protein docking in CAPRI using ATTRACT to account for global and local flexibility , 2007, Proteins.

[35]  Solène Grosdidier,et al.  Identification of hot-spot residues in protein-protein interactions by computational docking , 2008, BMC Bioinformatics.

[36]  Z. Weng,et al.  Protein–protein docking benchmark 2.0: An update , 2005, Proteins.

[37]  Alessandra Carbone,et al.  Great interactions: How binding incorrect partners can teach us about protein recognition and function , 2016, Proteins.

[38]  S. Sacquin-Mora Bridging Enzymatic Structure Function via Mechanics: A Coarse-Grain Approach. , 2016, Methods in enzymology.

[39]  Jonathan J. Ellis,et al.  Protein–RNA interactions: Structural analysis and functional classes , 2006, Proteins.

[40]  B. Alberts The Cell as a Collection of Protein Machines: Preparing the Next Generation of Molecular Biologists , 1998, Cell.

[41]  J. Janin,et al.  Dissecting protein–RNA recognition sites , 2008, Nucleic acids research.

[42]  Student,et al.  THE PROBABLE ERROR OF A MEAN , 1908 .

[43]  O. Keskin,et al.  Predicting Protein-Protein Interactions from the Molecular to the Proteome Level. , 2016, Chemical reviews.

[44]  Marc F Lensink,et al.  Blind predictions of protein interfaces by docking calculations in CAPRI , 2010, Proteins.

[45]  Ian M. Donaldson,et al.  BIND: the Biomolecular Interaction Network Database , 2001, Nucleic Acids Res..

[46]  S. Sacquin-Mora Fold and flexibility: what can proteins' mechanical properties tell us about their folding nucleus? , 2015, Journal of The Royal Society Interface.

[47]  Alessandra Carbone,et al.  JET2 Viewer: a database of predicted multiple, possibly overlapping, protein–protein interaction sites for PDB structures , 2017, Nucleic Acids Res..

[48]  Jinyan Li,et al.  Structural and Functional Analysis of Multi-Interface Domains , 2012, PloS one.

[49]  Juliette Martin,et al.  Arbitrary protein−protein docking targets biologically relevant interfaces , 2012, BMC biophysics.

[50]  Sarah A. Teichmann,et al.  Principles of protein-protein interactions , 2002, ECCB.

[51]  James Vlasblom,et al.  Challenges and Rewards of Interaction Proteomics * , 2009, Molecular & Cellular Proteomics.

[52]  Alessandra Carbone,et al.  Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling , 2009, PLoS Comput. Biol..

[53]  Alessandra Carbone,et al.  Protein social behavior makes a stronger signal for partner identification than surface geometry , 2016, Proteins.

[54]  Angelo D. Favia,et al.  Protein promiscuity and its implications for biotechnology , 2009, Nature Biotechnology.

[55]  Anna R Panchenko,et al.  Exploring functional roles of multibinding protein interfaces , 2009, Protein science : a publication of the Protein Society.

[56]  Dominique Douguet,et al.  DOCKGROUND resource for studying protein-protein interfaces , 2006, Bioinform..

[57]  M. Zacharias,et al.  Accounting for loop flexibility during protein–protein docking , 2005, Proteins.

[58]  Benjamin A. Shoemaker,et al.  Deciphering Protein–Protein Interactions. Part I. Experimental Techniques and Databases , 2007, PLoS Comput. Biol..

[59]  B Jayaram,et al.  Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes , 2011, Proteome Science.

[60]  J. Janin,et al.  A dissection of specific and non-specific protein-protein interfaces. , 2004, Journal of molecular biology.

[61]  Raphael A. G. Chaleil,et al.  Updates to the Integrated Protein-Protein Interaction Benchmarks: Docking Benchmark Version 5 and Affinity Benchmark Version 2. , 2015, Journal of molecular biology.

[62]  Philippe Roche,et al.  2P2Idb: a structural database dedicated to orthosteric modulation of protein–protein interactions , 2012, Nucleic Acids Res..

[63]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[64]  David W Ritchie,et al.  Recent progress and future directions in protein-protein docking. , 2008, Current protein & peptide science.

[65]  J. Janin,et al.  Dissecting protein–protein recognition sites , 2002, Proteins.

[66]  T. Hughes,et al.  High-definition macromolecular composition of yeast RNA-processing complexes. , 2004, Molecular cell.

[67]  A. Fornili,et al.  Protein–protein interaction networks studies and importance of 3D structure knowledge , 2013, Expert review of proteomics.

[68]  R. Bahadur,et al.  The interface of protein-protein complexes: Analysis of contacts and prediction of interactions , 2008, Cellular and Molecular Life Sciences.

[69]  A. Carbone,et al.  Hidden partners: Using cross-docking calculations to predict binding sites for proteins with multiple interactions , 2018, bioRxiv.