How to Use Not-Always-Reliable Binding Site Information in Protein-Protein Docking Prediction

In many protein-protein docking algorithms, binding site information is used to help predicting the protein complex structures. Using correct and accurate binding site information can increase protein-protein docking success rate significantly. On the other hand, using wrong binding sites information should lead to a failed prediction, or, at least decrease the success rate. Recently, various successful theoretical methods have been proposed to predict the binding sites of proteins. However, the predicted binding site information is not always reliable, sometimes wrong binding site information could be given. Hence there is a high risk to use the predicted binding site information in current docking algorithms. In this paper, a softly restricting method (SRM) is developed to solve this problem. By utilizing predicted binding site information in a proper way, the SRM algorithm is sensitive to the correct binding site information but insensitive to wrong information, which decreases the risk of using predicted binding site information. This SRM is tested on benchmark 3.0 using purely predicted binding site information. The result shows that when the predicted information is correct, SRM increases the success rate significantly; however, even if the predicted information is completely wrong, SRM only decreases success rate slightly, which indicates that the SRM is suitable for utilizing predicted binding site information.

[1]  L. Krippahl,et al.  BiGGER: A new (soft) docking algorithm for predicting protein interactions , 2000, Proteins.

[2]  D. Covell,et al.  A role for surface hydrophobicity in protein‐protein recognition , 1994, Protein science : a publication of the Protein Society.

[3]  Ying Gao,et al.  DOCKGROUND protein-protein docking decoy set , 2008, Bioinform..

[4]  O. Schueler‐Furman,et al.  Progress in protein–protein docking: Atomic resolution predictions in the CAPRI experiment using RosettaDock with an improved treatment of side‐chain flexibility , 2005, Proteins.

[5]  R. Raz,et al.  ProMate: a structure based prediction program to identify the location of protein-protein binding sites. , 2004, Journal of molecular biology.

[6]  Z. Weng,et al.  Protein–protein docking benchmark version 3.0 , 2008, Proteins.

[7]  M. Schroeder,et al.  Using protein binding site prediction to improve protein docking. , 2008, Gene.

[8]  M. Sternberg,et al.  Rapid refinement of protein interfaces incorporating solvation: application to the docking problem. , 1998, Journal of molecular biology.

[9]  Ilya A. Vakser,et al.  DECK: Distance and environment-dependent, coarse-grained, knowledge-based potentials for protein-protein docking , 2011, BMC Bioinformatics.

[10]  S. Jones,et al.  Analysis of protein-protein interaction sites using surface patches. , 1997, Journal of molecular biology.

[11]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  P. Bourne,et al.  Exploiting sequence and structure homologs to identify protein–protein binding sites , 2005, Proteins.

[13]  N. Ben-Tal,et al.  Residue frequencies and pairing preferences at protein–protein interfaces , 2001, Proteins.

[14]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[15]  Aleksey A. Porollo,et al.  Prediction‐based fingerprints of protein–protein interactions , 2006, Proteins.

[16]  Miriam Eisenstein,et al.  Electrostatics in protein–protein docking , 2002, Protein science : a publication of the Protein Society.

[17]  D. Goodsell,et al.  Automated prediction of ligand‐binding sites in proteins , 2007, Proteins.

[18]  L. T. Ten Eyck,et al.  Protein docking using continuum electrostatics and geometric fit. , 2001, Protein engineering.

[19]  Jian Zhang,et al.  Prediction of the interaction site on the surface of an isolated protein structure by analysis of side chain energy scores , 2004, Proteins.

[20]  Z. Weng,et al.  Integrating statistical pair potentials into protein complex prediction , 2007, Proteins.

[21]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[22]  C. Li,et al.  Biologically enhanced sampling geometric docking and backbone flexibility treatment with multiconformational superposition , 2005, Proteins.

[23]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[24]  R. Abagyan,et al.  Soft protein–protein docking in internal coordinates , 2002, Protein science : a publication of the Protein Society.

[25]  D. Ritchie,et al.  Protein docking using spherical polar Fourier correlations , 2000, Proteins.

[26]  M. Sternberg,et al.  Modelling protein docking using shape complementarity, electrostatics and biochemical information. , 1997, Journal of molecular biology.

[27]  Huan-Xiang Zhou,et al.  meta-PPISP: a meta web server for protein-protein interaction site prediction , 2007, Bioinform..

[28]  Joël Janin,et al.  Side‐chain rotamer transitions at protein‐protein interfaces , 2010, Proteins.

[29]  Huan‐Xiang Zhou,et al.  Prediction of protein interaction sites from sequence profile and residue neighbor list , 2001, Proteins.

[30]  Huan-Xiang Zhou,et al.  A holistic approach to protein docking , 2007, Proteins.

[31]  A. Panchenko,et al.  Prediction of functional sites by analysis of sequence and structure conservation , 2004, Protein science : a publication of the Protein Society.

[32]  Xiaoqin Zou,et al.  An iterative knowledge‐based scoring function for protein–protein recognition , 2008, Proteins.

[33]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[34]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[35]  Huan-Xiang Zhou,et al.  Prediction of interface residues in protein–protein complexes by a consensus neural network method: Test against NMR data , 2005, Proteins.

[36]  Zhiping Weng,et al.  Performance of ZDOCK and ZRANK in CAPRI rounds 13–19 , 2010, Proteins.

[37]  A. Valencia,et al.  Prediction of protein--protein interaction sites in heterocomplexes with neural networks. , 2002, European journal of biochemistry.

[38]  Ruben Abagyan,et al.  Statistical analysis and prediction of protein–protein interfaces , 2005, Proteins.

[39]  Itay Mayrose,et al.  Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues , 2002, ISMB.

[40]  Xiaoqin Zou,et al.  MDockPP: A hierarchical approach for protein‐protein docking and its application to CAPRI rounds 15–19 , 2010, Proteins.

[41]  Miriam Eisenstein,et al.  Weighted geometric docking: Incorporating external information in the rotation‐translation scan , 2003, Proteins.

[42]  Z. Weng,et al.  A novel shape complementarity scoring function for protein‐protein docking , 2003, Proteins.

[43]  I. Vakser,et al.  Evaluation of GRAMM low‐resolution docking methodology on the hemagglutinin‐antibody complex , 1997, Proteins.

[44]  Ludwig Krippahl,et al.  Modeling protein complexes with BiGGER , 2003, Proteins.

[45]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[46]  Zhiping Weng,et al.  ZDOCK and RDOCK performance in CAPRI rounds 3, 4, and 5 , 2005, Proteins.

[47]  Song Liu,et al.  Protein binding site prediction using an empirical scoring function , 2006, Nucleic acids research.

[48]  David R. Westhead,et al.  Improved prediction of protein-protein binding sites using a support vector machines approach. , 2005, Bioinformatics.

[49]  Ariel Fernández,et al.  Insufficiently dehydrated hydrogen bonds as determinants of protein interactions , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Hongyi Zhou,et al.  Stability scale and atomic solvation parameters extracted from 1023 mutation experiments , 2002, Proteins.

[51]  Zhiping Weng,et al.  ZRANK: Reranking protein docking predictions with an optimized energy function , 2007, Proteins.

[52]  Gideon Schreiber,et al.  A novel method for scoring of docked protein complexes using predicted protein-protein binding sites. , 2004, Protein engineering, design & selection : PEDS.

[53]  Lin Li,et al.  ASPDock: protein-protein docking algorithm using atomic solvation parameters model , 2010, BMC Bioinformatics.

[54]  David Baker,et al.  Protein-protein docking with backbone flexibility. , 2007, Journal of molecular biology.

[55]  S. Jones,et al.  Prediction of protein-protein interaction sites using patch analysis. , 1997, Journal of molecular biology.

[56]  Yaoqi Zhou,et al.  Docking prediction using biological information, ZDOCK sampling technique, and clustering guided by the DFIRE statistical energy function , 2005, Proteins.

[57]  A. Bonvin,et al.  WHISCY: What information does surface conservation yield? Application to data‐driven docking , 2006, Proteins.

[58]  Pan Quan PREDICTING PROTEIN-PROTEIN INTERACTION BASED ON THE SEQUENCE-SEGMENTED AMINO ACID COMPOSITION , 2009 .

[59]  M J Sternberg,et al.  Use of pair potentials across protein interfaces in screening predicted docked complexes , 1999, Proteins.

[60]  Arun K. Ramani,et al.  Protein interaction networks from yeast to human. , 2004, Current opinion in structural biology.

[61]  Sarah A. Teichmann,et al.  Principles of protein-protein interactions , 2002, ECCB.

[62]  Julie C Mitchell,et al.  Finding needles in haystacks: Reranking DOT results by using shape complementarity, cluster analysis, and biological information , 2003, Proteins.

[63]  Li Li,et al.  RDOCK: Refinement of rigid‐body protein docking predictions , 2003, Proteins.