Protein-protein binding site identification by enumerating the configurations

BackgroundThe ability to predict protein-protein binding sites has a wide range of applications, including signal transduction studies, de novo drug design, structure identification and comparison of functional sites. The interface in a complex involves two structurally matched protein subunits, and the binding sites can be predicted by identifying structural matches at protein surfaces.ResultsWe propose a method which enumerates “all” the configurations (or poses) between two proteins (3D coordinates of the two subunits in a complex) and evaluates each configuration by the interaction between its components using the Atomic Contact Energy function. The enumeration is achieved efficiently by exploring a set of rigid transformations. Our approach incorporates a surface identification technique and a method for avoiding clashes of two subunits when computing rigid transformations. When the optimal transformations according to the Atomic Contact Energy function are identified, the corresponding binding sites are given as predictions. Our results show that this approach consistently performs better than other methods in binding site identification.ConclusionsOur method achieved a success rate higher than other methods, with the prediction quality improved in terms of both accuracy and coverage. Moreover, our method is being able to predict the configurations of two binding proteins, where most of other methods predict only the binding sites. The software package is available at http://sites.google.com/site/guofeics/dobi for non-commercial use.

[1]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[2]  Kathleen Marchal,et al.  Evaluation of time profile reconstruction from complex two-color microarray designs , 2008, BMC Bioinformatics.

[3]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[4]  Wentian Li,et al.  Three lectures on case-control genetic association analysis , 2007, Briefings Bioinform..

[5]  Xue-wen Chen,et al.  Sequence-based prediction of protein interaction sites with an integrative method , 2009, Bioinform..

[6]  Michael J E Sternberg,et al.  Protein–protein docking using 3D‐Dock in rounds 3, 4, and 5 of CAPRI , 2005, Proteins.

[7]  Marco Pierini,et al.  “Quasi flexible” automatic docking processing for studying stereoselective recognition mechanisms, part 2: Prediction of ΔΔG of complexation and 1H‐NMR NOE correlation , 2007, J. Comput. Chem..

[8]  Richard M. Jackson,et al.  Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces , 2006, Bioinform..

[9]  Burkhard Rost,et al.  Protein–Protein Interaction Hotspots Carved into Sequences , 2007, PLoS Comput. Biol..

[10]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[11]  David R. Westhead,et al.  Improved prediction of protein-protein binding sites using a support vector machines approach. , 2005, Bioinformatics.

[12]  C. DeLisi,et al.  Determination of atomic desolvation energies from the structures of crystallized proteins. , 1997, Journal of molecular biology.

[13]  Song Liu,et al.  Protein binding site prediction using an empirical scoring function , 2006, Nucleic acids research.

[14]  Z. Weng,et al.  Protein–protein docking benchmark 2.0: An update , 2005, Proteins.

[15]  Stefano Alcaro,et al.  GBPM: GRID-based pharmacophore model: concept and application studies to protein-protein recognition , 2006, Bioinform..

[16]  Aleksey A. Porollo,et al.  Prediction‐based fingerprints of protein–protein interactions , 2006, Proteins.

[17]  Fan Jiang,et al.  Prediction of protein-protein binding site by using core interface residue and support vector machine , 2008, BMC Bioinformatics.

[18]  Shuai Cheng Li,et al.  Finding Largest Well-Predicted Subset of Protein Structure Models , 2008, CPM.

[19]  Kristian Vlahovicek,et al.  Prediction of Protein–Protein Interaction Sites in Sequences and 3D Structures by Random Forests , 2009, PLoS Comput. Biol..

[20]  Dusanka Janezic,et al.  ProBiS algorithm for detection of structurally similar protein binding sites by local structural alignment , 2010, Bioinform..

[21]  Ying Gao,et al.  DOCKGROUND protein-protein docking decoy set , 2008, Bioinform..

[22]  R. Abagyan,et al.  Identification of protein-protein interaction sites from docking energy landscapes. , 2004, Journal of molecular biology.

[23]  Miriam Eisenstein,et al.  Electrostatics in protein–protein docking , 2002, Protein science : a publication of the Protein Society.

[24]  Ruth Nussinov,et al.  Geometry‐based flexible and symmetric protein docking , 2005, Proteins.

[25]  M. Nilges,et al.  Complementarity of structure ensembles in protein-protein binding. , 2004, Structure.

[26]  R. Raz,et al.  ProMate: a structure based prediction program to identify the location of protein-protein binding sites. , 2004, Journal of molecular biology.

[27]  P. Bourne,et al.  Exploiting sequence and structure homologs to identify protein–protein binding sites , 2005, Proteins.

[28]  Huan-Xiang Zhou,et al.  meta-PPISP: a meta web server for protein-protein interaction site prediction , 2007, Bioinform..

[29]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[30]  Michael J E Sternberg,et al.  Evaluation of the 3D‐Dock protein docking suite in rounds 1 and 2 of the CAPRI blind trial , 2003, Proteins.

[31]  Kenji Mizuguchi,et al.  Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites , 2010, Bioinform..

[32]  Z. Weng,et al.  Protein–protein docking benchmark version 3.0 , 2008, Proteins.

[33]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[34]  R. Wade,et al.  Computational approaches to identifying and characterizing protein binding sites for ligand design , 2009, Journal of molecular recognition : JMR.

[35]  M. Schroeder,et al.  Using protein binding site prediction to improve protein docking. , 2008, Gene.

[36]  M. Schroeder,et al.  LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation , 2006, BMC Structural Biology.

[37]  Chao Zhang,et al.  Extracting contact energies from protein structures: A study using a simplified model , 1998, Proteins.

[38]  R. Abagyan,et al.  Optimal docking area: A new method for predicting protein–protein interaction sites , 2004, Proteins.