Native fold and docking pose discrimination by the same residue‐based scoring function

Structure prediction and quality assessment are crucial steps in modeling native protein conformations. Statistical potentials are widely used in related algorithms, with different parametrizations typically developed for different contexts such as folding protein monomers or docking protein complexes. Here, we describe BACH‐SixthSense, a single residue‐based statistical potential that can be successfully employed in both contexts. BACH‐SixthSense shares the same approach as BACH, a knowledge‐based potential originally developed to score monomeric protein structures. A term that penalizes steric clashes as well as the distinction between polar and apolar sidechain‐sidechain contacts are crucial novel features of BACH‐SixthSense. The performance of BACH‐SixthSense in discriminating correctly the native structure among a competing set of decoys is significantly higher than other state‐of‐the‐art scoring functions, that were specifically trained for a single context, for both monomeric proteins (QMEAN, Rosetta, RF_CB_SRS_OD, benchmarked on CASP targets) and protein dimers (IRAD, Rosetta, PIE*PISA, HADDOCK, FireDock, benchmarked on 14 CAPRI targets). The performance of BACH‐SixthSense in recognizing near‐native docking poses within CAPRI decoy sets is good as well. Proteins 2015; 83:621–630. © 2015 Wiley Periodicals, Inc.

[1]  Flavio Seno,et al.  Insight into the Structure of Amyloid Fibrils from the Analysis of Globular Proteins , 2006, PLoS Comput. Biol..

[2]  Ian W. Davis,et al.  Structure validation by Cα geometry: ϕ,ψ and Cβ deviation , 2003, Proteins.

[3]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[4]  S. Fields,et al.  A novel genetic system to detect protein–protein interactions , 1989, Nature.

[5]  W. C. Still,et al.  Approximate atomic surfaces from linear combinations of pairwise overlaps (LCPO) , 1999 .

[6]  Alessandro Laio,et al.  BACHSCORE. A tool for evaluating efficiently and reliably the quality of large sets of protein structures , 2013, Comput. Phys. Commun..

[7]  Hong Liang,et al.  A method for integrative structure determination of protein-protein complexes , 2012, Bioinform..

[8]  Alessandro Laio,et al.  A simple and efficient statistical potential for scoring ensembles of protein , 2012 .

[9]  Z. Weng,et al.  Integrating atom‐based and residue‐based scoring functions for protein–protein docking , 2011, Protein science : a publication of the Protein Society.

[10]  D. V. S. Ravikant,et al.  Improving ranking of models for protein complexes with side chain modeling and atomic potentials , 2013, Proteins.

[11]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[12]  Marco Biasini,et al.  Toward the estimation of the absolute quality of individual protein structure models , 2010, Bioinform..

[13]  Ron Elber,et al.  PIE—Efficient filters and coarse grained potentials for unbound protein–protein docking , 2010, Proteins.

[14]  K. Takano ON SOLUTION OF , 1983 .

[15]  Silvio C. E. Tosatto,et al.  PASTA 2.0: an improved server for protein aggregation prediction , 2014, Nucleic Acids Res..

[16]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[17]  Salvador Ventura,et al.  Monitoring the interference of protein‐protein interactions in vivo by bimolecular fluorescence complementation: the DnaK case , 2008, Proteomics.

[18]  Yang Zhang,et al.  I-TASSER server for protein 3D structure prediction , 2008, BMC Bioinformatics.

[19]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[20]  Pascal Benkert,et al.  QMEAN: A comprehensive scoring function for model quality assessment , 2008, Proteins.

[21]  W. L. Jorgensen,et al.  The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. , 1988, Journal of the American Chemical Society.

[22]  Ioannis Ch. Paschalidis,et al.  Protein Docking by the Underestimation of Free Energy Funnels in the Space of Encounter Complexes , 2008, PLoS Comput. Biol..

[23]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[24]  Zhiping Weng,et al.  ZRANK: Reranking protein docking predictions with an optimized energy function , 2007, Proteins.

[25]  Alessandro Laio,et al.  A simple and efficient statistical potential for scoring ensembles of protein structures , 2012, Scientific Reports.

[26]  T. Schwede,et al.  QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information , 2009, BMC Structural Biology.

[27]  Tammy M. K. Cheng,et al.  pyDock: Electrostatics and desolvation for effective scoring of rigid‐body protein–protein docking , 2007, Proteins.

[28]  Anna Tramontano,et al.  Evaluation of CASP8 model quality predictions , 2009, Proteins.

[29]  Zhiping Weng,et al.  Accelerating Protein Docking in ZDOCK Using an Advanced 3D Convolution Library , 2011, PloS one.

[30]  Andrzej Kloczkowski,et al.  Combining statistical potentials with dynamics-based entropies improves selection from protein decoys and docking poses. , 2012, The journal of physical chemistry. B.

[31]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[32]  S. Wodak,et al.  Assessment of CAPRI predictions in rounds 3–5 shows progress in docking procedures , 2005, Proteins.

[33]  Ruth Nussinov,et al.  FireDock: Fast interaction refinement in molecular docking , 2007, Proteins.

[34]  Richard Bonneau,et al.  An improved protein decoy set for testing energy functions for protein structure prediction , 2003, Proteins.

[35]  Libin Cao,et al.  Protein–protein docking with binding site patch prediction and network‐based terms enhanced combinatorial scoring , 2010, Proteins.

[36]  András Fiser,et al.  New statistical potential for quality assessment of protein models and a survey of energy functions , 2010, BMC Bioinformatics.