HopDock: a probabilistic search algorithm for decoy sampling in protein-protein docking

BackgroundElucidating the three-dimensional structure of a higher-order molecular assembly formed by interacting molecular units, a problem commonly known as docking, is central to unraveling the molecular basis of cellular activities. Though protein assemblies are ubiquitous in the cell, it is currently challenging to predict the native structure of a protein assembly in silico.MethodsThis work proposes HopDock, a novel search algorithm for protein-protein docking. HopDock efficiently obtains an ensemble of low-energy dimeric configurations, also known as decoys, that can be effectively used by ab-initio docking protocols. HopDock is based on the Basin Hopping (BH) framework which perturbs the structure of a dimeric configuration and then follows it up with an energy minimization to explicitly sample a local minimum of a chosen energy function. This process is repeated in order to sample consecutive energy minima in a trajectory-like fashion. HopDock employs both geometry and evolutionary conservation analysis to narrow down the interaction search space of interest for the purpose of efficiently obtaining a diverse decoy ensemble.Results and conclusionsA detailed analysis and a comparative study on seventeen different dimers shows HopDock obtains a broad view of the energy surface near the native dimeric structure and samples many near-native configurations. The results show that HopDock has high sampling capability and can be employed to effectively obtain a large and diverse ensemble of decoy configurations that can then be further refined in greater structural detail in ab-initio docking protocols.

[1]  Ruth Nussinov,et al.  Combinatorial docking approach for structure prediction of large proteins and multi-molecular assemblies , 2005, Physical biology.

[2]  Daisuke Kihara,et al.  Protein-protein docking using region-based 3D Zernike descriptors , 2009, BMC Bioinformatics.

[3]  Sandor Vajda,et al.  Combination of scoring functions improves discrimination in protein–protein docking , 2003, Proteins.

[4]  Barry Honig,et al.  VASP: A Volumetric Analysis of Surface Properties Yields Insights into Protein-Ligand Binding Specificity , 2010, PLoS Comput. Biol..

[5]  Christopher R. Corbeil,et al.  Towards the development of universal, fast and highly accurate docking/scoring methods: a long way to go , 2008, British journal of pharmacology.

[6]  Amarda Shehu,et al.  Guiding protein docking with Geometric and Evolutionary Information , 2012, J. Bioinform. Comput. Biol..

[7]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[8]  Nir Ben-Tal,et al.  The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures , 2008, Nucleic Acids Res..

[9]  Tammy M. K. Cheng,et al.  pyDock: Electrostatics and desolvation for effective scoring of rigid‐body protein–protein docking , 2007, Proteins.

[10]  Amarda Shehu,et al.  Efficient basin hopping in the protein energy surface , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[11]  A. Schug,et al.  Basin hopping simulations for all-atom protein folding. , 2006, The Journal of chemical physics.

[12]  Wen-Lian Hsu,et al.  Protein-Protein Interaction Site Predictions with Three-Dimensional Probability Distributions of Interacting Atoms on Protein Surfaces , 2012, PloS one.

[13]  R. Nussinov,et al.  A geometry-based suite of molecular docking processes. , 1995, Journal of molecular biology.

[14]  Ruth Nussinov,et al.  PatchDock and SymmDock: servers for rigid and symmetric docking , 2005, Nucleic Acids Res..

[15]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[16]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[17]  D Fischer,et al.  Molecular surface representations by sparse critical points , 1994, Proteins.

[18]  Sandor Vajda,et al.  Protein-protein association kinetics and protein docking. , 2002, Current Opinion in Structural Biology.

[19]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[20]  Amarda Shehu,et al.  Populating Local Minima in the Protein Conformational Space , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[21]  M. Sternberg,et al.  Modelling protein docking using shape complementarity, electrostatics and biochemical information. , 1997, Journal of molecular biology.

[22]  Srinivas Aluru,et al.  Handbook Of Computational Molecular Biology , 2010 .

[23]  J. Doye,et al.  Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms , 1997, cond-mat/9803344.

[24]  Amarda Shehu,et al.  A basin hopping algorithm for protein-protein docking , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[25]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[26]  Amarda Shehu,et al.  Basin Hopping as a General and Versatile Optimization Framework for the Characterization of Biological Macromolecules , 2012, Adv. Artif. Intell..

[27]  D. Baker,et al.  A simple physical model for binding energy hot spots in protein–protein complexes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Haim J. Wolfson,et al.  Geometric hashing: an overview , 1997 .

[29]  Sandor Vajda,et al.  ClusPro: a fully automated algorithm for protein-protein docking , 2004, Nucleic Acids Res..

[30]  Marc F Lensink,et al.  Blind predictions of protein interfaces by docking calculations in CAPRI , 2010, Proteins.

[31]  Helena Ramalhinho Dias Lourenço,et al.  Iterated Local Search , 2001, Handbook of Metaheuristics.

[32]  Alessandra Carbone,et al.  Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling , 2009, PLoS Comput. Biol..

[33]  Amarda Shehu,et al.  Refinement of docked protein complex structures using evolutionary traces , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[34]  Sergey Lyskov,et al.  The RosettaDock server for local protein–protein docking , 2008, Nucleic Acids Res..

[35]  M. L. Connolly Analytical molecular surface calculation , 1983 .

[36]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[37]  Peter G Wolynes,et al.  Protein structure prediction using basin-hopping. , 2008, The Journal of chemical physics.

[38]  Dima Kozakov,et al.  Convergence and combination of methods in protein-protein docking. , 2009, Current opinion in structural biology.

[39]  Jae-Seong Yang,et al.  Evolutionary conservation in multiple faces of protein interaction , 2009, Proteins.

[40]  Yifeng D. Yang,et al.  Multi‐LZerD: Multiple protein docking for asymmetric complexes , 2012, Proteins.

[41]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[42]  Amarda Shehu,et al.  Evolutionary-inspired probabilistic search for enhancing sampling of local minima in the protein energy surface , 2012, Proteome Science.

[43]  Jeffrey J. Gray,et al.  High-resolution protein-protein docking. , 2006, Current opinion in structural biology.

[44]  Amarda Shehu,et al.  An Evolutionary conservation-Based Method for Refining and Reranking protein Complex Structures , 2012, J. Bioinform. Comput. Biol..

[45]  Genki Terashi,et al.  The SKE‐DOCK server and human teams based on a combined method of shape complementarity and free energy estimation , 2007, Proteins.

[46]  Kengo Kinoshita,et al.  Docking of protein molecular surfaces with evolutionary trace analysis , 2007, Proteins.

[47]  H. Wolfson,et al.  FiberDock: Flexible induced‐fit backbone refinement in molecular docking , 2010, Proteins.

[48]  L. T. Ten Eyck,et al.  Protein docking using continuum electrostatics and geometric fit. , 2001, Protein engineering.

[49]  N. Ben-Tal,et al.  Residue frequencies and pairing preferences at protein–protein interfaces , 2001, Proteins.

[50]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[51]  Amarda Shehu,et al.  Protein docking with information on evolutionary conserved interfaces , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).