Combinatorial docking approach for structure prediction of large proteins and multi-molecular assemblies

Protein folding and protein binding are similar processes. In both, structural units combinatorially associate with each other. In the case of folding, we mostly handle relatively small units, building blocks or domains, that are covalently linked. In the case of multi-molecular binding, the subunits are relatively large and are associated only by non-covalent bonds. Experimentally, the difficulty in the determination of the structures of such large assemblies increases with the complex size and the number of components it contains. Computationally, the prediction of the structures of multi-molecular complexes has largely not been addressed, probably owing to the magnitude of the combinatorial complexity of the problem. Current docking algorithms mostly target prediction of pairwise interactions. Here our goal is to predict the structures of multi-unit associations, whether these are chain-connected as in protein folding, or separate disjoint molecules in the assemblies. We assume that the structures of the single units are known, either through experimental determination or modeling. Our aim is to combinatorially assemble these units to predict their structure. To address this problem we have developed CombDock. CombDock is a combinatorial docking algorithm for the structural units assembly problem. Below, we briefly describe the algorithm and present examples of its various applications to folding and to multi-molecular assemblies. To test the robustness of the algorithm, we use inaccurate models of the structural units, derived either from crystal structures of unbound molecules or from modeling of the target sequences. The algorithm has been able to predict near-native arrangements of the input structural units in almost all of the cases, suggesting that a combinatorial approach can overcome the imperfect shape complementarity caused by the inaccuracy of the models. In addition, we further show that through a combinatorial docking strategy it is possible to enhance the predictions of pairwise interactions involved in a multi-molecular assembly.

[1]  R Sánchez,et al.  Evaluation of comparative protein structure modeling by MODELLER‐3 , 1997, Proteins.

[2]  Ruth Nussinov,et al.  Hierarchical protein folding pathways: A computational study of protein fragments , 2003, Proteins.

[3]  Zhiping Weng,et al.  A protein–protein docking benchmark , 2003, Proteins.

[4]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[5]  G. Rose,et al.  Hierarchic organization of domains in globular proteins. , 1979, Journal of molecular biology.

[6]  A M Lesk,et al.  Folding units in globular proteins. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[7]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[8]  H. Wolfson,et al.  Molecular surface complementarity at protein-protein interfaces: the critical role played by surface normals at well placed, sparse, points in docking. , 1995, Journal of molecular biology.

[9]  S. Wodak,et al.  Assessment of CAPRI predictions in rounds 3–5 shows progress in docking procedures , 2005, Proteins.

[10]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[11]  R. Nussinov,et al.  The building block folding model and the kinetics of protein folding. , 2001, Protein engineering.

[12]  D Eisenberg,et al.  3D domain swapping: A mechanism for oligomer assembly , 1995, Protein science : a publication of the Protein Society.

[13]  H. Wolfson,et al.  Prediction of multimolecular assemblies by multiple docking. , 2005, Journal of molecular biology.

[14]  Ruth Nussinov,et al.  Protein structure prediction via combinatorial assembly of sub-structural units , 2003, ISMB.

[15]  J. Skolnick,et al.  Automated structure prediction of weakly homologous proteins on a genomic scale. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Ruth Nussinov,et al.  Multiple Docking for Protein Structure Prediction , 2005, Int. J. Robotics Res..

[17]  I. Vakser,et al.  A systematic study of low-resolution recognition in protein--protein complexes. , 1999, Proceedings of the National Academy of Sciences of the United States of America.