Less Is More: Coarse-Grained Integrative Modeling of Large Biomolecular Assemblies with HADDOCK

Predicting the 3D structure of protein interactions remains a challenge in the field of computational structural biology. This is in part due to difficulties in sampling the complex energy landscape of multiple interacting flexible polypeptide chains. Coarse-graining approaches, which reduce the number of degrees of freedom of the system, help address this limitation by smoothing the energy landscape, allowing an easier identification of the global energy minimum. They also accelerate the calculations, allowing to model larger assemblies. Here, we present the implementation of the MARTINI coarse-grained force field for proteins into HADDOCK, our integrative modelling platform. Docking and refinement are performed at the coarse-grained level and the resulting models are then converted back to atomistic resolution through a distance restraints-guided morphing procedure. Our protocol, tested on the largest complexes of the protein docking benchmark 5, shows an overall ~7-fold speed increase compared to standard all-atom calculations, while maintaining a similar accuracy and yielding substantially more near-native solutions. To showcase the potential of our method, we performed simultaneous 7 body docking to model the 1:6 KaiC-KaiB complex, integrating mutagenesis and hydrogen/deuterium exchange data from mass spectrometry with symmetry restraints, and validated the resulting models against a recently published cryo-EM structure.

[1]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[2]  Susan E. Cohen,et al.  Structural basis of the day-night transition in a bacterial circadian clock , 2017, Science.

[3]  Martin Egli,et al.  CryoEM and molecular dynamics of the circadian KaiB-KaiC complex indicates that KaiB monomers interact with KaiC and block ATP binding clefts. , 2013, Journal of molecular biology.

[4]  M. Levitt,et al.  Refinement of protein conformations using a macromolecular energy minimization procedure. , 1969, Journal of molecular biology.

[5]  Alexandre M J J Bonvin,et al.  Clustering biomolecular complexes by residue contacts similarity , 2012, Proteins.

[6]  G C P van Zundert,et al.  The HADDOCK2.2 Web Server: User-Friendly Integrative Modeling of Biomolecular Complexes. , 2016, Journal of molecular biology.

[7]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[8]  Alexandre M J J Bonvin,et al.  Advances in integrative modeling of biomolecular complexes. , 2013, Methods.

[9]  J. Rodrigues,et al.  Integrative computational modeling of protein interactions , 2014, The FEBS journal.

[10]  Pierre Tufféry,et al.  The pepATTRACT web server for blind, large-scale peptide–protein docking , 2017, Nucleic Acids Res..

[11]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[12]  A. Kolinski,et al.  Coarse-Grained Protein Models and Their Applications. , 2016, Chemical reviews.

[13]  Rebecca C. Wade,et al.  Protein‐Protein Docking , 2001 .

[14]  Shoji Takada,et al.  Coarse-grained molecular simulations of large biomolecules. , 2012, Current opinion in structural biology.

[15]  R. Abagyan,et al.  Identification of protein-protein interaction sites from docking energy landscapes. , 2004, Journal of molecular biology.

[16]  Katsumi Imada,et al.  ATP‐induced hexameric ring structure of the cyanobacterial circadian clock protein KaiC , 2003, Genes to cells : devoted to molecular & cellular mechanisms.

[17]  Friedrich Förster,et al.  Structures of the cyanobacterial circadian oscillator frozen in a fully assembled state , 2017, Science.

[18]  Gert Vriend,et al.  Everyday , 2020, Oxford Research Encyclopedia of Literature.

[19]  Mateusz Kurcinski,et al.  CABS-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site , 2015, Nucleic Acids Res..

[20]  Marc F Lensink,et al.  Docking and scoring protein interactions: CAPRI 2009 , 2010, Proteins.

[21]  A J Rader,et al.  Coarse-grained models: getting more with less. , 2010, Current opinion in pharmacology.

[22]  R. Larson,et al.  The MARTINI Coarse-Grained Force Field: Extension to Proteins. , 2008, Journal of chemical theory and computation.

[23]  A. Brunger Version 1.2 of the Crystallography and NMR system , 2007, Nature Protocols.

[24]  Gregory A Voth,et al.  Coarse-graining of multiprotein assemblies. , 2012, Current opinion in structural biology.

[25]  Helgi I. Ingólfsson,et al.  Martini Coarse-Grained Force Field: Extension to RNA. , 2015, Biophysical journal.

[26]  Martin Zacharias,et al.  Binding site prediction and improved scoring during flexible protein–protein docking with ATTRACT , 2010, Proteins.

[27]  Siewert J Marrink,et al.  Martini Coarse-Grained Force Field: Extension to Carbohydrates. , 2009, Journal of chemical theory and computation.

[28]  Raphael A. G. Chaleil,et al.  Updates to the Integrated Protein-Protein Interaction Benchmarks: Docking Benchmark Version 5 and Affinity Benchmark Version 2. , 2015, Journal of molecular biology.

[29]  Mateusz Kurcinski,et al.  Modeling of protein-peptide interactions using the CABS-dock web server for binding site search and flexible docking. , 2015, Methods.

[30]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[31]  R. Russell,et al.  Protein complexes: structure prediction challenges for the 21st century. , 2005, Current opinion in structural biology.

[32]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[33]  W. L. Jorgensen,et al.  The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. , 1988, Journal of the American Chemical Society.

[34]  A. Kolinski Protein modeling and structure prediction with a reduced representation. , 2004, Acta biochimica Polonica.

[35]  W F Drew Bennett,et al.  Improved Parameters for the Martini Coarse-Grained Protein Force Field. , 2013, Journal of chemical theory and computation.

[36]  S. Wodak,et al.  Docking and scoring protein complexes: CAPRI 3rd Edition , 2007, Proteins.

[37]  Zhiping Weng,et al.  Protein-protein docking: overview and performance analysis. , 2008, Methods in molecular biology.

[38]  C. Johnson,et al.  Expression of a gene cluster kaiABC as a circadian feedback process in cyanobacteria. , 1998, Science.

[39]  D. Tieleman,et al.  The MARTINI force field: coarse grained model for biomolecular simulations. , 2007, The journal of physical chemistry. B.

[40]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[41]  J. Janin,et al.  Computer analysis of protein-protein interaction. , 1978, Journal of molecular biology.

[42]  L. Dijkhuizen,et al.  Martini Coarse-Grained Force Field: Extension to DNA. , 2015, Journal of chemical theory and computation.

[43]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[44]  P. Aloy,et al.  Interactome3D: adding structural details to protein networks , 2013, Nature Methods.

[45]  Alexandre M. J. J. Bonvin,et al.  Insight into cyanobacterial circadian timing from structural details of the KaiB–KaiC interaction , 2014, Proceedings of the National Academy of Sciences.

[46]  Mark N. Wass,et al.  Challenges for the prediction of macromolecular interactions. , 2011, Current opinion in structural biology.

[47]  Panagiotis I. Koukos,et al.  A Membrane Protein Complex Docking Benchmark. , 2018, Journal of molecular biology.

[48]  Piotr Sliz,et al.  Collaboration gets the most out of software , 2013, eLife.

[49]  Fumio Hayashi,et al.  Hexamerization by the N-terminal domain and intersubunit phosphorylation by the C-terminal domain of cyanobacterial circadian clock protein KaiC. , 2006, Biochemical and biophysical research communications.

[50]  Isaure Chauvot de Beauchêne,et al.  A web interface for easy flexible protein-protein docking with ATTRACT. , 2015, Biophysical journal.

[51]  David W Ritchie,et al.  Recent progress and future directions in protein-protein docking. , 2008, Current protein & peptide science.

[52]  Gert Vriend,et al.  A series of PDB related databases for everyday needs , 2010, Nucleic Acids Res..

[53]  A. B. Reddy,et al.  Circadian Clocks in Human Red Blood Cells , 2010, Nature.