Novel sampling strategies and a coarse-grained score function for docking homomers, flexible heteromers, and oligosaccharides using Rosetta in CAPRI Rounds 37–45

CAPRI Rounds 37 through 45 introduced larger complexes, new macromolecules, and multi-stage assemblies. For these rounds, we used and expanded docking methods in Rosetta to model 23 target complexes. We successfully predicted 14 target complexes and recognized and refined near-native models generated by other groups for two further targets. Notably, for targets T110 and T136, we achieved the closest prediction of any CAPRI participant. We created several innovative approaches during these rounds. Since Round 39 (target 122), we have used the new RosettaDock 4.0, which has a revamped coarse-grained energy function and the ability to perform conformer selection during docking with hundreds of pre-generated protein backbones. Ten of the complexes had some degree of symmetry in their interactions, so we tested Rosetta SymDock, realized its shortcomings, and developed the next-generation symmetric docking protocol, SymDock2, which includes docking of multiple backbones and induced-fit refinement. Since the last CAPRI assessment, we also developed methods for modeling and designing carbohydrates in Rosetta, and we used them to successfully model oligosaccharide–protein complexes in Round 41. While the results were broadly encouraging, they also highlighted the pressing need to invest in (1) flexible docking algorithms with the ability to model loop and linker motions and in (2) new sampling and scoring methods for oligosaccharide–protein interactions.

[1]  Sameer Velankar,et al.  The challenge of modeling protein assemblies: the CASP12‐CAPRI experiment , 2018, Proteins.

[2]  Timothy Gallagher,et al.  Carbohydrate–Aromatic Interactions in Proteins , 2015, Journal of the American Chemical Society.

[3]  Tirso Pons,et al.  Towards a detailed atlas of protein-protein interactions. , 2013, Current opinion in structural biology.

[4]  D. Baker,et al.  Alternate states of proteins revealed by detailed energy landscape mapping. , 2011, Journal of molecular biology.

[5]  Jeffrey J. Gray,et al.  Analysis and Modeling of the Variable Region of Camelid Single-Domain Antibodies , 2011, The Journal of Immunology.

[6]  Iannis E. Adamopoulos,et al.  Structural Activation of Pro‐inflammatory Human Cytokine IL‐23 by Cognate IL‐23 Receptor Enables Recruitment of the Shared Receptor IL‐12R&bgr;1 , 2018, Immunity.

[7]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[8]  Andreas Möglich,et al.  Full-length structure of a sensor histidine kinase pinpoints coaxial coiled coils as signal transducers and modulators. , 2013, Structure.

[9]  Amelie Stein,et al.  Improvements to Robotics-Inspired Conformational Sampling in Rosetta , 2013, PloS one.

[10]  David Baker,et al.  High-resolution comparative modeling with RosettaCM. , 2013, Structure.

[11]  Jeong-Sun Kim,et al.  Structures of iron-dependent alcohol dehydrogenase 2 from Zymomonas mobilis ZM4 with and without NAD+ cofactor. , 2011, Journal of molecular biology.

[12]  Mark J. van Raaij,et al.  Crystal structure of raptor adenovirus 1 fibre head and role of the beta-hairpin in siadenovirus fibre head domains , 2016, Virology Journal.

[13]  Ora Schueler-Furman,et al.  Crucial Roles of Single Residues in Binding Affinity, Specificity, and Promiscuity in the Cellulosomal Cohesin-Dockerin Interface* , 2015, The Journal of Biological Chemistry.

[14]  Colin A. Smith,et al.  Backrub-like backbone simulation recapitulates natural protein conformational variability and improves mutant side-chain prediction. , 2008, Journal of molecular biology.

[15]  Brian D. Weitzner,et al.  Modeling and docking of antibody structures with Rosetta , 2017, Nature Protocols.

[16]  Adrian A Canutescu,et al.  Cyclic coordinate descent: A robotics algorithm for protein loop closure , 2003, Protein science : a publication of the Protein Society.

[17]  Matthew J Bick,et al.  How to switch off a histidine kinase: crystal structure of Geobacillus stearothermophilus KinB with the inhibitor Sda. , 2009, Journal of molecular biology.

[18]  Jeffrey J. Gray,et al.  Conformer selection and induced fit in flexible backbone protein-protein docking using computational and NMR ensembles. , 2008, Journal of molecular biology.

[19]  Andrej Sali,et al.  Comparative Protein Structure Modeling Using MODELLER , 2014, Current protocols in bioinformatics.

[20]  Roland L. Dunbrack,et al.  The Rosetta all-atom energy function for macromolecular modeling and design , 2017, bioRxiv.

[21]  Han Remaut,et al.  Helicobacter pylori adhesin HopQ disrupts trans dimerization in human CEACAMs , 2018, The EMBO journal.

[22]  William Sheffler,et al.  Efficient Flexible Backbone Protein-Protein Docking for Challenging Targets , 2017, bioRxiv.

[23]  Pedro Bule,et al.  Assembly of Ruminococcus flavefaciens cellulosome revealed by structures of two cohesin-dockerin complexes , 2017, Scientific Reports.

[24]  Ben M. Webb,et al.  Comparative Protein Structure Modeling Using Modeller , 2006, Current protocols in bioinformatics.

[25]  Hidde L. Ploegh,et al.  CEACAM1 regulates TIM-3-mediated tolerance and exhaustion , 2014, Nature.

[26]  Ilya A Vakser,et al.  Protein-protein docking: from interaction to interactome. , 2014, Biophysical journal.

[27]  Daiki Hattori,et al.  Structural basis for the bifunctionality of fructose-1,6-bisphosphate aldolase/phosphatase , 2011, Nature.

[28]  Christof R Hauck,et al.  Helicobacter pylori adhesin HopQ engages in a virulence-enhancing interaction with human CEACAMs , 2016, Nature Microbiology.

[29]  Martin Frank,et al.  GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research. , 2006, Glycobiology.

[30]  William Sheffler,et al.  Efficient Flexible Backbone Protein-Protein Docking for Challenging Targets , 2017, bioRxiv.

[31]  Pedro Bule,et al.  Cell-surface Attachment of Bacterial Multienzyme Complexes Involves Highly Dynamic Protein-Protein Anchors* , 2015, The Journal of Biological Chemistry.

[32]  Christopher Davies,et al.  Structure of the N-terminal domain of human CEACAM1: binding target of the opacity proteins during invasion of Neisseria meningitidis and N. gonorrhoeae. , 2006, Acta crystallographica. Section D, Biological crystallography.

[33]  Rosalie Lipsh,et al.  Ultrahigh specificity in a network of computationally designed protein-interaction pairs , 2018, Nature Communications.

[34]  Krishna Praneeth Kilambi,et al.  Protein-Protein Docking with Dynamic Residue Protonation States , 2014, PLoS Comput. Biol..

[35]  Magnus Rueping,et al.  Identification and Experimental Characterization of an Extremophilic Brine Pool Alcohol Dehydrogenase from Single Amplified Genomes. , 2017, ACS chemical biology.

[36]  Vojtěch Spiwok,et al.  CH/π Interactions in Carbohydrate Recognition , 2017, Molecules.

[37]  Krishna Praneeth Kilambi,et al.  Extending RosettaDock with water, sugar, and pH for prediction of complex structures and affinities for CAPRI rounds 20–27 , 2013, Proteins.

[38]  Jeffrey J. Gray,et al.  Modeling oblong proteins and water‐mediated interfaces with RosettaDock in CAPRI rounds 28–35 , 2017, Proteins.

[39]  Shourya S. Roy Burman,et al.  Flexible backbone assembly and refinement of symmetrical homomeric complexes , 2018, bioRxiv.

[40]  Dima Kozakov,et al.  The ClusPro web server for protein–protein docking , 2017, Nature Protocols.

[41]  Alon Ben-David,et al.  Crystal structure of an inverting GH 43 1,5-alpha-L-arabinanase from Geobacillus stearothermophilus complexed with its substrate. , 2009, The Biochemical journal.

[42]  Justyna Aleksandra Wojdyla,et al.  Structure of the ultra-high-affinity colicin E2 DNase--Im2 complex. , 2012, Journal of molecular biology.

[43]  Mark J. van Raaij,et al.  Crystal Structure of the Fibre Head Domain of the Atadenovirus Snake Adenovirus 1 , 2014, PloS one.

[44]  Erwin Schneider,et al.  Crystal structures of the solute receptor GacH of Streptomyces glaucescens in complex with acarbose and an acarbose homolog: comparison with the acarbose-loaded maltose-binding protein of Salmonella typhimurium. , 2010, Journal of molecular biology.

[45]  Jared Adolf-Bryfogle,et al.  Residue‐centric modeling and design of saccharide and glycoconjugate structures , 2017, J. Comput. Chem..

[46]  Andrei L. Turinsky,et al.  Protein-protein interaction networks: the puzzling riches. , 2013, Current opinion in structural biology.

[47]  R. Woods,et al.  Vina-Carb: Improving Glycosidic Angles during Carbohydrate Docking. , 2016, Journal of chemical theory and computation.

[48]  D. Baker,et al.  RosettaRemodel: A Generalized Framework for Flexible Backbone Protein Design , 2011, PloS one.

[49]  S. Wodak,et al.  Assessment of blind predictions of protein–protein interactions: Current status of docking methods , 2003, Proteins.

[50]  Jeffrey J. Gray,et al.  SnugDock: Paratope Structural Optimization during Antibody-Antigen Docking Compensates for Errors in Antibody Homology Models , 2010, PLoS Comput. Biol..

[51]  J. Scheller,et al.  Non-Canonical Interleukin 23 Receptor Complex Assembly , 2014, The Journal of Biological Chemistry.

[52]  David Baker,et al.  Prediction of the structure of symmetrical protein assemblies , 2007, Proceedings of the National Academy of Sciences.

[53]  Jeffrey J. Gray,et al.  Flexible backbone assembly and refinement of symmetrical homomeric complexes , 2018, bioRxiv.

[54]  Mark J van Raaij,et al.  Structure of the C-terminal head domain of the fowl adenovirus type 1 long fiber. , 2007, The Journal of general virology.

[55]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[56]  Adam Round,et al.  Insc:LGN tetramers promote asymmetric divisions of mammary stem cells , 2018, Nature Communications.