High‐accuracy refinement using Rosetta in CASP13

Because proteins generally fold to their lowest free energy states, energy‐guided refinement in principle should be able to systematically improve the quality of protein structure models generated using homologous structure or co‐evolution derived information. However, because of the high dimensionality of the search space, there are far more ways to degrade the quality of a near native model than to improve it, and hence, refinement methods are very sensitive to energy function errors. In the 13th Critial Assessment of techniques for protein Structure Prediction (CASP13), we sought to carry out a thorough search for low energy states in the neighborhood of a starting model using restraints to avoid straying too far. The approach was reasonably successful in improving both regions largely incorrect in the starting models as well as core regions that started out closer to the correct structure. Models with GDT‐HA over 70 were obtained for five targets and for one of those, an accuracy of 0.5 å backbone root‐mean‐square deviation (RMSD) was achieved. An important current challenge is to improve performance in refining oligomers and larger proteins, for which the search problem remains extremely difficult.

[1]  Michael Levitt,et al.  Super-resolution biomolecular crystallography with low-resolution data , 2010, Nature.

[2]  Alberto Perez,et al.  Determining protein structures by combining semireliable data with atomistic physical models by Bayesian inference , 2015, Proceedings of the National Academy of Sciences.

[3]  Chaok Seok,et al.  Refinement of unreliable local regions in template‐based protein models , 2012, Proteins.

[4]  David E. Kim,et al.  Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules. , 2016, Journal of chemical theory and computation.

[5]  Chaok Seok,et al.  Simultaneous refinement of inaccurate local regions and overall structure in the CASP12 protein model refinement experiment , 2018, Proteins.

[6]  Francesco Luigi Gervasio,et al.  Assessment of the model refinement category in CASP12 , 2018, Proteins.

[7]  David Baker,et al.  Modeling Symmetric Macromolecular Structures in Rosetta3 , 2011, PloS one.

[8]  Randy J Read,et al.  Assessment of CASP7 predictions in the high accuracy template‐based modeling category , 2007, Proteins.

[9]  Georgios A. Pavlopoulos,et al.  Protein structure determination using metagenome sequence data , 2017, Science.

[10]  D. Baker,et al.  Relaxation of backbone bond geometry improves protein energy landscape modeling , 2014, Protein science : a publication of the Protein Society.

[11]  Frank DiMaio,et al.  CASP11 refinement experiments with ROSETTA , 2016, Proteins.

[12]  Vahid Mirjalili,et al.  Protein Structure Refinement through Structure Selection and Averaging from Molecular Dynamics Ensembles. , 2013, Journal of chemical theory and computation.

[13]  Gunnar F Schröder,et al.  Coupling an Ensemble of Homologues Improves Refinement of Protein Homology Models. , 2015, Journal of chemical theory and computation.

[14]  D. Baker,et al.  Assessing the utility of coevolution-based residue–residue contact predictions in a sequence- and structure-rich era , 2013, Proceedings of the National Academy of Sciences.

[15]  Michael Feig,et al.  Driven to near‐experimental accuracy by refinement via molecular dynamics simulations , 2019, Proteins.

[16]  Daniel W. Kulp,et al.  Generalized Fragment Picking in Rosetta: Design, Protocols and Applications , 2011, PloS one.

[17]  Frank DiMaio,et al.  Automatic structure prediction of oligomeric assemblies using Robetta in CASP12 , 2018, Proteins.

[18]  Jacek Blazewicz,et al.  SphereGrinder - reference structure-based tool for quality assessment of protein structural models , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[19]  David Baker,et al.  The origin of consistent protein structure refinement from structural averaging. , 2015, Structure.

[20]  Debswapna Bhattacharya,et al.  refineD: improved protein structure refinement using machine learning based restrained relaxation , 2019, Bioinform..

[21]  Frank DiMaio,et al.  Protein structure prediction using Rosetta in CASP12 , 2018, Proteins.

[22]  D. Baker,et al.  Alternate states of proteins revealed by detailed energy landscape mapping. , 2011, Journal of molecular biology.

[23]  David Baker,et al.  Protein homology model refinement by large-scale energy optimization , 2018, Proceedings of the National Academy of Sciences.

[24]  Ross C. Walker,et al.  An overview of the Amber biomolecular simulation package , 2013 .