Modeling protein structures from predicted contacts with modern molecular dynamics potentials: accuracy, sensitivity, and refinement

Protein structure prediction has become increasingly popular and successful in recent years. An essential step for fragment-free, template-free methods is the generation of a final three-dimensional protein model from a set of predicted amino acid contacts that are often described by interresidue pairwise atomic distances. Here we explore the use of modern, open-source molecular dynamics (MD) engines, which have been continually developed over the last three decades with all-atom Hamiltonians to model biomolecular structure and dynamics, to generate accurate protein structures starting from a set of inferred pairwise distances. Additionally, the ability of MD empirical physical potentials to correct inaccuracies in the predicted geometries is tested. We rigorously characterize the effect of modeling parameters on results, the effect of different amounts of error in the predicted distances on the final structures, and test the ability of post-processing analysis to sort the best models out of a set of statistical replicas. We find that with exact distances and with noisy distances, the method can produce excellent structural models, and that the molecular dynamics force field seems to help correct errors in distance predictions, resisting the effects of applied noise.

[1]  Jianlin Cheng,et al.  CONFOLD: Residue‐residue contact‐guided ab initio protein folding , 2015, Proteins.

[2]  C. Simmerling,et al.  ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. , 2015, Journal of chemical theory and computation.

[3]  C. Bugg,et al.  Structure of ubiquitin refined at 1.8 A resolution. , 1987, Journal of molecular biology.

[4]  G. Schulz,et al.  Structure of the outer membrane protein A transmembrane domain , 1998, Nature Structural Biology.

[5]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[6]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[7]  J. Sacchettini,et al.  The Structural Basis of T4 Phage Lysis Control: DNA as the Signal for Lysis Inhibition. , 2020, Journal of molecular biology.

[8]  Michael Feig,et al.  Experimental accuracy in protein structure refinement via molecular dynamics simulations , 2018, Proceedings of the National Academy of Sciences.

[9]  Michael Feig,et al.  High-Accuracy Protein Structures by Combining Machine-Learning with Physics-Based Refinement , 2019, bioRxiv.

[10]  David T Jones,et al.  Prediction of interresidue contacts with DeepMetaPSICOV in CASP13 , 2019, Proteins.

[11]  Michael Feig,et al.  Driven to near‐experimental accuracy by refinement via molecular dynamics simulations , 2019, Proteins.

[12]  Demis Hassabis,et al.  Improved protein structure prediction using potentials from deep learning , 2020, Nature.

[13]  Ben M. Webb,et al.  Comparative Protein Structure Modeling Using MODELLER , 2016, Current protocols in bioinformatics.

[14]  A. Brunger Version 1.2 of the Crystallography and NMR system , 2007, Nature Protocols.

[15]  Christopher J. Williams,et al.  MolProbity: More and better reference data for improved all‐atom structure validation , 2018, Protein science : a publication of the Protein Society.

[16]  Jun Hu,et al.  ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks , 2019, Bioinform..

[17]  W. L. Jorgensen,et al.  Improved Peptide and Protein Torsional Energetics with the OPLS-AA Force Field , 2015, Journal of chemical theory and computation.

[18]  Kristie B. Hadden,et al.  2020 , 2020, Journal of Surgical Orthopaedic Advances.

[19]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[20]  B. L. de Groot,et al.  CHARMM36m: an improved force field for folded and intrinsically disordered proteins , 2016, Nature Methods.

[21]  Debswapna Bhattacharya,et al.  SPECS: Integration of side-chain orientation and global distance-based measures for improved evaluation of protein structural models , 2020, PloS one.

[22]  Alberto Perez,et al.  Determining protein structures by combining semireliable data with atomistic physical models by Bayesian inference , 2015, Proceedings of the National Academy of Sciences.

[23]  A. Valencia,et al.  Emerging methods in protein co-evolution , 2013, Nature Reviews Genetics.

[24]  Ian W. Davis,et al.  Structure validation by Cα geometry: ϕ,ψ and Cβ deviation , 2003, Proteins.

[25]  Jianyi Yang,et al.  Improved protein structure prediction using predicted interresidue orientations , 2020, Proceedings of the National Academy of Sciences.

[26]  D. Eisenberg,et al.  A challenging case for protein crystal structure determination: the mating pheromone Er-1 from Euplotes raikovi. , 1996, Acta crystallographica. Section D, Biological crystallography.

[27]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.