Resolution-by-proxy: a simple measure for assessing and comparing the overall quality of NMR protein structures

In protein X-ray crystallography, resolution is often used as a good indicator of structural quality. Diffraction resolution of protein crystals correlates well with the number of X-ray observables that are used in structure generation and, therefore, with protein coordinate errors. In protein NMR, there is no parameter identical to X-ray resolution. Instead, resolution is often used as a synonym of NMR model quality. Resolution of NMR structures is often deduced from ensemble precision, torsion angle normality and number of distance restraints per residue. The lack of common techniques to assess the resolution of X-ray and NMR structures complicates the comparison of structures solved by these two methods. This problem is sometimes approached by calculating “equivalent resolution” from structure quality metrics. However, existing protocols do not offer a comprehensive assessment of protein structure as they calculate equivalent resolution from a relatively small number (<5) of protein parameters. Here, we report a development of a protocol that calculates equivalent resolution from 25 measurable protein features. This new method offers better performance (correlation coefficient of 0.92, mean absolute error of 0.28 Å) than existing predictors of equivalent resolution. Because the method uses coordinate data as a proxy for X-ray diffraction data, we call this measure “Resolution-by-Proxy” or ResProx. We demonstrate that ResProx can be used to identify under-restrained, poorly refined or inaccurate NMR structures, and can discover structural defects that the other equivalent resolution methods cannot detect. The ResProx web server is available at http://www.resprox.ca.

[1]  David Baker,et al.  Accurate Automated Protein NMR Structure Determination Using Unassigned NOESY Data , 2009, Journal of the American Chemical Society.

[2]  Bin Xia,et al.  Comparison of protein solution structures refined by molecular dynamics simulation in vacuum, with a generalized Born model, and with explicit water , 2002, Journal of biomolecular NMR.

[3]  Gert Vriend,et al.  Validation of protein structures derived by NMR spectroscopy , 2004 .

[4]  C. W. Hilbers,et al.  Improving the quality of protein structures derived by NMR spectroscopy** , 2002, Journal of biomolecular NMR.

[5]  Michael Andrec,et al.  A large data set comparison of protein structures determined by crystallography and NMR: Statistical test for structural differences and the effect of crystal packing , 2007, Proteins.

[6]  M. Nilges,et al.  Influence of non-bonded parameters on the quality of NMR structures: A new force field for NMR structure calculation , 1999, Journal of biomolecular NMR.

[7]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[8]  D. Eisenberg,et al.  VERIFY3D: assessment of protein models with three-dimensional profiles. , 1997, Methods in enzymology.

[9]  G. Vriend,et al.  Definition of a New Information-Based Per-Residue Quality Parameter , 2005, Journal of biomolecular NMR.

[10]  Charles D Schwieters,et al.  The Xplor-NIH NMR molecular structure determination package. , 2003, Journal of magnetic resonance.

[11]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[12]  Simon W. Ginzinger,et al.  SHIFTX2: significantly improved protein chemical shift prediction , 2011, Journal of biomolecular NMR.

[13]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[14]  David S. Wishart,et al.  VADAR: a web server for quantitative evaluation of protein structure quality , 2003, Nucleic Acids Res..

[15]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[16]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[17]  David Baker,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improving NMR protein structure quality by Rosetta refinement: A molecular , 2022 .

[18]  David S. Wishart,et al.  GeNMR: a web server for rapid NMR-based protein structure determination , 2009, Nucleic Acids Res..

[19]  D. Baker,et al.  De novo protein structure determination using sparse NMR data , 2000, Journal of biomolecular NMR.

[20]  D. Baker,et al.  RosettaHoles: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation , 2008, Protein science : a publication of the Protein Society.

[21]  Alan E Mark,et al.  Relative stability of protein structures determined by X‐ray crystallography or NMR spectroscopy: A molecular dynamics simulation study , 2003, Proteins.

[22]  J. Thornton,et al.  AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR , 1996, Journal of biomolecular NMR.

[23]  M. Jaskólski,et al.  Protein crystallography for non‐crystallographers, or how to get the best (but not more) from published macromolecular structures , 2008, The FEBS journal.

[24]  Robert Powers,et al.  Protein NMR recall, precision, and F-measure scores (RPF scores): structure quality assessment measures based on information retrieval statistics. , 2005, Journal of the American Chemical Society.

[25]  M. DePristo,et al.  Simultaneous determination of protein structure and dynamics , 2005, Nature.

[26]  David S. Wishart,et al.  PROSESS: a protein structure evaluation suite and server , 2010, Nucleic Acids Res..

[27]  W. Gronwald,et al.  RFAC, a program for automated NMR R-factor estimation , 2000, Journal of biomolecular NMR.

[28]  D. Baker,et al.  RosettaHoles2: A volumetric packing measure for protein structure refinement and validation , 2010, Protein science : a publication of the Protein Society.

[29]  Gert Vriend,et al.  Traditional Biomolecular Structure Determination by NMR Spectroscopy Allows for Major Errors , 2005, PLoS Comput. Biol..

[30]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[31]  Mehdi Mobli,et al.  Macromolecular NMR spectroscopy for the non‐spectroscopist , 2011, The FEBS journal.

[32]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[33]  Manfred J. Sippl,et al.  Thirty years of environmental health research--and growing. , 1996, Nucleic Acids Res..

[34]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[35]  D. Baker,et al.  De novo protein structure generation from incomplete chemical shift assignments , 2009, Journal of biomolecular NMR.

[36]  D. S. Garrett,et al.  R-factor, Free R, and Complete Cross-Validation for Dipolar Coupling Refinement of NMR Structures , 1999 .

[37]  J. Richardson,et al.  Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. , 1999, Journal of molecular biology.

[38]  Alexandre M J J Bonvin,et al.  DRESS: a database of REfined solution NMR structures , 2004, Proteins.

[39]  D. Wishart,et al.  Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts , 2003, Journal of Biomolecular NMR.

[40]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[41]  Pei Zhou,et al.  Evaluating the quality of NMR structures by local density of protons , 2005, Proteins.

[42]  S. Sathiya Keerthi,et al.  Improvements to the SMO algorithm for SVM regression , 2000, IEEE Trans. Neural Networks Learn. Syst..

[43]  David S Wishart,et al.  Interpreting protein chemical shift data. , 2011, Progress in nuclear magnetic resonance spectroscopy.

[44]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[45]  Bert L de Groot,et al.  Atomic contacts in protein structures. A detailed analysis of atomic radii, packing, and overlaps , 2007, Proteins.

[46]  Jack Snoeyink,et al.  Nucleic Acids Research Advance Access published April 22, 2007 MolProbity: all-atom contacts and structure validation for proteins and nucleic acids , 2007 .

[47]  Ad Bax,et al.  Validation of Protein Structure from Anisotropic Carbonyl Chemical Shifts in a Dilute Liquid Crystalline Phase , 1998 .

[48]  Daniel L. Minor,et al.  The Neurobiologist's Guide to Structural Biology: A Primer on Why Macromolecular Structure Matters and How to Evaluate Structural Data , 2007, Neuron.

[49]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.