Assessing precision and accuracy of protein structures derived from NMR data

One of the most challenging problems facing the field of biomolecular NMR spectroscopy is the lack of generally accepted, uniform criteria for defining the precision and accuracy of structures derived from NMR data. Although widely cited, metrics such as residual constraint violations, constraints per residue, and convergence across an ensemble of conformers [root-mean-square deviations (RMSDs)] provide necessary, but not sufficient, criteria for defining a good-quality solution NMR structure. Most importantly, none of these measures have standard conventions by which they are computed; each measure is assessed somewhat differently in different laboratories, using subjective criteria which may not be possible to reproduce in another laboratory. Owing to the lack of widely accepted algorithms and conventions for making these assessments, the same structure analyzed by different groups will often yield different structure quality assessment statistics. Two articles in this issue of Proteins demonstrate the pressing need for establishing community-wide standards for defining the accuracy and precision of NMR-derived protein structures. In one article, Snyder and Montelione present a method for determining internally well-defined “core atom set(s)” based on distance variance matrices, and for calculating a “joint RMSD” quantifying the precision of an ensemble of NMR-derived structures. In the other article, Nederveen et al. recalculate over 500 NMR-derived structures from archived constraint files, and demonstrate the value of standardized methods of structure calculation and assessment. They show that not only can improved methods of structure calculation provide more accurate results, but also that the original published structures often are reported with an inappropriately high estimation of precision. Both of these articles address key aspects of the challenge for assessing the precision and accuracy of protein structures derived from NMR data. This editorial addresses the general issues of estimating precision and accuracy of protein NMR structures, and presents recent progress in the field, with the aim of stimulating discussion, and eventually resolution, of these challenges by the scientific community.

[1]  Alexander Grishaev,et al.  CLOUDS, a protocol for deriving a molecular proton density via NMR , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Estimation of proton configurations from NOESY spectra , 2004 .

[3]  W. V. van Gunsteren,et al.  Time-averaged nuclear Overhauser effect distance restraints applied to tendamistat. , 1990, Journal of molecular biology.

[4]  T F Havel,et al.  The solution structure of eglin c based on measurements of many NOEs and coupling constants and its comparison with X‐ray structures , 1992, Protein science : a publication of the Protein Society.

[5]  D. Richardson,et al.  Exploring steric constraints on protein mutations using MAGE/PROBE , 2000, Protein science : a publication of the Protein Society.

[6]  M. Y. Lobanov,et al.  Comparison of X‐ray and NMR structures: Is there a systematic difference in residue contacts between X‐ray‐ and NMR‐resolved protein structures? , 2005, Proteins.

[7]  I. Gelfand,et al.  Geometric invariant core for the V(L) and V(H) domains of immunoglobulin molecules. , 1998, Protein engineering.

[8]  A. Brünger,et al.  Conformational variability of solution nuclear magnetic resonance structures. , 1995, Journal of molecular biology.

[9]  J. Prestegard,et al.  Residual dipolar couplings in structure determination of biomolecules. , 2004, Chemical reviews.

[10]  Mark Gerstein,et al.  Using a measure of structural variation to define a core for the globins , 1995, Comput. Appl. Biosci..

[11]  J. Taylor An Introduction to Error Analysis , 1982 .

[12]  H Frauenfelder,et al.  Variations on a theme by Debye and Waller: From simple crystals to proteins , 1997, Proteins.

[13]  W. Gronwald,et al.  RFAC, a program for automated NMR R-factor estimation , 2000, Journal of biomolecular NMR.

[14]  A M Gronenborn,et al.  Structures of larger proteins in solution: three- and four-dimensional heteronuclear NMR spectroscopy. , 1991, Science.

[15]  M. Zalis,et al.  Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. , 1999, Journal of molecular biology.

[16]  Robert Powers,et al.  Protein NMR recall, precision, and F-measure scores (RPF scores): structure quality assessment measures based on information retrieval statistics. , 2005, Journal of the American Chemical Society.

[17]  G. Montelione,et al.  Simulated annealing with restrained molecular dynamics using CONGEN: Energy refinement of the NMR solution structures of epidermal and type‐α transforming growth factors , 1996, Protein science : a publication of the Protein Society.

[18]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[19]  Ad Bax,et al.  An empirical backbone-backbone hydrogen-bonding potential in proteins and its applications to NMR structure refinement and validation. , 2004, Journal of the American Chemical Society.

[20]  Gert Vriend,et al.  Quantitative evaluation of experimental NMR restraints. , 2003, Journal of the American Chemical Society.

[21]  A. Bax Weak alignment offers new NMR opportunities to study protein structure and dynamics , 2003, Protein science : a publication of the Protein Society.

[22]  J. Thornton,et al.  AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR , 1996, Journal of biomolecular NMR.

[23]  A. Gronenborn,et al.  Structures of larger proteins, protein‐ligand and protein‐DNA complexes by multidimensional heteronuclear NMR , 1994, Progress in biophysics and molecular biology.

[24]  L. Kelley,et al.  An automated approach for defining core atoms and domains in an ensemble of NMR-derived protein structures. , 1997, Protein engineering.

[25]  Gert Vriend,et al.  The precision of NMR structure ensembles revisited , 2003, Journal of biomolecular NMR.

[26]  Alexander Grishaev,et al.  Protein structure elucidation from NMR proton densities , 2002, Proceedings of the National Academy of Sciences of the United States of America.