Quality assessment of NMR structures: a statistical survey.

A statistical analysis is reported of experimental data and coordinates of a set of 97 NMR structures deposited in the PDB. The aim is to assess the quality of these structures in relation to the amount of experimental information. Experimental restraints were analysed using the program AQUA. Many nomenclature inconsistencies between deposited restraint and coordinate files were observed. The experimental restraint files were found to contain a high proportion of redundant restraints. Procedures for analysing and correcting the inconsistencies and restraint counts are described. The analysis of NOE restraint violations (using AQUA) and of a wide variety of geometrical quality indicators (using PROCHECK-NMR and WHAT IF) provides a reference for other NMR structure determinations. The extent of NOE violations is anti-correlated with the quality of the Ramachandran map. The precision as measured by the circular variance of backbone dihedral angles, does increase with the amount of experimental data, as expected, but is sometimes overestimated. Bond lengths, bond angles and planarity of groups can deviate considerably from ideal values. Outliers appear to cluster per laboratory, indicating that the results depend on particulars of refinement protocols and/or software. We have identified a problem of atom overlap in a number of refined structures.We recommend adhering to the standard nomenclature as put forward by an IUPAC Task Group, to ensure consistency between restraints and coordinates, and to omit redundant restraints from the deposition. The results obtained from this analysis and the AQUA program are available through the World Wide Web.

[1]  A. Brünger,et al.  Torsion angle dynamics: Reduced variable conformational sampling enhances crystallographic structure refinement , 1994, Proteins.

[2]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[3]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[4]  K Wüthrich,et al.  Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSA. , 1991, Journal of molecular biology.

[5]  J. Thornton,et al.  Conformational analysis of protein structures derived from NMR data , 1993, Proteins.

[6]  J. Thornton,et al.  Stereochemical quality of protein structure coordinates , 1992, Proteins.

[7]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by hybrid distance geometry‐dynamical simulated annealing calculations , 1988, FEBS letters.

[8]  Timothy F. Havel,et al.  A distance geometry program for determining the structures of small proteins and other macromolecules from nuclear magnetic resonance measurements of intramolecular1H−1H proximities in solution , 1984 .

[9]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[10]  R. Kaptein,et al.  Solution structure of the HU protein from Bacillus stearothermophilus. , 1995, Journal of molecular biology.

[11]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[12]  Janet M. Thornton,et al.  Knowledge-based validation of protein structure coordinates derived by X-ray crystallography and NMR spectroscopy , 1994 .

[13]  Three-dimensional structure of porcine C5adesArg from 1H nuclear magnetic resonance data. , 1990 .

[14]  O. Jardetzky,et al.  An assessment of the precision and accuracy of protein structures determined by NMR. Dependence on distance errors. , 1994, Journal of molecular biology.

[15]  Chris Sander,et al.  Objectively judging the quality of a protein structure from a Ramachandran plot , 1997, Comput. Appl. Biosci..

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[18]  I. C. O. B. Nomenclature IUPAC-IUB Commission on Biochemical Nomenclature. Abbreviations and symbols for the description of the conformation of polypeptide chains. Tentative rules (1969). , 1970, Biochemistry.

[19]  Fritz Eckstein,et al.  Nucleic acids and molecular biology , 1987 .

[20]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[21]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[22]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[23]  Timothy F. Havel An evaluation of computational strategies for use in the determination of protein structure from distance constraints obtained by nuclear magnetic resonance. , 1991, Progress in biophysics and molecular biology.

[24]  G J Kleywegt,et al.  Phi/psi-chology: Ramachandran revisited. , 1996, Structure.

[25]  G. N. Ramachandran,et al.  Stereochemistry of polypeptide chain configurations. , 1963, Journal of molecular biology.

[26]  R. Kaptein,et al.  Solution structure of porcine pancreatic procolipase as determined from 1H homonuclear two-dimensional and three-dimensional NMR. , 1994, European journal of biochemistry.

[27]  J. Thornton,et al.  AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR , 1996, Journal of biomolecular NMR.

[28]  T F Havel,et al.  The solution structure of eglin c based on measurements of many NOEs and coupling constants and its comparison with X‐ray structures , 1992, Protein science : a publication of the Protein Society.

[29]  Chris Sander,et al.  Who checks the checkers? Four validation tools applied to eight atomic resolution structures. EU 3-D Validation Network. , 1998, Journal of molecular biology.

[30]  N Go,et al.  Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[31]  G M Clore,et al.  Exploring the limits of precision and accuracy of protein structures determined by nuclear magnetic resonance spectroscopy. , 1993, Journal of molecular biology.

[32]  J. Rullmann,et al.  Solution structure of the immunodominant region of protein G of bovine respiratory syncytial virus. , 1996, Biochemistry.

[33]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[34]  C. Sander,et al.  Quality control of protein models : directional atomic contact analysis , 1993 .

[35]  K. Constantine,et al.  Refined solution structure and ligand-binding properties of PDC-109 domain b. A collagen-binding type II domain. , 1992, Journal of molecular biology.