RECOORD: A recalculated coordinate database of 500+ proteins from the PDB using restraints from the BioMagResBank

State‐of‐the‐art methods based on CNS and CYANA were used to recalculate the nuclear magnetic resonance (NMR) solution structures of 500+ proteins for which coordinates and NMR restraints are available from the Protein Data Bank. Curated restraints were obtained from the BioMagResBank FRED database. Although the original NMR structures were determined by various methods, they all were recalculated by CNS and CYANA and refined subsequently by restrained molecular dynamics (CNS) in a hydrated environment. We present an extensive analysis of the results, in terms of various quality indicators generated by PROCHECK and WHAT_CHECK. On average, the quality indicators for packing and Ramachandran appearance moved one standard deviation closer to the mean of the reference database. The structural quality of the recalculated structures is discussed in relation to various parameters, including number of restraints per residue, NOE completeness and positional root mean square deviation (RMSD). Correlations between pairs of these quality indicators were generally low; for example, there is a weak correlation between the number of restraints per residue and the Ramachandran appearance according to WHAT_CHECK (r = 0.31). The set of recalculated coordinates constitutes a unified database of protein structures in which potential user‐ and software‐dependent biases have been kept as small as possible. The database can be used by the structural biology community for further development of calculation protocols, validation tools, structure‐based statistical approaches and modeling. The RECOORD database of recalculated structures is publicly available from http://www.ebi.ac.uk/msd/recoord. Proteins 2005. © 2005 Wiley‐Liss, Inc.

[1]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[2]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[3]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[4]  Alexandre M J J Bonvin,et al.  BioMagResBank databases DOCR and FRED containing converted and filtered sets of experimental NMR restraints and coordinates from over 500 protein PDB structures , 2005, Journal of biomolecular NMR.

[5]  N Go,et al.  Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[6]  Roman A Laskowski,et al.  Structural quality assurance. , 2003, Methods of biochemical analysis.

[7]  Alexandre M J J Bonvin,et al.  DRESS: a database of REfined solution NMR structures , 2004, Proteins.

[8]  Peter Güntert,et al.  Automated NMR protein structure calculation , 2003 .

[9]  Gert Vriend,et al.  The precision of NMR structure ensembles revisited , 2003, Journal of biomolecular NMR.

[10]  Gert Vriend,et al.  Concepts and Tools for NMR Restraint Analysis and Validation , 2004 .

[11]  J. Rullmann,et al.  Quality assessment of NMR structures: a statistical survey. , 1998, Journal of molecular biology.

[12]  Francine Berman,et al.  Overview of the Book: Grid Computing – Making the Global Infrastructure a Reality , 2003 .

[13]  P E Wright,et al.  Recommendations for the presentation of NMR structures of proteins and nucleic acids. IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy. , 1998, Journal of biomolecular NMR.

[14]  Michael Nilges,et al.  A simple method for delineating well‐defined and variable regions in protein structures determined from interproton distance data , 1987 .

[15]  T. N. Bhat,et al.  The CCPN project: an interim report on a data model for the NMR community , 2002, Nature Structural Biology.

[16]  Jun Zhu,et al.  BioMagResBank database with sets of experimental NMR constraints corresponding to the structures of over 1400 biomolecules deposited in the Protein Data Bank , 2003, Journal of biomolecular NMR.

[17]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[18]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[19]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  C. W. Hilbers,et al.  Improving the quality of protein structures derived by NMR spectroscopy** , 2002, Journal of biomolecular NMR.

[22]  Gert Vriend,et al.  Quantitative evaluation of experimental NMR restraints. , 2003, Journal of the American Chemical Society.

[23]  Miron Livny,et al.  Condor and the Grid , 2003 .

[24]  R Kaptein,et al.  Completeness of NOEs in protein structure: a statistical analysis of NMR. , 1999, Journal of biomolecular NMR.

[25]  A. Brünger,et al.  Torsion angle dynamics: Reduced variable conformational sampling enhances crystallographic structure refinement , 1994, Proteins.

[26]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[27]  M Nilges,et al.  A calculation strategy for the structure determination of symmetric demers by 1H NMR , 1993, Proteins.

[28]  Ton Rullmann,et al.  Completeness of NOEs in protein structures: A statistical analysis of NMR data , 1999 .

[29]  K. Wüthrich,et al.  Recommendations for the presentation of NMR structures of proteins and nucleic acids – IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy , 1998, European journal of biochemistry.

[30]  Timothy F. Havel An evaluation of computational strategies for use in the determination of protein structure from distance constraints obtained by nuclear magnetic resonance. , 1991, Progress in biophysics and molecular biology.

[31]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[32]  W F van Gunsteren,et al.  A protein structure from nuclear magnetic resonance data. lac repressor headpiece. , 1985, Journal of molecular biology.

[33]  Francine Berman,et al.  Grid Computing: Making the Global Infrastructure a Reality , 2003 .

[34]  P. Güntert Structure calculation of biological macromolecules from NMR data , 1998, Quarterly Reviews of Biophysics.

[35]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[36]  Michael Nilges,et al.  ARIA: automated NOE assignment and NMR structure calculation , 2003, Bioinform..