Systematic Comparison of Crystal and NMR Protein Structures Deposited in the Protein Data Bank

Nearly all the macromolecular three-dimensional structures deposited in Protein Data Bank were determined by either crystallographic (X-ray) or Nuclear Magnetic Resonance (NMR) spectroscopic methods. This paper reports a systematic comparison of the crystallographic and NMR results deposited in the files of the Protein Data Bank, in order to find out to which extent these information can be aggregated in bioinformatics. A non-redundant data set containing 109 NMR – X-ray structure pairs of nearly identical proteins was derived from the Protein Data Bank. A series of comparisons were performed by focusing the attention towards both global features and local details. It was observed that: (1) the RMDS values between NMR and crystal structures range from about 1.5 Å to about 2.5 Å; (2) the correlation between conformational deviations and residue type reveals that hydrophobic amino acids are more similar in crystal and NMR structures than hydrophilic amino acids; (3) the correlation between solvent accessibility of the residues and their conformational variability in solid state and in solution is relatively modest (correlation coefficient = 0.462); (4) beta strands on average match better between NMR and crystal structures than helices and loops; (5) conformational differences between loops are independent of crystal packing interactions in the solid state; (6) very seldom, side chains buried in the protein interior are observed to adopt different orientations in the solid state and in solution.

[1]  M. Philippopoulos,et al.  Exploring the dynamic information content of a protein NMR structure: Comparison of a molecular dynamics simulation with the NMR and X‐ray structures of Escherichia coli ribonuclease HI , 1999, Proteins.

[2]  Axel T. Brunger,et al.  X-ray crystallography and NMR reveal complementary views of structure and dynamics. , 1997 .

[3]  Iris Antes,et al.  Molecular dynamics simulations of photoactive yellow protein (PYP) in three states of its photocycle: a comparison with X-ray and NMR data and analysis of the effects of Glu46 deprotonation and mutation , 2002, European Biophysics Journal.

[4]  T. Gibson,et al.  Protein disorder prediction: implications for structural proteomics. , 2003, Structure.

[5]  J. Kelly,et al.  NMR solution structure of the isolated Apo Pin1 WW domain: comparison to the x-ray crystal structures of Pin1. , 2002, Biopolymers.

[6]  A. Li,et al.  Investigation of the solution structure of chymotrypsin inhibitor 2 using molecular dynamics: comparison to x-ray crystallographic and NMR data. , 1995, Protein engineering.

[7]  D. Marion,et al.  Comparison of solution and crystal structures of maize nonspecific lipid transfer protein: A model for a potential in vivo lipid carrier protein , 1998, Proteins.

[8]  M. DePristo,et al.  Heterogeneity and inaccuracy in protein structures solved by X-ray crystallography. , 2004, Structure.

[9]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[10]  V. Rybin,et al.  Crystal structure of human filamin C domain 23 and small angle scattering model for filamin C 23-24 dimer. , 2007, Journal of molecular biology.

[11]  J. Ponder,et al.  Binding of retinol induces changes in rat cellular retinol-binding protein II conformation and backbone dynamics. , 2000, Journal of molecular biology.

[12]  R. Kaptein,et al.  Joint refinement as a tool for thorough comparison between NMR and X-ray data and structures of HU protein , 2001, Journal of biomolecular NMR.

[13]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[14]  T F Havel,et al.  The solution structure of eglin c based on measurements of many NOEs and coupling constants and its comparison with X‐ray structures , 1992, Protein science : a publication of the Protein Society.

[15]  C. W. Hilbers,et al.  Refined solution structure of the Tyr41-->His mutant of the M13 gene V protein. A comparison with the crystal structure. , 1995, European journal of biochemistry.

[16]  M. DePristo,et al.  Relation between native ensembles and experimental structures of proteins. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[17]  J. Doye,et al.  Inhibition of protein crystallization by evolutionary negative design , 2004, Physical biology.

[18]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[19]  R. Huber,et al.  Comparison of the NMR solution structure with the X-ray crystal structure of the activation domain from procarboxypeptidase B , 1992, Journal of biomolecular NMR.

[20]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[21]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[22]  F. Abildgaard,et al.  NMR of hydrogen bonding in cold‐shock protein A and an analysis of the influence of crystallographic resolution on comparisons of hydrogen bond lengths , 2001, Protein science : a publication of the Protein Society.

[23]  K. Wüthrich,et al.  Comparison of the NMR solution structure and the x-ray crystal structure of rat metallothionein-2. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[24]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[25]  A. Ortiz,et al.  1H and 15N NMR assignment and solution structure of the SH3 domain of spectrin: Comparison of unrefined and refined structure sets with the crystal structure , 1997, Journal of biomolecular NMR.

[26]  W. Sundquist,et al.  Comparison of the NMR and X‐ray structures of the HIV‐1 matrix protein: Evidence for conformational changes during viral assembly , 1996, Protein science : a publication of the Protein Society.

[27]  Gregory A Petsko,et al.  Why protein R‐factors are so large: A self‐consistent analysis , 2002, Proteins.

[28]  R. Huber,et al.  Conformational variability of chicken cystatin. Comparison of structures determined by X-ray diffraction and NMR spectroscopy. , 1993, Journal of molecular biology.

[29]  Oliviero Carugo,et al.  Protein—protein crystal‐packing contacts , 1997, Protein science : a publication of the Protein Society.

[30]  G. Moore,et al.  Determination of the structure of oxidised Desulfovibrio africanus ferredoxin I by 1H NMR spectroscopy and comparison of its solution structure with its crystal structure. , 1998, Journal of molecular biology.

[31]  O. Carugo,et al.  Unique Features of the sodC-encoded Superoxide Dismutase from Mycobacterium tuberculosis, a Fully Functional Copper-containing Enzyme Lacking Zinc in the Active Site* , 2004, Journal of Biological Chemistry.

[32]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[33]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[34]  On the interrelationship between atomic displacement parameters (ADPs) and coordinates in protein structures. , 2007, Acta crystallographica. Section D, Biological crystallography.

[35]  S. Pongor,et al.  A normalized root‐mean‐spuare distance for comparing protein three‐dimensional structures , 2001, Protein science : a publication of the Protein Society.

[36]  D. Lipman,et al.  Rapid and sensitive protein similarity searches. , 1985, Science.

[37]  T. Ueda,et al.  Fluctuations in free or substrate-complexed lysozyme and a mutant of it detected on x-ray crystallography and comparison with those detected on NMR. , 2002, Journal of biochemistry.

[38]  Ernest Fraenkel,et al.  Comparison of X-ray and NMR structures for the Antennapedia homeodomain–DNA complex , 1998, Nature Structural &Molecular Biology.

[39]  Michael Nilges,et al.  Refined Solution Structure of the Tyr41-]His Mutant of the M13 Gene-V Protein - a Comparison with the Crystal-Structure , 1995 .

[40]  Identification of new, well-populated amino-acid sidechain rotamers involving hydroxyl-hydrogen atoms and sulfhydryl-hydrogen atoms , 2008, BMC Structural Biology.

[41]  András Fiser,et al.  Effects of amino acid composition, finite size of proteins, and sparse statistics on distance‐dependent statistical pair potentials , 2007, Proteins.

[42]  Adam Godzik,et al.  Clustering of highly homologous sequences to reduce the size of large protein databases , 2001, Bioinform..

[43]  A. D. McLachlan,et al.  Rapid comparison of protein structures , 1982 .

[44]  M. Y. Lobanov,et al.  Comparison of X‐ray and NMR structures: Is there a systematic difference in residue contacts between X‐ray‐ and NMR‐resolved protein structures? , 2005, Proteins.

[45]  Oliviero Carugo,et al.  How root-mean-square distance (r.m.s.d.) values depend on the resolution of protein structures that are compared , 2003 .

[46]  C. Chothia,et al.  The Packing Density in Proteins: Standard Radii and Volumes , 1999 .

[47]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[48]  I. Bahar,et al.  Structure‐based analysis of protein dynamics: Comparison of theoretical results for hen lysozyme with X‐ray diffraction and NMR relaxation data , 1999, Proteins.

[49]  G L Gilliland,et al.  Combining experimental information from crystal and solution studies: joint X-ray and NMR refinement. , 1992, Science.

[50]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[51]  M. Stern,et al.  Backbone dynamics and solution structure refinement of the 15N-labeled human oncogenic protein p13MTCP1: Comparison with X-ray data , 2000, Journal of biomolecular NMR.

[52]  R Nussinov,et al.  A set of van der Waals and coulombic radii of protein atoms for molecular and solvent‐accessible surface calculation, packing evaluation, and docking , 1998, Proteins.

[53]  P. Ortiz de Montellano,et al.  Solution 1H NMR of the Active Site of Substrate-bound, Cyanide-inhibited Human Heme Oxygenase , 2001, The Journal of Biological Chemistry.

[54]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[55]  Manfred J. Sippl,et al.  Thirty years of environmental health research--and growing. , 1996, Nucleic Acids Res..

[56]  M Wilmanns,et al.  Molecular replacement with NMR models using distance-derived pseudo B factors. , 1996, Acta crystallographica. Section D, Biological crystallography.

[57]  L. Banaszak,et al.  Crystal structures of holo and apo-cellular retinol-binding protein II. , 1993, Journal of molecular biology.

[58]  George N Phillips,et al.  Ensemble refinement of protein crystal structures: validation and application. , 2007, Structure.

[59]  I. Polikarpov,et al.  The X-ray structure of a recombinant major urinary protein at 1.75 A resolution. A comparative study of X-ray and NMR-derived structures. , 2001, Acta crystallographica. Section D, Biological crystallography.