A community resource of experimental data for NMR / X‐ray crystal structure pairs

We have developed an online NMR / X‐ray Structure Pair Data Repository. The NIGMS Protein Structure Initiative (PSI) has provided many valuable reagents, 3D structures, and technologies for structural biology. The Northeast Structural Genomics Consortium was one of several PSI centers. NESG used both X‐ray crystallography and NMR spectroscopy for protein structure determination. A key goal of the PSI was to provide experimental structures for at least one representative of each of hundreds of targeted protein domain families. In some cases, structures for identical (or nearly identical) constructs were determined by both NMR and X‐ray crystallography. NMR spectroscopy and X‐ray diffraction data for 41 of these “NMR / X‐ray” structure pairs determined using conventional triple‐resonance NMR methods with extensive sidechain resonance assignments have been organized in an online NMR / X‐ray Structure Pair Data Repository. In addition, several NMR data sets for perdeuterated, methyl‐protonated protein samples are included in this repository. As an example of the utility of this repository, these data were used to revisit questions about the precision and accuracy of protein NMR structures first outlined by Levy and coworkers several years ago (Andrec et al., Proteins 2007;69:449–465). These results demonstrate that the agreement between NMR and X‐ray crystal structures is improved using modern methods of protein NMR spectroscopy. The NMR / X‐ray Structure Pair Data Repository will provide a valuable resource for new computational NMR methods development.

[1]  H N Moseley,et al.  Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. , 2001, Methods in enzymology.

[2]  K. Hwang,et al.  Structural and kinetic analysis of an MsrA–MsrB fusion protein from Streptococcus pneumoniae , 2009, Molecular microbiology.

[3]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[4]  R. Brüschweiler,et al.  Quantitative molecular ensemble interpretation of NMR dipolar couplings without restraints. , 2007, Journal of the American Chemical Society.

[5]  Jing Chen,et al.  Structure-Guided Functional Characterization of Enediyne Self-Sacrifice Resistance Proteins, CalU16 and CalU19 , 2014, ACS chemical biology.

[6]  Christopher Clapham,et al.  The Concise Oxford Dictionary of Mathematics , 1990 .

[7]  D. Blow,et al.  An Automated System for Micro-Batch Protein Crystallization and Screening , 1990 .

[8]  Arash Bahrami,et al.  Probabilistic Interaction Network of Evidence Algorithm and its Application to Complete Labeling of Peak Lists from Protein NMR Spectroscopy , 2009, PLoS Comput. Biol..

[9]  G. Montelione,et al.  High-level production of uniformly 15N-and 13C-enriched fusion proteins in Escherichia coli , 1996 .

[10]  Mark Gerstein,et al.  Robotic cloning and Protein Production Platform of the Northeast Structural Genomics Consortium. , 2005, Methods in enzymology.

[11]  Jinfeng Liu,et al.  Novel leverage of structural genomics , 2007, Nature Biotechnology.

[12]  Randy J. Read,et al.  Phaser crystallographic software , 2007, Journal of applied crystallography.

[13]  R. Brüschweiler,et al.  Validation of Molecular Dynamics Simulations of Biomolecules Using NMR Spin Relaxation as Benchmarks:  Application to the AMBER99SB Force Field. , 2007, Journal of chemical theory and computation.

[14]  Oliver F. Lange,et al.  NMR Structure Determination for Larger Proteins Using Backbone-Only Data , 2010, Science.

[15]  Gaohua Liu,et al.  Preparation of protein samples for NMR structure, function, and small-molecule screening studies. , 2011, Methods in enzymology.

[16]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[17]  I. Kurnaz Protein Production and Purification , 2015 .

[18]  Robert Powers,et al.  A topology‐constrained distance network algorithm for protein structure determination from NOESY data , 2005, Proteins.

[19]  Peter Güntert,et al.  Objective identification of residue ranges for the superposition of protein structures , 2011, BMC Bioinformatics.

[20]  Randy J Read,et al.  Electronic Reprint Biological Crystallography Phenix: Building New Software for Automated Crystallographic Structure Determination Biological Crystallography Phenix: Building New Software for Automated Crystallographic Structure Determination , 2022 .

[21]  G. Montelione,et al.  Recommendations of the wwPDB NMR Validation Task Force. , 2013, Structure.

[22]  Charles D Schwieters,et al.  The Xplor-NIH NMR molecular structure determination package. , 2003, Journal of magnetic resonance.

[23]  Gaetano T Montelione,et al.  The high-throughput protein sample production platform of the Northeast Structural Genomics Consortium. , 2010, Journal of structural biology.

[24]  S. Cusack,et al.  Crystallization and preliminary X‐ray analysis of the 9 kDa protein of the mouse signal recognition particle and the selenomethionyl‐SRP9 , 1996, FEBS letters.

[25]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[26]  Peter Güntert,et al.  Increased reliability of nuclear magnetic resonance protein structures by consensus structure bundles. , 2015, Structure.

[27]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[28]  David Baker,et al.  Accurate protein structure modeling using sparse NMR data and homologous structure information , 2012, Proceedings of the National Academy of Sciences.

[29]  W. Hendrickson Determination of macromolecular structures from anomalous diffraction of synchrotron radiation. , 1991, Science.

[30]  Miron Livny,et al.  BioMagResBank , 2007, Nucleic Acids Res..

[31]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[32]  Sebastian Hiller,et al.  References and Notes Supporting Online Material Materials and Methods Figures S1 to S5 Table S1 References Solution Structure of the Integral Human Membrane Protein Vdac-1 in Detergent Micelles , 2022 .

[33]  Kate A. Stafford,et al.  Interpreting protein structural dynamics from NMR chemical shifts. , 2012, Journal of the American Chemical Society.

[34]  G. Montelione,et al.  Contributions to the NIH-NIGMS Protein Structure Initiative from the PSI Production Centers. , 2008, Structure.

[35]  Gaohua Liu,et al.  NMR data collection and analysis protocol for high-throughput protein structure determination. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[36]  G. Montelione,et al.  Three‐dimensional structure of the weakly associated protein homodimer SeR13 using RDCs and paramagnetic surface mapping , 2010, Protein science : a publication of the Protein Society.

[37]  Joseph R Luft,et al.  A deliberate approach to screening for initial crystallization conditions of biological macromolecules. , 2003, Journal of structural biology.

[38]  George M Sheldrick,et al.  Substructure solution with SHELXD. , 2002, Acta crystallographica. Section D, Biological crystallography.

[39]  Antonio Rosato,et al.  RPF: a quality assessment tool for protein NMR structures , 2012, Nucleic Acids Res..

[40]  David Baker,et al.  Protein NMR Structures Refined with Rosetta Have Higher Accuracy Relative to Corresponding X-ray Crystal Structures , 2014, Journal of the American Chemical Society.

[41]  Gaetano T Montelione,et al.  Clustering algorithms for identifying core atom sets and for assessing the precision of protein structure ensembles , 2005, Proteins.

[42]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[43]  Thomas C Terwilliger,et al.  SOLVE and RESOLVE: automated structure solution and density modification. , 2003, Methods in enzymology.

[44]  J. Prestegard,et al.  Residual dipolar couplings in structure determination of biomolecules. , 2004, Chemical reviews.

[45]  F. Studier,et al.  Use of bacteriophage T7 RNA polymerase to direct selective high-level expression of cloned genes. , 1986, Journal of molecular biology.

[46]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.

[47]  G. Montelione,et al.  A banner year for membranes , 1999, Nature Structural Biology.

[48]  Stereospecific nuclear magnetic resonance assignments of the methyl groups of valine and leucine in the DNA-binding domain of the 434 repressor by biosynthetically directed fractional 13C labeling. , 1995, Biochemistry.

[49]  L. Kay,et al.  Global folds of proteins with low densities of NOEs using residual dipolar couplings: application to the 370-residue maltodextrin-binding protein. , 2000, Journal of molecular biology.

[50]  A. Palmer Enzyme Dynamics from NMR Spectroscopy , 2015, Accounts of chemical research.

[51]  T F Havel,et al.  The solution structure of eglin c based on measurements of many NOEs and coupling constants and its comparison with X‐ray structures , 1992, Protein science : a publication of the Protein Society.

[52]  R. Brüschweiler,et al.  Certification of Molecular Dynamics Trajectories with NMR Chemical Shifts , 2010 .

[53]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[54]  Oliver F. Lange,et al.  Determination of solution structures of proteins up to 40 kDa using CS-Rosetta with sparse NMR data from deuterated samples , 2012, Proceedings of the National Academy of Sciences.

[55]  G. Montelione,et al.  Assignment validation software suite for the evaluation and presentation of protein resonance assignment data , 2004, Journal of biomolecular NMR.

[56]  F A Quiocho,et al.  Refined 1.8-A structure reveals the mode of binding of beta-cyclodextrin to the maltodextrin binding protein. , 1993, Biochemistry.

[57]  James M Aramini,et al.  PDBStat: a universal restraint converter and restraint analysis software package for protein NMR , 2013, Journal of biomolecular NMR.

[58]  Gaetano T. Montelione,et al.  The Protein Structure Initiative: achievements and visions for the future , 2012, F1000 biology reports.

[59]  Kevin Cowtan,et al.  research papers Acta Crystallographica Section D Biological , 2005 .

[60]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[61]  Gaetano T Montelione,et al.  The expanded FindCore method for identification of a core atom set for assessment of protein structure prediction , 2014, Proteins.

[62]  Gaetano T. Montelione,et al.  A microscale protein NMR sample screening pipeline , 2009, Journal of biomolecular NMR.

[63]  David A. Lee,et al.  PSI-2: structural genomics to cover protein domain family space. , 2009, Structure.

[64]  Oliver F. Lange,et al.  Determination of the Structures of Symmetric Protein Oligomers from NMR Chemical Shifts and Residual Dipolar Couplings , 2011, Journal of the American Chemical Society.

[65]  Wing-Yiu Choy,et al.  Solution NMR-derived global fold of a monomeric 82-kDa enzyme. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[66]  Michael Nilges,et al.  ISD: a software package for Bayesian NMR structure calculation , 2008, Bioinform..

[67]  Michael Andrec,et al.  A large data set comparison of protein structures determined by crystallography and NMR: Statistical test for structural differences and the effect of crystal packing , 2007, Proteins.

[68]  L. Kay,et al.  Global folds of highly deuterated, methyl-protonated proteins by multidimensional NMR. , 1997, Biochemistry.

[69]  Michael Nilges,et al.  Materials and Methods Som Text Figs. S1 to S6 References Movies S1 to S5 Inferential Structure Determination , 2022 .

[70]  T Pawson,et al.  Selective methyl group protonation of perdeuterated proteins. , 1996, Journal of molecular biology.

[71]  Gaetano T Montelione,et al.  Assessing precision and accuracy of protein structures derived from NMR data , 2005, Proteins.

[72]  G. Montelione,et al.  Solution NMR structure of yeast succinate dehydrogenase flavinylation factor Sdh5 reveals a putative Sdh1 binding site. , 2012, Biochemistry.

[73]  Charles A Laughton,et al.  COCO: A simple tool to enrich the representation of conformational variability in NMR structures , 2009, Proteins.

[74]  Gaetano T Montelione,et al.  Automated protein fold determination using a minimal NMR constraint strategy , 2003, Protein science : a publication of the Protein Society.

[75]  Binchen Mao,et al.  Improved technologies now routinely provide protein NMR structures useful for molecular replacement. , 2011, Structure.

[76]  Gaetano T Montelione,et al.  Automated analysis of protein NMR assignments and structures. , 2004, Chemical reviews.

[77]  Z. Otwinowski,et al.  Processing of X-ray diffraction data collected in oscillation mode. , 1997, Methods in enzymology.

[78]  A. Bax,et al.  TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts , 2009, Journal of biomolecular NMR.

[79]  Henry van den Bedem,et al.  Integrated description of protein dynamics from room-temperature X-ray crystallography and NMR , 2014, Proceedings of the National Academy of Sciences.

[80]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[81]  Robert Powers,et al.  Protein NMR recall, precision, and F-measure scores (RPF scores): structure quality assessment measures based on information retrieval statistics. , 2005, Journal of the American Chemical Society.

[82]  Gaetano T Montelione,et al.  Evaluating protein structures determined by structural genomics consortia , 2006, Proteins.

[83]  K. Gunsalus,et al.  Protein production and purification , 2008, Nature Methods.

[84]  Gert Vriend,et al.  The precision of NMR structure ensembles revisited , 2003, Journal of biomolecular NMR.

[85]  David Baker,et al.  A hybrid NMR/SAXS‐based approach for discriminating oligomeric protein interfaces using Rosetta , 2015, Proteins.

[86]  Robert Powers,et al.  An integrated platform for automated analysis of protein NMR structures. , 2005, Methods in enzymology.