Automated protein structure calculation from NMR data

Current software is almost at the stage to permit completely automatic structure determination of small proteins of <15 kDa, from NMR spectra to structure validation with minimal user interaction. This goal is welcome, as it makes structure calculation more objective and therefore more easily validated, without any loss in the quality of the structures generated. Moreover, it releases expert spectroscopists to carry out research that cannot be automated. It should not take much further effort to extend automation to ca 20 kDa. However, there are technological barriers to further automation, of which the biggest are identified as: routines for peak picking; adoption and sharing of a common framework for structure calculation, including the assembly of an automated and trusted package for structure validation; and sample preparation, particularly for larger proteins. These barriers should be the main target for development of methodology for protein structure determination, particularly by structural genomics consortia.

[1]  Udo Heinemann,et al.  Structural genomics in Europe: Slow start, strong finish? , 2000, Nature Structural Biology.

[2]  A. Grishaev,et al.  ABACUS, a direct method for protein NMR structure computation via assembly of fragments , 2005, Proteins.

[3]  M Nilges,et al.  NMR in the SPINE Structural Proteomics project. , 2006, Acta crystallographica. Section D, Biological crystallography.

[4]  Jason D. Gans,et al.  APART: Automated Preprocessing for NMR Assignments with Reduced Tedium , 2005, Bioinform..

[5]  Mark Gerstein,et al.  Structural proteomics of an archaeon , 2000, Nature Structural Biology.

[6]  David Cyranoski,et al.  'Big science' protein project under fire , 2006, Nature.

[7]  A. Edwards,et al.  Structural proteomics: toward high-throughput structural biology as a tool in functional genomics. , 2003, Accounts of chemical research.

[8]  Thomas C. Terwilliger,et al.  Structural genomics in North America , 2000, Nature Structural Biology.

[9]  Peter Güntert,et al.  Automated structure determination of proteins with the SAIL-FLYA NMR method , 2007, Nature Protocols.

[10]  Fast high-resolution protein structure determination by using unassigned NMR data. , 2007, Angewandte Chemie.

[11]  Michael Nilges,et al.  ARIA: automated NOE assignment and NMR structure calculation , 2003, Bioinform..

[12]  Chris Bailey-Kellogg,et al.  An efficient randomized algorithm for contact-based NMR backbone resonance assignment , 2006, Bioinform..

[13]  A. Altieri,et al.  Automation of NMR structure determination of proteins. , 2004, Current opinion in structural biology.

[14]  Chris Bailey-Kellogg,et al.  Contact replacement for NMR resonance assignment , 2008, ISMB.

[15]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[16]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[17]  Francesco Fiorito,et al.  Automated amino acid side-chain NMR assignment of proteins using 13C- and 15N-resolved 3D [1H,1H]-NOESY , 2008, Journal of biomolecular NMR.

[18]  D. Wishart,et al.  An NMR approach to structural proteomics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Thomas Szyperski,et al.  G-matrix Fourier transform NMR spectroscopy for complete protein resonance assignment. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[20]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[21]  Kurt Wüthrich,et al.  Sequence-specific resonance assignment of soluble nonglobular proteins by 7D APSY-NMR spectroscopy. , 2007, Journal of the American Chemical Society.

[22]  Peter Güntert,et al.  KUJIRA, a package of integrated modules for systematic and interactive analysis of NMR data directed to high-throughput NMR structure studies , 2007, Journal of biomolecular NMR.

[23]  Cheryl H. Arrowsmith,et al.  Structural Proteomics: Toward High‐Throughput Structural Biology as a Tool in Functional Genomics , 2003 .

[24]  Gert Vriend,et al.  Validation of protein structures derived by NMR spectroscopy , 2004 .

[25]  Miron Livny,et al.  RECOORD: A recalculated coordinate database of 500+ proteins from the PDB using restraints from the BioMagResBank , 2005, Proteins.

[26]  Cheryl H Arrowsmith,et al.  Solution NMR in structural genomics. , 2006, Current opinion in structural biology.

[27]  F. Allain,et al.  Improved segmental isotope labeling methods for the NMR study of multidomain or large proteins: application to the RRMs of Npl3p and hnRNP L. , 2008, Journal of molecular biology.

[28]  J. Sussman,et al.  RIKEN aids international structural genomics efforts , 2007, Nature.

[29]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.

[30]  David Baker,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improving NMR protein structure quality by Rosetta refinement: A molecular , 2022 .

[31]  Charles D Schwieters,et al.  Completely automated, highly error-tolerant macromolecular structure determination from multidimensional nuclear overhauser enhancement spectra and chemical shift assignments. , 2004, Journal of the American Chemical Society.

[32]  Gaohua Liu,et al.  NMR data collection and analysis protocol for high-throughput protein structure determination. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Oliver F. Lange,et al.  Consistent blind protein structure generation from NMR chemical shift data , 2008, Proceedings of the National Academy of Sciences.

[34]  Charles D Schwieters,et al.  The Xplor-NIH NMR molecular structure determination package. , 2003, Journal of magnetic resonance.

[35]  Peter Güntert,et al.  Automated NMR protein structure calculation , 2003 .

[36]  Thomas Szyperski,et al.  A generalized approach to automated NMR peak list editing: application to reduced dimensionality triple resonance spectra. , 2004, Journal of magnetic resonance.

[37]  John D. Westbrook,et al.  TargetDB: a target registration database for structural genomics projects , 2004, Bioinform..

[38]  Michael Nilges,et al.  Quantitative study of the effects of chemical shift tolerances and rates of SA cooling on structure calculation from automatically assigned NOE data. , 2005, Journal of magnetic resonance.

[39]  Yutaka Kuroda,et al.  Structural genomics projects in Japan , 2000, Nature Structural Biology.

[40]  Alexander Grishaev,et al.  CLOUDS, a protocol for deriving a molecular proton density via NMR , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[41]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[42]  Robert Powers,et al.  An integrated platform for automated analysis of protein NMR structures. , 2005, Methods in enzymology.

[43]  Michael Levitt,et al.  Growth of novel protein structural data , 2007, Proceedings of the National Academy of Sciences.

[44]  Masasuke Yoshida,et al.  Conformational change of H+-ATPase beta monomer revealed on segmental isotope labeling NMR spectroscopy. , 2004, Journal of the American Chemical Society.

[45]  M Nilges,et al.  Calculation of protein structures with ambiguous distance restraints. Automated assignment of ambiguous NOE crosspeaks and disulphide connectivities. , 1995, Journal of molecular biology.

[46]  Brian D Sykes,et al.  Smartnotebook: A semi-automated approach to protein sequential NMR resonance assignments , 2003, Journal of biomolecular NMR.

[47]  K. Gunsalus,et al.  Protein production and purification , 2008, Nature Methods.

[48]  Gert Vriend,et al.  The precision of NMR structure ensembles revisited , 2003, Journal of biomolecular NMR.

[49]  Bruce A. Johnson,et al.  NMR View: A computer program for the visualization and analysis of NMR data , 1994, Journal of biomolecular NMR.

[50]  G. Montelione,et al.  Assignment validation software suite for the evaluation and presentation of protein resonance assignment data , 2004, Journal of biomolecular NMR.

[51]  Michael Nilges,et al.  Structural bioinformatics ARIA 2 : Automated NOE assignment and data integration in NMR structure calculation , 2007 .

[52]  M. Nilges,et al.  Influence of chemical shift tolerances on NMR structure calculations using ARIA protocols for assigning NOE data , 2005, Journal of biomolecular NMR.

[53]  Peter Güntert,et al.  Automated structure determination from NMR spectra , 2009, European Biophysics Journal.

[54]  Alexandre M J J Bonvin,et al.  DRESS: a database of REfined solution NMR structures , 2004, Proteins.

[55]  P. Güntert,et al.  Fully automated structure determinations of the Fes SH2 domain using different sets of NMR spectra , 2006, Magnetic resonance in chemistry : MRC.

[56]  Stephen K. Burley,et al.  An overview of structural genomics , 2000, Nature Structural Biology.

[57]  Peter Güntert,et al.  Automated protein structure determination from NMR spectra. , 2006, Journal of the American Chemical Society.

[58]  Martin Billeter,et al.  High-throughput analysis of protein NMR spectra , 2005 .

[59]  D. Baker,et al.  Rapid protein fold determination using unassigned NMR data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Michael Nilges,et al.  Structural Biology by NMR: Structure, Dynamics, and Interactions , 2008, PLoS Comput. Biol..

[61]  Y. Matsuo,et al.  Structural genomics projects in Japan. , 2000, Progress in biophysics and molecular biology.

[62]  Peter Güntert,et al.  Influence of the completeness of chemical shift assignments on NMR structures obtained with automated NOE assignment , 2004, Journal of Structural and Functional Genomics.

[63]  Martin Billeter,et al.  Fully automated sequence-specific resonance assignments of hetero- nuclear protein spectra , 2003, Journal of biomolecular NMR.

[64]  Gaetano T Montelione,et al.  Assessing precision and accuracy of protein structures derived from NMR data , 2005, Proteins.

[65]  Steven E Brenner,et al.  The Impact of Structural Genomics: Expectations and Outcomes , 2005, Science.

[66]  G. Marius Clore,et al.  Automated error-tolerant macromolecular structure determination from multidimensional nuclear Overhauser enhancement spectra and chemical shift assignments: improved robustness and performance of the PASD algorithm , 2008, Journal of biomolecular NMR.

[67]  Gaetano T Montelione,et al.  Evaluating protein structures determined by structural genomics consortia , 2006, Proteins.

[68]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[69]  Michael Andrec,et al.  A large data set comparison of protein structures determined by crystallography and NMR: Statistical test for structural differences and the effect of crystal packing , 2007, Proteins.

[70]  Gaetano T Montelione,et al.  Automated analysis of protein NMR assignments and structures. , 2004, Chemical reviews.

[71]  K Wüthrich,et al.  The program XEASY for computer-supported NMR spectral analysis of biological macromolecules , 1995, Journal of biomolecular NMR.

[72]  Ray Freeman,et al.  Projection-reconstruction technique for speeding up multidimensional NMR spectroscopy. , 2004, Journal of the American Chemical Society.

[73]  Wolfgang Jahnke,et al.  Perspectives of biomolecular NMR in drug discovery: the blessing and curse of versatility , 2007, Journal of biomolecular NMR.

[74]  H N Moseley,et al.  Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. , 2001, Methods in enzymology.

[75]  Kurt Wüthrich,et al.  GARANT‐a general algorithm for resonance assignment of multidimensional nuclear magnetic resonance spectra , 1997 .

[76]  Kurt Wüthrich,et al.  Solution NMR structure determination of proteins revisited , 2008, Journal of biomolecular NMR.

[77]  Michael Nilges,et al.  Ambiguous NOEs and automated NOE assignment , 1998 .

[78]  Timothy F. Havel,et al.  Solution conformation of proteinase inhibitor IIA from bull seminal plasma by 1H nuclear magnetic resonance and distance geometry. , 1985, Journal of molecular biology.

[79]  J. Lukin,et al.  MONTE: An automated Monte Carlo based approach to nuclear magnetic resonance assignment of proteins , 2003, Journal of biomolecular NMR.

[80]  Thomas Szyperski,et al.  G-matrix Fourier transform NOESY-based protocol for high-quality protein structure determination. , 2005, Journal of the American Chemical Society.

[81]  Wayne Boucher,et al.  The CCPN data model for NMR spectroscopy: Development of a software pipeline , 2005, Proteins.

[82]  J. Skilling,et al.  Exponential sampling, an alternative method for sampling in two-dimensional NMR experiments , 1987 .

[83]  W. Gronwald,et al.  Automated structure determination of proteins by NMR spectroscopy , 2004 .

[84]  Ming Luo,et al.  The Southeast Collaboratory for Structural Genomics: a high-throughput gene to structure factory. , 2003, Accounts of chemical research.