Automated amino acid side-chain NMR assignment of proteins using 13C- and 15N-resolved 3D [1H,1H]-NOESY

ASCAN is a new algorithm for automatic sequence-specific NMR assignment of amino acid side-chains in proteins, which uses as input the primary structure of the protein, chemical shift lists of 1HN, 15N, 13Cα, 13Cβ and possibly 1Hα from the previous polypeptide backbone assignment, and one or several 3D 13C- or 15N-resolved [1H,1H]-NOESY spectra. ASCAN has also been laid out for the use of TOCSY-type data sets as supplementary input. The program assigns new resonances based on comparison of the NMR signals expected from the chemical structure with the experimentally observed NOESY peak patterns. The core parts of the algorithm are a procedure for generating expected peak positions, which is based on variable combinations of assigned and unassigned resonances that arise for the different amino acid types during the assignment procedure, and a corresponding set of acceptance criteria for assignments based on the NMR experiments used. Expected patterns of NOESY cross peaks involving unassigned resonances are generated using the list of previously assigned resonances, and tentative chemical shift values for the unassigned signals taken from the BMRB statistics for globular proteins. Use of this approach with the 101-amino acid residue protein FimD(25–125) resulted in 84% of the hydrogen atoms and their covalently bound heavy atoms being assigned with a correctness rate of 90%. Use of these side-chain assignments as input for automated NOE assignment and structure calculation with the ATNOS/CANDID/DYANA program suite yielded structure bundles of comparable quality, in terms of precision and accuracy of the atomic coordinates, as those of a reference structure determined with interactive assignment procedures. A rationale for the high quality of the ASCAN-based structure determination results from an analysis of the distribution of the assigned side chains, which revealed near-complete assignments in the core of the protein, with most of the incompletely assigned residues located at or near the protein surface.

[1]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[2]  Brian D Sykes,et al.  Smartnotebook: A semi-automated approach to protein sequential NMR resonance assignments , 2003, Journal of biomolecular NMR.

[3]  K Wüthrich,et al.  Pseudo-structures for the 20 common amino acids for use in studies of protein conformations by measurements of intramolecular proton-proton distance constraints with nuclear magnetic resonance. , 1983, Journal of molecular biology.

[4]  Robert Powers,et al.  A common sense approach to peak picking in two-, three-, and four-dimensional spectra using automatic computer analysis of contour diagrams , 1991 .

[5]  M. Billeter,et al.  Automated peak picking and peak integration in macromolecular NMR spectra using AUTOPSY. , 1998, Journal of magnetic resonance.

[6]  R. Glockshuber,et al.  Structural basis of chaperone–subunit complex recognition by the type 1 pilus assembly platform FimD , 2005, The EMBO journal.

[7]  W. M. Westler,et al.  A relational database for sequence-specific protein NMR data , 1991, Journal of biomolecular NMR.

[8]  K. Wüthrich,et al.  Solution structures of the putative anti‐σ‐factor antagonist TM1442 from Thermotoga maritima in the free and phosphorylated states , 2006, Magnetic resonance in chemistry : MRC.

[9]  Peter Güntert,et al.  Automated NMR protein structure calculation , 2003 .

[10]  Gaetano T Montelione,et al.  Automated analysis of protein NMR assignments and structures. , 2004, Chemical reviews.

[11]  H N Moseley,et al.  Automated analysis of NMR assignments and structures for proteins. , 1999, Current opinion in structural biology.

[12]  Gerhard Wagner,et al.  IBIS – A tool for automated sequential assignment of protein spectra from triple resonance experiments , 2003, Journal of Biomolecular NMR.

[13]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[14]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[15]  H N Moseley,et al.  Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. , 2001, Methods in enzymology.

[16]  Kurt Wüthrich,et al.  GARANT‐a general algorithm for resonance assignment of multidimensional nuclear magnetic resonance spectra , 1997 .

[17]  Robert F. Boyko,et al.  CAMRA: Chemical shift based computer aided protein NMR assignments , 1998, Journal of biomolecular NMR.

[18]  R A Goldstein,et al.  Protein heteronuclear NMR assignments using mean-field simulated annealing. , 1997, Journal of magnetic resonance.

[19]  A. Altieri,et al.  Automation of NMR structure determination of proteins. , 2004, Current opinion in structural biology.

[20]  P. J. Kraulis,et al.  ANSIG: A program for the assignment of protein 1H 2D NMR spectra by interactive computer graphics , 1989, Journal of Magnetic Resonance (1969).

[21]  W. Gronwald,et al.  Automated structure determination of proteins by NMR spectroscopy , 2004 .

[22]  K. Wüthrich,et al.  Assignments for the Bombyx mori pheromone-binding protein fragment BmPBP(1-128) at pH 6.5. , 2005, Journal of Biomolecular NMR.

[23]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[24]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[25]  K Wüthrich,et al.  Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSA. , 1991, Journal of molecular biology.

[26]  K. Wüthrich,et al.  Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS , 2002, Journal of biomolecular NMR.

[27]  Thomas Szyperski,et al.  A generalized approach to automated NMR peak list editing: application to reduced dimensionality triple resonance spectra. , 2004, Journal of magnetic resonance.

[28]  K Wüthrich,et al.  Automated sequence-specific NMR assignment of homologous proteins using the program GARANT , 1996, Journal of biomolecular NMR.

[29]  Brian E Coggins,et al.  PACES: Protein sequential assignment by computer-assisted exhaustive search , 2003, Journal of biomolecular NMR.

[30]  M. Billeter,et al.  MUNIN: A new approach to multi-dimensional NMR spectra interpretation , 2001, Journal of biomolecular NMR.

[31]  Martin Billeter,et al.  Fully automated sequence-specific resonance assignments of hetero- nuclear protein spectra , 2003, Journal of biomolecular NMR.

[32]  H. Atreya,et al.  A tracked approach for automated NMR assignments in proteins (TATAPRO) , 2000, Journal of biomolecular NMR.

[33]  Arash Bahrami,et al.  High-resolution iterative frequency identification for NMR as a general strategy for multidimensional data collection. , 2005, Journal of the American Chemical Society.

[34]  Arash Bahrami,et al.  Probabilistic Identification of Spin Systems and their Assignments including Coil–Helix Inference as Output (PISTACHIO) , 2005, Journal of biomolecular NMR.