CASA: An Efficient Automated Assignment of Protein Mainchain NMR Data Using an Ordered Tree Search Algorithm

Rapid analysis of protein structure, interaction, and dynamics requires fast and automated assignments of 3D protein backbone triple-resonance NMR spectra. We introduce a new depth-first ordered tree search method of automated assignment, CASA, which uses hand-edited peak-pick lists of a flexible number of triple resonance experiments. The computer program was tested on 13 artificially simulated peak lists for proteins up to 723 residues, as well as on the experimental data for four proteins. Under reasonable tolerances, it generated assignments that correspond to the ones reported in the literature within a few minutes of CPU time. The program was also tested on the proteins analyzed by other methods, with both simulated and experimental peaklists, and it could generate good assignments in all relevant cases. The robustness was further tested under various situations.

[1]  G. Kateman,et al.  Sequential assignment of 2D-NMR spectra of proteins using genetic algorithms , 1993, J. Chem. Inf. Comput. Sci..

[2]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[3]  P. Kraulis,et al.  Protein three-dimensional structure determination and sequence-specific assignment of 13C and 15N-separated NOE data. A novel real-space ab initio approach. , 1994, Journal of molecular biology.

[4]  Kuo-Bin Li,et al.  Automated Resonance Assignment of Proteins Using Heteronuclear 3D NMR, 2. Side Chain and Sequence-Specific Assignment , 1997, J. Chem. Inf. Comput. Sci..

[5]  P. Schmieder,et al.  MUSIC in triple-resonance experiments: amino acid type-selective (1)H-(15)N correlations , 1999, Journal of magnetic resonance.

[6]  F. Richards,et al.  1H-15N heteronuclear NMR studies of Escherichia coli thioredoxin in samples isotopically labeled by residue type. , 1985, Biochemistry.

[7]  S. Talukdar,et al.  Automated probabilistic method for assigning backbone resonances of (13C,15N)-labeled proteins , 1997, Journal of biomolecular NMR.

[8]  W. M. Westler,et al.  A relational database for sequence-specific protein NMR data , 1991, Journal of biomolecular NMR.

[9]  G T Montelione,et al.  Automated sequencing of amino acid spin systems in proteins using multidimensional HCC(CO)NH-TOCSY spectroscopy and constraint propagation methods from artificial intelligence , 1994, Journal of biomolecular NMR.

[10]  E. Campbell,et al.  1H, 15N, 13C and 13CO assignments and secondary structure determination of basic fibroblast growth factor using 3D heteronuclear NMR spectroscopy , 1995, Journal of biomolecular NMR.

[11]  D. Wishart,et al.  Identification of a novel archaebacterial thioredoxin: determination of function through structure. , 2002, Biochemistry.

[12]  V Dötsch,et al.  Amino-acid-type identification for deuterated proteins with a beta-carbon-edited HNCOCACB experiment. , 1996, Journal of magnetic resonance. Series B.

[13]  E. Grishin,et al.  Three-dimensional structure of ectatomin from Ectatomma tuberculatum ant venom , 1995, Journal of biomolecular NMR.

[14]  K Wüthrich,et al.  The program XEASY for computer-supported NMR spectral analysis of biological macromolecules , 1995, Journal of biomolecular NMR.

[15]  V. Dötsch,et al.  Amino-acid-type-selective triple-resonance experiments. , 1996, Journal of magnetic resonance. Series B.

[16]  R A Goldstein,et al.  Protein heteronuclear NMR assignments using mean-field simulated annealing. , 1997, Journal of magnetic resonance.

[17]  S. Swanson,et al.  Imaging and localized spectroscopy of 13C by polarization transfer , 1989 .

[18]  Chris Bailey-Kellogg,et al.  The NOESY Jigsaw: Automated Protein Secondary Structure and Main-Chain Assignment from Sparse, Unassigned NMR Data , 2000, J. Comput. Biol..

[19]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[20]  M. Zweckstetter,et al.  Mars - robust automatic backbone assignment of proteins , 2004, Journal of biomolecular NMR.

[21]  J. Lukin,et al.  MONTE: An automated Monte Carlo based approach to nuclear magnetic resonance assignment of proteins , 2003, Journal of biomolecular NMR.

[22]  P. Schmieder,et al.  MUSIC, selective pulses, and tuned delays: amino acid type-selective (1)H-(15)N correlations, II. , 2001, Journal of magnetic resonance.

[23]  V. Dötsch,et al.  Editing for amino-acid type in CBCACONH experiments based on the 13C beta-13C gamma coupling. , 1996, Journal of magnetic resonance. Series B.

[24]  Brian E Coggins,et al.  PACES: Protein sequential assignment by computer-assisted exhaustive search , 2003, Journal of biomolecular NMR.

[25]  G. Montelione,et al.  Conformation-independent sequential NMR connections in isotope-enriched polypeptides by 1H13C15N triple-resonance experiments , 1990 .

[26]  D. Hupe,et al.  Assignments for the main-chain nuclear magnetic resonances and delineation of the secondary structure of the catalytic domain of human stromelysin-1 as obtained from triple-resonance 3D NMR experiments. , 1993, Biochemistry.

[27]  J. Prestegard,et al.  Application of neural networks to automated assignment of NMR spectra of proteins , 1994, Journal of biomolecular NMR.

[28]  H N Moseley,et al.  Automated analysis of NMR assignments and structures for proteins. , 1999, Current opinion in structural biology.

[29]  B. Farmer,et al.  Assignment of aliphatic side-chain 1HN/15N resonances in perdeuterated proteins , 1996, Journal of biomolecular NMR.

[30]  G. Montelione,et al.  Assignment validation software suite for the evaluation and presentation of protein resonance assignment data , 2004, Journal of biomolecular NMR.

[31]  P. Schmieder,et al.  MUSIC and aromatic residues: amino acid type-selective (1)H-(15)N correlations, III. , 2001, Journal of magnetic resonance.

[32]  Hartmut Oschkinat,et al.  Computer-assisted assignment of multidimensional NMR spectra of proteins: Application to 3D NOESY-HMQC and TOCSY-HMQC spectra , 1993 .

[33]  K Wüthrich,et al.  Sequential resonance assignments in protein 1H nuclear magnetic resonance spectra. Basic pancreatic trypsin inhibitor. , 1982, Journal of molecular biology.

[34]  G. Montelione,et al.  Classification of amino acid spin systems using PFG HCC(CO)NH-TOCSY with constant-time aliphatic 13C frequency labeling , 1995, Journal of biomolecular NMR.

[35]  H N Moseley,et al.  Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. , 2001, Methods in enzymology.

[36]  Kurt Wüthrich,et al.  GARANT‐a general algorithm for resonance assignment of multidimensional nuclear magnetic resonance spectra , 1997 .

[37]  L. Kay,et al.  A novel approach for sequential assignment of proton, carbon-13, and nitrogen-15 spectra of larger proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin , 1990 .

[38]  L. Kay,et al.  A novel approach for sequential assignment of 1H, 13C, and 15N spectra of proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin. , 1990, Biochemistry.

[39]  Arash Bahrami,et al.  Probabilistic Identification of Spin Systems and their Assignments including Coil–Helix Inference as Output (PISTACHIO) , 2005, Journal of biomolecular NMR.

[40]  Kurt Wüthrich,et al.  Sequence-specific NMR assignment of proteins by global fragment mapping with the program Mapper , 2000, Journal of biomolecular NMR.

[41]  Horst Kessler,et al.  Automated backbone assignment of labeled proteins using the threshold accepting algorithm , 1998, Journal of biomolecular NMR.

[42]  H. Atreya,et al.  A tracked approach for automated NMR assignments in proteins (TATAPRO) , 2000, Journal of biomolecular NMR.

[43]  B. Farmer,et al.  Characterizing the use of perdeuteration in NMR studies of large proteins: 13C, 15N and 1H assignments of human carbonic anhydrase II. , 1996, Journal of molecular biology.

[44]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[45]  J. Simorre,et al.  Computer assignment of the backbone resonances of labelled proteins using two-dimensional correlation experiments , 1995, Journal of biomolecular NMR.