Automated 1H and 13C chemical shift prediction using the BioMagResBank

A computer program has been developed to accurately and automatically predict the 1H and 13C chemical shifts of unassigned proteins on the basis of sequence homology. The program (called SHIFTY) uses standard sequence alignment techniques to compare the sequence of an unassigned protein against the BioMagResBank – a public database containing sequences and NMR chemical shifts of nearly 200 assigned proteins [Seavey et al. (1991) J. Biomol. NMR, 1, 217–236]. From this initial sequence alignment, the program uses a simple set of rules to directly assign or transfer a complete set of 1H or 13C chemical shifts (from the previously assigned homologues) to the unassigned protein. This ‘homologous assignment’ protocol takes advantage of the simple fact that homologous proteins tend to share both structural similarity and chemical shift similarity. SHIFTY has been extensively tested on more than 25 medium-sized proteins. Under favorable circumstances, this program can predict the 1H or 13C chemical shifts of proteins with an accuracy far exceeding any other method published to date. With the expo- nential growth in the number of assigned proteins appearing in the literature (now at a rate of more than 150 per year), we believe that SHIFTY may have widespread utility in assigning individual members in families of related proteins, an endeavor that accounts for a growing portion of the protein NMR work being done today.

[1]  F. Richards,et al.  Relationship between nuclear magnetic resonance chemical shift and protein secondary structure. , 1991, Journal of molecular biology.

[2]  E. Oldfield,et al.  Secondary and tertiary structural effects on protein NMR chemical shifts: an ab initio approach. , 1993, Science.

[3]  John P. Overington,et al.  From comparisons of protein sequences and structures to protein modelling and design. , 1990, Trends in biochemical sciences.

[4]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[5]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its new supplement TREMBL , 1996, Nucleic Acids Res..

[6]  J. Neira,et al.  Peptide group chemical shift computation , 1992 .

[7]  D. Case,et al.  A new analysis of proton chemical shifts in proteins , 1991 .

[8]  Jeffrey C. Hoch,et al.  Computational Aspects of the Study of Biological Macromolecules by Nuclear Magnetic Resonance Spectroscopy , 1991, NATO ASI Series.

[9]  W R Taylor,et al.  Fast structure alignment for protein databank searching , 1992, Proteins.

[10]  J S Sack,et al.  Structure of a recombinant calmodulin from Drosophila melanogaster refined at 2.2-A resolution. , 1992, The Journal of biological chemistry.

[11]  A Tsugita,et al.  The PIR-International Protein Sequence Database. , 1996, Nucleic acids research.

[12]  M. O. Dayhoff,et al.  Establishing homologies in protein sequences. , 1983, Methods in enzymology.

[13]  K. Wüthrich,et al.  Ring current effects in the conformation dependent NMR chemical shifts of aliphatic protons in the basic pancreatic trypsin inhibitor. , 1979, Biochimica et biophysica acta.

[14]  M. Williamson,et al.  A method for the calculation of protein α-CH chemical shifts , 1992 .

[15]  David S. Wishart,et al.  ORB, a homology-based program for the prediction of protein NMR chemical shifts , 1997 .

[16]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[17]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[18]  K Wüthrich,et al.  Automated sequence-specific NMR assignment of homologous proteins using the program GARANT , 1996, Journal of biomolecular NMR.

[19]  R. Dwek,et al.  Comparisons of ring-current shifts calculated from the crystal structure of egg white lysozyme of hen with the proton nuclear magnetic resonance spectrum of lysozyme in solution. , 1980, Biochemistry.

[20]  M. James,et al.  Refined crystal structure of troponin C from turkey skeletal muscle at 2.0 A resolution. , 1988, Journal of molecular biology.

[21]  L. Kay,et al.  A novel approach for sequential assignment of 1H, 13C, and 15N spectra of proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin. , 1990, Biochemistry.