Graph-theoretical assignment of secondary structure in multidimensional protein NMR spectra: Application to the lac repressor headpiece

SummaryA novel procedure is presented for the automatic identification of secondary structures in proteins from their corresponding NOE data. The method uses a branch of mathematics known as graph theory to identify prescribed NOE connectivity patterns characteristic of the regular secondary structures. Resonance assignment is achieved by connecting these patterns of secondary structure together, thereby matching the connected spin systems to specific segments of the protein sequence. The method known as SERENDIPITY refers to a set of routines developed in a modular fashion, where each program has one or several well-defined tasks. NOE templates for several secondary structure motifs have been developed and the method has been successfully applied to data obtained from NOESY-type spectra. The present report describes the application of the SERENDIPITY protocol to a 3D NOESY-HMQC spectrum of the 15N-labelled lac repressor headpiece protein. The application demonstrates that, under favourable conditions, fully automated identification of secondary structures and semi-automated assignment are feasible.

[1]  Jun Xu,et al.  CPA: Constrained partitioning algorithm for initial assignment of protein proton resonances from MQF-COSY , 1993, J. Chem. Inf. Comput. Sci..

[2]  G. Kleywegt,et al.  Computer-Assisted Assignment of Homonuclear 3D NMR Spectra of Proteins. Application to Pike Parvalbumin III , 1993 .

[3]  H. T. Lau Algorithms on graphs , 1990 .

[4]  Hans Robert Kalbitzer,et al.  Distribution of chemical shifts in 1H nuclear magnetic resonance spectra of proteins , 1988 .

[5]  K. Wüthrich,et al.  Sequence-specific resonance assignments in the 1H nuclear-magnetic-resonance spectrum of the lac repressor DNA-binding domain 1-51 from Escherichia coli by two-dimensional spectroscopy. , 1983, European journal of biochemistry.

[6]  W F van Gunsteren,et al.  A protein structure from nuclear magnetic resonance data. lac repressor headpiece. , 1985, Journal of molecular biology.

[7]  D. Wishart,et al.  The 13C Chemical-Shift Index: A simple method for the identification of protein secondary structure using 13C chemical-shift data , 1994, Journal of biomolecular NMR.

[8]  K. Wüthrich,et al.  Carbon‐13 NMR chemical shifts of the common amino acid residues measured in aqueous solutions of the linear tetrapeptides H‐Gly‐Gly‐ X‐L‐ Ala‐OH , 1978 .

[9]  Georg E. Schulz,et al.  Principles of Protein Structure , 1979 .

[10]  Irwin D. Kuntz,et al.  A program for semi-automatic sequential resonance assignments in protein 1H nuclear magnetic resonance spectra , 1988 .

[11]  Ad Bax,et al.  Multidimensional nuclear magnetic resonance methods for protein studies , 1994 .

[12]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[13]  Dennis H. Rouvray,et al.  Computational chemical graph theory , 1990 .

[14]  W F van Gunsteren,et al.  Combined procedure of distance geometry and restrained molecular dynamics techniques for protein structure determination from nuclear magnetic resonance data: Application to the DNA binding domain of lac repressor from Escherichia coli , 1988, Proteins.

[15]  Narsingh Deo,et al.  Graph Theory with Applications to Engineering and Computer Science , 1975, Networks.

[16]  P. Kraulis,et al.  Three-dimensional NMR spectroscopy of a protein in solution , 1988, Nature.

[17]  Frank Harary,et al.  Graph Theory , 2016 .

[18]  K. Wüthrich,et al.  Secondary structure of the lac repressor DNA-binding domain by two-dimensional 1H nuclear magnetic resonance in solution. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Peter Willett,et al.  Upperbound procedures for the identification of similar three-dimensional chemical structures , 1989, J. Comput. Aided Mol. Des..

[20]  A. Gronenborn,et al.  Multidimensional heteronuclear nuclear magnetic resonance of proteins. , 1994, Methods in enzymology.

[21]  Peter Willett,et al.  Identification of .beta.-sheet motifs, of .psi.-loops, and of patterns of amino acid residues in three-dimensional protein structures using a subgraph-isomorphism algorithm , 1994, J. Chem. Inf. Comput. Sci..

[22]  Nicos Christofides,et al.  Combinatorial optimization , 1979 .

[23]  A. Wand,et al.  Refinement of the main chain directed assignment strategy for the analysis of 1H NMR spectra of proteins. , 1991, Biophysical journal.

[24]  L. Kay,et al.  Overcoming the overlap problem in the assignment of 1H NMR spectra of larger proteins by use of three-dimensional heteronuclear 1H-15N Hartmann-Hahn-multiple quantum coherence and nuclear Overhauser-multiple quantum coherence spectroscopy: application to interleukin 1 beta. , 1989, Biochemistry.

[25]  V. P. Chuprina,et al.  Structure of the complex of lac repressor headpiece and an 11 base-pair half-operator determined by nuclear magnetic resonance spectroscopy and restrained molecular dynamics. , 1994, Journal of Molecular Biology.

[26]  Xiaoyu Liu,et al.  Computer-assisted graph-theoretical construction of 13C NMR signal and intensity patterns , 1990 .

[27]  P Willett,et al.  Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm. , 1993, Journal of molecular biology.

[28]  K Wüthrich,et al.  Sequential resonance assignments in protein 1H nuclear magnetic resonance spectra. Computation of sterically allowed proton-proton distances and statistical analysis of proton-proton distances in single crystal protein conformations. , 1982, Journal of molecular biology.

[29]  J. J. McGregor,et al.  Backtrack search algorithms and the maximal common subgraph problem , 1982, Softw. Pract. Exp..

[30]  S. Straus,et al.  Use of fuzzy mathematics for complete automated assignment of peptide 1H 2D NMR spectra. , 1994, Journal of Magnetic Resonance - Series B.

[31]  S. Harrison,et al.  A structural taxonomy of DNA-binding domains , 1991, Nature.

[32]  Stephen W. Fesik,et al.  A computer-based protocol for semiautomated assignments and 3D structure determination of proteins , 1994, Journal of biomolecular NMR.

[33]  L. Kay,et al.  A novel approach for sequential assignment of proton, carbon-13, and nitrogen-15 spectra of larger proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin , 1990 .

[34]  L. Kay,et al.  A novel approach for sequential assignment of 1H, 13C, and 15N spectra of proteins: heteronuclear triple-resonance three-dimensional NMR spectroscopy. Application to calmodulin. , 1990, Biochemistry.

[35]  P. J. Kraulis,et al.  ANSIG: A program for the assignment of protein 1H 2D NMR spectra by interactive computer graphics , 1989, Journal of Magnetic Resonance (1969).

[36]  K Wüthrich,et al.  Polypeptide secondary structure determination by nuclear magnetic resonance observation of short proton-proton distances. , 1984, Journal of molecular biology.

[37]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[38]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[39]  Geoffrey Bodenhausen,et al.  Topological classification of fragments of coupling networks and multiplet patterns in two-dimensional NMR spectra , 1988 .

[40]  S. Fesik,et al.  Heteronuclear three-dimensional NMR spectroscopy of the inflammatory protein C5a. , 1989, Biochemistry.

[41]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[42]  Rolf Boelens,et al.  Restrained Molecular Dynamics Procedure for Protein Tertiary Structure Determination from NMR Data: A Lac Repressor Headpiece Structure Based on Information on J‐coupling and from Presence and Absence of NOE's , 1986 .

[43]  Jun Xu,et al.  Automated extraction of spin coupling topologies from 2D NMR correlation spectra for protein proton resonance assignment , 1993, Journal of chemical information and computer sciences.

[44]  G. Marius Clore,et al.  Computer-aided sequential assignment of protein 1H NMR spectra , 1988 .

[45]  Harry G. Barrow,et al.  Subgraph Isomorphism, Matching Relational Structures and Maximal Cliques , 1976, Inf. Process. Lett..

[46]  H Oschkinat,et al.  Assignment of protein nmr spectra in the light of homonuclear 3D spectroscopy: An automatable procedure based on 3D TOCSY‐TOCSY and 3D TOCSY‐NOESY , 1991, Biopolymers.

[47]  D. M. Schneider,et al.  Implementation of the main chain directed assignment strategy. Computer assisted approach. , 1991, Biophysical journal.

[48]  Irwin D. Kuntz,et al.  Programs for computer-assisted sequential assignment of proteins , 1989 .

[49]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[50]  Ad Bax,et al.  Methodological advances in protein NMR , 1993 .

[51]  R. Brennan DNA recognition by the helix-turn-helix motif , 1992 .

[52]  F. Richards,et al.  Relationship between nuclear magnetic resonance chemical shift and protein secondary structure. , 1991, Journal of molecular biology.