Determination of three‐dimensional protein structures from nuclear magnetic resonance data using fragments of known structures

A method to build a three‐dimensional protein model from nuclear magnetic resonance (NMR) data using fragments from a data base of crystallographically determined protein structures is presented. The interproton distances derived from the nuclear Overhauser effect (NOE) data are compared to the precalculated distances in the known protein structures. An efficient search algorithm is used, which arranges the distancs in matrices akin to a Cα diagonal distance plot, and compares the NOE distance matrices for short sequential zones of the protein to the data base matrices. After cluster analysis of the fragments found in this way, the structure is built by aligning fragments in overlapping zones. The sequentially long‐range NOEs cannot be used in the initial fragments search but are vital to discriminate between several possible combinations of different groups of fragments. The method has been tested on one simulated NOE data set derived from a crystal structure and one experimental NMR data set. The method produces models that have good local structure, but may contain larger global errors. These models can be used as the starting point for further refinement, e.g., by restrained molecular dynamics or interactive graphics.

[1]  K Wüthrich,et al.  Polypeptide fold in the two metal clusters of metallothionein-2 by nuclear magnetic resonance in solution. , 1986, Journal of molecular biology.

[2]  Kurt Wüthrich,et al.  Secondary structure of the α-amylase polypeptide inhibitor Tendamistat from Streptomyces tendae determined in solution by 1H nuclear magnetic resonance , 1985 .

[3]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[4]  K Wüthrich,et al.  Polypeptide secondary structure determination by nuclear magnetic resonance observation of short proton-proton distances. , 1984, Journal of molecular biology.

[5]  K. Wüthrich,et al.  Secondary structure of the lac repressor DNA-binding domain by two-dimensional 1H nuclear magnetic resonance in solution. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[6]  W F van Gunsteren,et al.  A protein structure from nuclear magnetic resonance data. lac repressor headpiece. , 1985, Journal of molecular biology.

[7]  Wayne A. Hendrickson,et al.  Structure of the hydrophobic protein crambin determined directly from the anomalous scattering of sulphur , 1981, Nature.

[8]  G. Clore,et al.  Nuclear magnetic resonance study of the globular domain of chicken histone H5: resonance assignment and secondary structure. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Anders Liljas,et al.  Recognition of structural domains in globular proteins , 1974 .

[10]  M Karplus,et al.  Application of molecular dynamics with interproton distance restraints to three-dimensional protein structure determination. A model study of crambin. , 1986, Journal of molecular biology.

[11]  Kurt Wüthrich,et al.  Systematic application of two-dimensional 1H nuclear-magnetic-resonance techniques for studies of proteins. 2. Combined use of correlated spectroscopy and nuclear Overhauser spectroscopy for sequential assignments of backbone resonances and elucidation of polypeptide secondary structures. , 1981, European journal of biochemistry.

[12]  H. Scheraga,et al.  Energy parameters in polypeptides. 9. Updating of geometrical parameters, nonbonded interactions, and hydrogen bond interactions for the naturally occurring amino acids , 1983 .

[13]  Kurt Wüthrich,et al.  Interactive program for investigation of protein structures based on 1H NMR experiments , 1985 .

[14]  H. Scheraga,et al.  Energy parameters in polypeptides. VII. Geometric parameters, partial atomic charges, nonbonded interactions, hydrogen bond interactions, and intrinsic torsional potentials for the naturally occurring amino acids , 1975 .

[15]  K Wüthrich,et al.  Conformation of glucagon in a lipid-water interphase by 1H nuclear magnetic resonance. , 1983, Journal of molecular biology.

[16]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[17]  M Karplus,et al.  Three-dimensional structure of proteins determined by molecular dynamics with interproton distance restraints: application to crambin. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[18]  N Go,et al.  Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[19]  K Wüthrich,et al.  Pseudo-structures for the 20 common amino acids for use in studies of protein conformations by measurements of intramolecular proton-proton distance constraints with nuclear magnetic resonance. , 1983, Journal of molecular biology.

[20]  K Wüthrich,et al.  Nuclear magnetic resonance identification of "half-turn" and 3(10)-helix secondary structure in rabbit liver metallothionein-2. , 1986, Journal of molecular biology.

[21]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[22]  M. Billeter,et al.  Calibration of the angular dependence of the amide proton-Cα proton coupling constants, 3JHNα, in a globular protein: Use of 3JHNα for identification of helical secondary structure , 1984 .

[23]  K Wüthrich,et al.  Studies by 1H nuclear magnetic resonance and distance geometry of the solution conformation of the alpha-amylase inhibitor tendamistat. , 1986, Journal of molecular biology.

[24]  P. Kraulis,et al.  Solving Solution Structural Problems by Combining 2-D NMR Data with Known Substructures from a Protein Database , 1987 .

[25]  Timothy F. Havel,et al.  A distance geometry program for determining the structures of small proteins and other macromolecules from nuclear magnetic resonance measurements of intramolecular1H−1H proximities in solution , 1984 .

[26]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[27]  Timothy F. Havel,et al.  Solution conformation of proteinase inhibitor IIA from bull seminal plasma by 1H nuclear magnetic resonance and distance geometry. , 1985, Journal of molecular biology.

[28]  M. Billeter,et al.  Spatial arrangement of the three α helices in the solution conformation of E. coli lac represser DNA‐binding domain , 1984, FEBS letters.

[29]  K Wüthrich,et al.  Sequential resonance assignments as a basis for determination of spatial protein structures by high resolution proton nuclear magnetic resonance. , 1982, Journal of molecular biology.

[30]  R. Kaptein,et al.  Stereospecific assignments of 1H‐nmr methyl lines and conformation of valyl residues in the lac repressor headpiece , 1985 .

[31]  V. Maiorov,et al.  NMR solution spatial structure of ‘short’ scorpion insectotoxin I5A , 1984 .

[32]  Timothy F. Havel,et al.  An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. , 1985, Journal of molecular biology.