Limited sampling of conformational space by the distance geometry algorithm: implications for structures generated from NMR data.

Calculations with a metric matrix distance geometry algorithm were performed that show that the standard implementation of the algorithm generally samples a very limited region of conformational space. This problem is most severe when only a small amount of distance information is used as input for the algorithm. Control calculations were performed on linear peptides, disulfide-linked peptides, and a double-stranded DNA decamer where only distances defining the covalent structures of the molecules (as well as the hydrogen bonds for the base pairs in the DNA) were included as input. Since the distance geometry algorithm is commonly used to generate structures of biopolymers from distance data obtained from NMR experiments, simulations were performed on the small globular protein basic pancreatic trypsin inhibitor (BPTI) that mimic calculations performed with actual NMR data. The results on BPTI and on the control peptides indicate that the standard implementation of the algorithm has two main problems: first, that it generates extended structures; second, that it has a tendency to consistently produce similar structures instead of sampling all structures consistent with the input distance information. These results also show that use of a simple root-mean-square deviation for evaluating the quality of the structures generated from NMR data may not be generally appropriate. The main sources of these problems are identified, and our results indicate that the problems are not a fundamental property of the distance geometry algorithm but arise from the implementations presently used to generate structures from NMR data. Several possible methods for alleviating these problems are discussed.

[1]  B. Reid Sequence-specific assignments and their use in NMR studies of DNA structure , 1987, Quarterly Reviews of Biophysics.

[2]  Timothy F. Havel,et al.  An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. , 1985, Journal of molecular biology.

[3]  Gordon M. Crippen,et al.  Note rapid calculation of coordinates from distance matrices , 1978 .

[4]  K. Wüthrich Protein structure determination in solution by nuclear magnetic resonance spectroscopy. , 1989, Science.

[5]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by dynamical simulated annealing from a random array of atoms Circumventing problems associated with folding , 1988, FEBS letters.

[6]  S. C. Nyburg Some uses of a best molecular fit routine , 1974 .

[7]  D. Patel,et al.  Nuclear magnetic resonance and distance geometry studies of DNA structures in solution. , 1987, Annual review of biophysics and biophysical chemistry.

[8]  A. Gronenborn,et al.  The polypeptide fold of the globular domain of histone H5 in solution. A study using nuclear magnetic resonance, distance geometry and restrained molecular dynamics. , 1987, The EMBO journal.

[9]  J. Deisenhofer,et al.  Crystallographic refinement of the structure of bovine pancreatic trypsin inhibitor at l.5 Å resolution , 1975 .

[10]  W F van Gunsteren,et al.  Combined procedure of distance geometry and restrained molecular dynamics techniques for protein structure determination from nuclear magnetic resonance data: Application to the DNA binding domain of lac repressor from Escherichia coli , 1988, Proteins.

[11]  A. Gronenborn,et al.  The conformations of hirudin in solution: a study using nuclear magnetic resonance, distance geometry and restrained molecular dynamics , 1987, The EMBO journal.

[12]  B. Reid,et al.  Three-dimensional structure of the wild-type lac Pribnow promoter DNA in solution. Two-dimensional nuclear magnetic resonance studies and distance geometry calculations. , 1988, Journal of molecular biology.

[13]  M Karplus,et al.  Three-dimensional structure of proteins determined by molecular dynamics with interproton distance restraints: application to crambin. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Timothy F. Havel,et al.  The theory and practice of distance geometry , 1983, Bulletin of Mathematical Biology.

[15]  E. Olejniczak,et al.  Determining the structure of a glycopeptide-Ac2-Lys-D-Ala-D-Ala complex using NMR parameters and molecular modeling , 1986 .

[16]  Timothy F. Havel,et al.  Protein structures in solution by nuclear magnetic resonance and distance geometry. The polypeptide fold of the basic pancreatic trypsin inhibitor determined using two different algorithms, DISGEO and DISMAN. , 1987, Journal of molecular biology.

[17]  W F van Gunsteren,et al.  A protein structure from nuclear magnetic resonance data. lac repressor headpiece. , 1985, Journal of molecular biology.

[18]  W. Braun,et al.  Distance geometry and related methods for protein structure determination from NMR data , 1987, Quarterly Reviews of Biophysics.

[19]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by hybrid distance geometry‐dynamical simulated annealing calculations , 1988, FEBS letters.

[20]  R. Levy,et al.  Determination of Protein Structures in Solution Using Nmr Data and Impact , 1988 .

[21]  N Go,et al.  Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[22]  Irwin D. Kuntz,et al.  Effects of distance constraints on macromolecular conformation. II. Simulation of experimental results and theoretical predictions , 1979 .

[23]  V. Bloomfield Biophysical Chemistry , 2020, Definitions.

[24]  A. Pardi,et al.  Determination of DNA structures by NMR and distance geometry techniques: a computer simulation. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[25]  J. Prestegard,et al.  NMR-pseudoenergy approach to the solution structure of acyl carrier protein. , 1987, Biochemistry.

[26]  Timothy F. Havel,et al.  A distance geometry program for determining the structures of small proteins and other macromolecules from nuclear magnetic resonance measurements of intramolecular1H−1H proximities in solution , 1984 .

[27]  B. Reid,et al.  Three-dimensional structure of a DNA hairpin in solution: two-dimensional NMR studies and distance geometry calculations on d(CGCGTTTTCGCG). , 1986, Biochemistry.