论文信息 - Assembly of protein structure from sparse experimental data: An efficient Monte Carlo model

Assembly of protein structure from sparse experimental data: An efficient Monte Carlo model

A new, efficient method for the assembly of protein tertiary structure from known, loosely encoded secondary structure restraints and sparse information about exact side chain contacts is proposed and evaluated. The method is based on a new, very simple method for the reduced modeling of protein structure and dynamics, where the protein is described as a lattice chain connecting side chain centers of mass rather than Cαs. The model has implicit built‐in multibody correlations that simulate short‐ and long‐range packing preferences, hydrogen bonding cooperativity and a mean force potential describing hydrophobic interactions. Due to the simplicity of the protein representation and definition of the model force field, the Monte Carlo algorithm is at least an order of magnitude faster than previously published Monte Carlo algorithms for structure assembly. In contrast to existing algorithms, the new method requires a smaller number of tertiary restraints for successful fold assembly; on average, one for every seven residues as compared to one for every four residues. For example, for smaller proteins such as the B domain of protein G, the resulting structures have a coordinate root mean square deviation (cRMSD), which is about 3 Å from the experimental structure; for myoglobin, structures whose backbone cRMSD is 4.3 Å are produced, and for a 247‐residue TIM barrel, the cRMSD of the resulting folds is about 6 Å. As would be expected, increasing the number of tertiary restraints improves the accuracy of the assembled structures. The reliability and robustness of the new method should enable its routine application in model building protocols based on various (very sparse) experimentally derived structural restraints. Proteins 32:475–494, 1998. © 1998 Wiley‐Liss, Inc.

J. Skolnick | A. Kolinski

[1] H. Scheraga,et al. Experimental and theoretical aspects of protein folding. , 1975, Advances in protein chemistry.

[2] G J Williams,et al. The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[3] P. Gennes. Scaling Concepts in Polymer Physics , 1979 .

[4] J. Richardson,et al. The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[5] R. Doolittle,et al. A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[6] W. Kabsch,et al. Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[7] N Go,et al. Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[8] Timothy F. Havel,et al. An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. , 1985, Journal of molecular biology.

[9] A. D. McLachlan,et al. Solvation energy in protein folding and binding , 1986, Nature.

[10] W. V. van Gunsteren,et al. Protein structures from NMR. , 1988, Biochemistry.

[11] J. Szulmajster. Protein folding , 1988, Bioscience reports.