Predicting the three-dimensional structure of a protein given only its amino acid sequence is a long-standing goal in computational chemistry. In the thermodynamic approach, one needs a potential function of conformation that resembles the free energy of the real protein to the extent that the global minimum of the potential is attained by the native conformation and no other. In practice, this has never been achieved with certainty because even with greatly simplified representations of the polypeptide chain, there are an astronomical number of local minima to examine. If one chooses instead a protein representation with only a large but manageable number of discrete conformations, then the global preference of the potential for the native can be directly verified. Representing a protein as a walk on a two-dimensional square lattice makes it easy to see that simple functions of the interresidue contacts are sufficient to globally favor a given "native" conformation, as long as it is a compact, globular structure. Explicit representation of the solvent is not required. Another more realistic way to confine the conformational search to a finite set is to draw alternative conformations from fragments of larger proteins having known crystal structure. Then it is possible to construct a simple function of interresidue contacts in three dimensions such that only 8 proteins are required to determine the adjustable parameters, and the native conformations of 37 other proteins are correctly preferred over all alternative conformations. The deduced function favors short-range backbone-backbone contacts regardless of residue type and long-range hydrophobic associations. Interactions over long distances, such as electrostatics, are not required.
[1]
C. Anfinsen.
Principles that govern the folding of protein chains.
,
1973,
Science.
[2]
G. Crippen.
Global optimization and polypeptide conformation
,
1975
.
[3]
G J Williams,et al.
The Protein Data Bank: a computer-based archival file for macromolecular structures.
,
1977,
Journal of molecular biology.
[4]
N. Go.
Theoretical studies of protein folding.
,
1983,
Annual review of biophysics and bioengineering.
[5]
R. Bruccoleri,et al.
Criteria that discriminate between native proteins and incorrectly folded models
,
1988,
Proteins.
[6]
K. Dill,et al.
A lattice statistical mechanics model of the conformational and sequence spaces of proteins
,
1989
.
[7]
D. Covell,et al.
Conformations of folded proteins in restricted spaces.
,
1990,
Biochemistry.
[8]
M. Sippl.
Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins.
,
1990,
Journal of molecular biology.
[9]
G M Crippen,et al.
A 1.8 Å resolution potential function for protein folding
,
1990,
Biopolymers.
[10]
P. Seetharamulu,et al.
A potential function for protein folding
,
1991
.