Algorithmic approach to quantifying the hydrophobic force contribution in protein folding

Though the electrostatic, ionic, van der Waals, Lennard-Jones, hydrogen bonding, and other forces play an important role in the energy function minimized at a protein's native state, it is widely believed that the hydrophobic force is the dominant term in protein folding. Here we attempt to quantify the extent to which the hydrophobic force determines the positions of the backbone alpha-carbon atoms in PDB data, by applying Monte-Carlo and genetic algorithms to determine the predicted conformation with minimum energy, where only the hydrophobic force is considered (i.e. Dill's HP-model, and refinements using Woese's polar requirement). This is done by computing the root mean square deviation between the normalized distance matrix D = (di,j) (di,j is normalized Euclidean distance between residues ri and rj) for PDB data with that obtained from the output of our algorithms. Our program was run on the database of ancient conserved regions drawn from GenBank 101 generously supplied by W. Gilbert's lab, as well as medium-sized proteins (E. Coli RecA, 2reb, Erythrocruorin, 1eca, and Actinidin 2act). The root mean square deviation (RMSD) between distance matrices derived from the PDB data and from our program output is quite small, and by comparison with RMSD between PDB data and random coils, allows a quantification of the hydrophobic force contribution. A preliminary version of this paper appeared at GCB'99 (http:¿bibiserv.techfak.uni-bielefeld.de/gcb9 9/).

[1]  M. Karplus,et al.  How does a protein fold? , 1994, Nature.

[2]  M. Karplus,et al.  Kinetics of protein folding. A lattice model study of the requirements for folding to the native state. , 1994, Journal of molecular biology.

[3]  M. Whitlow,et al.  An empirical examination of potential energy minimization using the well-determined structure of the protein crambin , 1986 .

[4]  S. Govindarajan,et al.  Why are some protein structures so common ? ( tertiary structure y protein evolution y lattice models y fitness landscapes y spin glasses ) , 1998 .

[5]  C. Woese,et al.  On the fundamental nature and evolution of the genetic code. , 1966, Cold Spring Harbor symposia on quantitative biology.

[6]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[7]  J. Skolnick,et al.  Prediction of quaternary structure of coiled coils. Application to mutants of the GCN4 leucine zipper. , 1995, Journal of molecular biology.

[8]  A Kolinski,et al.  Prediction of the quaternary structure of coiled coils: GCN4 leucine zipper and its mutants. , 1996, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[9]  M. Teeter,et al.  Water-protein interactions: theory and experiment. , 1991, Annual review of biophysics and biophysical chemistry.

[10]  N. J. A. Sloane,et al.  Sphere Packings, Lattices and Groups , 1987, Grundlehren der mathematischen Wissenschaften.

[11]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[12]  E. Bornberg-Bauer,et al.  How are model protein structures distributed in sequence space? , 1997, Biophysical journal.

[13]  Richard A. Goldstein,et al.  Searching for foldable protein structures using optimized energy functions , 1995 .

[14]  N. Madras,et al.  THE SELF-AVOIDING WALK , 2006 .

[15]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[16]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[17]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[18]  Ken A. Dill,et al.  Conformations and Forces in Protein Folding , 1991 .

[19]  Thomas C. Hales Sphere packings, I , 1997, Discret. Comput. Geom..

[20]  Richard A. Goldstein,et al.  Why are some protein structures so common? (tertiary structureyprotein evolutionylattice modelsyfitness landscapesyspin glasses) , 1996 .

[21]  S J de Souza,et al.  Toward a resolution of the introns early/late debate: only phase zero introns are correlated with the structure of ancient proteins. , 1998, Proceedings of the National Academy of Sciences of the United States of America.