Convex Global Underestimation for Molecular Structure Prediction

Key problems in computational biology, including protein and RNA folding and drug docking, involve conformational searching. Current search methods — Monte Carlo, Molecular Dynamics, Simulated Annealing, and Genetic Algorithms — are too slow for protein folding by many orders of magnitude. They get stuck in kinetic traps. We describe a global optimization method, the CGU method, which appears to be very promising. We know the method always finds the same conformation from 100 different starting points, indicating that it finds the unique global minimum for the many different sequences we have tried. We know the CGU doesn’t get stuck in kinetic traps because the search time is independent of the shapes of the landscapes (amino acid sequence and composition). We know that the method is much faster than a standard Simulated Annealing algorithm that we have tested: the SA method doesn’t find global minima for chains longer than 10 residues, and the performance advantage of the CGU method increases with chain length. And computational results show that the computer time scales with n 4 where n is the number of degrees of freedom, and we consistently reach the global minimum of the model energy function for PPT, a 36-amino acid peptide (n = 72), in less than 3 hours on a 32 processor Cray T3E.

[1]  C. DeLisi,et al.  Necessary conditions for avoiding incorrect polypeptide folds in conformational search by energy minimization , 1993, Biopolymers.

[2]  Daniel R. Ripoll,et al.  A parallel Monte Carlo search algorithm for the conformational analysis of proteins , 1990, Proceedings SUPERCOMPUTING '90.

[3]  Laxmikant V. Kalé,et al.  NAMD: a Parallel, Object-Oriented Molecular Dynamics Program , 1996, Int. J. High Perform. Comput. Appl..

[4]  D G Covell Folding protein alpha-carbon chains into compact forms by Monte Carlo methods. , 1992, Proteins.

[5]  H. Scheraga,et al.  Monte Carlo-minimization approach to the multiple-minima problem in protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[6]  A. Wallqvist,et al.  A simplified amino acid potential for use in structure predictions of proteins , 1994, Proteins.

[7]  E. O'Toole,et al.  Monte Carlo simulation of folding transitions of simple model proteins using a chain growth algorithm , 1992 .

[8]  H. Scheraga,et al.  Application of the diffusion equation method for global optimization to oligopeptides , 1992 .

[9]  K. Dill,et al.  A simple protein folding algorithm using a binary code and secondary structure constraints. , 1995, Protein engineering.

[10]  M J Sippl,et al.  Assembly of polypeptide and protein backbone conformations from low energy ensembles of short fragments: Development of strategies and construction of models for myoglobin, lysozyme, and thymosin β4 , 1992, Protein science : a publication of the Protein Society.

[11]  J. Onuchic,et al.  Protein folding funnels: a kinetic approach to the sequence-structure relationship. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  I. Kuntz,et al.  Calculation of protein tertiary structure. , 1976, Journal of molecular biology.

[13]  Stillinger Role of potential-energy scaling in the low-temperature relaxation behavior of amorphous materials. , 1985, Physical review. B, Condensed matter.

[14]  M. Struthers,et al.  Design of a Monomeric 23-Residue Polypeptide with Defined Tertiary Structure , 1996, Science.

[15]  D. Covell,et al.  Lattice model simulations of polypeptide chain folding. , 1994, Journal of molecular biology.

[16]  R. Rajagopalan,et al.  A lattice model for solid-state sintering simple particle arrays , 1995 .

[17]  J. Onuchic,et al.  Folding kinetics of proteinlike heteropolymers , 1994, cond-mat/9404001.

[18]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[19]  D. Covell Folding protein α‐carbon chains into compact forms by monte carlo methods , 1992 .

[20]  K. Dill,et al.  A fast conformational search strategy for finding low energy structures of model proteins , 1996, Protein science : a publication of the Protein Society.

[21]  H. Scheraga,et al.  Performance of the diffusion equation method in searches for optimum structures of clusters of Lennard-Jones atoms , 1991 .

[22]  J. Ben Rosen,et al.  Protein structure prediction and potential energy landscape analysis using continuous global minimization , 1997, RECOMB '97.

[23]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[24]  Ken A. Dill,et al.  Molecular Structure Prediction by Global Optimization , 1997 .

[25]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[26]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[27]  J. Ben Rosen,et al.  Protein Structure and Energy Landscape Dependence on Sequence Using a Continuous Energy Function , 1997, J. Comput. Biol..

[28]  J. Kostrowicki,et al.  Diffusion equation method of global minimization: Performance for standard test functions , 1991 .

[29]  L. Ingber Very fast simulated re-annealing , 1989 .

[30]  Eugene I. Shakhnovich,et al.  Enumeration of all compact conformations of copolymers with random sequence of links , 1990 .

[31]  A. Kolinski,et al.  Simulations of the Folding of a Globular Protein , 1990, Science.

[32]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[33]  J. Onuchic,et al.  Navigating the folding routes , 1995, Science.

[34]  J. Skolnick,et al.  Monte carlo simulations of protein folding. I. Lattice model and interaction scheme , 1994, Proteins.

[35]  S. Doniach,et al.  A computer model to dynamically simulate protein folding: Studies with crambin , 1989, Proteins.

[36]  S. Sun,et al.  Reduced representation model of protein structure prediction: Statistical potential and genetic algorithms , 1993, Protein science : a publication of the Protein Society.

[37]  M. Levitt,et al.  Computer simulation of protein folding , 1975, Nature.

[38]  K. M. Crenell Molecular dynamics on parallel computers , 1991 .

[39]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[40]  C. Brooks,et al.  First-principles calculation of the folding free energy of a three-helix bundle protein. , 1995, Science.

[41]  Karplus,et al.  Protein folding bottlenecks: A lattice Monte Carlo simulation. , 1991, Physical review letters.

[42]  B Honig,et al.  An algorithm to generate low-resolution protein tertiary structures from knowledge of secondary structure. , 1994, Proceedings of the National Academy of Sciences of the United States of America.