Constraint Techniques for Solving the Protein Structure Prediction Problem

The protein structure prediction problem is one of the most (if not the most) important problem in computational biology. This problem consists of finding the conformation of a protein (i.e., a sequence of amino-acids) with minimal energy. Because of the complexity of this problem, simplified models like Dill's HP-lattice model [12] have become a major tool for investigating general properties of protein folding. Even for this simplified model, the structure prediction problem has been shown to be NP-complete [3, 5]. We describe a constraint formulation of the HP-model structure prediction problem, present the basic constraints and search strategy. We then introduce a novel, general technique for excluding geometrical symmetries in constraint programming. To our knowledge, this is the first general and declarative technique for excluding symmetries in constraint programming that can be added to an existing implementation. Finally, we describe a new lower bound on the energy of an HP-protein. Both techniques yield an efficient pruning of the search tree.

[1]  M Karplus,et al.  The folding mechanism of larger model proteins: role of native structure. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[2]  E I Shakhnovich,et al.  Computer simulations of prebiotic evolution. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[3]  Erich Bornberg-Bauer,et al.  Chain growth algorithms for HP-type lattice proteins , 1997, RECOMB '97.

[4]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[5]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[6]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[7]  K. Dill,et al.  Cooperativity in protein-folding kinetics. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M. Karplus,et al.  Kinetics of protein folding , 1995, Nature.

[9]  William E. Hart,et al.  Fast Protein Folding in the Hydrophobic-Hydrophillic Model within Three-Eights of Optimal , 1996, J. Comput. Biol..

[10]  J Moult,et al.  Local interactions dominate folding in a simple protein model. , 1996, Journal of molecular biology.

[11]  E I Shakhnovich,et al.  Impact of local and non-local interactions on thermodynamics and kinetics of protein folding. , 1995, Journal of molecular biology.

[12]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[13]  Gert Smolka The Oz Programming Model , 1996 .

[14]  J. Skolnick,et al.  Combined multiple sequence reduced protein model approach to predict the tertiary structure of small proteins. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[15]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[16]  R A Goldstein,et al.  The foldability landscape of model proteins , 1997, Biopolymers.

[17]  M Levitt,et al.  From structure to sequence and back again. , 1996, Journal of molecular biology.

[18]  K Yue,et al.  Forces of tertiary structural organization in globular proteins. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[19]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .