Protein Structure Prediction on the Face Centered Cubic Lattice by Local Search

Ab initio protein structure prediction is an important problem for which several algorithms have been developed. Algorithms differ by how they represent 3D protein conformations (on-lattice, off-lattice, coarse-grain or fine-grain model), by the energy model they consider, and whether they are heuristic or exact algorithms. This paper presents a local search algorithm to find the native state for the Hydrophobic-Polar (HP) model on the Face Centered Cubic (FCC) lattice; i.e. a self-avoiding walk on the FCC lattice with maximum number of H-H contacts. The algorithm relies on a randomized, structured initialization, a novel fitness function to guide the search, and efficient data structures to obtain self-avoiding walks. Experimental results on benchmark instances show the efficiency and excellent performance of our algorithm, and illustrate the biological pertinence of the FCC lattice.

[1]  Rolf Backofen,et al.  Excluding Symmetries in Constraint-Based Search , 1999, CP.

[2]  Rolf Backofen,et al.  Algorithmic approach to quantifying the hydrophobic force contribution in protein folding , 1999, German Conference on Bioinformatics.

[3]  Jacek Blazewicz,et al.  Application of tabu search strategy for finding low energy structure of protein , 2005, Artif. Intell. Medicine.

[4]  M. Karplus,et al.  How does a protein fold? , 1994, Nature.

[5]  Alistair Sinclair,et al.  Algorithms for Random Generation and Counting: A Markov Chain Approach , 1993, Progress in Theoretical Computer Science.

[6]  K Yue,et al.  Folding proteins with a simple energy function and extensive conformational searching , 1996, Protein science : a publication of the Protein Society.

[7]  M. Karplus,et al.  Kinetics of protein folding. A lattice model study of the requirements for folding to the native state. , 1994, Journal of molecular biology.

[8]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[9]  K Yue,et al.  Constraint-based assembly of tertiary protein structures from secondary structure elements. , 2000, Protein science : a publication of the Protein Society.

[10]  E I Shakhnovich,et al.  Computer simulations of prebiotic evolution. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[11]  Rolf Backofen,et al.  Excluding Symmetries in Constraint-Based Search , 1999, Constraints.

[12]  Rolf Backofen,et al.  A Constraint-Based Approach to Fast and Exact Structure Prediction in Three-Dimensional Protein Models , 2006, Constraints.

[13]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[14]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[15]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[16]  James M. Crawford,et al.  Symmetry-Breaking Predicates for Search Problems , 1996, KR.

[17]  Wing Hung Wong,et al.  A study of density of states and ground states in hydrophobic-hydrophilic protein folding models by equi-energy sampling. , 2006, The Journal of chemical physics.

[18]  N. Madras,et al.  THE SELF-AVOIDING WALK , 2006 .

[19]  Peter Clote Protein Folding, the Levinthal Paradox and Rapidly Mixing Markov Chains , 1999, ICALP.

[20]  William E. Hart,et al.  Fast Protein Folding in the Hydrophobic-Hydrophillic Model within Three-Eights of Optimal , 1996, J. Comput. Biol..

[21]  E I Shakhnovich,et al.  A test of lattice protein folding algorithms. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[22]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[23]  C. Woese,et al.  On the fundamental nature and evolution of the genetic code. , 1966, Cold Spring Harbor symposia on quantitative biology.

[24]  Sebastian Will Exact, constraint-based structure prediction in simple protein models , 2005 .

[25]  P. Schuster,et al.  RNA folding at elementary step resolution. , 1999, RNA.

[26]  Songde Ma,et al.  Protein folding simulations of the hydrophobic–hydrophilic model by combining tabu search with genetic algorithms , 2003 .

[27]  Lydia Tapia,et al.  Kinetics analysis methods for approximate folding landscapes , 2007, ISMB/ECCB.

[28]  D. Covell,et al.  Conformations of folded proteins in restricted spaces. , 1990, Biochemistry.

[29]  Jun S. Liu,et al.  Biopolymer structure simulation and optimization via fragment regrowth Monte Carlo. , 2007, The Journal of chemical physics.

[30]  Barry Cipra,et al.  Packing Challenge Mastered At Last , 1998, Science.