Rotation crossover and K-site move mutation for evolutionary protein folding in 3D FCC HP model (preliminary version)

In this paper we present a new evolutionary algorithm for the protein folding problem. We study the problem in the 3D FCC HP model which has been widely used in previous research. Our focus is to develop evolutionary algorithms (EA) which are robust, easy to operate and can handle various energy functions. We propose lattice rotation for crossover and K-site move for mutation, which form the key components of our evolutionary algorithms. Experiment shows that our algorithms are able to find minimum-energy conformations for many sequences whose optimal conformations are not found in previous EA-based algorithms. Furthermore, our idea can be easily integrated into Monte Carlo and Tabu searches as approaches for local searches.

[1]  Hsiao-Ping Hsu,et al.  Growth-based optimization algorithm for lattice heteropolymers. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  M. Levitt,et al.  The complexity and accuracy of discrete state models of protein structure. , 1995, Journal of molecular biology.

[3]  Jeffrey Kovac,et al.  Effect of bead movement rules on the relaxation of cubic lattice models of polymer chains , 1983 .

[4]  Zhihong He,et al.  Protein folding simulations of 2D HP model by the genetic algorithm based on optimal secondary structures , 2010, Comput. Biol. Chem..

[5]  William E. Hart,et al.  Lattice and off-lattice side chain models of protein folding (extended abstract): linear time structure prediction better than 86% of optimal , 1997, RECOMB '97.

[6]  Holger H. Hoos,et al.  A replica exchange Monte Carlo algorithm for protein folding in the HP model , 2007, BMC Bioinformatics.

[7]  Jyh-Jong Tsay,et al.  Ab initio protein structure prediction based on memetic algorithm and 3D FCC lattice model , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[8]  Alessandro Dal Palù,et al.  Constraint Logic Programming approach to protein structure prediction , 2004, BMC Bioinformatics.

[9]  Andrew Lewis,et al.  DFS-generated pathways in GA crossover for protein structure prediction , 2010, Neurocomputing.

[10]  Rolf Backofen,et al.  A Constraint-Based Approach to Fast and Exact Structure Prediction in Three-Dimensional Protein Models , 2006, Constraints.

[11]  Holger H. Hoos,et al.  An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem , 2005, BMC Bioinformatics.

[12]  Abdul Sattar,et al.  Protein folding prediction in 3D FCC HP lattice model using genetic algorithm , 2007, 2007 IEEE Congress on Evolutionary Computation.

[13]  R L Jernigan,et al.  Ideal architecture of residue packing and its observation in protein structures , 1997, Protein science : a publication of the Protein Society.

[14]  Rolf Backofen,et al.  CPSP-web-tools: a server for 3D lattice protein studies , 2009, Bioinform..

[15]  William E. Hart,et al.  Lattice and Off-Lattice Side Chain Models of Protein Folding: Linear Time Structure Prediction Better than 86% of Optimal , 1997, J. Comput. Biol..

[16]  Rolf Backofen,et al.  CPSP-tools – Exact and complete algorithms for high-throughput 3D lattice protein studies , 2008, BMC Bioinformatics.

[17]  Walter H. Stockmayer,et al.  Monte Carlo Calculations on the Dynamics of Polymers in Dilute Solution , 1962 .

[18]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[19]  Sun-Yuan Hsieh,et al.  A New Branch and Bound Method for the Protein Folding Problem Under the 2D-HP Model , 2011, IEEE Transactions on NanoBioscience.

[20]  Sue Whitesides,et al.  A complete and effective move set for simplified protein folding , 2003, RECOMB '03.

[21]  Hans-Joachim Böckenhauer,et al.  A Local Move Set for Protein Folding in Triangular Lattice Models , 2008, WABI.

[22]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[23]  Kathleen Steinhöfel,et al.  Protein Folding Simulation by Two-Stage Optimization , 2009 .

[24]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[25]  Andrew Lewis,et al.  DFS Based Partial Pathways in GA for Protein Structure Prediction , 2008, PRIB.

[26]  Pascal Van Hentenryck,et al.  On Lattice Protein Structure Prediction Revisited , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[27]  Cheng-Jian Lin,et al.  An effective hybrid of hill climbing and genetic algorithm for 2D triangular protein structure prediction , 2011, Proteome Science.

[28]  Andrew Lewis,et al.  Twin Removal in Genetic Algorithms for Protein Structure Prediction Using Low-Resolution Model , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[29]  P. Schuster,et al.  Discrete Models of Biopolymers , 2007 .