Evolutionary Algorithms for the Protein Folding Problem: A Review and Current Trends

Proteins are complex macromolecules that perform vital functions in all living beings. They are composed of a chain of amino acids. The biological function of a protein is determined by the way it is folded into a specific tri-dimensional structure, known as native conformation. Understanding how proteins fold is of great importance to Biology, Biochemistry and Medicine. Considering the full analytic atomic model of a protein, it is still not possible to determine the exact tri-dimensional structure of real-world proteins, even with the most powerful computational resources. To reduce the computational complexity of the analytic model, many simplified models have been proposed. Even the simplest one, the bi-dimensional Hydrophobic-Polar (2D-HP) model (see Sect. 12.2.2), was proved to be intractable due to its NP-completeness. The current approach for studying the structure of proteins is the use of heuristic methods that, however, do not guarantee the optimal solution. Evolutionary computation techniques have been proved to be efficient for many engineering and computer science problems. This is also the case of unveiling the structure of proteins using simple lattice models.

[1]  E. Shakhnovich,et al.  Engineering of stable and fast-folding sequences of model proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[2]  B. Honig Protein folding: from the levinthal paradox to structure prediction. , 1999, Journal of molecular biology.

[3]  Giancarlo Mauri,et al.  Approximation algorithms for protein folding prediction , 1999, SODA '99.

[4]  Erik D. Goodman,et al.  A Standard GA Approach to Native Protein Conformation Prediction , 1995 .

[5]  Heitor Silvério Lopes,et al.  An Enhanced Genetic Algorithm for Protein Structure Prediction Using the 2D Hydrophobic-Polar Model , 2005, Artificial Evolution.

[6]  William E. Hart,et al.  Protein structure prediction with evolutionary algorithms , 1999 .

[7]  Madhu Chetty,et al.  A Guided Genetic Algorithm for Protein Folding Prediction Using 3D Hydrophobic-Hydrophilic Model , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[8]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[9]  Christian N. S. Pedersen,et al.  Protein Folding in the 2D HP Model , 1999 .

[10]  William E. Hart,et al.  Lattice and Off-Lattice Side Chain Models of Protein Folding: Linear Time Structure Prediction Better than 86% of Optimal , 1997, J. Comput. Biol..

[11]  Julian F. Miller,et al.  Genetic and Evolutionary Computation — GECCO 2003 , 2003, Lecture Notes in Computer Science.

[12]  W. Wong,et al.  Evolutionary Monte Carlo for protein folding simulations , 2001 .

[13]  Gary B. Lamont,et al.  Protein structure prediction by applying an evolutionary algorithm , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[14]  Heitor Silvério Lopes,et al.  A Differential Evolution Approach for Protein Folding , 2006, 2006 IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology.

[15]  Giancarlo Mauri,et al.  Application of Evolutionary Algorithms to Protein Folding Prediction , 1997, Artificial Evolution.

[16]  William E. Hart,et al.  Fast Protein Folding in the Hydrophobic-Hydrophillic Model within Three-Eights of Optimal , 1996, J. Comput. Biol..

[17]  Aviezri S. Fraenkel,et al.  Complexity of protein folding , 1993 .

[18]  Lorna J. Smith,et al.  Understanding protein folding via free-energy surfaces from theory and experiment. , 2000, Trends in biochemical sciences.

[19]  Joe Marks,et al.  Computational Complexity, Protein Structure Prediction, and the Levinthal Paradox , 1994 .

[20]  Alantha Newman A new algorithm for protein folding in the HP model , 2002, SODA '02.

[21]  Vijay Chandru,et al.  The algorithmics of folding proteins on lattices , 2003, Discret. Appl. Math..

[22]  Albert Y. Zomaya,et al.  Parallel Ant Colony Optimization for 3D Protein Structure Prediction using the HP Lattice Model , 2005, IPDPS.

[23]  N. Wingreen,et al.  NATURE OF DRIVING FORCE FOR PROTEIN FOLDING : A RESULT FROM ANALYZING THE STATISTICAL POTENTIAL , 1995, cond-mat/9512111.

[24]  Burak Erman,et al.  Minimum Energy Configurations of the 2-Dimensional HP-Model of Proteins by Self-Organizing Networks , 2002, J. Comput. Biol..

[25]  Helio J. C. Barbosa,et al.  Investigation of the three-dimensional lattice HP protein folding model using a genetic algorithm , 2004 .

[26]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[27]  Vincenzo Cutello,et al.  A Class of Pareto Archived Evolution Strategy Algorithms Using Immune Inspired Operators for Ab-Initio Protein Structure Prediction , 2005, EvoWorkshops.

[28]  Edmund K. Burke,et al.  Multimeme Algorithms for Protein Structure Prediction , 2002, PPSN.

[29]  M Karplus,et al.  The fundamentals of protein folding: bringing together theory and experiment. , 1999, Current opinion in structural biology.

[30]  Weiguo Liu,et al.  Mapping of Genetic Algorithms for Protein Folding onto Computational Grids , 2005, TENCON 2005 - 2005 IEEE Region 10 Conference.

[31]  Uri Zwick,et al.  Spatial codes and the hardness of string folding problems , 1998, SODA '98.

[32]  Ron Unger,et al.  On the applicability of genetic algorithms to protein folding , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[33]  H Chen,et al.  Secondary-structure-favored hydrophobic-polar lattice model of protein folding. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  P Argos,et al.  Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions. , 1996, Journal of molecular biology.

[35]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[36]  Holger H. Hoos,et al.  An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem , 2005, BMC Bioinformatics.

[37]  Steffen Schulze-Kremer,et al.  Parameterizing genetic algorithms for protein folding simulation , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[38]  Vincenzo Cutello,et al.  An Immune Algorithm for Protein Structure Prediction on Lattice Models , 2007, IEEE Transactions on Evolutionary Computation.

[39]  Binhai Zhu,et al.  Protein Folding on the Hexagonal Lattice in the Hp Model , 2005, J. Bioinform. Comput. Biol..

[40]  Heitor Silvério Lopes,et al.  Reconfigurable Computing for Accelerating Protein Folding Simulations , 2007, ARC.

[41]  B Honig,et al.  Adding backbone to protein folding: why proteins are polypeptides. , 1996, Folding & design.

[42]  Juan Julián Merelo Guervós,et al.  Parallel Problem Solving from Nature — PPSN VII , 2002, Lecture Notes in Computer Science.

[43]  Heitor Silvério Lopes,et al.  A Hybrid Genetic Algorithm for the Protein Folding Problem Using the 2D-HP Lattice Model , 2008 .

[44]  Andrew G. Glen,et al.  APPL , 2001 .

[45]  Yong Duan,et al.  Computational protein folding: From lattice to all-atom , 2001, IBM Syst. J..

[46]  Lam Thu Bui,et al.  Success in Evolutionary Computation , 2008 .

[47]  D. Osguthorpe Ab initio protein folding. , 2000, Current opinion in structural biology.

[48]  Volker Heun,et al.  Approximate protein folding in the HP side chain model on extended cubic lattices , 1999, Discret. Appl. Math..

[49]  Garrison W. Greenwood,et al.  A survey of recent work on evolutionary approaches to the protein folding problem , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[50]  Jiaxing Cheng,et al.  A Novel Genetic Algorithm for HP Model Protein Folding , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[51]  J Moult,et al.  Molecular dynamics study of the structure and dynamics of a protein molecule in a crystalline ionic environment, Streptomyces griseus protease A. , 1990, Biochemistry.

[52]  Chao Tang,et al.  Simple models of the protein folding problem , 1999, cond-mat/9912450.

[53]  Graham Kendall,et al.  Advanced Population Diversity Measures in Genetic Programming , 2002, PPSN.

[54]  Jiaxing Cheng,et al.  Protein 3D HP model folding simulation based on ACO , 2006, Sixth International Conference on Intelligent Systems Design and Applications.

[55]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[56]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[57]  J T Ngo,et al.  Computational complexity of a problem in molecular structure prediction. , 1992, Protein engineering.

[58]  William E. Hart,et al.  On the Intractability of Protein Folding with a Finite Alphabet of Amino Acids , 1999, Algorithmica.

[59]  J Moult,et al.  Protein folding simulations with genetic algorithms and a detailed molecular description. , 1997, Journal of molecular biology.

[60]  K A Dill,et al.  Local and nonlocal interactions in globular proteins and mechanisms of alcohol denaturation , 1993, Protein science : a publication of the Protein Society.

[61]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[62]  M R Lee,et al.  State of the art in studying protein folding and protein structure prediction using molecular dynamics methods. , 2001, Journal of molecular graphics & modelling.

[63]  Z. Luthey-Schulten,et al.  Ab initio protein structure prediction. , 2002, Current opinion in structural biology.

[64]  Luonan Chen,et al.  Unique Optimal Foldings of Proteins on a Triangular Lattice , 2005, Applied bioinformatics.

[65]  Wen-Qi Huang,et al.  A Branch and Bound Algorithm for the Protein Folding Problem in the HP Lattice Model , 2005, Genomics, proteomics & bioinformatics.

[66]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[67]  J. Onuchic,et al.  Folding kinetics of proteinlike heteropolymers , 1994, cond-mat/9404001.

[68]  T. Dandekar,et al.  Improving genetic algorithms for protein folding simulations by systematic crossover. , 1999, Bio Systems.

[69]  Erich Bornberg-Bauer,et al.  Chain growth algorithms for HP-type lattice proteins , 1997, RECOMB '97.

[70]  Eugene Santos,et al.  Reducing the computational load of energy evaluations for protein folding , 2004, Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering.

[71]  David W. Corne,et al.  Use of a novel Hill-climbing genetic algorithm in protein folding simulations , 2003, Comput. Biol. Chem..

[72]  Songde Ma,et al.  Protein folding simulations of the hydrophobic–hydrophilic model by combining tabu search with genetic algorithms , 2003 .

[73]  Rolf Backofen,et al.  Application of constraint programming techniques for structure prediction of lattice proteins with extended alphabets , 1999, Bioinform..

[74]  Carolina P. de Almeida,et al.  A Hybrid Immune-Based System for the Protein Folding Problem , 2007, EvoCOP.

[75]  Jens Gottlieb,et al.  Evolutionary Computation in Combinatorial Optimization , 2006, Lecture Notes in Computer Science.

[76]  Roy L. Johnston,et al.  Development and optimisation of a novel genetic algorithm for studying model protein folding , 2004 .

[77]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[78]  Lei-Han Tang,et al.  Designability and cooperative folding in a four-letter hydrophobic-polar model of proteins. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[79]  Rolf Drechsler,et al.  Applications of Evolutionary Computing, EvoWorkshops 2008: EvoCOMNET, EvoFIN, EvoHOT, EvoIASP, EvoMUSART, EvoNUM, EvoSTOC, and EvoTransLog, Naples, Italy, March 26-28, 2008. Proceedings , 2008, EvoWorkshops.