Protein structure prediction with the 3D-HP side-chain model using a master–slave parallel genetic algorithm

This work presents a master-slave parallel genetic algorithm for the protein folding problem, using the 3D-HP side-chain model (3D-HP-SC). This model is sparsely studied in the literature, although more expressive than other lattice models. The fitness function proposed includes information not only about the free-energy of the conformation, but also compactness of the side-chains. Since there is no benchmark available to date for this model, a set of 15 sequences was used, based on a simpler model. Results show that the parallel GA achieved a good level of efficiency and obtained biologically coherent results, suggesting the adequacy of the methodology. Future work will include new biologically-inspired genetic operators and more experiments to create new benchmarks.

[1]  Heitor Silvério Lopes Evolutionary Algorithms for the Protein Folding Problem: A Review and Current Trends , 2008, Computational Intelligence in Biomedicine and Bioinformatics.

[2]  Heitor Silvério Lopes,et al.  Self-Adapting Evolutionary Parameters: Encoding Aspects for Combinatorial Optimization Problems , 2005, EvoCOP.

[3]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[4]  Robert B. Ross,et al.  Using MPI-2: Advanced Features of the Message Passing Interface , 2003, CLUSTER.

[5]  Heitor Silvério Lopes,et al.  A Hybrid Genetic Algorithm for the Protein Folding Problem Using the 2D-HP Lattice Model , 2008 .

[6]  Zbigniew Michalewicz,et al.  Parameter Setting in Evolutionary Algorithms , 2007, Studies in Computational Intelligence.

[7]  Mai Suan Li,et al.  Folding in lattice models with side chains , 2002, cond-mat/0211348.

[8]  Heitor Silvério Lopes,et al.  Particle Swarm Optimization for the Multidimensional Knapsack Problem , 2007, ICANNGA.

[9]  Lam Thu Bui,et al.  Success in Evolutionary Computation , 2008 .

[10]  Heitor Silvério Lopes,et al.  An Enhanced Genetic Algorithm for Protein Structure Prediction Using the 2D Hydrophobic-Polar Model , 2005, Artificial Evolution.

[11]  William E. Hart,et al.  Protein structure prediction with evolutionary algorithms , 1999 .

[12]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[13]  Jiaxing Cheng,et al.  A Novel Genetic Algorithm for HP Model Protein Folding , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[14]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[15]  Heitor Silvério Lopes,et al.  A parallel genetic algorithm for protein folding prediction using the 3D-HP Side Chain model , 2009, 2009 IEEE Congress on Evolutionary Computation.

[16]  Aboul Ella Hassanien,et al.  Computational Intelligence in Biomedicine and Bioinformatics, Current Trends and Applications , 2008, Computational Intelligence in Biomedicine and Bioinformatics.

[17]  Erick Cantú-Paz,et al.  Efficient and Accurate Parallel Genetic Algorithms , 2000, Genetic Algorithms and Evolutionary Computation.

[18]  Helio J. C. Barbosa,et al.  Investigation of the three-dimensional lattice HP protein folding model using a genetic algorithm , 2004 .

[19]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[20]  Ron Unger,et al.  Genetic Algorithm for 3D Protein Folding Simulations , 1993, ICGA.

[21]  Heitor Silvério Lopes,et al.  Reconfigurable Computing for Accelerating Protein Folding Simulations , 2007, ARC.

[22]  William E. Hart,et al.  On the Intractability of Protein Folding with a Finite Alphabet of Amino Acids , 1999, Algorithmica.

[23]  Rolf Apweiler,et al.  UniProt archive , 2004, Bioinform..

[24]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[25]  Joseph G. Pigeon,et al.  Statistics for Experimenters: Design, Innovation and Discovery , 2006, Technometrics.

[26]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[27]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[28]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..