Mixing Energy Models in Genetic Algorithms for On-Lattice Protein Structure Prediction

Protein structure prediction (PSP) is computationally a very challenging problem. The challenge largely comes from the fact that the energy function that needs to be minimised in order to obtain the native structure of a given protein is not clearly known. A high resolution 20 × 20 energy model could better capture the behaviour of the actual energy function than a low resolution energy model such as hydrophobic polar. However, the fine grained details of the high resolution interaction energy matrix are often not very informative for guiding the search. In contrast, a low resolution energy model could effectively bias the search towards certain promising directions. In this paper, we develop a genetic algorithm that mainly uses a high resolution energy model for protein structure evaluation but uses a low resolution HP energy model in focussing the search towards exploring structures that have hydrophobic cores. We experimentally show that this mixing of energy models leads to significant lower energy structures compared to the state-of-the-art results.

[1]  Abdul Sattar,et al.  Spiral search: a hydrophobic-core directed local search for simplified PSP on 3D FCC lattice , 2013, BMC Bioinformatics.

[2]  Kathleen Steinhöfel,et al.  Protein Folding Simulation by Two-Stage Optimization , 2009 .

[3]  Abdul Sattar,et al.  A New Genetic Algorithm for Simplified Protein Structure Prediction , 2012, Australasian Conference on Artificial Intelligence.

[4]  Abdul Sattar,et al.  The road not taken: retreat and diverge in local search for simplified protein structure prediction , 2013, BMC Bioinformatics.

[5]  Alessandro Dal Palù,et al.  Exploring Protein Fragment Assembly Using CLP , 2011, IJCAI.

[6]  T. Hales The Kepler conjecture , 1998, math/9811078.

[7]  Rabiah Ahmad,et al.  Communications in Computer and Information Science , 2010 .

[8]  Z. Luthey-Schulten,et al.  Ab initio protein structure prediction. , 2002, Current opinion in structural biology.

[9]  Ron Unger,et al.  Genetic Algorithm for 3D Protein Folding Simulations , 1993, ICGA.

[10]  Vincenzo Cutello,et al.  An Immune Algorithm for Protein Structure Prediction on Lattice Models , 2007, IEEE Transactions on Evolutionary Computation.

[11]  Ivan Kondov,et al.  Protein structure prediction using particle swarm optimization and a distributed parallel approach , 2011, BADS '11.

[12]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[13]  Pascal Van Hentenryck,et al.  Protein Structure Prediction on the Face Centered Cubic Lattice by Local Search , 2008, AAAI.

[14]  Sue Whitesides,et al.  A complete and effective move set for simplified protein folding , 2003, RECOMB '03.

[15]  El-Ghazali Talbi,et al.  A grid-based genetic algorithm combined with an adaptive simulated annealing for protein structure prediction , 2008, Soft Comput..

[16]  Hans-Joachim Böckenhauer,et al.  A Local Move Set for Protein Folding in Triangular Lattice Models , 2008, WABI.

[17]  Andrew Lewis,et al.  Twin Removal in Genetic Algorithms for Protein Structure Prediction Using Low-Resolution Model , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  Abdul Sattar,et al.  Mixed Heuristic Local Search for Protein Structure Prediction , 2013, AAAI.

[19]  Christian Blum,et al.  Ant colony optimization: Introduction and recent trends , 2005 .

[20]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[21]  C. Levinthal Are there pathways for protein folding , 1968 .

[22]  Kathleen Steinhöfel,et al.  Population-based local search for protein folding simulation in the MJ energy model and cubic lattices , 2009, Comput. Biol. Chem..

[23]  Abdul Sattar,et al.  Random-walk: a stagnation recovery technique for simplified protein structure prediction , 2012, BCB '12.

[24]  Abdul Sattar,et al.  Protein folding prediction in 3D FCC HP lattice model using genetic algorithm , 2007, 2007 IEEE Congress on Evolutionary Computation.

[25]  Abdul Sattar,et al.  An efficient encoding for simplified protein structure prediction using genetic algorithms , 2013, 2013 IEEE Congress on Evolutionary Computation.

[26]  Joe Marks,et al.  Human-guided tabu search , 2002, AAAI/IAAI.

[27]  Rolf Backofen,et al.  CPSP-tools – Exact and complete algorithms for high-throughput 3D lattice protein studies , 2008, BMC Bioinformatics.

[28]  Federico Fogolari,et al.  Amino acid empirical contact energy definitions for fold recognition in the space of contact maps , 2003, BMC Bioinformatics.

[29]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[30]  R Samudrala,et al.  Ab initio construction of protein tertiary structures using a hierarchical approach. , 2000, Journal of molecular biology.

[31]  Kathleen Steinhöfel,et al.  A hybrid approach to protein folding problem integrating constraint programming with local search , 2010, BMC Bioinformatics.

[32]  Alessandro Dal Palù,et al.  Heuristics, optimizations, and parallelism for protein structure prediction in CLP(FD) , 2005, PPDP '05.

[33]  Abdul Sattar,et al.  A local search embedded genetic algorithm for simplified protein structure prediction , 2013, 2013 IEEE Congress on Evolutionary Computation.

[34]  C. Dobson Protein folding and misfolding , 2003, Nature.

[35]  Alessandro Dal Palù,et al.  A constraint solver for discrete lattices, its parallelization, and application to protein structure prediction , 2007, Softw. Pract. Exp..

[36]  Abdul Sattar,et al.  Memory-based local search for simplified protein structure prediction , 2012, BCB.

[37]  Abdul Sattar,et al.  Collaborative Parallel Local Search for Simplified Protein Structure Prediction , 2013, 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications.

[38]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[39]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[40]  Fernando Niño,et al.  A novel ab-initio genetic-based approach for protein folding prediction , 2007, GECCO '07.

[41]  S. Colowick,et al.  Methods in Enzymology , Vol , 1966 .

[42]  Holger H. Hoos,et al.  A replica exchange Monte Carlo algorithm for protein folding in the HP model , 2007, BMC Bioinformatics.

[43]  Alessandro Dal Palù,et al.  Constraint Logic Programming approach to protein structure prediction , 2004, BMC Bioinformatics.

[44]  Richard Bonneau,et al.  Ab initio protein structure prediction: progress and prospects. , 2001, Annual review of biophysics and biomolecular structure.

[45]  Nashat Mansour,et al.  Protein structure prediction in the 3D HP model , 2009, 2009 IEEE/ACS International Conference on Computer Systems and Applications.

[46]  So Much More to Know … , 2005, Science.

[47]  Pascal Van Hentenryck,et al.  On Lattice Protein Structure Prediction Revisited , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[48]  Adam Smith Protein misfolding , 2003, Nature.