Amino Acids Pattern-Biased Spiral Search for Protein Structure Prediction

Proteins are essentially sequences of amino acids. They adopt specific folded 3-dimensional structures to perform specific tasks. The formation of 3-dimensional structures is largely guided by the constituent amino acids. Therefore, the positional presence of amino acids in a sequence might play important roles during the protein folding process. In this paper, we present a new heuristic derived from the positional patterns of amino acids in a sequence. With the help of a biased tabu tenure, we apply this heuristic within a spiral search algorithm. The spiral search is an efficient algorithm to develop hydrophobic core in a protein structure pulling hydrophobic amino acids towards the core centre in a spiral fashion. On a set of standard benchmark proteins, we experimentally show that applying our new heuristic improves the performance of a spiral search algorithm consistently.

[1]  Ron Unger,et al.  Genetic Algorithm for 3D Protein Folding Simulations , 1993, ICGA.

[2]  Vincenzo Cutello,et al.  An Immune Algorithm for Protein Structure Prediction on Lattice Models , 2007, IEEE Transactions on Evolutionary Computation.

[3]  Abdul Sattar,et al.  The road not taken: retreat and diverge in local search for simplified protein structure prediction , 2013, BMC Bioinformatics.

[4]  C. Levinthal Are there pathways for protein folding , 1968 .

[5]  Abdul Sattar,et al.  Memory-based local search for simplified protein structure prediction , 2012, BCB.

[6]  El-Ghazali Talbi,et al.  A grid-based genetic algorithm combined with an adaptive simulated annealing for protein structure prediction , 2008, Soft Comput..

[7]  Pascal Van Hentenryck,et al.  Protein Structure Prediction on the Face Centered Cubic Lattice by Local Search , 2008, AAAI.

[8]  Sue Whitesides,et al.  A complete and effective move set for simplified protein folding , 2003, RECOMB '03.

[9]  Joe Marks,et al.  Human-guided tabu search , 2002, AAAI/IAAI.

[10]  Hans-Joachim Böckenhauer,et al.  A Local Move Set for Protein Folding in Triangular Lattice Models , 2008, WABI.

[11]  Pascal Van Hentenryck,et al.  On Lattice Protein Structure Prediction Revisited , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[13]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[14]  Abdul Sattar,et al.  Spiral search: a hydrophobic-core directed local search for simplified PSP on 3D FCC lattice , 2013, BMC Bioinformatics.

[15]  Rolf Backofen,et al.  CPSP-web-tools: a server for 3D lattice protein studies , 2009, Bioinform..

[16]  Abdul Sattar,et al.  Random-walk: a stagnation recovery technique for simplified protein structure prediction , 2012, BCB '12.

[17]  Kathleen Steinhöfel,et al.  Protein Folding Simulation by Two-Stage Optimization , 2009 .

[18]  T. Hales The Kepler conjecture , 1998, math/9811078.

[19]  Andrew Lewis,et al.  DFS-generated pathways in GA crossover for protein structure prediction , 2010, Neurocomputing.

[20]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[22]  Andrew Lewis,et al.  Twin Removal in Genetic Algorithms for Protein Structure Prediction Using Low-Resolution Model , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[23]  Christian Blum,et al.  Ant colony optimization: Introduction and recent trends , 2005 .

[24]  Rita Casadio,et al.  Algorithms in Bioinformatics, 5th International Workshop, WABI 2005, Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, WABI.

[25]  Kathleen Steinhöfel,et al.  A hybrid approach to protein folding problem integrating constraint programming with local search , 2010, BMC Bioinformatics.

[26]  Yong Liu,et al.  Computational Intelligence and Intelligent Systems , 2011 .

[27]  Rolf Backofen,et al.  A Constraint-Based Approach to Fast and Exact Structure Prediction in Three-Dimensional Protein Models , 2006, Constraints.

[28]  Rolf Backofen,et al.  CPSP-tools – Exact and complete algorithms for high-throughput 3D lattice protein studies , 2008, BMC Bioinformatics.

[29]  Federico Fogolari,et al.  Amino acid empirical contact energy definitions for fold recognition in the space of contact maps , 2003, BMC Bioinformatics.

[30]  Songde Ma,et al.  Protein folding simulations of the hydrophobic–hydrophilic model by combining tabu search with genetic algorithms , 2003 .

[31]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[32]  Abdul Sattar,et al.  Protein folding prediction in 3D FCC HP lattice model using genetic algorithm , 2007, 2007 IEEE Congress on Evolutionary Computation.

[33]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[34]  Erik D. Goodman,et al.  A Standard GA Approach to Native Protein Conformation Prediction , 1995 .

[35]  Holger H. Hoos,et al.  A replica exchange Monte Carlo algorithm for protein folding in the HP model , 2007, BMC Bioinformatics.