DFS Based Partial Pathways in GA for Protein Structure Prediction

Nondeterministic conformational search techniques, such as Genetic Algorithms (GAs) are promising for solving protein structure prediction (PSP) problem. The crossover operator of a GA can underpin the formation of potential conformations by exchanging and sharing potential sub-conformations, which is promising for solving PSP. However, the usual nature of an optimum PSP conformation being compact can produce many invalid conformations (by having non-self-avoiding-walk) using crossover. While a crossover-based converging conformation suffers from limited pathways, combining it with depth-first search (DFS) can partially reveal potential pathways. DFS generates random conformations increasingly quickly with increasing length of the protein sequences compared to random-move-only-based conformation generation. Random conformations are frequently applied for maintaining diversity as well as for initialization in many GA variations.

[1]  W. Wong,et al.  Evolutionary Monte Carlo for protein folding simulations , 2001 .

[2]  L Toma,et al.  Folding simulation of protein models on the structure‐based cubo‐octahedral lattice with the Contact Interactions algorithm , 1999, Protein science : a publication of the Protein Society.

[3]  Erich Bornberg-Bauer,et al.  Recombinatoric exploration of novel folded structures: A heteropolymer-based model of protein evolutionary landscapes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Tamjidul Hoque,et al.  Significance of Hybrid Evolutionary Computation for Ab Initio Protein Folding Prediction , 2007 .

[5]  O. Schueler‐Furman,et al.  Progress in Modeling of Protein Structures and Interactions , 2005, Science.

[6]  O. Takahashi,et al.  Protein Folding by a Hierarchical Genetic Algorithm , 1999 .

[7]  Randy L. Haupt,et al.  Practical Genetic Algorithms , 1998 .

[8]  Jim Smith,et al.  Study of fitness landscapes for the HP model of protein structure prediction , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[9]  Mihalis Yannakakis,et al.  On the complexity of protein folding (extended abstract) , 1998, STOC '98.

[10]  Rolf Backofen,et al.  Algorithmic approach to quantifying the hydrophobic force contribution in protein folding , 1999, German Conference on Bioinformatics.

[11]  Jooyoung Lee,et al.  New optimization method for conformational energy calculations on polypeptides: Conformational space annealing , 1997, J. Comput. Chem..

[12]  P. Grassberger,et al.  Testing a new Monte Carlo algorithm for protein folding , 1997, Proteins.

[13]  Andrzej Kolinski,et al.  Computational studies of protein folding , 2001, Comput. Sci. Eng..

[14]  Anthony J. Guttmann,et al.  Self-avoiding walks in constrained and random geometries: Series studies , 2005 .

[15]  Jens Meiler,et al.  Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation , 2003, Proteins.

[16]  Madhu Chetty,et al.  Non-Isomorphic Coding in Lattice Model and its Impact for Protein Folding Prediction Using Genetic Algorithm , 2006, 2006 IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology.

[17]  Madhu Chetty,et al.  Generalized Schemata Theorem Incorporating Twin Removal for Protein Structure Prediction , 2007, PRIB.

[18]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[19]  Holger H. Hoos,et al.  An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem , 2005, BMC Bioinformatics.

[20]  Keri Schreiner News: Distributed Projects Tackle Protein Mystery , 2001, IEEE Distributed Syst. Online.

[21]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[22]  Richard Bonneau,et al.  Rosetta in CASP4: Progress in ab initio protein structure prediction , 2001, Proteins.

[23]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[24]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[25]  S. Toma,et al.  Contact interactions method: A new algorithm for protein folding simulations , 1996, Protein science : a publication of the Protein Society.

[26]  Abdul Sattar,et al.  Protein folding prediction in 3D FCC HP lattice model using genetic algorithm , 2007, 2007 IEEE Congress on Evolutionary Computation.

[27]  Madhu Chetty,et al.  A Hybrid Genetic Algorithm for 2D FCC Hydrophobic-Hydrophilic Lattice Model to Predict Protein Folding , 2006, Australian Conference on Artificial Intelligence.

[28]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[29]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[30]  Michael Bachmann,et al.  Exact enumeration of three-dimensional lattice proteins , 2005, Comput. Phys. Commun..

[31]  K. Lin,et al.  Universal amplitude ratios for three-dimensional self-avoiding walks , 2002 .

[32]  Songde Ma,et al.  Protein folding simulations of the hydrophobic–hydrophilic model by combining tabu search with genetic algorithms , 2003 .

[33]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[34]  Madhu Chetty,et al.  A new guided genetic algorithm for 2D hydrophobic-hydrophilic model to predict protein folding , 2005, 2005 IEEE Congress on Evolutionary Computation.

[35]  S Banu Ozkan,et al.  The protein folding problem: when will it be solved? , 2007, Current opinion in structural biology.

[36]  V. K. Koumousis,et al.  A saw-tooth genetic algorithm combining the effects of variable population size and reinitialization to enhance performance , 2006, IEEE Transactions on Evolutionary Computation.

[37]  H A Scheraga,et al.  A comparison of the CHARMM, AMBER and ECEPP potentials for peptides. I. Conformational predictions for the tandemly repeated peptide (Asn-Ala-Asn-Pro)9. , 1989, Journal of biomolecular structure & dynamics.

[38]  Konstantinos G. Margaritis,et al.  An Experimental Study of Benchmarking Functions for Genetic Algorithms , 2002, Int. J. Comput. Math..

[39]  V S Pande,et al.  Folding pathway of a lattice model for proteins. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Sue Whitesides,et al.  A complete and effective move set for simplified protein folding , 2003, RECOMB '03.

[41]  K. Dill,et al.  The ultimate speed limit to protein folding is conformational searching. , 2007, Journal of the American Chemical Society.

[42]  Abdul Sattar,et al.  AI 2006: Advances in Artificial Intelligence, 19th Australian Joint Conference on Artificial Intelligence, Hobart, Australia, December 4-8, 2006, Proceedings , 2006, Australian Conference on Artificial Intelligence.

[43]  Ron Unger,et al.  On the applicability of genetic algorithms to protein folding , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[44]  Tamjidul Hoque,et al.  Fast computation of the fitness function for protein folding prediction in a 2D hydrophilic-hydrophobic model , 2005 .

[45]  David Corne,et al.  An Introduction to Bioinformatics for Computer Scientists , 2003 .

[46]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[47]  Kei Yura,et al.  [Structural bioinformatics]. , 2009, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme.

[48]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[49]  H. Scheraga,et al.  A comparison of the CHARMM, AMBER and ECEPP potentials for peptides. II. Phi-psi maps for N-acetyl alanine N'-methyl amide: comparisons, contrasts and simple experimental tests. , 1989, Journal of biomolecular structure & dynamics.

[50]  Hisao Ishibuchi,et al.  Hybrid Evolutionary Algorithms , 2007 .

[51]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules J. Am. Chem. Soc. 1995, 117, 5179−5197 , 1996 .

[52]  Ajay K. Royyuru,et al.  Blue Gene: A vision for protein science using a petaflop supercomputer , 2001, IBM Syst. J..

[53]  C. Levinthal Are there pathways for protein folding , 1968 .

[54]  Thomas Dandekar,et al.  Refined Genetic Algorithm Simulations to Model Proteins , 1999 .

[55]  Madhu Chetty,et al.  A Guided Genetic Algorithm for Protein Folding Prediction Using 3D Hydrophobic-Hydrophilic Model , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[56]  Rolf Backofen,et al.  A Constraint-Based Approach to Fast and Exact Structure Prediction in Three-Dimensional Protein Models , 2006, Constraints.

[57]  Ulrich H. E. Hansmann,et al.  Protein folding in silico: an overview , 2003, Comput. Sci. Eng..

[58]  Tom Creighton Importance of Protein Folding , 2008 .

[59]  D. Baker,et al.  Prediction and design of macromolecular structures and interactions , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[60]  Erich Bornberg-Bauer,et al.  Chain growth algorithms for HP-type lattice proteins , 1997, RECOMB '97.

[61]  Julian Lee Conformational space annealing and a lattice model protein , 2004 .

[62]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .

[63]  Gerard T. Barkema,et al.  Exploring high-dimensional energy landscapes , 1999, Comput. Sci. Eng..

[64]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[65]  Ron Unger,et al.  Genetic Algorithm for 3D Protein Folding Simulations , 1993, ICGA.

[66]  Vincenzo Cutello,et al.  An Immune Algorithm for Protein Structure Prediction on Lattice Models , 2007, IEEE Transactions on Evolutionary Computation.

[67]  David Baker,et al.  Ab initio methods. , 2003, Methods of biochemical analysis.

[68]  R Samudrala,et al.  Ab initio construction of protein tertiary structures using a hierarchical approach. , 2000, Journal of molecular biology.