Toward Effective Polypeptide Structure Prediction with Parallel Fast Messy Genetic Algorithms

Publisher Summary The protein folding problem (PFP) involves the determination of the 3D structures of proteins given only their amino acid sequences. One reason for the significant interest in general and efficient techniques to solve this type of problem is that such knowledge would facilitate understanding of the tremendous amount of genetic information produced by the Human Genome Project. This chapter refers to efforts aimed at identifying in vivo structures of naturally occurring proteins or structures of arbitrary polypeptides in arbitrary environments. In particular, it looks at the general problem of identifying three-dimensional structures of arbitrary polypeptides in arbitrary environments, and focuses on a particular kind of evolutionary algorithm (the messy genetic algorithm) for use on this task. It concentrates on improved effectiveness through the exploitation of domain constraints (i.e., dihedral-angle constraints inspired by the Ramachandran plot) and the exploitation of prior secondary structure analysis. The work described is performed on high-grade parallel and heterogeneous computing resources.

[1]  Kalyanmoy Deb,et al.  Don't Worry, Be Messy , 1991, ICGA.

[2]  D. T. Jones Protein structure prediction in the postgenomic era. , 2000, Current opinion in structural biology.

[3]  Laurence D. Merkle Generalization and Parallelization of Messy Genetic Algorithms and Communication in Parallel Genetic Algorithms. , 1992 .

[4]  Hans Hagen,et al.  Scientific Visualization: Overviews, Methodologies, and Techniques , 1997 .

[5]  Kalyanmoy Deb,et al.  Genetic Algorithms, Noise, and the Sizing of Populations , 1992, Complex Syst..

[6]  Melanie Mitchell,et al.  Relative Building-Block Fitness and the Building Block Hypothesis , 1992, FOGA.

[7]  Soumya S. Patnaik,et al.  New smart materials: molecular simulation of nonlinear optical chromophore-containing polypeptides and liquid crystalline siloxanes , 1993, Smart Structures.

[8]  Gary B. Lamont,et al.  Load Balancing Search Algorithms on a Heterogeneous Cluster of PCs , 2001, PPSC.

[9]  J Moult,et al.  From fold to function. , 2000, Current opinion in structural biology.

[10]  George H. Gates,et al.  Predicting Protein Structure Using Parallel Genetic Algorithms. , 1994 .

[11]  Thomas Lengauer,et al.  Algorithmic research problems in molecular bioinformatics , 1993, [1993] The 2nd Israel Symposium on Theory and Computing Systems.

[12]  J. Nowick,et al.  Designed molecules that fold to mimic protein secondary structures. , 1999, Current opinion in chemical biology.

[13]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[14]  Charles E. Kaiser Refined Genetic Algorithms for Polypeptide Structure Prediction. , 1996 .

[15]  Giancarlo Mauri,et al.  Application of Evolutionary Algorithms to Protein Folding Prediction , 1997, Artificial Evolution.

[16]  Richard Kenneth Thompson Fitness landscapes investigated , 1995 .

[17]  C P Ponting,et al.  Protein fold irregularities that hinder sequence analysis. , 1998, Current opinion in structural biology.

[18]  Larry J. Eshelman,et al.  Preventing Premature Convergence in Genetic Algorithms by Preventing Incest , 1991, ICGA.

[19]  Kalyanmoy Deb,et al.  Messy Genetic Algorithms: Motivation, Analysis, and First Results , 1989, Complex Syst..

[20]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[21]  Gary B. Lamont,et al.  Hybrid genetic algorithms for polypeptide energy minimization , 1996, SAC '96.

[22]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[23]  Kenneth M. Merz,et al.  The application of the genetic algorithm to the minimization of potential energy functions , 1993, J. Glob. Optim..