Abstract.Determination of the native state of a protein from its amino acid sequence is the goal of protein folding simulations, with potential applications in gene therapy and drug design. Location of the global minimum structure for a given sequence, however, is a difficult optimisation problem. In this paper, we describe the development and application of a genetic algorithm (GA) to find the lowest-energy conformations for the 2D HP lattice bead protein model. Optimisation of the parameters of our “standard” GA program reveals that the GA is most successful (at finding the lowest-energy conformations) for high rates of mating and mutation and relatively high elitism. We have also introduced a number of new genetic operators: a duplicate predator—which maintains population diversity by eliminating duplicate structures; brood selection—where two “parent” structures undergo crossover and give rise to a brood of (not just two) offspring; and a Monte Carlo based local search algorithm—to explore the neighbourhood of all members of the population. It is shown that these operators lead to significant improvements in the success and efficiency of the GA, both compared with our standard GA and with previously published GA studies for benchmark HP sequences with up to 50 beads.
[1]
Georg E. Schulz,et al.
Principles of Protein Structure
,
1979
.
[2]
David E. Goldberg,et al.
Genetic Algorithms in Search Optimization and Machine Learning
,
1988
.
[3]
Roy L. Johnston,et al.
Applications of Evolutionary Computation in Chemistry
,
2004
.
[4]
C. Tanford.
Macromolecules
,
1994,
Nature.
[5]
J. Valverde.
Molecular Modelling: Principles and Applications
,
2001
.
[6]
John H. Holland,et al.
Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence
,
1992
.
[7]
David E. Clark,et al.
Evolutionary Algorithms in Molecular Design
,
1999
.
[8]
J. K. Kinnear,et al.
Advances in Genetic Programming
,
1994
.