Modelling the dynamics of genetic algorithms using statistical mechanics

A formalism for modelling the dynamics of Genetic Algorithms (GAs) using methods from statistical mechanics, originally due to Prugel-Bennett and Shapiro, is reviewed, generalized and improved upon. This formalism can be used to predict the averaged trajectory of macroscopic statistics describing the GA's population. These macroscopics are chosen to average well between runs, so that fluctuations from mean behaviour can often be neglected. Where necessary, non-trivial terms are determined by assuming maximum entropy with constraints on known macroscopics. Problems of realistic size are described in compact form and finite population effects are included, often proving to be of fundamental importance. The macroscopics used here are cumulants of an appropriate quantity within the population and the mean correlation (Hamming distance) within the population. Including the correlation as an explicit macroscopic provides a significant improvement over the original formulation. The formalism is applied to a number of simple optimization problems in order to determine its predictive power and to gain insight into GA dynamics. Problems which are most amenable to analysis come from the class where alleles within the genotype contribute additively to the phenotype. This class can be treated with some generality, including problems with inhomogeneous contributions from each site, non-linear or noisy fitness measures, simple diploid representations and temporally varying fitness. The results can also be applied to a simple learning problem, generalization in a binary perceptron, and a limit is identified for which the optimal training batch size can be determined for this problem. The theory is compared to averaged results from a real GA in each case, showing excellent agreement if the maximum entropy principle holds. Some situations where this approximation brakes down are identified. In order to fully test the formalism, an attempt is made on the strong sc np-hard problem of storing random patterns in a binary perceptron. Here, the relationship between the genotype and phenotype (training error) is strongly non-linear. Mutation is modelled under the assumption that perceptron configurations are typical of perceptrons with a given training error. Unfortunately, this assumption does not provide a good approximation in general. It is conjectured that perceptron configurations would have to be constrained by other statistics in order to accurately model mutation for this problem. Issues arising from this study are discussed in conclusion and some possible areas of further research are outlined.

[1]  Melanie Mitchell,et al.  Finite populations induce metastability in evolutionary search , 1997 .

[2]  M. Rattray,et al.  THE DYNAMICS OF A GENETIC ALGORITHM FOR A SIMPLE LEARNING PROBLEM , 1996, cond-mat/9609109.

[3]  David E. Goldberg,et al.  Genetic Algorithms, Selection Schemes, and the Varying Effects of Noise , 1996, Evolutionary Computation.

[4]  Lalit M. Patnaik,et al.  Genetic Search: Analysis Using Fitness Moments , 1996, IEEE Trans. Knowl. Data Eng..

[5]  Magnus Rattray,et al.  Noisy Fitness Evaluation in Genetic Algorithms and the Dynamics of Learning , 1996, FOGA.

[6]  Kenneth A. De Jong,et al.  Analyzing GAs Using Markov Models with Semantically Ordered and Lumped States , 1996, FOGA.

[7]  Saad,et al.  On-line learning in soft committee machines. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[8]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[9]  Newman,et al.  Broken ergodicity and the geometry of rugged landscapes. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[10]  Lothar Thiele,et al.  A Comparison of Selection Schemes used in Genetic Algorithms , 1995 .

[11]  Magnus Rattray,et al.  The Dynamics of a Genetic Algorithm under Stabilizing Selection , 1995, Complex Syst..

[12]  Heinz Mühlenbein,et al.  Analysis of Selection, Mutation and Recombination in Genetic Algorithms , 1995, Evolution and Biocomputation.

[13]  Alden H. Wright,et al.  Simple Genetic Algorithms with Linear Fitness , 1994, Evolutionary Computation.

[14]  C. Lively,et al.  Selection by parasites for clonal diversity and mixed mating. , 1994, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[15]  C. Lively,et al.  Selection by parasites for clonal diversity and mixed mating , 1994 .

[16]  Yukiko Yoshida,et al.  A Diploid Genetic Algorithm for Preserving Population Diversity - pseudo-Meiosis GA , 1994, PPSN.

[17]  Dirk Thierens,et al.  Convergence Models of Genetic Algorithm Selection Schemes , 1994, PPSN.

[18]  B. Derrida,et al.  Replica symmetry instability in perceptron models , 1994 .

[19]  Thomas Bäck,et al.  Selective Pressure in Evolutionary Algorithms: A Characterization of Selection Mechanisms , 1994, International Conference on Evolutionary Computation.

[20]  Adam Prügel-Bennett,et al.  A Statistical Mechanical Formulation of the Dynamics of Genetic Algorithms , 1994, Evolutionary Computing, AISB Workshop.

[21]  Paul Higgs,et al.  Error thresholds and stationary mutant distributions in multi-locus diploid genetics models , 1994 .

[22]  H. Horner Dynamics of learning and generalization in perceptrons with constraints , 1993 .

[23]  H. Schwarze Learning a rule in a multilayer neural network , 1993 .

[24]  W. Hamilton Haploid Dynamic Polymorphism in a Host with Matching Parasites: Effects of Mutation/Subdivision, Linkage, and Patterns of Selection , 1993 .

[25]  H. K. Patel,et al.  Computational complexity, learning rules and storage capacities: A Monte Carlo study for the binary perceptron , 1993 .

[26]  Lalit M. Patnaik,et al.  Binomially Distributed Populations for Modelling GAs , 1993, ICGA.

[27]  J. D. Schaffer,et al.  Combinations of genetic algorithms and neural networks: a survey of the state of the art , 1992, [Proceedings] COGANN-92: International Workshop on Combinations of Genetic Algorithms and Neural Networks.

[28]  R. Fateman,et al.  A System for Doing Mathematics by Computer. , 1992 .

[29]  Sompolinsky,et al.  Statistical mechanics of learning from examples. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[30]  Bruce Tidor,et al.  Increased Flexibility in Genetic Algorithms: the Use of variable Boltzmann Selective pressure to control Propagation , 1992, Computer Science and Operations Research.

[31]  Gilbert Syswerda,et al.  Simulated Crossover in Genetic Algorithms , 1992, FOGA.

[32]  Oliver Vornberger,et al.  Enumerative vs. Genetic Optimization: Two Parallel Algorithms for the Bin Packing Problem , 1992, Data Structures and Efficient Algorithms.

[33]  David E. Goldberg,et al.  Diploidy and Dominance in Artificial Genetic Search , 1992, Complex Syst..

[34]  Kalyanmoy Deb,et al.  Genetic Algorithms, Noise, and the Sizing of Populations , 1992, Complex Syst..

[35]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[36]  J. Crow,et al.  Haploidy or diploidy: which is better? , 1991, Nature.

[37]  Gunar E. Liepins,et al.  Punctuated Equilibria in Genetic Search , 1991, Complex Syst..

[38]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[39]  H M Kohler,et al.  Adaptive genetic algorithm for the binary perceptron problem , 1990 .

[40]  W. Daniel Hillis,et al.  Co-evolving parasites improve simulated evolution as an optimization procedure , 1990 .

[41]  Györgyi,et al.  First-order transition to perfect generalization in a neural network with binary synapses. , 1990, Physical review. A, Atomic, molecular, and optical physics.

[42]  Kalyanmoy Deb,et al.  A Comparative Analysis of Selection Schemes Used in Genetic Algorithms , 1990, FOGA.

[43]  W. Krauth,et al.  Storage capacity of memory networks with binary couplings , 1989 .

[44]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[45]  Chrisila C. Pettey,et al.  A Theoretical Investigation of a Parallel Genetic Algorithm , 1989, ICGA.

[46]  Stephen Wolfram,et al.  Mathematica: a system for doing mathematics by computer (2nd ed.) , 1991 .

[47]  Leslie G. Valiant,et al.  Computational limitations on learning from examples , 1988, JACM.

[48]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[49]  David E. Goldberg,et al.  Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[50]  James E. Baker,et al.  Reducing Bias and Inefficienry in the Selection Algorithm , 1987, ICGA.

[51]  David E. Goldberg,et al.  Nonstationary Function Optimization Using Genetic Algorithms with Dominance and Diploidy , 1987, ICGA.

[52]  E. Gardner,et al.  Maximum Storage Capacity in Neural Networks , 1987 .

[53]  R. Elston The mathematical theory of quantitative genetics , 1982 .

[54]  B. Derrida Random-energy model: An exactly solvable model of disordered systems , 1981 .

[55]  I. S. Gradshteyn Table of Integrals, Series and Products, Corrected and Enlarged Edition , 1980 .

[56]  R. Palmer,et al.  The replica method and solvable spin glass model , 1979 .

[57]  M. Garey Johnson: computers and intractability: a guide to the theory of np- completeness (freeman , 1979 .

[58]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[59]  S. Gould,et al.  Punctuated equilibria: the tempo and mode of evolution reconsidered , 1977, Paleobiology.

[60]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[61]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[62]  R. Fisher The Advanced Theory of Statistics , 1943, Nature.