Crossover-Based Tree Distance in Genetic Programming

In evolutionary algorithms, distance metrics between solutions are often useful for many aspects of guiding and understanding the search process. A good distance measure should reflect the capability of the search: if two solutions are found to be close in distance, or similarity, they should also be close in the search algorithm sense, i.e., the variation operator used to traverse the search space should easily transform one of them into the other. This paper explores such a distance for genetic programming syntax trees. Distance measures are discussed, defined and empirically investigated. The value of such measures is then validated in the context of analysis (fitness-distance correlation is analyzed during population evolution) as well as guiding search (results are improved using our measure in a fitness sharing algorithm) and diversity (new insights are obtained as compared with standard measures).

[1]  Leonardo Vanneschi,et al.  Operator-Based Distance for Genetic Programming: Subtree Crossover Distance , 2005, EuroGP.

[2]  Jason M. Daida,et al.  What Makes a Problem GP-Hard? Validating a Hypothesis of Structural Causes , 2003, GECCO.

[3]  Tim Jones Evolutionary Algorithms, Fitness Landscapes and Search , 1995 .

[4]  Leonardo Vanneschi,et al.  Fitness Distance Correlation in Structural Mutation Genetic Programming , 2003, EuroGP.

[5]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[6]  Una-May O'Reilly,et al.  Program Search with a Hierarchical Variable Lenght Representation: Genetic Programming, Simulated Annealing and Hill Climbing , 1994, PPSN.

[7]  Leonardo Vanneschi,et al.  Theory and practice for efficient genetic programming , 2004 .

[8]  Graham Kendall,et al.  Problem Difficulty and Code Growth in Genetic Programming , 2004, Genetic Programming and Evolvable Machines.

[9]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[10]  Steven M. Gustafson An analysis of diversity in genetic programming , 2004 .

[11]  D. Goldberg How Fitness Structure Affects Subsolution Acquisition in Genetic Programming , 1998 .

[12]  Leonardo Vanneschi,et al.  A Study of Fitness Distance Correlation as a Difficulty Measure in Genetic Programming , 2005, Evolutionary Computation.

[13]  Anikó Ekárt,et al.  Maintaining the Diversity of Genetic Programs , 2002, EuroGP.

[14]  Giancarlo Mauri,et al.  Using Subtree Crossover Distance to Investigate Genetic Programming Dynamics , 2006, EuroGP.

[15]  Victor J. Rayward-Smith,et al.  Fitness Distance Correlation and Ridge Functions , 1998, PPSN.

[16]  Nguyen Xuan Hoai,et al.  Representation and structural difficulty in genetic programming , 2006, IEEE Transactions on Evolutionary Computation.

[17]  Mark Wineberg,et al.  Distance between Populations , 2003, GECCO.

[18]  Terry Jones,et al.  Fitness Distance Correlation as a Measure of Problem Difficulty for Genetic Algorithms , 1995, ICGA.

[19]  Kalyanmoy Deb,et al.  Analyzing Deception in Trap Functions , 1992, FOGA.

[20]  N. Hopper,et al.  Analysis of genetic diversity through population history , 1999 .

[21]  Riccardo Poli,et al.  General Schema Theory for Genetic Programming with Subtree-Swapping Crossover: Part I , 2003, Evolutionary Computation.

[22]  Wolfgang Banzhaf,et al.  Explicit Control of Diversity and Effective Variation Distance in Linear Genetic Programming , 2002, EuroGP.

[23]  Peter J. Angeline,et al.  The Royal Tree Problem, a Benchmark for Single and Multiple Population Genetic Programming , 1996 .

[24]  Riccardo Poli,et al.  Topological crossover for the permutation representation , 2005, GECCO '05.

[25]  Graham Kendall,et al.  Diversity in genetic programming: an analysis of measures and correlation with fitness , 2004, IEEE Transactions on Evolutionary Computation.

[26]  Alberto Moraglio Geometric Unification of Evolutionary Algorithms , 2006 .

[27]  Lee Altenberg,et al.  Fitness Distance Correlation Analysis: An Instructive Counterexample , 1997, ICGA.