Empirical Analysis of GP Tree-Fragments

Researchers have attempted to explain the power of Genetic Programming (GP) search using various notions of schema. However empirical studies of schemas have been limited due to their vast numbers in typical populations. This paper addresses the problem of analyzing schemas represented by tree-fragments. It describes a new efficient way of representing the huge sets of fragments in a population of GP programs and presents an algorithm to find all fragments using this efficient representation. Using this algorithm, the paper presents an initial analysis of fragments in populations of up to 300 programs, each up to seven nodes deep. The analysis demonstrates a surprisingly large variation in the numbers of fragments through evolution and a non-monotonic rise in the most useful fragments. With his method, empirical investigation of the GP building block hypothesis and schema theory in realistic sized GP systems becomes possible.

[1]  William B. Langdon,et al.  Repeated Patterns in Tree Genetic Programming , 2005, EuroGP.

[2]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[3]  P.A. Whigham,et al.  A Schema Theorem for context-free grammars , 1995, Proceedings of 1995 IEEE International Conference on Evolutionary Computation.

[4]  David E. Goldberg,et al.  BUILDING-BLOCK SUPPLY IN GENETIC PROGRAMMING , 2003 .

[5]  Riccardo Poli,et al.  Genetic Programming Bloat with Dynamic Fitness , 1998, EuroGP.

[6]  Jason M. Daida,et al.  Analysis of single-node (building) blocks in genetic programming , 1999 .

[7]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[8]  Lee Altenberg,et al.  The Schema Theorem and Price's Theorem , 1994, FOGA.

[9]  John R. Koza,et al.  Genetic programming 1997 : proceedings of the Second Annual Conference, July 13-16, 1997, Stanford University , 1997 .

[10]  Riccardo Poli,et al.  Genetic and Evolutionary Computation – GECCO 2004 , 2004, Lecture Notes in Computer Science.

[11]  J. K. Kinnear,et al.  Advances in Genetic Programming , 1994 .

[12]  Hammad Majeed,et al.  A new approach to evaluate GP schema in context , 2005, GECCO '05.

[13]  Una-May O'Reilly,et al.  The Troubling Aspects of a Building Block Hypothesis for Genetic Programming , 1994, FOGA.

[14]  Dana H. Ballard,et al.  Rooted-tree schemata in genetic programming , 1999 .

[15]  Terence Soule,et al.  Genetic Programming: Theory and Practice , 2003 .

[16]  ProgrammingJustinian P. RoscaComputer Analysis of Complexity Drift in Genetic , 1997 .

[17]  Malcolm I. Heywood,et al.  Context-Based Repeated Sequences in Linear Genetic Programming , 2005, EuroGP.

[18]  Charles E. Taylor Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. Complex Adaptive Systems.John H. Holland , 1994 .

[19]  Riccardo Poli,et al.  An Experimental Analysis of Schema Creation, Propagation and Disruption in Genetic Programming , 1997, ICGA.

[20]  Riccardo Poli,et al.  Schema Theory for Genetic Programming with One-Point Crossover and Point Mutation , 1997, Evolutionary Computation.

[21]  Conor Ryan,et al.  A Competitive Building Block Hypothesis , 2004, GECCO.

[22]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.