On the search space of genetic programming and its relation to nature's search space

The size of the search space has been analyzed for genetic programming and genetic algorithms. It is highly unlikely to find any single individual in this huge search space. However, genetic programming with variable length structures differs from standard genetic algorithms where fixed size bit strings are used in that usually many different individuals show the same pheno-typical behavior due to introns. Therefore, finding any given behavior is not as difficult as the size of the search space suggests. A quantitative analysis is presented for the number of individuals that code for the identity function. The identity function is important in the analysis of the search space because it can be used to construct individuals showing the same behavior as any given individual. Finally, an analogy is drawn to nature's sequence space which suggests possible directions for future research. The representation should be chosen such that all possible behaviors are reachable within a comparatively small number of steps from any given behavior and the individuals coding for any given behavior should be distributed randomly in the search space. In addition, long paths of neutral mutations should lead to individuals which code for the same behavior.

[1]  Alfred V. Aho,et al.  Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[2]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[3]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[4]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[5]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[6]  Peter Schuster,et al.  Extended Molecular Evolutionary Biology: Artificial Life Bridging the Gap Between Chemistry and Biology , 1993, Artificial Life.

[7]  Una-May O'Reilly,et al.  Genetic Programming II: Automatic Discovery of Reusable Programs. , 1994, Artificial Life.

[8]  Wolfgang Banzhaf,et al.  Genotype-Phenotype-Mapping and Neutral Variation - A Case Study in Genetic Programming , 1994, PPSN.

[9]  Arantxa Etxeverria The Origins of Order , 1993 .

[10]  W. Banzhaf,et al.  Genetic programming using genotype-phenotype mapping from linear genomes into linear phenotypes , 1996 .

[11]  Riccardo Poli,et al.  Fitness Causes Bloat: Mutation , 1997, EuroGP.

[12]  David E. Goldberg,et al.  Where Does the Good Stuff Go, and Why? How Contextual Semantics Influences Program Structure in Simple Genetic Programming , 1998, EuroGP.

[13]  Riccardo Poli,et al.  Genetic Programming Bloat with Dynamic Fitness , 1998, EuroGP.

[14]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .