Comparison of tree and graph encodings as function of problem complexity

In this paper, we analyze two general-purpose encoding types, trees and graphs systematically, focusing on trends over increasingly complex problems. Tree and graph encodings are similar in application but offer distinct advantages and disadvantages in genetic programming. We describe two implementations and discuss their evolvability. We then compare performance using symbolic regression on hundreds of random nonlinear target functions of both 1-dimensional and 8-dimensional cases. Results show the graph encoding has less bias for bloating solutions but is slower to converge and deleterious crossovers are more frequent. The graph encoding however is found to have computational benefits, suggesting it to be an advantageous trade-off between regression performance and computational effort.

[1]  J. David Schaffer,et al.  Representation and Hidden Bias: Gray vs. Binary Coding for Genetic Algorithms , 1988, ML.

[2]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[3]  Mark J. Willis,et al.  Using a tree structured genetic algorithm to perform symbolic regression , 1995 .

[4]  Leo Breiman,et al.  Bias, Variance , And Arcing Classifiers , 1996 .

[5]  David H. Wolpert,et al.  On Bias Plus Variance , 1997, Neural Computation.

[6]  Helio J. C. Barbosa,et al.  Symbolic regression via genetic programming , 2000, Proceedings. Vol.1. Sixth Brazilian Symposium on Neural Networks.

[7]  Pedro M. Domingos A Unifeid Bias-Variance Decomposition and its Applications , 2000, ICML.

[8]  Pedro M. Domingos A Unifeid Bias-Variance Decomposition and its Applications , 2000, ICML.

[9]  Franz Rothlauf,et al.  Representations for genetic and evolutionary algorithms , 2002, Studies in Fuzziness and Soft Computing.

[10]  Nguyen Xuan Hoai,et al.  Solving the symbolic regression problem with tree-adjunct grammar guided genetic programming: the comparative results , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[11]  J. Duffy,et al.  Using Symbolic Regression to Infer Strategies from Experimental Data , 2002 .

[12]  Katya Rodríguez-Vázquez,et al.  Symbolic Regression Problems by Genetic Programming with Multi-branches , 2004, MICAI.

[13]  Tomasz Arciszewski,et al.  Evolutionary computation and structural design: A survey of the state-of-the-art , 2005 .

[14]  Hod Lipson,et al.  Principles of modularity, regularity, and hierarchy for scalable systems , 2007 .

[15]  Hod Lipson,et al.  Coevolution of Fitness Predictors , 2008, IEEE Transactions on Evolutionary Computation.