A New Method for Simplifying Algebraic Expressions in Genetic Programming Called Equivalent Decision Simplification

Symbolic Regression is one of the most important applications of Genetic Programming, but these applications suffer from one of the key issues in Genetic Programming, namely bloat --- the uncontrolled growth of ineffective code segments, which do not contribute to the value of the function evolved, but complicate the evolutionary proces, and at minimum greatly increase the cost of evaluation. For a variety of reasons, reliable techniques to remove bloat are highly desirable --- to simplify the solutions generated at the end of runs, so that there is some chance of understanding them, to permit systematic study of the evolution of the effective core of the genotype, or even to perform simplification of expressions during the course of a run. This paper introduces an alternative approach, Equivalent Decision Simplification, in which subtrees are evaluated over the set of regression points; if the subtrees evaluate to the same values as known simple subtrees, they are replaced. The effectiveness of the proposed method is confirmed by computer simulation taking simple Symbolic Regression problems as examples.

[1]  Nguyen Xuan Hoai,et al.  Using compression to understand the distribution of building blocks in genetic programming populations , 2007, 2007 IEEE Congress on Evolutionary Computation.

[2]  Nguyen Xuan Hoai,et al.  Analysing the Regularity of Genomes Using Compression and Expression Simplification , 2007, EuroGP.

[3]  Mengjie Zhang,et al.  Online Program Simplification in Genetic Programming , 2006, SEAL.

[4]  Graham Kendall,et al.  Problem Difficulty and Code Growth in Genetic Programming , 2004, Genetic Programming and Evolvable Machines.

[5]  John R. Koza,et al.  Genetic Programming IV: Routine Human-Competitive Machine Intelligence , 2003 .

[6]  Graham Kendall,et al.  A Survey And Analysis Of Diversity Measures In Genetic Programming , 2002, GECCO.

[7]  Terence Soule,et al.  Code growth in genetic programming , 1996 .

[8]  Peter Nordin,et al.  Complexity Compression and Evolution , 1995, ICGA.

[9]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part II , 1964, Inf. Control..

[10]  B. L. Welch THE SIGNIFICANCE OF THE DIFFERENCE BETWEEN TWO MEANS WHEN THE POPULATION VARIANCES ARE UNEQUAL , 1938 .

[11]  N. Hoai,et al.  Finding Trigonometric Identities with Tree Adjunct Grammar Guided Genetic Programming , 2004 .

[12]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[13]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.