Genetic programming with incremental data inheritance

A data-driven method for accelerating genetic programming is presented. This method, called incremental data inheritance or IDI for short, evolves programs using program-specific subsets of given data which also evolve incrementally as generation goes on. The concept of data evolution in IDI is contrasted to conventional genetic programming in which all the given training data are used repeatedly. IDI is also distinguished from the previous subset selection methods in that each program in IDI evolves its own data set of incremental size rather than a common data set of fixed or arbitrary size for the whole population. The method has been applied to time series prediction. Compared to the conventional methods, IDI significantly reduced the evolution speed of genetic programming without loss of the generalization accuracy of evolved programs. We also provide a theoretical foundation of the IDI method from the Bayesian inference point of view.

[1]  Byoung-Tak Zhang,et al.  Accelerated Learning by Active Example Selection , 1994, Int. J. Neural Syst..

[2]  Jason M. Daida,et al.  Computer-assisted design of image classification algorithms: dynamic and static fitness evaluations in a scaffolded genetic programming environment , 1996 .

[3]  B. Zhang Self-development learning: Constructing optimal size neural networks via in-cremental data selection , 1993 .

[4]  Peter Ross,et al.  Small Populations over Many Generations can beat Large Populations over Few Generations in Genetic P , 1997 .

[5]  Neal B. Abraham,et al.  Lorenz-like chaos in NH3-FIR lasers , 1995 .

[6]  Peter J. Angeline,et al.  Competitive Environments Evolve Better Solutions for Complex Tasks , 1993, ICGA.

[7]  Peter Ross,et al.  Dynamic Training Subset Selection for Supervised Learning in Genetic Programming , 1994, PPSN.

[8]  Eric V. Siegel Competitively evolving decision trees against fixed training cases for natural language processing , 1994 .

[9]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[10]  Byoung-Tak Zhang,et al.  Neural networks that teach themselves through genetic discovery of novel examples , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[11]  Byoung-Tak Zhang A Bayesian framework for evolutionary computation , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[12]  J. Rissanen Stochastic Complexity and Modeling , 1986 .

[13]  Byoung-Tak Zhang,et al.  Evolutionary Induction of Sparse Neural Trees , 1997, Evolutionary Computation.

[14]  Butong Zhang,et al.  Focused incremental learning for improved generalization with reduced training sets , 1991 .

[15]  Astro Teller,et al.  Automatically Choosing the Number of Fitness Cases: The Rational Allocation of Trials , 1997 .

[16]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[17]  Wolfgang Banzhaf,et al.  Genetic Programming: An Introduction , 1997 .

[18]  Terence Soule,et al.  Code growth in genetic programming , 1996 .