Regularization approach to inductive genetic programming

This paper presents an approach to regularization of inductive genetic programming tuned for learning polynomials. The objective is to achieve optimal evolutionary performance when searching high-order multivariate polynomials represented as tree structures. We show how to improve the genetic programming of polynomials by balancing its statistical bias with its variance. Bias reduction is achieved by employing a set of basis polynomials in the tree nodes for better agreement with the examples. Since this often leads to over-fitting, such tendencies are counteracted by decreasing the variance through regularization of the fitness function. We demonstrate that this balance facilitates the search as well as enables discovery of parsimonious, accurate, and predictive polynomials. The experimental results given show that this regularization approach outperforms traditional genetic programming on benchmark data mining and practical time-series prediction tasks.

[1]  R. Savit,et al.  Dynamics of genetic programming and chaotic time series prediction , 1996 .

[2]  Hitoshi Iba,et al.  Inductive genetic programming of polynomial learning networks , 2000, 2000 IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks. Proceedings of the First IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks (Cat. No.00.

[3]  Byoung-Tak Zhang,et al.  Evolutionary Induction of Sparse Neural Trees , 1997, Evolutionary Computation.

[4]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[5]  L. Glass,et al.  Oscillation and chaos in physiological control systems. , 1977, Science.

[6]  Terry Jones,et al.  Fitness Distance Correlation as a Measure of Problem Difficulty for Genetic Algorithms , 1995, ICGA.

[7]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[8]  Richard P. Lippmann,et al.  A Comparative Study of the Practical Characteristics of Neural Network and Conventional Pattern Classifiers , 1990, NIPS 1990.

[9]  Peter J. Angeline,et al.  Evolving predictors for chaotic time series , 1998, Defense, Security, and Sensing.

[10]  Kalyanmoy Deb,et al.  Messy Genetic Algorithms: Motivation, Analysis, and First Results , 1989, Complex Syst..

[11]  Casimir A. Kulikowski,et al.  Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems , 1990 .

[12]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[13]  Peter Nordin,et al.  Programmatic compression of images and sound , 1996 .

[14]  Bernard Manderick,et al.  The Genetic Algorithm and the Structure of the Fitness Landscape , 1991, ICGA.

[15]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[16]  Nikolay I. Nikolaev,et al.  Concepts of Inductive Genetic Programming , 1998, EuroGP.

[17]  Hitoshi Iba,et al.  System identification approach to genetic programming , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[18]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[19]  Dennis Gabor,et al.  A universal nonlinear filter, predictor and simulator which optimizes itself by a learning process , 1961 .

[20]  João Gama,et al.  Oblique Linear Tree , 1997, IDA.

[21]  G. Wahba Spline models for observational data , 1990 .

[22]  D. Rumelhart,et al.  Predicting sunspots and exchange rates with connectionist networks , 1991 .

[23]  Andreas S. Weigend,et al.  Time Series Prediction: Forecasting the Future and Understanding the Past , 1994 .

[24]  A. G. Ivakhnenko,et al.  Polynomial Theory of Complex Systems , 1971, IEEE Trans. Syst. Man Cybern..

[25]  F. Lemke,et al.  GMDH algorithms for complex systems modelling , 1998 .

[26]  Hillol Kargupta,et al.  System Identification with Evolving Polynomial Networks , 1991, ICGA.

[27]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[28]  Hitoshi Iba,et al.  Extending genetic programming with recombinative guidance , 1996 .

[29]  Geum Yong Lee Genetic recursive regression for modeling and forecasting real-world chaotic time series , 1999 .

[30]  陳樹衡,et al.  Option Pricing with Genetic Programming , 1998 .

[31]  A. Barron,et al.  Discussion: Multivariate Adaptive Regression Splines , 1991 .

[32]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[33]  Byoung-Tak Zhang,et al.  Balancing Accuracy and Parsimony in Genetic Programming , 1995, Evolutionary Computation.

[34]  John R. Koza,et al.  GENETIC PROGRAMMING FOR ECONOMIC MODELING , 1994 .

[35]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[36]  Wim Hordijk,et al.  A Measure of Landscapes , 1996, Evolutionary Computation.

[37]  Terence D. Sanger,et al.  A Tree-Structured Algorithm for Reducing Computation in Networks with Separable Basis Functions , 1991, Neural Computation.