Elite bases regression: A real-time algorithm for symbolic regression

Symbolic regression is an important but challenging research topic in data mining. It can detect the underlying mathematical models. Genetic programming (GP) is one of the most popular methods for symbolic regression. However, its convergence speed might be too slow for large scale problems with a large number of variables. This drawback has become a bottleneck in practical applications. In this paper, a new non-evolutionary real-time algorithm for symbolic regression, Elite Bases Regression (EBR), is proposed. EBR generates a set of candidate basis functions coded with parse-matrix in specific mapping rules. Meanwhile, a certain number of elite bases are preserved and updated iteratively according to the correlation coefficients with respect to the target model. The regression model is then spanned by the elite bases. A comparative study between EBR and a recent proposed machine learning method for symbolic regression, Fast Function eXtraction (FFX), are conducted. Numerical results indicate that EBR can solve symbolic regression problems more effectively.

[1]  Klaus Mueller,et al.  Evaluation and Design of Filters Using a Taylor Series Expansion , 1997, IEEE Trans. Vis. Comput. Graph..

[2]  Maurice Herman,et al.  Fourier series expansion of the transfer equation in the atmosphere-ocean system , 1989 .

[3]  Georges G. E. Gielen,et al.  Stochastic degradation modeling and simulation for analog integrated circuits in nanometer CMOS , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[4]  Conor Ryan,et al.  Grammatical Evolution , 2001, Genetic Programming Series.

[5]  Godfrey A. Walters,et al.  Symbolic and numerical regression: experiments and applications , 2003, Inf. Sci..

[6]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[7]  Georges G. E. Gielen,et al.  Hierarchical analog circuit reliability analysis using multivariate nonlinear regression and active learning sample selection , 2012, 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[8]  Trent McConaghy,et al.  FFX: Fast, Scalable, Deterministic Symbolic Regression Technology , 2011 .

[9]  Reinhard Wilhelm,et al.  Functional Programming Languages , 2010 .

[10]  Josh C. Bongard,et al.  Improving genetic programming based symbolic regression using deterministic machine learning , 2013, 2013 IEEE Congress on Evolutionary Computation.

[11]  Conor Ryan,et al.  Grammatical evolution , 2007, GECCO '07.

[12]  Chee Kiong Soh,et al.  Force identification of dynamic systems using genetic programming , 2005 .

[13]  Anthony Worm,et al.  Prioritized Grammar Enumeration: A novel method for symbolic regression , 2016 .

[14]  Peter J. Barclay,et al.  Functional languages on linear chromosomes , 1996 .

[15]  Yajie Wang,et al.  The Recent Developments and Comparative Analysis of Neural Network and Evolutionary Algorithms for Solving Symbolic Regression , 2015, ICIC.

[16]  Dervis Karaboga,et al.  Artificial bee colony programming for symbolic regression , 2012, Inf. Sci..

[17]  Elsayed M. Saad,et al.  Multi-objective symbolic regression using long-term artificial neural network memory (LTANN-MEM) and neural symbolization algorithm (NSA) , 2018, Neural Computing and Applications.

[18]  Shaoliang Zhang,et al.  Adaptive space transformation: An invariant based method for predicting aerodynamic coefficients of hypersonic vehicles , 2015, Eng. Appl. Artif. Intell..

[19]  Ramana V. Grandhi,et al.  Improved Distributed Hypercube Sampling , 2002 .

[20]  Shaoliang Zhang,et al.  Parse-matrix evolution for symbolic regression , 2012, Eng. Appl. Artif. Intell..

[21]  Cândida Ferreira Gene Expression Programming in Problem Solving , 2002 .

[22]  Dennis K. J. Lin,et al.  A construction method for orthogonal Latin hypercube designs , 2006 .

[23]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[24]  Alina Patelli,et al.  Elite Based Multiobjective Genetic Programming in Nonlinear Systems Identification , 2010 .