Learning noise

In this paper we propose a genetic programming approach to learning stochastic models with unsymmetrical noise distributions. Most learning algorithms try to learn from noisy data by modeling the maximum likelihood output or least squared error, assuming that noise effects average out. While this process works well for data with symmetrical noise distributions (such as Gaussian observation noise), many real-life sources of noise are not symmetrically distributed, thus this approach does not hold. We suggest improved learning can be obtained by including noise sources explicitly in the model as a stochastic element. A stochastic element is a random sub-process or latent variable of a hidden system that can propagate nonlinear noise to the observable outputs. Stochastic elements can skew and distort output features making regression of analytical models particularly difficult and error minimizing approaches inhibiting. We introduce a new method to infer the analytical model of a system by decomposing non-uniform noise observed at the outputs into uniform stochastic elements appearing symbolically inside the system. Results demonstrate the ability to regress exact analytical models where stochastic elements are embedded inside nonlinear and polynomial hidden systems.

[1]  Hod Lipson,et al.  Coevolution of Fitness Predictors , 2008, IEEE Transactions on Evolutionary Computation.

[2]  Sandia Report,et al.  Condence Region Estimation Techniques for Nonlinear Regression: Three Case Studies , 2005 .

[3]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[4]  T. K. Leen Learning Local Error Bars for Nonlinear Regression , 1995 .

[5]  Jack P. C. Kleijnen,et al.  White Noise Assumptions Revisited: Regression Models and Statistical Designs for Simulation Practice , 2006 .

[6]  R. M. Roberts,et al.  Confidence region estimation techniques for nonlinear regression in groundwater flow: Three case studies , 2007 .

[7]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[8]  J. Theiler Detecting nonlinear structure in time series , 1991 .

[9]  W M Schaffer,et al.  Effects of noise on some dynamical models in ecology , 1986, Journal of mathematical biology.

[10]  Geoffrey E. Hinton,et al.  Evaluation of Gaussian processes and other methods for non-linear regression , 1997 .

[11]  Dirk V. Arnold,et al.  Evolution strategies in noisy environments- a survey of existing work , 2001 .

[12]  Xavier Llorà,et al.  Bounding the Effect of Noise in Multiobjective Learning Classifier Systems , 2003, Evolutionary Computation.

[13]  J. Ben Atkinson,et al.  Modeling and Analysis of Stochastic Systems , 1996 .

[14]  Helio J. C. Barbosa,et al.  Symbolic regression via genetic programming , 2000, Proceedings. Vol.1. Sixth Brazilian Symposium on Neural Networks.