A Gaussian mixture framework for incremental nonparametric regression with topology learning neural networks

Incremental learning is important for memory critical systems, especially when the growth of information technology has pushed the memory and storage costs to limits. Despite the great amount of effort researching into incremental classification paradigms and algorithms, the regression is given far less attention. In this paper, an incremental regression framework that is able to model the linear and nonlinear relationships between response variables and explanatory variables is proposed. A three layer feed-forward neural network structure is devised where the weights of the hidden layer are trained by topology learning neural networks. A Gaussian mixture weighted integrator is used to synthesize from the output of the hidden layer to give smoothed predictions. Two hidden layer parameters learning strategies whether by Growing Neural Gas (GNG) or the single layered Self-Organizing Incremental Neural Network (SOINN) are explored. The GNG strategy is more robust and flexible, and single layered SOINN strategy is less sensitive to parameter settings. Experiments are carried out on an artificial dataset and 6 UCI datasets. The artificial dataset experiments show that the proposed method is able to give predictions more smoothed than K-nearest-neighbor (KNN) and the regression tree. Comparing to the parametric method Support Vector Regression (SVR), the proposed method has significant advantage when learning on data with multi-models. Incremental methods including Passive and Aggressive regression, Online Sequential Extreme Learning Machine, Self-Organizing Maps and Incremental K-means are compared with the proposed method on the UCI datasets, and the results show that the proposed method outperforms them on most datasets.

[1]  Jean-Marie Monnez,et al.  A fast and recursive algorithm for clustering large datasets with k-medians , 2011, Comput. Stat. Data Anal..

[2]  Hujun Yin,et al.  Bayesian self-organising map for Gaussian mixtures , 2001 .

[3]  D. Sculley,et al.  Web-scale k-means clustering , 2010, WWW '10.

[4]  I-Cheng Yeh,et al.  Modeling of strength of high-performance concrete using artificial neural networks , 1998 .

[5]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[6]  Giorgio Metta,et al.  Incremental learning of robot dynamics using random features , 2011, 2011 IEEE International Conference on Robotics and Automation.

[7]  Jun-Ming Xu,et al.  OASIS: Online Active Semi-Supervised Learning , 2011, AAAI.

[8]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[10]  Kweku-Muata Osei-Bryson,et al.  Post-pruning in decision tree induction using multiple performance measures , 2007, Comput. Oper. Res..

[11]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[12]  Xiong Xiao,et al.  Density Estimation Method Based on Self-organizing Incremental Neural Networks and Error Estimation , 2013, ICONIP.

[13]  Diego Vidaurre,et al.  SOMwise regression: a new clusterwise regression method , 2011, Neural Computing and Applications.

[14]  M. Blair,et al.  Video Game Telemetry as a Critical Tool in the Study of Complex Skill Learning , 2013, PloS one.

[15]  Pınar Tüfekci,et al.  Prediction of full load electrical power output of a base load operated combined cycle power plant using machine learning methods , 2014 .

[16]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[17]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[18]  Haibo He,et al.  SOMKE: Kernel Density Estimation Over Data Streams by Sequences of Self-Organizing Maps , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Shen Furao,et al.  An enhanced self-organizing incremental neural network for online unsupervised learning , 2007, Neural Networks.

[20]  Hui Yu,et al.  An incremental online semi-supervised active learning algorithm based on self-organizing incremental neural network , 2011, Neural Computing and Applications.

[21]  Yuan Lan,et al.  Ensemble of online sequential extreme learning machine , 2009, Neurocomputing.

[22]  Shen Furao,et al.  An incremental network for on-line unsupervised classification and topology learning , 2006, Neural Networks.

[23]  Christian Sohler,et al.  StreamKM++: A clustering algorithm for data streams , 2010, JEAL.

[24]  Emilio Del-Moral-Hernandez,et al.  A SOM combined with KNN for classification task , 2011, The 2011 International Joint Conference on Neural Networks.

[25]  Wolfgang Rosenstiel,et al.  Online SVR Training by Solving the Primal Optimization Problem , 2009, 2009 IEEE International Workshop on Machine Learning for Signal Processing.

[26]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[27]  B. Silverman Density estimation for statistics and data analysis , 1986 .