Uncertainty evaluation and model selection of extreme learning machine based on Riemannian metric

Considering the uncertainty of hidden neurons, choosing significant hidden nodes, called as model selection, has played an important role in the applications of extreme learning machines(ELMs). How to define and measure this uncertainty is a key issue of model selection for ELM. From the information geometry point of view, this paper presents a new model selection method of ELM for regression problems based on Riemannian metric. First, this paper proves theoretically that the uncertainty can be characterized by a form of Riemannian metric. As a result, a new uncertainty evaluation of ELM is proposed through averaging the Riemannian metric of all hidden neurons. Finally, the hidden nodes are added to the network one by one, and at each step, a multi-objective optimization algorithm is used to select optimal input weights by minimizing this uncertainty evaluation and the norm of output weight simultaneously in order to obtain better generalization performance. Experiments on five UCI regression data sets and cylindrical shell vibration data set are conducted, demonstrating that the proposed method can generally obtain lower generalization error than the original ELM, evolutionary ELM, ELM with model selection, and multi-dimensional support vector machine. Moreover, the proposed algorithm generally needs less hidden neurons and computational time than the traditional approaches, which is very favorable in engineering applications.

[1]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[2]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[3]  Ponnuthurai Nagaratnam Suganthan,et al.  Comprehensive learning particle swarm optimizer for solving multiobjective optimization problems: Research Articles , 2006 .

[4]  Q. M. Jonathan Wu,et al.  Human action recognition using extreme learning machine based on visual vocabularies , 2010, Neurocomputing.

[5]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[6]  Hongming Zhou,et al.  Optimization method based extreme learning machine for classification , 2010, Neurocomputing.

[7]  Dianhui Wang,et al.  Extreme learning machines: a survey , 2011, Int. J. Mach. Learn. Cybern..

[8]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Jing J. Liang,et al.  Comprehensive learning particle swarm optimizer for solving multiobjective optimization problems , 2006, Int. J. Intell. Syst..

[10]  Guangbin Huang,et al.  A Fast Construction Algorithm for Feedforward Neural Networks , 2012 .

[11]  Yuan Lan,et al.  Random search enhancement of error minimized extreme learning machine , 2010, ESANN.

[12]  Q. M. Jonathan Wu,et al.  Application of Wave Atoms Decomposition and Extreme Learning Machine for Fingerprint Classification , 2010, ICIAR.

[13]  Robert K. L. Gay,et al.  Error Minimized Extreme Learning Machine With Growth of Hidden Nodes and Incremental Learning , 2009, IEEE Transactions on Neural Networks.

[14]  A. Kai Qin,et al.  Evolutionary extreme learning machine , 2005, Pattern Recognit..

[15]  F. L. Chen,et al.  Sales forecasting system based on Gray extreme learning machine with Taguchi method in retail industry , 2011, Expert Syst. Appl..

[16]  Yuan Lan,et al.  Two-stage extreme learning machine for regression , 2010, Neurocomputing.

[17]  Si Wu,et al.  Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[18]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[19]  Wentao Mao,et al.  A Novel Machine Learning Based Method of Combined Dynamic Environment Prediction , 2013 .

[21]  Fernando Pérez-Cruz,et al.  SVM multiregression for nonlinear channel estimation in multiple-input multiple-output systems , 2004, IEEE Transactions on Signal Processing.

[22]  Christopher J. C. Burges,et al.  Geometry and invariance in kernel based methods , 1999 .

[23]  Guang-Bin Huang,et al.  Upper bounds on the number of hidden neurons in feedforward networks with arbitrary bounded nonlinear activation functions , 1998, IEEE Trans. Neural Networks.

[24]  Shuzhi Sam Ge,et al.  Fast Construction of Single-Hidden-Layer Feedforward Networks , 2012, Handbook of Natural Computing.

[25]  Wentao Mao,et al.  Model selection of extreme learning machine based on multi-objective optimization , 2011, Neural Computing and Applications.