Incremental Extreme Learning Machine Based on Cascade Neural Networks

This paper extends extreme learning machine (ELM) for multi-layer cascade neural networks. We reformulate the cascade neural networks as a linear-in-the-parameters model, and propose a novel constructive training algorithm motivated by the efficient incremental ELM. The orthogonal least squares (OLS) is introduced to derive a new criterion for evaluating candidate hidden units, which avoids the computation of Moore-Penrose generalized inverse in the training process. Moreover, the calculation of output weights can be greatly simplified. Besides its efficiency, we show that the proposed evaluation function can effectively identify optimal candidate unit which leads to maximum error (sum of squared errors, SSE) reduction of the network. As a result, the proposed algorithm tends to yield smaller network with better generalization performance compared to traditional ELM. The effectiveness of the proposed algorithm on classification and regression problems is demonstrated by experimental results on several real-world datasets.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  Bimal K. Bose,et al.  Neural Network Applications in Power Electronics and Motor Drives—An Introduction and Perspective , 2007, IEEE Transactions on Industrial Electronics.

[3]  Lei Chen,et al.  Enhanced random search based incremental extreme learning machine , 2008, Neurocomputing.

[4]  Jooyoung Park,et al.  Universal Approximation Using Radial-Basis-Function Networks , 1991, Neural Computation.

[5]  M. Chtourou,et al.  MLP neural network based face recognition system using constructive training algorithm , 2012, 2012 International Conference on Multimedia Computing and Systems.

[6]  Okyay Kaynak,et al.  Computing Gradient Vector and Jacobian Matrix in Arbitrarily Connected Neural Networks , 2008, IEEE Transactions on Industrial Electronics.

[7]  Hossein Sameti,et al.  Robust speech recognition using MLP neural network in log-spectral domain , 2009, 2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[8]  Guang-Bin Huang,et al.  Convex incremental extreme learning machine , 2007, Neurocomputing.

[9]  Teresa Orlowska-Kowalska,et al.  Adaptive Sliding-Mode Neuro-Fuzzy Control of the Two-Mass Induction Motor Drive Without Mechanical Sensors , 2010, IEEE Transactions on Industrial Electronics.

[10]  Okyay Kaynak,et al.  Oil well diagnosis by sensing terminal characteristics of the induction motor , 2000, IEEE Trans. Ind. Electron..

[11]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[12]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[13]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[14]  B.M. Wilamowski,et al.  Neural Networks and Fuzzy Systems for Nonlinear Applications , 2007, 2007 11th International Conference on Intelligent Engineering Systems.

[15]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[16]  Bogdan M. Wilamowski,et al.  Compensation of Nonlinearities Using Neural Networks Implemented on Inexpensive Microcontrollers , 2011, IEEE Transactions on Industrial Electronics.

[17]  P. J. Werbos,et al.  Backpropagation: past and future , 1988, IEEE 1988 International Conference on Neural Networks.

[18]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[19]  B.M. Wilamowski,et al.  Neural network architectures and learning algorithms , 2009, IEEE Industrial Electronics Magazine.

[20]  Hao Yu,et al.  Improved Computation for Levenberg–Marquardt Training , 2010, IEEE Transactions on Neural Networks.

[21]  K. Lang,et al.  Learning to tell two spirals apart , 1988 .

[22]  Marcian N. Cirstea,et al.  Direct Neural-Network Hardware-Implementation Algorithm , 2010, IEEE Transactions on Industrial Electronics.

[23]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[24]  Cheng Wu,et al.  Orthogonal Least Squares Algorithm for Training Cascade Neural Networks , 2012, IEEE Transactions on Circuits and Systems I: Regular Papers.

[25]  Hao Yu,et al.  Selection of Proper Neural Network Sizes and Architectures—A Comparative Study , 2012, IEEE Transactions on Industrial Informatics.

[26]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.