Robust extreme learning machine

The output weights computing of extreme learning machine (ELM) encounters two problems, the computational and outlier robustness problems. The computational problem occurs when the hidden layer output matrix is a not full column rank matrix or an ill-conditioned matrix because of randomly generated input weights and biases. An existing solution to this problem is Singular Value Decomposition (SVD) method. However, the training speed is still affected by the large complexity of SVD when computing the Moore-Penrose (MP) pseudo inverse. The outlier robustness problem may occur when the training data set contaminated with outliers then the accuracy rate of ELM is extremely affected. This paper proposes the Extended Complete Orthogonal Decomposition (ECOD) method to solve the computational problem in ELM weights computing via ECODLS algorithm. And the paper also proposes the other three algorithms, i.e. the iteratively reweighted least squares (IRWLS-ELM), ELM based on the multivariate least-trimmed squares (MLTS-ELM), and ELM based on the one-step reweighted MLTS (RMLTS-ELM) to solve the outlier robustness problem. However, they also encounter the computational problem. Therefore, the ECOD via ECODLS algorithm is also used successfully in the three proposed algorithms. The experiments of regression problems were conducted on both toy and real-world data sets. The outlier types are one-sided and two-sided outliers. Each experiment was randomly contaminated with outliers, of one type only, with 10%, 20%, 30%, 40%, and 50% of the total training data size. Meta-metrics evaluation was used to measure the outlier robustness of the proposed algorithms compared to the existing algorithms, i.e. the minimax probability machine regression (MPMR) and the ordinary ELM. The experimental results showed that ECOD can effectively replace SVD. The ECOD is robust to the not full column rank or the ill-conditional problem. The speed of the ELM training using ECOD is also faster than the ordinary training algorithm. Moreover, the meta-metrics measure showed that the proposed algorithms are less affected by the increasing number of outliers than the existing algorithms.

[1]  Feilong Cao,et al.  Optimization approximation solution for regression problem based on extreme learning machine , 2011, Neurocomputing.

[2]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[3]  Amit Agarwal,et al.  A new machine learning paradigm for terrain reconstruction , 2006, IEEE Geoscience and Remote Sensing Letters.

[4]  Stephen A. Vavasis,et al.  Complete Orthogonal Decomposition for Weighted Least Squares , 1995 .

[5]  Colin R. Goodall,et al.  13 Computation using the QR decomposition , 1993, Computational Statistics.

[6]  David S. Watkins,et al.  Fundamentals of matrix computations , 1991 .

[7]  Min Han,et al.  Partial Lanczos extreme learning machine for single-output regression problems , 2009, Neurocomputing.

[8]  D. Ruppert,et al.  A Note on Computing Robust Regression Estimates via Iteratively Reweighted Least Squares , 1988 .

[9]  Yong Yu,et al.  Sales forecasting using extreme learning machine with applications in fashion retailing , 2008, Decis. Support Syst..

[10]  Dianhui Wang,et al.  Extreme learning machines: a survey , 2011, Int. J. Mach. Learn. Cybern..

[11]  Lei Chen,et al.  Enhanced random search based incremental extreme learning machine , 2008, Neurocomputing.

[12]  José Julio Espina Agulló,et al.  The multivariate least-trimmed squares estimator , 2008 .

[13]  Pierre Courrieu,et al.  Fast Computation of Moore-Penrose Inverse Matrices , 2008, ArXiv.

[14]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[15]  Ali Selamat,et al.  Modeling Permeability Prediction Using Extreme Learning Machines , 2010, 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling and Computer Simulation.

[16]  P. Rousseeuw,et al.  A fast algorithm for the minimum covariance determinant estimator , 1999 .

[17]  Sunil S. Bhagwat,et al.  Prediction of Melting Points of Organic Compounds Using Extreme Learning Machines , 2008 .

[18]  William Dumouchel,et al.  Integrating a robust option into a multiple regression computing environment , 1992 .

[19]  Nicholas J. Higham,et al.  Fast Polar Decomposition of an Arbitrary Matrix , 1990, SIAM J. Sci. Comput..

[20]  Ferial El-Hawary,et al.  NOx emission modelling using the iteratively reweighted least-square procedures , 1995 .

[21]  V. Yohai,et al.  Robust Statistics: Theory and Methods , 2006 .

[22]  Gregory Z. Grudic,et al.  A Formulation for Minimax Probability Machine Regression , 2002, NIPS.

[23]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[24]  M. El-Hawary,et al.  Load forecasting via suboptimal seasonal autoregressive models and iteratively reweighted least squares estimation , 1993 .

[25]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[26]  Feilong Cao,et al.  A study on effectiveness of extreme learning machine , 2011, Neurocomputing.

[27]  Xizhao Wang,et al.  Upper integral network with extreme learning mechanism , 2011, Neurocomputing.

[28]  Qinghua Zheng,et al.  Regularized Extreme Learning Machine , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[29]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[30]  J. Navarro-Pedreño Numerical Methods for Least Squares Problems , 1996 .

[31]  Yonggwan Won,et al.  Weighted Least Squares Scheme for Reducing Effects of Outliers in Regression based on Extreme Learning Machine , 2008, J. Digit. Content Technol. its Appl..

[32]  Nicholas J. Higham,et al.  The Matrix Computation Toolbox for MATLAB (Version 1.0) , 2002 .

[33]  Christophe Croux,et al.  Robust Estimation of the Vector Autoregressive Model by a Least Trimmed Squares Procedure , 2008 .

[34]  Jafar A. Khan,et al.  Robust Linear Model Selection Based on Least Angle Regression , 2007 .

[35]  Gene H. Golub,et al.  Matrix computations , 1983 .

[36]  Antonia Stefani,et al.  Meta-metric Evaluation of E-Commerce-related Metrics , 2009, SQM@CSMR.