An ELM-based model with sparse-weighting strategy for sequential data imbalance problem

In many practical engineering applications, online sequential data imbalance problems are universally found. Many traditional machine learning methods are hard to improve the classification accuracy effectively while solving these problems. To get fast and efficient classification, a new online sequential extreme learning machine algorithm with sparse-weighting strategy is proposed to increase the accuracy of minority class while reducing the accuracy loss of majority class as much as possible. The main idea is integrating a new sparse-weighting strategy into the present data-based strategy for sequential data imbalance problem. In offline stage, a two phase balanced strategies is introduced to obtain the valuable virtual sample set. In online stage, a dynamic weighting strategy is proposed to assign the corresponding weight for each sequential sample by means of the change of sensitivity and specificity in order to maintain the optimal network structure. Experimental results on two kinds of imbalanced datasets, UCI datasets and the real-world air pollutant forecasting dataset, show that the proposed method has higher prediction accuracy and better numerical stability compared with ELM, OS-ELM, meta-cognitive OS-ELM and weighted OS-ELM.

[1]  Nan Liu,et al.  Ensemble of subset online sequential extreme learning machine for class imbalance and concept drift , 2015, Neurocomputing.

[2]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[3]  Liu Yuxi New oversampling algorithm DB_SMOTE , 2014 .

[4]  Gerald Schaefer,et al.  Cost-sensitive decision tree ensembles for effective imbalanced classification , 2014, Appl. Soft Comput..

[5]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[6]  Amaury Lendasse,et al.  OP-ELM: Optimally Pruned Extreme Learning Machine , 2010, IEEE Transactions on Neural Networks.

[7]  Chi-Man Vong,et al.  Predicting minority class for suspended particulate matters level by extreme learning machine , 2014, Neurocomputing.

[8]  Witold Pedrycz,et al.  A Study on Relationship Between Generalization Abilities and Fuzziness of Base Classifiers in Ensemble Learning , 2015, IEEE Transactions on Fuzzy Systems.

[9]  Jun Miao,et al.  Extreme Support Vector Regression , 2014 .

[10]  Wang Xi-zhao,et al.  Architecture selection for networks trained with extreme learning machine using localized generalization error model , 2013 .

[11]  Yi Yang,et al.  A new distance-based total uncertainty measure in the theory of belief functions , 2016, Knowl. Based Syst..

[12]  Zhiping Lin,et al.  Weighted Online Sequential Extreme Learning Machine for Class Imbalance Learning , 2013, Neural Processing Letters.

[13]  Xian-Da Zhang,et al.  Matrix Analysis and Applications , 2017 .

[14]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[15]  Jun Miao,et al.  One-Class Classification with Extreme Learning Machine , 2015 .

[16]  Robert K. L. Gay,et al.  Error Minimized Extreme Learning Machine With Growth of Hidden Nodes and Incremental Learning , 2009, IEEE Transactions on Neural Networks.

[17]  Xu Zhou,et al.  Effective algorithms of the Moore-Penrose inverse matrices for extreme learning machine , 2015, Intell. Data Anal..

[18]  José Sergio Ruiz Castilla,et al.  PSO-Based Method for SVM Classification on Skewed Data-Sets , 2015, ICIC.

[19]  Jun Miao,et al.  Robust regression with extreme support vectors , 2014, Pattern Recognit. Lett..

[20]  Xizhao Wang,et al.  Fuzziness based sample categorization for classifier performance improvement , 2015, J. Intell. Fuzzy Syst..

[21]  Dianhui Wang,et al.  Evolutionary extreme learning machine ensembles with size control , 2013, Neurocomputing.

[22]  Yue-Shi Lee,et al.  Cluster-based under-sampling approaches for imbalanced data distributions , 2009, Expert Syst. Appl..

[23]  Xizhao Wang,et al.  Learning from big data with uncertainty - editorial , 2015, J. Intell. Fuzzy Syst..