SVM optimization algorithm based on dynamic clustering and ensemble learning for large scale dataset

This paper studies on the predicted regression model of support vector machines (SVM). Aiming at the shortage that with the amount of samples grows, training time increases rapidly as well, we propose an optimization algorithm to optimize it for large scale dataset. The optimization algorithm is based on ensemble learning and dynamic clustering. Firstly, we use dynamic cluster method to generate different types of sub training set based on fuzzy granular. Then we construct SVM sub-learners. Afterwards we synthesize outputs of each sub-learner by using the strategy of mean squared error. Simulation experimental results demonstrate that the optimization algorithm can increase training speed obviously, and keep the original accuracy compared to traditional SVM.

[1]  Klaus Tochtermann,et al.  The Long Way towards Workplace-Integrated Learning , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[2]  Qin Hua,et al.  A Novel Method for Training Large Scale E-Business SVM Models in a Grid Computing Environment , 2010, 2010 International Conference on E-Business and E-Government.

[3]  Zheng Dan,et al.  Selection algorithm for K-means initial clustering center , 2012 .

[4]  Heesung Kwon,et al.  Sparse Kernel-Based Ensemble Learning With Fully Optimized Kernel Parameters for Hyperspectral Classification Problems , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Mohamed Cheriet,et al.  Model selection for the LS-SVM. Application to handwriting recognition , 2009, Pattern Recognit..

[6]  Heesung Kwon,et al.  A full diagonal bandwidth gaussian kernel SVM based ensemble learning for hyperspectral chemical plume detection , 2010, 2010 IEEE International Geoscience and Remote Sensing Symposium.

[7]  Jing Yang,et al.  A parallel SVM training algorithm on large-scale classification problems , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[8]  Shao Ying Zhu,et al.  Weighted fuzzy classification with integrated learning method for medical diagnosis , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[9]  Jianhong Xie,et al.  Kernel optimization of LS-SVM based on damage detection for smart structures , 2009, 2009 2nd IEEE International Conference on Computer Science and Information Technology.

[10]  Kai Li,et al.  Study of selective ensemble learning method and its diversity based on decision tree and neural network , 2010, 2010 Chinese Control and Decision Conference.

[11]  Jason Cong,et al.  Synthesis Algorithm for Application-Specific Homogeneous Processor Networks , 2009, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[12]  Geoffrey I. Webb,et al.  Multistrategy ensemble learning: reducing error by combining ensemble learning techniques , 2004, IEEE Transactions on Knowledge and Data Engineering.

[13]  Lei Wang,et al.  A study of AdaBoost with SVM based weak learners , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..