Constructing an ensemble learning model by using Euclidean distance

The support vector machine (SVM) has a good generalization performance, but the classification result of the SVM in some real problems is often unsatisfied. Because SVM is sensitive to the noisy data and it may not be effective under the high level of noise. To improve the performance of SVM in the noisy environment, we propose an ensemble learning model to address the noise problem in this work. First, we employ the noise-tolerant probabilistic Support Vector Machine. Then a Naïve Bayesian classifier is established in the model. Finally the decision of the two classifiers is appropriately combined to give the final decision. We use the Euclidean distance to complete the integration based on a probabilistic interpretation. The ensemble learning model is evaluated on an artificial dataset for a classification task. Compared with single classifier, the ensemble learning model exhibits good performance in the noisy environment.

[1]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[2]  Jianqiang Yang,et al.  Exploiting Uncertain Data in Support Vector Classification , 2007, KES.

[3]  H. P. Huang,et al.  Fuzzy Support Vector Machines for Pattern Recognition and Data Mining , 2002 .

[4]  Jinbo Bi,et al.  Support Vector Classification with Input Data Uncertainty , 2004, NIPS.

[5]  Han-Xiong Li,et al.  Probabilistic support vector machines for classification of noise affected data , 2013, Inf. Sci..

[6]  Xuegong Zhang,et al.  Using class-center vectors to build support vector machines , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[7]  Zhi Liu,et al.  A probabilistic fuzzy logic system for modeling and control , 2005, IEEE Transactions on Fuzzy Systems.

[8]  Bernhard Schölkopf,et al.  Support Vector Machines as Probabilistic Models , 2011, ICML.

[9]  J. Ben Atkinson,et al.  Modeling and Analysis of Stochastic Systems , 1996 .

[10]  Gert Cauwenberghs,et al.  Gini Support Vector Machine: Quadratic Entropy Based Robust Multi-Class Probability Regression , 2007, J. Mach. Learn. Res..

[11]  Kristin P. Bennett,et al.  Support vector machines: hype or hallelujah? , 2000, SKDD.

[12]  Ji Zhu,et al.  Kernel Logistic Regression and the Import Vector Machine , 2001, NIPS.

[13]  Nikolaos Avouris,et al.  Machine Learning algorithms : a study on noise sensitivity , 2003 .

[14]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[15]  Zhi Liu,et al.  A Probabilistic Neural-Fuzzy Learning System for Stochastic Modeling , 2008, IEEE Transactions on Fuzzy Systems.

[16]  Junbin Gao,et al.  A Probabilistic Framework for SVM Regression and Error Bar Estimation , 2002, Machine Learning.

[17]  Peter Sollich,et al.  Bayesian Methods for Support Vector Machines: Evidence and Predictive Class Probabilities , 2002, Machine Learning.

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .