Probabilistic support vector machines for classification of noise affected data

The support vector machines (SVMs) have gained visibility and been thoroughly studied in the machine learning community. However, the performance of these machines is sensitive to noisy data and the machine may not be effective when the level of noise is high. Since the noise makes the separating margin of SVM to be a stochastic variable, a probabilistic support vector machine (PSVM) is proposed to capture the probabilistic information of the separating margin and formulate the decision function within such a noisy environment. First, all data are clustered, upon which different subsets are formed by PCA-based sampling; then, a distributed SVM system is constructed to estimate the separating margin for each subset. Next, a quadratic optimization problem is being solved with the use of probabilistic information extracted from separating margins to determine the decision function. Using the weighted average of probability of cluster centers, the confidence of the decision can be estimated. An artificial dataset and four real-life datasets from a UCI machine learning database are used to demonstrate the effectiveness of the proposed probabilistic SVM.

[1]  Johan A. K. Suykens,et al.  Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction , 2004, Bioinform..

[2]  James T. Kwok,et al.  The evidence framework applied to support vector machines , 2000, IEEE Trans. Neural Networks Learn. Syst..

[3]  Xizhao Wang,et al.  Induction of multiple fuzzy decision trees based on rough set technique , 2008, Inf. Sci..

[4]  Xuegong Zhang,et al.  Using class-center vectors to build support vector machines , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[5]  Jinbo Bi,et al.  Support Vector Classification with Input Data Uncertainty , 2004, NIPS.

[6]  Gérard Bloch,et al.  Support vector regression from simulation data and few experimental samples , 2008, Inf. Sci..

[7]  Han-Xiong Li,et al.  Comparative study of fluid dispensing modeling [Abstracts of Forthcoming Manuscripts] , 2003 .

[8]  Zhi Liu,et al.  A Probabilistic Wavelet System for Stochastic and Incomplete Data-Based Modeling , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[10]  C. L. Philip Chen,et al.  Adaptive least squares support vector machines filter for hand tremor canceling in microsurgery , 2011, Int. J. Mach. Learn. Cybern..

[11]  Yanqing Zhang,et al.  Support vector machines with genetic fuzzy feature transformation for biomedical data classification , 2007, Inf. Sci..

[12]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[13]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Fernando José Von Zuben,et al.  Hybridizing mixtures of experts with support vector machines: Investigation into nonlinear dynamic systems identification , 2007, Inf. Sci..

[15]  Ji Zhu,et al.  Kernel Logistic Regression and the Import Vector Machine , 2001, NIPS.

[16]  Han-Xiong Li,et al.  A probabilistic support vector machine for uncertain data , 2009, 2009 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications.

[17]  Johan A. K. Suykens,et al.  Optimal control by least squares support vector machines , 2001, Neural Networks.

[18]  H. P. Huang,et al.  Fuzzy Support Vector Machines for Pattern Recognition and Data Mining , 2002 .

[19]  Zhi Liu,et al.  A Probabilistic Neural-Fuzzy Learning System for Stochastic Modeling , 2008, IEEE Transactions on Fuzzy Systems.

[20]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[21]  Zhi Liu,et al.  A probabilistic fuzzy logic system for modeling and control , 2005, IEEE Transactions on Fuzzy Systems.

[22]  Xi-Zhao Wang,et al.  Improving Generalization of Fuzzy IF--THEN Rules by Maximizing Fuzzy Entropy , 2009, IEEE Transactions on Fuzzy Systems.

[23]  Kin Keung Lai,et al.  A new fuzzy support vector machine to evaluate credit risk , 2005, IEEE Transactions on Fuzzy Systems.

[24]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[25]  Isabelle Guyon,et al.  Discovering Informative Patterns and Data Cleaning , 1996, Advances in Knowledge Discovery and Data Mining.

[26]  Congxin Wu,et al.  Separating theorem of samples in Banach space for support vector machine learning , 2011, Int. J. Mach. Learn. Cybern..

[27]  Xizhao Wang,et al.  Fast Fuzzy Multicategory SVM Based on Support Vector Domain Description , 2008, Int. J. Pattern Recognit. Artif. Intell..

[28]  Jianping Li,et al.  Evolution strategies based adaptive Lp LS-SVM , 2011, Inf. Sci..

[29]  Peter Sollich,et al.  Bayesian Methods for Support Vector Machines: Evidence and Predictive Class Probabilities , 2002, Machine Learning.

[30]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[31]  Matthias Baumgarten,et al.  Optimal model selection for posture recognition in home-based healthcare , 2011, Int. J. Mach. Learn. Cybern..

[32]  Han-Xiong Li,et al.  Model-based integration of control and supervision for one kind of curing process , 2004, IEEE Transactions on Electronics Packaging Manufacturing.

[33]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[34]  David G. Stork,et al.  Pattern Classification , 1973 .

[35]  Weiyi Liu,et al.  Constructing probabilistic graphical model from predicate formulas for fusing logical and probabilistic knowledge , 2011, Inf. Sci..

[36]  Madan Gopal,et al.  Knowledge based Least Squares Twin support vector machines , 2010, Inf. Sci..

[37]  Boualem Boashash,et al.  The bootstrap and its application in signal processing , 1998, IEEE Signal Process. Mag..

[38]  Richard Weber,et al.  Simultaneous feature selection and classification using kernel-penalized support vector machines , 2011, Inf. Sci..

[39]  Zhenya He,et al.  Solving a kind of nonlinear programming problems via analog neural networks , 1998, Neurocomputing.

[40]  Han-Xiong Li,et al.  A probabilistic SVM based decision system for pain diagnosis , 2011, Expert Syst. Appl..

[41]  Gert Cauwenberghs,et al.  Sparse Probability Regression by Label Partitioning , 2003, COLT.

[42]  Junbin Gao,et al.  A Probabilistic Framework for SVM Regression and Error Bar Estimation , 2002, Machine Learning.

[43]  C. L. Philip Chen,et al.  An incremental adaptive implementation of functional-link processing for function approximation, time-series prediction, and system identification , 1998, Neurocomputing.

[44]  Gert Cauwenberghs,et al.  Gini Support Vector Machine: Quadratic Entropy Based Robust Multi-Class Probability Regression , 2007, J. Mach. Learn. Res..

[45]  Tony Jebara,et al.  Probability Product Kernels , 2004, J. Mach. Learn. Res..

[46]  Jianqiang Yang,et al.  Exploiting Uncertain Data in Support Vector Classification , 2007, KES.

[47]  Su-Yun Huang,et al.  Reduced Support Vector Machines: A Statistical Theory , 2007, IEEE Transactions on Neural Networks.