A novel support vector regression for data set with outliers

Support vector machine is sensitive to the outliers.A novel support vector regression together with fuzzification theory, inconsistency matrix and neighbors match operator is presented.The objective of this novel support vector regression is to increase the generalization ability for data set with outliers. Support vector machine (SVM) is sensitive to the outliers, which reduces its generalization ability. This paper presents a novel support vector regression (SVR) together with fuzzification theory, inconsistency matrix and neighbors match operator to address this critical issue. Fuzzification method is exploited to assign similarities on the input space and on the output response to each pair of training samples respectively. The inconsistency matrix is used to calculate the weights of input variables, followed by searching outliers through a novel neighborhood matching algorithm and then eliminating them. Finally, the processed data is sent to the original SVR, and the prediction results are acquired. A simulation example and three real-world applications demonstrate the proposed method for data set with outliers.

[1]  C. K. Kwong,et al.  Case-based reasoning approach to concurrent design of low power transformers , 2002 .

[2]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD 2000.

[3]  Xuegong Zhang,et al.  Using class-center vectors to build support vector machines , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[4]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[5]  Ming-Chi Lee,et al.  Using support vector machine with a hybrid feature selection method to the stock trend prediction , 2009, Expert Syst. Appl..

[6]  Tzung-Pei Hong,et al.  A fuzzy AprioriTid mining algorithm with reduced computational time , 2004, Appl. Soft Comput..

[7]  Qian Liu,et al.  Weighted least squares support vector machine local region method for nonlinear time series prediction , 2010, Appl. Soft Comput..

[8]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[9]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[10]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[11]  Mahesh Panchal,et al.  A Review on Support Vector Machine for Data Classification , 2012 .

[12]  Baoqing Li Distance-Based Selection of Potential Support Vectors by Kernel Matrix , 2004, ISNN.

[13]  Shun-Feng Su,et al.  Robust support vector regression networks for function approximation with outliers , 2002, IEEE Trans. Neural Networks.

[14]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[15]  F. Tay,et al.  Application of support vector machines in financial time series forecasting , 2001 .

[16]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[17]  Zne-Jung Lee,et al.  Hybrid robust support vector machines for regression with outliers , 2011, Appl. Soft Comput..

[18]  Wei-Chiang Hong,et al.  SVR with hybrid chaotic genetic algorithms for tourism demand forecasting , 2011, Appl. Soft Comput..

[19]  Mariette Awad,et al.  Dynamic classification for video stream using support vector machine , 2008, Appl. Soft Comput..

[20]  Xiaowei Yang,et al.  Robust least squares support vector machine based on recursive outlier elimination , 2010, Soft Comput..

[21]  Clara Pizzuti,et al.  Fast Outlier Detection in High Dimensional Spaces , 2002, PKDD.

[22]  Yanqing Zhang,et al.  Support vector machines with genetic fuzzy feature transformation for biomedical data classification , 2007, Inf. Sci..

[23]  Wen-der Yu,et al.  Hybridization of CBR and numeric soft computing techniques for mining of scarce construction databases , 2006 .

[24]  Peter Williams,et al.  The generalization error of the symmetric and scaled support vector machines , 2001, IEEE Trans. Neural Networks.

[25]  Denis Hamad,et al.  Bounded Influence Support Vector Regression for Robust Single-Model Estimation , 2009, IEEE Transactions on Neural Networks.

[26]  Cungen Cao,et al.  Some issues about outlier detection in rough set theory , 2009, Expert Syst. Appl..

[27]  Su Wu,et al.  A framework for intelligent reliability centered maintenance analysis , 2008, Reliab. Eng. Syst. Saf..

[28]  Yuanyuan Wang,et al.  A rough margin based support vector machine , 2008, Inf. Sci..

[29]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[30]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[31]  Theodore Johnson,et al.  Fast Computation of 2-Dimensional Depth Contours , 1998, KDD.

[32]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[33]  Dinggang Shen,et al.  An adaptive error penalization method for training an efficient and generalized SVM , 2006, Pattern Recognit..

[34]  Engin Avci Selecting of the optimal feature subset and kernel parameters in digital modulation classification by using hybrid genetic algorithm-support vector machines: HGASVM , 2009, Expert Syst. Appl..

[35]  Johan A. K. Suykens,et al.  Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.