A local mixture based SVM for an efficient supervised binary classification

Despite support vector machines' (SVM) robustness and optimality, SVM do not scale well computationally. Suffering from slow training convergence on large datasets, SVM online testing time can be suboptimal because SVM write the classifier hyper-plane model as a sum of support vectors that could total as much as half the datasets. Motivated to speed up SVM real time testing by reducing the number of SV, we introduce in this paper a novel local mixture based SVM (LMSVM) approach that exploits the increased separability provided by the kernel trick, while introducing a onetime computational expense. LMSVM applies kernel k-means clustering to the data in kernel space before pruning unwanted clusters based on a mixture measure for label heterogeneity. LMSVM's computational complexity and classification accuracy on four databases from UCI show promising results and motivate follow on research.

[1]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[2]  N. V. Vinodchandran,et al.  SVM-based generalized multiple-instance learning via approximate box counting , 2004, ICML.

[3]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[4]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[5]  Lukasz A. Kurgan,et al.  Knowledge discovery approach to automated cardiac SPECT diagnosis , 2001, Artif. Intell. Medicine.

[6]  Chen Lin,et al.  A New Method to Construct Reduced Vector Sets for Simplifying Support Vector Machines , 2006, 2006 IEEE International Conference on Engineering of Intelligent Systems.

[7]  Hichem Frigui,et al.  Fuzzy clustering with multiple kernels in feature space , 2012, 2012 IEEE International Conference on Fuzzy Systems.

[8]  S. Maouche,et al.  Self-adaptive kernel machine: online clustering in RKHS , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[9]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[10]  Krzysztof J. Cios,et al.  CLIP3: Cover learning using integer programming , 1997 .

[11]  Bo Kong,et al.  Reduced Support Vector Machine Based on Margin Vectors , 2010, 2010 International Conference on Computational Intelligence and Software Engineering.

[12]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[13]  Tom Downs,et al.  Exact Simplification of Support Vector Solutions , 2002, J. Mach. Learn. Res..

[14]  Guo-Sheng Hu,et al.  A support vector reduction method for accelerating calculation , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[15]  Hou Shu,et al.  Research Classification of Printing Fault Based on RSVM , 2008, 2008 3rd International Conference on Innovative Computing Information and Control.

[16]  Xindong Wu,et al.  Support vector machines based on K-means clustering for real-time business intelligence systems , 2005, Int. J. Bus. Intell. Data Min..

[17]  Thomas G. Dietterich,et al.  A Comparison of Dynamic Reposing and Tangent Distance for Drug Activity Prediction , 1993, NIPS.

[18]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[19]  Ning Chen,et al.  Extended Kernel Self-Organizing Map Clustering Algorithm , 2009, 2009 Fifth International Conference on Natural Computation.

[20]  Ming Li,et al.  Speaker Identification Based on Multi-reduced SVM , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[21]  Li Zhang,et al.  Support vectors pre-extracting for support vector machine based on K nearest neighbour method , 2008, 2008 International Conference on Information and Automation.

[22]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[23]  Krzysztof J. Cios,et al.  Hybrid inductive machine learning: an overview of CLIP algorithms , 2002 .

[24]  Zheng Nanning,et al.  Unsupervised clustering based reduced support vector machines , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[25]  Wlodzislaw Duch,et al.  Optimization of logical rules derived by neural procedures , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[26]  Yuh-Jye Lee,et al.  RSVM: Reduced Support Vector Machines , 2001, SDM.

[27]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[28]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[29]  Zhi-Hua Zhou Multi-Instance Learning : A Survey , 2004 .

[30]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[31]  Juan C. Cockburn,et al.  Reduced support vector machines applied to real-time face tracking , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[32]  Hu Ya-ting A Kernel Based Fuzzy Clustering Algorithm , 2008 .

[33]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[34]  Tu Bao Ho,et al.  A bottom-up method for simplifying support vector solutions , 2006, IEEE Transactions on Neural Networks.

[35]  He Guo-guang,et al.  Real-Time Intelligent Recognition of Chaos in Traffic Flow Using Reduced Support Vector Machine , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[36]  Harris Drucker,et al.  Comparison of learning algorithms for handwritten digit recognition , 1995 .

[37]  Yong Wang,et al.  Modeling for Optimal Probability Prediction , 2002, ICML.

[38]  Samy Bengio,et al.  Online adaptive policies for ensemble classifiers , 2005, Neurocomputing.