Study on Multi-class Text Classification Based on Improved SVM

Traditional SVM multi-class classification methods such as 1-a-r, 1-a-1, and DAG-SVM are popular learning techniques, but they often have the problems that the input text instances are nonlinear separable or their training/testing process is time consuming. In this chapter, we propose an improved SVM multi-class classification algorithm: first, the nonlinear separable texts in input space are mapped into a high dimensional feature space (Hilbert space) by using Mercer kernel function for obtaining linear separable texts; then in the Hilbert space, we construct SVM multi-class classifiers with binary tree to recognize testing instances. Our experiments demonstrate that for some web text classification problems, the proposed method can effectively solve the nonlinear separable problem existing in input text space, saves training/testing time efficiently, and improves the text classification accuracy.

[1]  Yong Luo,et al.  Pitch-density-based features and an SVM binary tree approach for multi-class audio classification in broadcast news , 2011, Multimedia Systems.

[2]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[3]  Shi Bing Fast kNN Text Classification Algorithm Based on Area Division , 2012 .

[4]  Djeffal Abdelhamid,et al.  A fast multi-class SVM learning method for huge databases , 2014 .

[5]  Zhiquan Wang,et al.  Recognition of human activities using SVM multi-class classifier , 2010, Pattern Recognit. Lett..

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Wang Ming-chun,et al.  Rough set text classification rule extraction based on CHI value , 2005 .

[8]  Dino Isa,et al.  An enhanced Support Vector Machine classification framework by using Euclidean distance function for text document categorization , 2011, Applied Intelligence.

[9]  Zhang Jian-wei Research on Feature Preextraction Method in Text Classification , 2011 .

[10]  Shi Hongbo,et al.  Bagging Bayes Text Classification Based on Map Reduce , 2012 .

[11]  Y Duan Application of SVM in Text Categorization , 2012 .

[12]  Liu Yong Research on classification of large-scale text on GPU platform , 2012 .

[13]  Zhu Yongli Parameter Optimization for Support Vector Machine and Its Application to Fault Diagnosis of Power Transformers , 2012 .

[14]  Zhu Yun-xia Text classification algorithm research based on clustering and neural network , 2012 .

[15]  Zhixiang Chen,et al.  The Covering Number for Some Mercer Kernel Hilbert Spaces on the Unit Sphere , 2011 .