Research of Web-Page Multi-Class Classification System Based on Support Vector Machine

A Chinese web-page classification algorithm based on SVM including the important aspects of text preprocessing, feature selection and multiple-Classification algorithm. In this paper, based on the analyses of features of Web documents, this paper does research the approach of classification in Support Vector Machine (SVM) and select of Kernel function. Furthermore, a web-page classification model and algorithm that is based on Binary Tree SVM is presented. The experiments show that it not only reduces the size of train set, but also has very high training efficiency. Its precision and recall are better.