Research and Implementation of Automatic Text Categorization System Based on VSM
This paper firstly gives a research to several key techniques about Text Categorization, and then provides the architecture of an implemented Automatic Text Categorization System Based on VSM, focusing on its implementation algorithms which determine the vector feature selection dimension via test set in training process and provide an "Average" matching_threshold adjustment method. Thus it outperforms the traditional classification algorithms in precision and speed. Finally, the evaluations and test results are presented in this paper.