KNN text categorization algorithm based on LSA reduce dimensionality

Aimed at the problem of document automatic classification, an improved KNN algorithm is proposed based on LSA reduced dimensionality. It advances the KNN algorithm's efficiency and classifier's precision by using LSA to reduce dimensionality of text feature matrix. The experiment result shows that the improved KNN algorithm has good performance.