AN OVERVIEW OF TEXT CATEGORIZATION TECHNIQUES

Text categorization is one of the important techniques in textual data mining.This survey introduces general solutions to every step of the categorization process including document modeling,feature selection,dimensionality reduction,classification scheme selection.All classification algorithms mentioned are divided into several categories and are evaluated qualitatively and quantitatively by different measures.At the end,the paper presents some existing problems and future developments in text categorization field.