An approach of multi-hierarchy text classification
暂无分享,去创建一个
Improves on the classical formula of calculating the term weight in the vector space model. Furthermore, an approach to multi-hierarchy text classification based on the vector space model is proposed. In this approach, all classes are organized as a tree according to some given hierarchical relations, and all the training documents in a class are combined into a class-document. In order to construct the class models, only the class-documents attached to the same node of the same layer are compared. When classifying the documents, one matching process is hierarchically performed from the root node to the leaf nodes until a corresponding subclass is found. The experiment and real systems indicate that the approach is of high classification precision and recall.
[1] Li Xiao. THE CONCEPT-REASONING NETWORK AND ITS APPLICATION IN TEXT CLASSIFICATION , 2000 .
[2] Lu Song. An Improved Approach to Weighting Terms in Text , 2000 .
[3] Yiming Yang,et al. An example-based mapping method for text categorization and retrieval , 1994, TOIS.
[4] Yiming Yang,et al. Expert network: effective and efficient learning from human decisions in text categorization and retrieval , 1994, SIGIR '94.