As the amount of online text increases, the demand for text classification to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive. Automatic classification of text can provide this information at low cost, but the classifiers themselves must be built with expensive human effort, or trained from texts which have themselves been manually classified. In this paper we will discuss a procedure of classifying text using the concept of association rule of data mining. Association rule mining technique has been used to derive feature set from pre-classified text documents. Naive Bayes classifier is then used on derived features for final classification.
[1]
Philip J. Hayes,et al.
CONSTRUE/TIS: A System for Content-Based Indexing of a Database of News Stories
,
1990,
IAAI.
[2]
W. Bruce Croft,et al.
Term clustering of syntactic phrases
,
1989,
SIGIR '90.
[3]
Thomas G. Dietterich.
What is machine learning?
,
2020,
Archives of Disease in Childhood.
[4]
Jiawei Han,et al.
Data Mining: Concepts and Techniques
,
2000
.