A Comparative Study on Different Types of Approaches to Text Categorization

three ways unsupervised, supervised and semi supervised methods. Text categorization refers to the process of assign a category or some categories among predefined ones to each document, automatically. This paper presents a comparative study on different types of approaches to text categorization.

[1]  Yoram Singer,et al.  Context-sensitive learning methods for text categorization , 1996, SIGIR '96.

[2]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[3]  William W. Cohen,et al.  Context-sensitive learning methods for text categorization , 1999, TOIS.

[4]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[5]  Vincent Tam,et al.  A Comparative Study of Centroid-Based, Neighborhood-Based and Statistical Approaches for Effective Document Categorization , 2002, ICPR.

[6]  Jinwoo Park,et al.  Improving text categorization using the importance of sentences , 2004, Inf. Process. Manag..

[7]  Moustafa Ghanem,et al.  A novel refinement approach for text categorization , 2005, CIKM '05.

[8]  Liang Su,et al.  Dictionary-based text categorization of chemical web pages , 2006, Inf. Process. Manag..

[9]  Hisham Al-Mubaid,et al.  A New Text Categorization Technique Using Distributional Clustering and Learning Logic , 2006, IEEE Transactions on Knowledge and Data Engineering.

[10]  Jung-Hsien Chiang,et al.  Hierarchically SVM classification based on support vector clustering method and its application to document categorization , 2007, Expert Syst. Appl..

[11]  Taeho Jo SITE Categorizer ) : Neural Network for Text Categorization , 2007 .

[12]  Dino Isa,et al.  Text Document Preprocessing with the Bayes Formula for Classification Using the Support Vector Machine , 2008, IEEE Transactions on Knowledge and Data Engineering.

[13]  Dino Isa,et al.  Using the self organizing map for clustering of text documents , 2009, Expert Syst. Appl..

[14]  Jian Su,et al.  Supervised and Traditional Term Weighting Methods for Automatic Text Categorization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Hongyun Zhang,et al.  Rough set based hybrid algorithm for text classification , 2009, Expert Syst. Appl..

[16]  Elias Oliveira,et al.  Multi-Label Text Categorization Using a Probabilistic Neural Network , 2009 .

[17]  S. Ramasundaram,et al.  Text Categorization by Backpropagation Network , 2010 .

[18]  D. S. Guru,et al.  Representation and Classification of Text Documents: A Brief Review , 2010 .