Recent Trends in Text Classification Techniques

Text Mining is the discovery of valuable, yet hidden, information from the text document. Text classification (Also called Text Categorization) is one of the important research issues in the field of text mining. With the dramatic increase in the amount of content available in digital forms gives rise to a problem to manage this online textual data. As a result, it has become a necessary to classify/categorize large texts (documents) into specific classes. Text Classification assigns a text document to one of a set of predefined classes. This paper covers different text classification techniques and also includes Classifier Architecture and Text Classification Applications. Terms Text Mining, Text Classification, Applications, Classifier Architecture, Classification Techniques.

[1]  Nicolás Marín,et al.  Association rule evaluation for classification purposes , 2005 .

[2]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[3]  Gurpreet Singh Lehal,et al.  A Survey of Text Mining Techniques and Applications , 2009 .

[4]  Houkuan Huang,et al.  Feature selection for text classification with Naïve Bayes , 2009, Expert Syst. Appl..

[5]  Wanli Ma,et al.  A New Term Ranking Method based on Relation Extraction and Graph Model for Text Classification , 2011, ACSC.

[6]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[7]  Malik Yousef,et al.  One-class document classification via Neural Networks , 2007, Neurocomputing.

[8]  Dorothea Blostein,et al.  A survey of document image classification: problem statement, classifier architecture and performance evaluation , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[9]  Yiming Yang,et al.  Text categorization , 2008, Scholarpedia.

[10]  Frans Coenen,et al.  Text classification using graph mining-based feature extraction , 2010 .

[11]  Xijin Tang,et al.  Text classification based on multi-word with support vector machine , 2008, Knowl. Based Syst..

[12]  D. S. Guru,et al.  Representation and Classification of Text Documents: A Brief Review , 2010 .

[13]  Chen Wang,et al.  ICL at NTCIR-7: An Improved KNN Algorithm for Text Categorization , 2008, NTCIR.

[14]  Verayuth Lertnattee,et al.  Class normalization in centroid-based text categorization , 2006, Inf. Sci..

[15]  Songbo Tan,et al.  An improved centroid classifier for text categorization , 2008, Expert Syst. Appl..

[16]  R. Brereton,et al.  Support vector machines for classification and regression. , 2010, The Analyst.

[17]  Machdel C. Matthee,et al.  Differentiating between data-mining and text-mining terminology , 2004 .

[18]  Christoph Goller,et al.  Automatic Document Classification - A thorough Evaluation of various Methods , 2000, ISI.

[19]  M. H. Shenassa Classification based on Predictive Association Rules , 2006 .

[20]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[21]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[22]  Naveen Aggarwal,et al.  CLASSIFICATION TECHNIQUES ANALYSIS , 2010 .

[23]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[24]  Richard Simon,et al.  The maximum likelihood neural network as a statistical classification model , 1995 .

[25]  Ali Selamat,et al.  Web page feature selection and classification using neural networks , 2004, Inf. Sci..

[26]  Stellan Ohlsson,et al.  Learning Tutorial Rules Using Classification Based On Associations , 2007, AIED.

[27]  Xuemin Lin,et al.  Term Graph Model for Text Classification , 2005, ADMA.