A survey on classification techniques for text mining

The development of the World Wide Web it is no longer feasible for a user can understand all the data coming from classify into categories. The expansion of information and power automatic classification of data and textual data gains increasingly and give high performance. In this paper five important text classifications are described Naive Bayesian, K-Nearest Neighbour, Support Vector Machine, Decision Tree and Regression. Which are categorized the text data into pre define class. The target of the paper is to study different classification techniques and finding the classification accuracy for different datasets. An efficient and effective text documents into mutually exclusive categories.

[1]  Cynthia Rudin,et al.  The Bayesian Case Model: A Generative Approach for Case-Based Reasoning and Prototype Classification , 2014, NIPS.

[2]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[3]  S. Venkatesan,et al.  Performance comparison of various information retrieval models used in search engines , 2012, 2012 International Conference on Communication, Information & Computing Technology (ICCICT).

[4]  Cynthia Rudin,et al.  Methods and Models for Interpretable Linear Classification , 2014, ArXiv.

[5]  Yong-Soo Kim,et al.  Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size , 2008, Expert Syst. Appl..

[6]  Mansi Goyal,et al.  An Efficient Malicious Email Detection Using Multi Naive Bayes Classifier , 2015 .

[7]  V. Sharmila,et al.  Fuzzy Based Co-Cluster Algorithm for Text Clustering , 2015 .

[8]  Gaurav Sharma,et al.  A survey on text mining techniques , 2014 .

[9]  Katherine A. Heller,et al.  Growing a list , 2013, Data Mining and Knowledge Discovery.

[10]  Zhi-Hua Zhou,et al.  Distributional features for text categorization , 2006 .

[11]  Gurpreet Singh Lehal,et al.  A Survey of Text Mining Techniques and Applications , 2009 .

[12]  M. Padmavathi,et al.  Clustering Sentence-Level Text Using a Fuzzy Back- Propagation Clustering Algorithm , 2015 .

[13]  S. Srinivasan,et al.  A Survey of Text Mining : Retrieval , Extraction and Indexing Techniques , 2012 .

[14]  Shalini Puri,et al.  A technical study and analysis on fuzzy similarity based models for text classification , 2012, ArXiv.

[15]  C. R. Barde,et al.  Sentence Level Clustering using Fuzzy Relational Algorithm , 2015 .

[16]  Alex Alves Freitas,et al.  Comprehensible classification models: a position paper , 2014, SKDD.

[17]  Cynthia Rudin,et al.  Learning theory analysis for association rules and sequential event prediction , 2013, J. Mach. Learn. Res..

[18]  E. Chang,et al.  A survey in traditional information retrieval models , 2008, 2008 2nd IEEE International Conference on Digital Ecosystems and Technologies.

[19]  Cynthia Rudin,et al.  Sequential Event Prediction with Association Rules , 2011, COLT.

[20]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[21]  D. S. Guru,et al.  Texture Features and KNN in Classification of Flower Images , 2010 .