A Fuzzy Similarity Based Concept Mining Model for Text Classification

Text Classification is a challenging and a red hot field in the current scenario and has great importance in text categorization applications. A lot of research work has been done in this field but there is a need to categorize a collection of text documents into mutually exclusive categories by extracting the concepts or features using supervised learning paradigm and different classification algorithms. In this paper, a new Fuzzy Similarity Based Concept Mining Model (FSCMM) is proposed to classify a set of text documents into pre - defined Category Groups (CG) by providing them training and preparing on the sentence, document and integrated corpora levels along with feature reduction, ambiguity removal on each level to achieve high system performance. Fuzzy Feature Category Similarity Analyzer (FFCSA) is used to analyze each extracted feature of Integrated Corpora Feature Vector (ICFV) with the corresponding categories or classes. This model uses Support Vector Machine Classifier (SVMC) to classify correctly the training data patterns into two groups; i. e., + 1 and - 1, thereby producing accurate and correct results. The proposed model works efficiently and effectively with great performance and high - accuracy results.

[1]  Xiuqi Li,et al.  Web document classification based on fuzzy association , 2002, Proceedings 26th Annual International Computer Software and Applications.

[2]  Kevin Kok Wai Wong,et al.  Exploring the use of fuzzy signature for text mining , 2010, International Conference on Fuzzy Systems.

[3]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[4]  Fakhri Karray,et al.  An Efficient Concept-Based Mining Model for Enhancing Text Clustering , 2010, IEEE Transactions on Knowledge and Data Engineering.

[5]  Nor Azan Mat Zin,et al.  Classifying modality learning styles based on Production-Fuzzy Rules , 2011, 2011 International Conference on Pattern Analysis and Intelligence Robotics.

[6]  Ana Cristina Bicharra Garcia,et al.  An Analysis of Constructed Categories for Textual Classification Using Fuzzy Similarity and Agglomerative Hierarchical Methods , 2007, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System.

[7]  Seyed Mostafa Fakhrahmad,et al.  Efficient Fuzzy Rule Generation: A New Approach Using Data Mining Principles and Rule Weighting , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[8]  Wei Chen,et al.  Fuzzy ontology generation model using fuzzy clustering for learning evaluation , 2009, 2009 IEEE International Conference on Granular Computing.

[9]  Pat Langley,et al.  Artificial Intelligence and Intelligent Systems , 2006 .

[10]  Chen Li,et al.  Advances in Research of Fuzzy C-Means Clustering Algorithm , 2011, 2011 International Conference on Network Computing and Information Security.

[11]  Ji Hyea Han,et al.  Data Mining : Concepts and Techniques 2 nd Edition Solution Manual , 2005 .

[12]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[13]  Mauridhi Hery Purnomo,et al.  Facial Emotional Expressions of Li fe-like Character Based on Text Classifier and Fuzzy Logic , 2011 .

[14]  Vincenzo Loia,et al.  Concept mining of semantic web services by means of extended Fuzzy Formal Concept Analysis (FFCA) , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[15]  P. Harini,et al.  A Fuzzy Self-Constructing Feature Clustering Algorithm for Text Classification , 2012 .