A New Spam Short Message Classification

This paper proposes an approach of dual-filtering messages. First the combination of KNN classification algorithm and rough set separates spam messages from messages. To avoid lowering precision for reduction, it needs to use KNN classification algorithm to re-filter some messages. This method not only improves the speed of classification but also retains high accuracy based on rough set of KNN classification algorithm.

[1]  Wei Zheng,et al.  Filtering Short Message Spam of Group Sending Using CAPTCHA , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[2]  Henrik Boström,et al.  Classification of Microarrays with kNN: Comparison of Dimensionality Reduction Methods , 2007, IDEAL.

[3]  Xu Cong-fu A Large-Scale Online Spam Short Message Filtering System , 2008 .

[4]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[5]  Chun Guan,et al.  Emotional Agent Based on Rough Set , 2007, 2007 International Conference on Computational Intelligence and Security Workshops (CISW 2007).

[6]  Xu Xin,et al.  Advances in Machine Learning Based Text Categorization , 2006 .