Graph-Based Spam Image Detection for Mobile Phone Spam Image Filtering

Spam images in mobile phones have increasingly appeared these days. As the spam filtering systems become more sophisticated, spams are being more intelligent. Although detection of email-spams has been quite successful, there have not been effective solutions for detecting mobile phone spams yet, especially, spam images. In addition to the expensive image processing time, insufficient spam image data in mobile phones makes it challenging to train a general model. To address this issue, the authors propose a graph-based approach that utilizes graph structure in abundant e-mail spam dataset. The authors employ different clustering algorithms to find a subset of e-mail spam images similar to phone spam images. Furthermore, the performance behavior with respect to different image descriptors of Pyramid Histogram of Visual Words PHOW and RGB histogram is extensively investigated. The authors' results highlight that the proposed idea is fairly meaningful in increasing training data size, thus effectively improving image spam detection performance.

[1]  Malcolm Slaney,et al.  Image classification using the web graph , 2010, ACM Multimedia.

[2]  James A. Herson,et al.  Image analysis for efficient categorization of image-based spam e-mail , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[3]  Fabio Roli,et al.  A survey and experimental evaluation of image spam filtering techniques , 2011, Pattern Recognit. Lett..

[4]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[5]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[6]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[7]  Ming Yang,et al.  Image spam hunter , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Fabio Roli,et al.  Spam Filtering Based On The Analysis Of Text Information Embedded Into Images , 2006, J. Mach. Learn. Res..

[10]  Zhiguang Qin,et al.  Graph-Based Semi-supervised Feature Selection with Application to Automatic Spam Image Identification , 2011 .

[11]  M. Soranamageswari,et al.  Statistical Feature Extraction for Classification of Image Spam Using Artificial Neural Networks , 2010, 2010 Second International Conference on Machine Learning and Computing.

[12]  Charles Elkan,et al.  Using the Triangle Inequality to Accelerate k-Means , 2003, ICML.

[13]  Walmir M. Caminhas,et al.  A review of machine learning approaches to Spam filtering , 2009, Expert Syst. Appl..

[14]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Tu Minh Phuong,et al.  An Efficient Method for Filtering Image-Based Spam , 2007, 2007 IEEE International Conference on Research, Innovation and Vision for the Future.

[16]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[17]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[18]  Andrew Zisserman,et al.  Efficient Additive Kernels via Explicit Feature Maps , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Basheer Al-Duwairi,et al.  Texture Analysis-Based Image Spam Filtering , 2011, 2011 International Conference for Internet Technology and Secured Transactions.

[20]  Chien-Chung Chan,et al.  Application of Learning Algorithms to Image Spam Evolution , 2013 .

[21]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.