SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks

We have entered the era of social media networks represented by Facebook, Twitter, YouTube and Flickr. Internet users now spend more time on social networks than search engines. Business entities or public figures set up social networking pages to enhance direct interactions with online users. Social media systems heavily depend on users for content contribution and sharing. Information is spread across social networks quickly and effectively. However, at the same time social media networks become susceptible to different types of unwanted and malicious spammer or hacker actions. There is a crucial need in the society and industry for security solution in social media. In this demo, we propose SocialSpamGuard, a scalable and online social media spam detection system based on data mining for social network security. We employ our GAD clustering algorithm for large scale clustering and integrate it with the designed active learning algorithm to deal with the scalability and real-time detection challenges.

[1]  Sangkyum Kim,et al.  A general framework for efficient clustering of large datasets based on activity detection , 2011, Stat. Anal. Data Min..

[2]  Chien-Chung Chan,et al.  Mining pharmaceutical spam from Twitter , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[3]  P. Oscar Boykin,et al.  Collaborative Spam Filtering Using E-Mail Networks , 2006, Computer.

[4]  Virgílio A. F. Almeida,et al.  Identifying video spammers in online social networks , 2008, AIRWeb '08.

[5]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[6]  Calton Pu,et al.  Social Honeypots: Making Friends With A Spammer Near You , 2008, CEAS.

[7]  Gang Wang,et al.  iRIN: image retrieval in image-rich information networks , 2010, WWW '10.

[8]  Hermann Ney,et al.  Features for image retrieval: an experimental comparison , 2008, Information Retrieval.

[9]  Kyumin Lee,et al.  Uncovering social spammers: social honeypots + machine learning , 2010, SIGIR.

[10]  Calton Pu,et al.  A Discriminative Classifier Learning Approach to Image Modeling and Spam Image Identification , 2007, CEAS.

[11]  Ciro Cattuto,et al.  Social spam detection , 2009, AIRWeb '09.

[12]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[13]  Calton Pu,et al.  Introducing the Webb Spam Corpus: Using Email Spam to Identify Web Spam Automatically , 2006, CEAS.

[14]  Yiannis S. Boutalis,et al.  CEDD: Color and Edge Directivity Descriptor: A Compact Descriptor for Image Indexing and Retrieval , 2008, ICVS.

[15]  Katsuyuki Yamazaki,et al.  Density-based spam detector , 2004, IEICE Trans. Inf. Syst..