SMSAD: a framework for spam message and spam account detection

Short message communication media, such as mobile and microblogging social networks, have become attractive platforms for spammers to disseminate unsolicited contents. However, the traditional content-based methods for spam detection degraded in performance due to many factors. For instance, unlike the contents posted on social networks like Facebook and Renren, SMS and microblogging messages have limited size with the presence of many domain specific words, such as idioms and abbreviations. In addition, microblogging messages are very unstructured and noisy. These distinguished characteristics posed challenges to existing email spam detection models for effective spam identification in short message communication media. The state-of-the-art solutions for social spam accounts detection have faced different evasion tactics in the hands of intelligent spammers. In this paper, a unified framework is proposed for both spam message and spam account detection tasks. We utilized four datasets in this study, two of which are from SMS spam message domain and the remaining two from Twitter microblog. To identify a minimal number of features for spam account detection on Twitter, this paper studied bio-inspired evolutionary search method. Using evolutionary search algorithm, a compact model for spam account detection is proposed, which is incorporated in the machine learning phase of the unified framework. The results of the various experiments conducted indicate that the proposed framework is promising for detecting both spam message and spam account with a minimal number of features.

[1]  AbdulMalik S. Al-Salman,et al.  TSD: Detecting Sybil Accounts in Twitter , 2014, 2014 13th International Conference on Machine Learning and Applications.

[2]  Zhiwu Lu,et al.  Community Based Spammer Detection in Social Networks , 2015, WAIM.

[3]  Vern Paxson,et al.  @spam: the underground on 140 characters or less , 2010, CCS '10.

[4]  Sushil Jajodia,et al.  Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg? , 2012, IEEE Transactions on Dependable and Secure Computing.

[5]  Dawn Xiaodong Song,et al.  Suspended accounts in retrospect: an analysis of twitter spam , 2011, IMC '11.

[6]  Nor Badrul Anuar,et al.  Malicious accounts: Dark of the social networks , 2017, J. Netw. Comput. Appl..

[7]  Jong Kim,et al.  Early filtering of ephemeral malicious accounts on Twitter , 2014, Comput. Commun..

[8]  Haining Wang,et al.  Detecting Social Spam Campaigns on Twitter , 2012, ACNS.

[9]  Wei Hu,et al.  Twitter spammer detection using data stream clustering , 2014, Inf. Sci..

[10]  Huan Liu,et al.  Social Spammer Detection in Microblogging , 2013, IJCAI.

[11]  Krishna P. Gummadi,et al.  Understanding and combating link farming in the twitter social network , 2012, WWW.

[12]  Mohd Zalisham Jali,et al.  A Perception Model of Spam Risk Assessment Inspired by Danger Theory of Artificial Immune Systems , 2015 .

[13]  Yi Zhang,et al.  Discover millions of fake followers in Weibo , 2016, Social Network Analysis and Mining.

[14]  Chia-Mei Chen,et al.  Feature set identification for detecting suspicious URLs using Bayesian classification in social networks , 2014, Inf. Sci..

[15]  El-Sayed M. El-Alfy,et al.  Spam filtering framework for multimodal mobile communication based on dendritic cell algorithm , 2016, Future Gener. Comput. Syst..

[16]  David G. Schwartz,et al.  Social network analysis of web links to eliminate false positives in collaborative anti-spam systems , 2011, J. Netw. Comput. Appl..

[17]  Vangelis Metsis,et al.  Spam Filtering with Naive Bayes - Which Naive Bayes? , 2006, CEAS.

[18]  Tiago A. Almeida,et al.  Towards SMS Spam Filtering: Results under a New Dataset , 2013 .

[19]  Jun Ho Huh,et al.  Hybrid spam filtering for mobile communication , 2009, Comput. Secur..

[20]  V. Paxson,et al.  The Underground on 140 Characters or Less ∗ , 2010 .

[21]  Juan Martínez-Romo,et al.  Detecting malicious tweets in trending topics using a statistical analysis of language , 2013, Expert Syst. Appl..

[22]  Albert Y. Zomaya,et al.  Segregating Spammers and Unsolicited Bloggers from Genuine Experts on Twitter , 2018, IEEE Transactions on Dependable and Secure Computing.

[23]  Vimala Balakrishnan,et al.  Improving document relevancy using integrated language modeling techniques , 2016 .

[24]  Nor Badrul Anuar,et al.  The rise of "malware": Bibliometric analysis of malware study , 2016, J. Netw. Comput. Appl..

[25]  Muhammad Abulaish,et al.  An MCL-Based Approach for Spam Profile Detection in Online Social Networks , 2012, 2012 IEEE 11th International Conference on Trust, Security and Privacy in Computing and Communications.

[26]  H. Manurung An evolutionary algorithm approach to poetry generation , 2004 .

[27]  Wei Wang,et al.  Application of Bayesian Method to Spam SMS Filtering , 2009, 2009 International Conference on Information Engineering and Computer Science.

[28]  Onder Coban,et al.  SMS spam filtering based on text classification and expert system , 2015, 2015 23nd Signal Processing and Communications Applications Conference (SIU).

[29]  Gordon V. Cormack,et al.  Spam filtering for short messages , 2007, CIKM '07.

[30]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2011, IEEE Transactions on Information Forensics and Security.

[31]  Kasturi Dewi Varathan,et al.  Cybercrime detection in online communications: The experimental case of cyberbullying detection in the Twitter network , 2016, Comput. Hum. Behav..

[32]  Patrick P. K. Chan,et al.  Spam filtering for short messages in adversarial environment , 2015, Neurocomputing.

[33]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[34]  Ponnurangam Kumaraguru,et al.  PhishAri : Automatic Realtime Phishing Detection on Twitter Anupama Aggarwal , 2012 .

[35]  Kyumin Lee,et al.  Uncovering social spammers: social honeypots + machine learning , 2010, SIGIR.

[36]  Bin Wu,et al.  SDHM: A hybrid model for spammer detection in Weibo , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[37]  Zheyi Chen,et al.  Detecting spammers on social networks , 2015, Neurocomputing.

[38]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[39]  Gianluca Stringhini,et al.  Towards Detecting Compromised Accounts on Social Networks , 2015, IEEE Transactions on Dependable and Secure Computing.

[40]  Xiutian Cui,et al.  Identifying Suspended Accounts In Twitter , 2016 .