SDHM: A hybrid model for spammer detection in Weibo

As the microblogging service (such as Weibo) is becoming popular, spam becomes a serious problem of affecting the credibility and readability of Online Social Networks. Most existing studies took use of a set of features to identify spam, but without the consideration of the overlap and dependency among different features. In this study, we investigate the problem of spam detection by analyzing real spam dataset collections of Weibo and propose a novel hybrid model of spammer detection, called SDHM, which utilizing significant features, i.e. user behavior information, online social network attributes and text content characteristics, in an organic way. Experiments on real Weibo dataset demonstrate the power of the proposed hybrid model and the promising performance.

[1]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[2]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2011, IEEE Transactions on Information Forensics and Security.

[3]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[4]  Danah Boyd,et al.  Detecting Spam in a Twitter Network , 2009, First Monday.

[5]  Brian D. Davison,et al.  Identifying link farm spam pages , 2005, WWW '05.

[6]  Lin Liu,et al.  Detecting Spam in Chinese Microblogs - A Study on Sina Weibo , 2012, 2012 Eighth International Conference on Computational Intelligence and Security.

[7]  Xiaokang Yang,et al.  Analysis and identification of spamming behaviors in Sina Weibo microblog , 2013, SNAKDD '13.

[8]  Konstantin Beznosov,et al.  The socialbot network: when bots socialize for fame and money , 2011, ACSAC '11.

[9]  Wei Hu,et al.  Twitter spammer detection using data stream clustering , 2014, Inf. Sci..

[10]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[11]  Srinivasan Venkatesh,et al.  Battling the Internet water army: Detection of hidden paid posters , 2011, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[12]  Yanlei Wu,et al.  2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, Beijing, China, August 17-20, 2014 , 2014, ASONAM.

[13]  Dawn Xiaodong Song,et al.  Suspended accounts in retrospect: an analysis of twitter spam , 2011, IMC '11.

[14]  P. Oscar Boykin,et al.  Leveraging social networks to fight spam , 2005, Computer.

[15]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.

[16]  Muhammad Abulaish,et al.  A generic statistical approach for spam detection in Online Social Networks , 2013, Comput. Commun..

[17]  Aoying Zhou,et al.  Duplicate Detection for Identifying Social Spam in Microblogs , 2013, 2013 IEEE International Congress on Big Data.

[18]  Jun Hu,et al.  Detecting and characterizing social spam campaigns , 2010, CCS '10.

[19]  Chao Yang,et al.  CATS: Characterizing automation of Twitter spammers , 2013, 2013 Fifth International Conference on Communication Systems and Networks (COMSNETS).

[20]  Anna Cinzia Squicciarini,et al.  Online abusive users analytics through visualization , 2014, WWW '14 Companion.