Unveil the Spams in Weibo

In online social network(OSN), spams refer to the messages deliberately propagated by spammers. Annoyed advertisements, illegal contents, malware and phishing links can all be spread by spams. Some detection approaches have been proposed in previous works. However, on the one hand, some of the assumptions they rely on are still unproven. On the other hand, most of the previous works focus on famous sites such as Twitter and Facebook. The effectiveness of these approaches on other networks is still unknown. In this paper, we study the problem of spam in Weibo, Chinas leading micro-blog service. With a dataset of 375, 430 posts and 2, 370 users crawled from Weibo, we conducted a deep analysis in spammers and spammed posts. The contributions of our work are as follows. First, we work on the unproven assumption of regarding spammers as botnet users, which is relied by many previous works. We find that nowadays spammers perform more like regular users, which indicates that some previous methods are not effective any more. Second, we investigate the burst properties in spammed posts and legitimate posts. We find that there're some useful features can be extracted to separate them. Finally, we provide some helpful features that are suitable for spam detection in Weibo.

[1]  Thomas Mandl,et al.  Comparing Chinese and German blogs , 2009, HT '09.

[2]  Yiqun Liu,et al.  User behavior oriented web spam detection , 2008, WWW.

[3]  Michael Kaminsky,et al.  SybilGuard: Defending Against Sybil Attacks via Social Networks , 2008, IEEE/ACM Transactions on Networking.

[4]  Nick Feamster,et al.  Observing common spam in Twitter and email , 2012, Internet Measurement Conference.

[5]  Anirban Dasgupta,et al.  Enhanced email spam filtering through combining similarity graphs , 2011, WSDM '11.

[6]  Padmini Srinivasan,et al.  Spam detection in online classified advertisements , 2011, WebQuality '11.

[7]  Steven Myers,et al.  Prevalence and mitigation of forum spamming , 2011, 2011 Proceedings IEEE INFOCOM.

[8]  Yong Yu,et al.  A comparative study of users' microblogging behavior on sina weibo and twitter , 2012, UMAP.

[9]  Dawn Xiaodong Song,et al.  Design and Evaluation of a Real-Time URL Spam Filtering Service , 2011, 2011 IEEE Symposium on Security and Privacy.

[10]  Wang Hui,et al.  Measurement of Microblogging Network , 2012 .

[11]  Yiqun Liu,et al.  Identifying web spam with user behavior analysis , 2008, AIRWeb '08.

[12]  Jun Hu,et al.  Detecting and characterizing social spam campaigns , 2010, CCS '10.

[13]  Virgílio A. F. Almeida,et al.  Identifying video spammers in online social networks , 2008, AIRWeb '08.

[14]  Vern Paxson,et al.  @spam: the underground on 140 characters or less , 2010, CCS '10.

[15]  Dawn Xiaodong Song,et al.  Suspended accounts in retrospect: an analysis of twitter spam , 2011, IMC '11.

[16]  Hao Chen,et al.  Spam double-funnel: connecting web spammers with advertisers , 2007, WWW '07.

[17]  Calton Pu,et al.  Study of Static Classification of Social Spam Profiles in MySpace , 2010, ICWSM.

[18]  Anand Mahendran,et al.  Comment Spam Classification in Blogs through Comment Analysis and Comment-Blog Post Relationships , 2012, CICLing.