论文信息 - Sina-Weibo Spammer Detection with GBDT

Sina-Weibo Spammer Detection with GBDT

In China, Sina-Weibo, with its rising popularity as a microblogging website, has inevitably attracted the attention of spammers. Spammers use myriad of techniques to evade security mechanisms and post spam messages, which are either unwelcome advertisements for the victim or lure victims in to clicking malicious URLs embedded in spam tweets. With the extensive application of machine learning in social media mining and Sina-Weibo’s development, we get many new ideas for the spammers detection. In this paper, we first make a comprehensive analysis specifically aiming at some new Sina-Weibo features rather than other social media, we further design a new feature set to detect spammers. We grab a large amount of Sina-Weibo data on the Internet and train the classifier with the algorithm GBDT. Through our experiments, we show that our new designed features are much more effective than some existing detector. And GBDT also has been significantly improved in both the accuracy and the FP-rate.

[1] Danah Boyd,et al. Detecting Spam in a Twitter Network , 2009, First Monday.

[2] Jun Hu,et al. Detecting and characterizing social spam campaigns , 2010, IMC '10.

[3] Gianluca Stringhini,et al. Detecting spammers on social networks , 2010, ACSAC '10.

[4] Virgílio A. F. Almeida,et al. Detecting Spammers and Content Promoters in Online Video Social Networks , 2009, IEEE INFOCOM Workshops 2009.

[5] Jong Kim,et al. Spam Filtering in Twitter Using Sender-Receiver Relationship , 2011, RAID.

[6] Xiaokang Yang,et al. Analysis and identification of spamming behaviors in Sina Weibo microblog , 2013, SNAKDD '13.

[7] Yan Chen,et al. Poster: online spam filtering in social networks , 2011, CCS '11.

[8] Yang Xiao,et al. Detection of Internet Water Army in Social Network , 2014, INFOCOM 2014.

[9] Dawn Xiaodong Song,et al. Design and Evaluation of a Real-Time URL Spam Filtering Service , 2011, 2011 IEEE Symposium on Security and Privacy.