Detecting Spam and Promoting Campaigns in Twitter

Twitter has become a target platform for both promoters and spammers to disseminate their messages, which are more harmful than traditional spamming methods, such as email spamming. Recently, large amounts of campaigns that contain lots of spam or promotion accounts have emerged in Twitter. The campaigns cooperatively post unwanted information, and thus they can infect more normal users than individual spam or promotion accounts. Organizing or participating in campaigns has become the main technique to spread spam or promotion information in Twitter. Since traditional solutions focus on checking individual accounts or messages, efficient techniques for detecting spam and promotion campaigns in Twitter are urgently needed. In this article, we propose a framework to detect both spam and promotion campaigns. Our framework consists of three steps: the first step links accounts who post URLs for similar purposes; the second step extracts candidate campaigns that may be for spam or promotion purposes; and the third step classifies the candidate campaigns into normal, spam, and promotion groups. The key point of the framework is how to measure the similarity between accounts' purposes of posting URLs. We present two measure methods based on Shannon information theory: the first one uses the URLs posted by the users, and the second one considers both URLs and timestamps. Experimental results demonstrate that the proposed methods can extract the majority of the candidate campaigns correctly, and detect promotion and spam campaigns with high precision and recall.

[1]  Sreenivas Gollapudi,et al.  Ranking mechanisms in twitter-like forums , 2010, WSDM '10.

[2]  Hiroyuki Hisamatsu,et al.  Method for Countering Social Bookmarking Pollution Using User Similarities , 2010, NDT.

[3]  Huan Liu,et al.  Social Spammer Detection in Microblogging , 2013, IJCAI.

[4]  Krishna P. Gummadi,et al.  Understanding and combating link farming in the twitter social network , 2012, WWW.

[5]  Kyumin Lee,et al.  Content-driven detection of campaigns in social media , 2011, CIKM '11.

[6]  W. Bruce Croft,et al.  User oriented tweet ranking: a filtering approach to microblogs , 2011, CIKM '11.

[7]  Fabrício Benevenuto,et al.  Reverse engineering socialbot infiltration strategies in Twitter , 2014, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[8]  James Caverlee,et al.  Ranking Comments on the Social Web , 2009, 2009 International Conference on Computational Science and Engineering.

[9]  Vern Paxson,et al.  @spam: the underground on 140 characters or less , 2010, CCS '10.

[10]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[11]  Ciro Cattuto,et al.  Social spam detection , 2009, AIRWeb '09.

[12]  Anthony K. H. Tung,et al.  CSV: visualizing and mining cohesive subgraphs , 2008, SIGMOD Conference.

[13]  Calton Pu,et al.  A social-spam detection framework , 2011, CEAS '11.

[14]  Alex Hai Wang,et al.  Don't follow me: Spam detection in Twitter , 2010, 2010 International Conference on Security and Cryptography (SECRYPT).

[15]  Kyumin Lee,et al.  Campaign extraction from social media , 2013, ACM Trans. Intell. Syst. Technol..

[16]  Jiebo Luo,et al.  SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks , 2011, Proc. VLDB Endow..

[17]  Jun Hu,et al.  Detecting and characterizing social spam campaigns , 2010, CCS '10.

[18]  Songqing Chen,et al.  UNIK: unsupervised social network spam detection , 2013, CIKM.

[19]  Geoff Hulten,et al.  Spamming botnets: signatures and characteristics , 2008, SIGCOMM '08.

[20]  Kyumin Lee,et al.  Uncovering social spammers: social honeypots + machine learning , 2010, SIGIR.

[21]  Virgílio A. F. Almeida,et al.  Detecting Spammers and Content Promoters in Online Video Social Networks , 2009, IEEE INFOCOM Workshops 2009.

[22]  Qiang Yang,et al.  Discovering Spammers in Social Networks , 2012, AAAI.

[23]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2011, IEEE Transactions on Information Forensics and Security.

[24]  Guofei Gu,et al.  Analyzing spammers' social networks for fun and profit: a case study of cyber criminal ecosystem on twitter , 2012, WWW.

[25]  Christoph Meinel,et al.  Telling experts from spammers: expertise ranking in folksonomies , 2009, SIGIR.

[26]  Haining Wang,et al.  Detecting Social Spam Campaigns on Twitter , 2012, ACNS.

[27]  Hiroyuki Kitagawa,et al.  TURank: Twitter User Ranking Based on User-Tweet Graph Analysis , 2010, WISE.

[28]  Harry Shum,et al.  An Empirical Study on Learning to Rank of Tweets , 2010, COLING.

[29]  Chong Long,et al.  Uncovering collusive spammers in Chinese review websites , 2013, CIKM.

[30]  Andreas Hotho,et al.  The anti-social tagger: detecting spam in social bookmarking systems , 2008, AIRWeb '08.

[31]  Vasileios Kandylas,et al.  The utility of tweeted URLs for web search , 2010, WWW '10.

[32]  Huan Liu,et al.  Online Social Spammer Detection , 2014, AAAI.

[33]  Aixin Sun,et al.  HSpam14: A Collection of 14 Million Tweets for Hashtag-Oriented Spam Research , 2015, SIGIR.

[34]  ZhangXianchao,et al.  Detecting Spam and Promoting Campaigns in Twitter , 2016 .

[35]  A.P.J. van den Bosch,et al.  Using language modeling for spam detection in social reference manager websites , 2009 .

[36]  Pang-Ning Tan,et al.  A co-classification framework for detecting web spam and spammers in social media web sites , 2009, CIKM.

[37]  Xianchao Zhang,et al.  Detecting Spam and Promoting Campaigns in the Twitter Social Network , 2012, 2012 IEEE 12th International Conference on Data Mining.

[38]  Georgia Koutrika,et al.  Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges , 2007, IEEE Internet Computing.

[39]  Calton Pu,et al.  Study of Trend-Stuffing on Twitter through Text Classification , 2010 .