Network Analysis of Recurring YouTube Spam Campaigns

As the popularity of content sharing websites such as YouTube and Flickr has increased, they have become targets for spam, phishing and the distribution of malware. On YouTube, the facility for users to post comments can be used by spam campaigns to direct unsuspecting users to bogus e-commerce websites. In this paper, we demonstrate how such campaigns can be tracked over time using network motif profiling, i.e. by tracking counts of indicative network motifs. By considering all motifs of up to five nodes, we identify discriminating motifs that reveal two distinctly different spam campaign strategies. One of these strategies uses a small number of spam user accounts to comment on a large number of videos, whereas a larger number of accounts is used with the other. We present an evaluation that uses motif profiling to track two active campaigns matching these strategies, and identify some of the associated user accounts.

[1]  Peter Donnelly,et al.  Superfamilies of Evolved and Designed Networks , 2004 .

[2]  Jun Hu,et al.  Detecting and characterizing social spam campaigns , 2010, IMC '10.

[3]  S. Shen-Orr,et al.  Superfamilies of Evolved and Designed Networks , 2004, Science.

[4]  Ashish Sureka Mining User Comment Activity for Detecting Forum Spammers in YouTube , 2011, ArXiv.

[5]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[6]  Richard M. Karp,et al.  Efficient Randomized Pattern-Matching Algorithms , 1987, IBM J. Res. Dev..

[7]  Gilad Mishne,et al.  Blocking Blog Spam with Language Model Disagreement , 2005, AIRWeb.

[8]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[9]  Fabrício Benevenuto,et al.  Phi.sh/$oCiaL: the phishing landscape through short URLs , 2011, CEAS '11.

[10]  Virgílio A. F. Almeida,et al.  Identifying video spammers in online social networks , 2008, AIRWeb '08.

[11]  P. Oscar Boykin,et al.  Leveraging social networks to fight spam , 2005, Computer.

[12]  Padraig Cunningham,et al.  Characterizing Wikipedia pages using edit network motif profiles , 2011, SMUC '11.

[13]  Jafar Adibi,et al.  Characterizing Network Motifs to Identify Spam Comments , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[14]  Errin W. Fulp,et al.  Using Network Motifs to Identify Application Protocols , 2009, GLOBECOM 2009 - 2009 IEEE Global Telecommunications Conference.

[15]  Luca Becchetti,et al.  Efficient semi-streaming algorithms for local triangle counting in massive graphs , 2008, KDD.

[16]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[17]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[18]  Geoff Hulten,et al.  Spamming botnets: signatures and characteristics , 2008, SIGCOMM '08.

[19]  Virgílio A. F. Almeida,et al.  Understanding video interactions in youtube , 2008, ACM Multimedia.

[20]  Virgílio A. F. Almeida,et al.  Detecting Spammers and Content Promoters in Online Video Social Networks , 2009, IEEE INFOCOM Workshops 2009.

[21]  Sebastian Wernicke,et al.  FANMOD: a tool for fast network motif detection , 2006, Bioinform..

[22]  K. Fernow New York , 1896, American Potato Journal.

[23]  John C. Paolillo Structure and Network in the YouTube Core , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[24]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[25]  Vern Paxson,et al.  @spam: the underground on 140 characters or less , 2010, CCS '10.