A Mapping Study to Investigate Spam Detection on Social Networks

Social networks such as Facebook, Twitter and SinaWeibo have become increasingly important for reaching millions of user globally. Consequently, spammers are increasing using such networks for propagating spam. Existing research on filtering techniques such as collaborative filters and behavioral analysis filters are able to significantly reduce spam. In recent years, online social networks have become the most important medium of communication among individual and organization to interact. Unfortunately, driven by the desire to communicate, fraudster or spammers have produced deceptive spam or unsolicited commercial email(UCE). The fraudsters’ or spammer activities mislead potential users and victims reshaping their individual life and general communication on social network platform. The aim of this study is to understand, classify and analyze existing research in spam detection on social networks, focusing on approaches and elements that are used to evaluate the general framework of spam detection and its architectural framework from the users perspective, service provider and security analyst ‘s point of view. This paper presents a systematic mapping study of several spam detection techniques and approaches on social networks that were proposed to measure to evaluate the general framework of spam detection on social networks. We found 17 proposals that could be applied to evaluate spam detection on social networks, while 14 proposals could be applied to evaluate the users, service providers and practitioners. Various elements of spam detection on social networks that were measured are reviewed and discussed. Only a few of the proposed spam detection on social networks are soundly defined. The quality assessment of the primary studies detected many limitations and suggested guidelines for possibilities for improving and increasing the acceptance of spam detection on social networks. However, it remains a challenge to characterize and evaluate a spam detection and framework on social networks quantitatively. For this fact, much effort must be made to achieve a better spam detection approach in the future that will be devoid of problem anomaly detection, fault detection, malware detection and intrusion detection General Terms Spam detection, Security, Mapping study,Spam detection metrics.

[1]  Steven Myers,et al.  The Nuts and Bolts of a Forum Spam Automator , 2011, LEET.

[2]  Andreas Hotho,et al.  The anti-social tagger: detecting spam in social bookmarking systems , 2008, AIRWeb '08.

[3]  Farida Ridzuan,et al.  Key Parameters in Identifying Cost of Spam 2.0 , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[4]  David Carmel,et al.  The connectivity sonar: detecting site functionality by structural patterns , 2003, HYPERTEXT '03.

[5]  Anestis Gkanogiannis,et al.  A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems , 2008 .

[6]  Alex Talevski,et al.  Behaviour-Based Web Spambot Detection by Utilising Action Time and Action Frequency , 2010, ICCSA.

[7]  Gerhard Paass,et al.  Improved Phishing Detection using Model-Based Features , 2008, CEAS.

[8]  Wei Hu,et al.  Twitter spammer detection using data stream clustering , 2014, Inf. Sci..

[9]  Ashish Sureka Mining User Comment Activity for Detecting Forum Spammers in YouTube , 2011, ArXiv.

[10]  Georgia Koutrika,et al.  Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges , 2007, IEEE Internet Computing.

[11]  Calton Pu,et al.  Study of Trend-Stuffing on Twitter through Text Classification , 2010 .

[12]  Naomie Salim,et al.  Detection of review spam: A survey , 2015, Expert Syst. Appl..

[13]  Leyla Bilge,et al.  All your contacts are belong to us: automated identity theft attacks on social networks , 2009, WWW '09.

[14]  Dit-Yan Yeung,et al.  A learning approach to spam detection based on social networks , 2007 .

[15]  HuWei,et al.  Twitter spammer detection using data stream clustering , 2014 .

[16]  Claire Cardie,et al.  Finding Deceptive Opinion Spam by Any Stretch of the Imagination , 2011, ACL.

[17]  Ciro Cattuto,et al.  Social spam detection , 2009, AIRWeb '09.

[18]  Frank Stajano,et al.  Eight friends are enough: social graph approximation via public listings , 2009, SNS '09.

[19]  Dhiraj K. Pradhan,et al.  International Conference on Eco-friendly Computing and Communication Systems (ICECCS ) , 2012, ICECCS 2012.

[20]  Calton Pu,et al.  Observed Trends in Spam Construction Techniques: A Case Study of Spam Evolution , 2006, CEAS.

[21]  Zheyi Chen,et al.  Detecting spammers on social networks , 2015, Neurocomputing.

[22]  Ciro Cattuto,et al.  Evaluating similarity measures for emergent semantics of social tagging , 2009, WWW '09.

[23]  Kevin Borders,et al.  Social networks and context-aware spam , 2008, CSCW.

[24]  Alex Talevski,et al.  Web Spambot Detection Based on Web Navigation Behaviour , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[25]  Steven Myers,et al.  Prevalence and mitigation of forum spamming , 2011, 2011 Proceedings IEEE INFOCOM.

[26]  Ling Liu,et al.  Socialtrust: tamper-resilient trust establishment in online communities , 2008, JCDL '08.

[27]  Alexander K. Seewald,et al.  An evaluation of Naive Bayes variants in content-based learning for spam filtering , 2007, Intell. Data Anal..

[28]  Jun Hu,et al.  Detecting and characterizing social spam campaigns , 2010, CCS '10.

[29]  Virgílio A. F. Almeida,et al.  Identifying video spammers in online social networks , 2008, AIRWeb '08.

[30]  Philip S. Yu,et al.  Review spam detection via temporal pattern discovery , 2012, KDD.

[31]  Eugene Agichtein,et al.  A few bad votes too many?: towards robust ranking in social media , 2008, AIRWeb '08.

[32]  Michael Sirivianos,et al.  Aiding the Detection of Fake Accounts in Large Scale Social Online Services , 2012, NSDI.

[33]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[34]  Vidyasagar Potdar,et al.  Toward spam 2.0: An evaluation of Web 2.0 anti-spam methods , 2009, 2009 7th IEEE International Conference on Industrial Informatics.

[35]  Vidyasagar Potdar,et al.  Spammer and hacker, two old friends , 2009, 2009 3rd IEEE International Conference on Digital Ecosystems and Technologies.

[36]  Mourad Debbabi,et al.  Spam campaign detection, analysis, and investigation , 2015, Digit. Investig..

[37]  Ali Selamat,et al.  Improved email spam detection model with negative selection algorithm and particle swarm optimization , 2014, Appl. Soft Comput..

[38]  Muhammad Abulaish,et al.  A generic statistical approach for spam detection in Online Social Networks , 2013, Comput. Commun..

[39]  Jeffrey O. Kephart,et al.  SpamGuru: An Enterprise Anti-Spam Filtering System , 2004, CEAS.

[40]  Gang Wang,et al.  Northeastern University , 2021, IEEE Pulse.

[41]  P. Lalitha,et al.  New Filtering Approaches for Phishing Email , 2013 .

[42]  Adam Thomason Blog Spam: A Review , 2007, CEAS.

[43]  David Mandell Freeman,et al.  Using naive bayes to detect spammy names in social networks , 2013, AISec.

[44]  Fisher Cf Two old friends. , 1969 .

[45]  Lluís Màrquez i Villodre,et al.  Boosting Trees for Anti-Spam Email Filtering , 2001, ArXiv.

[46]  Vern Paxson,et al.  Detecting and Analyzing Automated Activity on Twitter , 2011, PAM.

[47]  Calton Pu,et al.  A social-spam detection framework , 2011, CEAS '11.

[48]  Chen-Nee Chuah,et al.  Unveiling facebook: a measurement study of social network based applications , 2008, IMC '08.

[49]  Aoying Zhou,et al.  Towards online review spam detection , 2014, WWW.

[50]  Alexandros Asthenidis,et al.  Social Networks as an Attack Platform: Facebook Case Study , 2009, 2009 Eighth International Conference on Networks.

[51]  Virgílio A. F. Almeida,et al.  Comparative Graph Theoretical Characterization of Networks of Spam , 2005, CEAS.

[52]  Nazanin Firoozeh,et al.  Definition of spam 2.0: New spamming boom , 2010, 4th IEEE International Conference on Digital Ecosystems and Technologies.

[53]  Steve Hanna,et al.  A survey of mobile malware in the wild , 2011, SPSM '11.

[54]  Vangelis Metsis,et al.  Spam Filtering with Naive Bayes - Which Naive Bayes? , 2006, CEAS.

[55]  Erdong Chen,et al.  Facebook immune system , 2011, SNS '11.

[56]  Jianchang Mao,et al.  Towards the Semantic Web: Collaborative Tag Suggestions , 2006 .