Spammer Detection and Fake User Identification on Social Networks

Social networking sites engage millions of users around the world. The users’ interactions with these social sites, such as Twitter and Facebook have a tremendous impact and occasionally undesirable repercussions for daily life. The prominent social networking sites have turned into a target platform for the spammers to disperse a huge amount of irrelevant and deleterious information. Twitter, for example, has become one of the most extravagantly used platforms of all times and therefore allows an unreasonable amount of spam. Fake users send undesired tweets to users to promote services or websites that not only affect legitimate users but also disrupt resource consumption. Moreover, the possibility of expanding invalid information to users through fake identities has increased that results in the unrolling of harmful content. Recently, the detection of spammers and identification of fake users on Twitter has become a common area of research in contemporary online social Networks (OSNs). In this paper, we perform a review of techniques used for detecting spammers on Twitter. Moreover, a taxonomy of the Twitter spam detection approaches is presented that classifies the techniques based on their ability to detect: (i) fake content, (ii) spam based on URL, (iii) spam in trending topics, and (iv) fake users. The presented techniques are also compared based on various features, such as user features, content features, graph features, structure features, and time features. We are hopeful that the presented study will be a useful resource for researchers to find the highlights of recent developments in Twitter spam detection on a single platform.

[1]  Sagar Gharge,et al.  An integrated approach for malicious tweets detection using NLP , 2017, 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT).

[2]  Sonali Agarwal,et al.  Anomalous behavior detection in social networking , 2017, 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[3]  Daniela Moctezuma,et al.  Features combination for the detection of malicious Twitter accounts , 2016, 2016 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC).

[4]  Florence Sèdes,et al.  Leveraging time for spammers detection on Twitter , 2016, MEDES.

[5]  Muhammad Arshad Islam,et al.  A hybrid approach for spam detection for Twitter , 2017, 2017 14th International Bhurban Conference on Applied Sciences and Technology (IBCAST).

[6]  Marco Morana,et al.  Twitter analysis for real-time malware discovery , 2017, 2017 AEIT International Annual Conference.

[7]  Niloy Ganguly,et al.  Spammers' networks within online social networks: a case-study on Twitter , 2011, WWW.

[8]  Jun Zhang,et al.  A Performance Evaluation of Machine Learning-Based Streaming Spam Tweets Detection , 2015, IEEE Transactions on Computational Social Systems.

[9]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[10]  Cody Buntain,et al.  Automatically Identifying Fake News in Popular Twitter Threads , 2017, 2017 IEEE International Conference on Smart Cloud (SmartCloud).

[11]  Jun Zhang,et al.  Investigating the deceptive information in Twitter spam , 2017, Future Gener. Comput. Syst..

[12]  Rodolfo Zunino,et al.  Spam detection of Twitter traffic: A framework based on random forests and non-uniform feature sampling , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[13]  Louis Lei Yu,et al.  An Evaluation of the Effect of Spam on Twitter Trending Topics , 2013, 2013 International Conference on Social Computing.

[14]  Özlem Aktaş,et al.  Twitter fake account detection , 2017, 2017 International Conference on Computer Science and Engineering (UBMK).

[15]  Saini Jacob Soman A survey on behaviors exhibited by spammers in popular social media networks , 2016, 2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT).

[16]  Manisha Sharma,et al.  Spam detection in social media using convolutional and long short term memory neural network , 2018, Annals of Mathematics and Artificial Intelligence.

[17]  Chong-kwon Kim,et al.  Follow spam detection based on cascaded social information , 2016, Inf. Sci..

[18]  Yu Wang,et al.  Statistical Features-Based Real-Time Detection of Drifted Twitter Spam , 2017, IEEE Transactions on Information Forensics and Security.

[19]  Rishabh Kaushal,et al.  Improving spam detection in Online Social Networks , 2015, 2015 International Conference on Cognitive Computing and Information Processing(CCIP).

[20]  Shu-Ching Chen,et al.  AAFA: Associative Affinity Factor Analysis for Bot Detection and Stance Classification in Twitter , 2017, 2017 IEEE International Conference on Information Reuse and Integration (IRI).

[21]  Mehrdad Jalali,et al.  Detecting spam tweets in Twitter using a data stream clustering algorithm , 2015, 2015 International Congress on Technology, Communication and Knowledge (ICTCK).

[22]  Hua Shen,et al.  Detecting Spammers on Twitter Based on Content and Social Interaction , 2015, 2015 International Conference on Network and Information Systems for Computers.

[23]  Douglas C. Creighton,et al.  Recognising User Identity in Twitter Social Networks via Text Mining , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[24]  Awais Ahmad,et al.  Towards ontology-based multilingual URL filtering: a big data problem , 2018, The Journal of Supercomputing.

[25]  Albert Y. Zomaya,et al.  Segregating Spammers and Unsolicited Bloggers from Genuine Experts on Twitter , 2018, IEEE Transactions on Dependable and Secure Computing.

[26]  P. Kumaraguru,et al.  $1.00 per RT #BostonMarathon #PrayForBoston: Analyzing fake content on Twitter , 2013, 2013 APWG eCrime Researchers Summit.

[27]  Mohamed Bouguessa,et al.  A model-based approach for identifying spammers in social networks , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[28]  Stefano Ceri,et al.  False News On Social Media: A Data-Driven Survey , 2019, SGMD.

[29]  Florence Sèdes,et al.  A Topic-Based Hidden Markov Model for Real-Time Spam Tweets Filtering , 2017, KES.

[30]  Sumeet Kumar,et al.  Diffusion of pro- and anti-false information tweets: the Black Panther movie case , 2018, Comput. Math. Organ. Theory.

[31]  Chiew Tong Lau,et al.  Real-Time Twitter Content Polluter Detection Based on Direct Features , 2015, 2015 2nd International Conference on Information Science and Security (ICISS).

[32]  Arkaitz Zubiaga,et al.  Making the Most of Tweet-Inherent Features for Social Spam Detection on Twitter , 2015, #MSM.

[33]  Paolo Gastaldo,et al.  A machine learning approach for Twitter spammers detection , 2014, 2014 International Carnahan Conference on Security Technology (ICCST).

[34]  Wanlei Zhou,et al.  Twitter spam detection: Survey of new approaches and comparative study , 2017, Comput. Secur..