Antisocial Behavior on the Web: Characterization and Detection

Web platforms enable unprecedented speed and ease in transmission of knowledge, and allow users to communicate and shape opinions. However, the safety, usability and reliability of these platforms are compromised by the prevalence of online antisocial behavior, for e.g. 40% of users have experienced online harassment. This is present in the form of antisocial users, such as trolls, sockpuppets and vandals, and misinformation, such as hoaxes, rumors and fraudulent reviews. This tutorial presents the state-of-the-art research spanning two aspects of antisocial behavior: characterization of their behavioral properties, and development of algorithms for identifying and predicting them. The tutorial first discusses antisocial users --- trolls, sockpuppets and vandals. We present the causes, community effects, and linguistic, social and temporal characteristics of trolls. Then we discuss the types of sockpuppets, i.e. multiple accounts of the same user, and their behavioral characteristics in Wikipedia and online discussion forums. Vandals make destructive edits on Wikipedia and we discuss the properties of vandals and vandalism edits. In each case, detection and prediction algorithms of the antisocial user are also discussed. The second part of the tutorial discusses about misinformation --- hoaxes, rumors and fraudulent reviews. We present the characteristics and impact of hoaxes on Wikipedia, followed by the spread and evolution of rumors on social media. Then, we discuss the algorithms to identify fake reviews and reviewers from their characteristics, and the camouflage and coordination among sophisticated fraudsters. Again, in each case, we present the detection algorithms, using textual, temporal, sentiment, network structure and rating patterns. Finally, the tutorial concludes with future research avenues.

[1]  John Suler,et al.  The Online Disinhibition Effect , 2004, Cyberpsychology Behav. Soc. Netw..

[2]  Leman Akoglu,et al.  Collective Opinion Spam Detection: Bridging Review Networks and Metadata , 2015, KDD.

[3]  Mahmoud Fouz,et al.  Why rumors spread so quickly in social networks , 2012, Commun. ACM.

[4]  Christos Faloutsos,et al.  BIRDNEST: Bayesian Inference for Ratings-Fraud Detection , 2015, SDM.

[5]  Christian Bauckhage,et al.  The slashdot zoo: mining a social network with negative edges , 2009, WWW.

[6]  Jure Leskovec,et al.  Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes , 2016, WWW.

[7]  Siu-Ming Yiu,et al.  Sockpuppet Detection in Online Discussion Forums , 2011, 2011 Seventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[8]  Amos Azaria,et al.  The DARPA Twitter Bot Challenge , 2016, Computer.

[9]  V. S. Subrahmanian,et al.  VEWS: A Wikipedia Vandal Early Warning System , 2015, KDD.

[10]  Zhan Bu,et al.  A sock puppet detection algorithm on virtual spaces , 2013, Knowl. Based Syst..

[11]  Richard Gilbert,et al.  The psychological functions of avatars and alt(s): A qualitative study , 2014, Comput. Hum. Behav..

[12]  Anupam Joshi,et al.  Faking Sandy: characterizing and identifying fake images on Twitter during Hurricane Sandy , 2013, WWW.

[13]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[14]  Pascal Neis,et al.  Towards Automatic Vandalism Detection in OpenStreetMap , 2012, ISPRS Int. J. Geo Inf..

[15]  N. Hara,et al.  Beyond vandalism: Wikipedia trolls , 2010, J. Inf. Sci..

[16]  Shivakant Mishra,et al.  Towards understanding cyberbullying behavior in a semi-anonymous social network , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[17]  Arjun Mukherjee,et al.  Spotting fake reviewer groups in consumer reviews , 2012, WWW.

[18]  Christos Faloutsos,et al.  CatchSync: catching synchronized behavior in large directed graphs , 2014, KDD.

[19]  J. Golbeck Online Harassment , 2020, Human Resource Management International Digest.

[20]  M. De Domenico,et al.  The Anatomy of a Scientific Rumor , 2013, Scientific Reports.

[21]  Edgar R. Weippl,et al.  Fake identities in social media: A case study on the sustainability of the Facebook business model , 2012, J. Serv. Sci. Res..

[22]  Charles Seife Virtual Unreality: Just Because the Internet Told You, How Do You Know It's True? , 2014 .

[23]  Laurent Vercouter,et al.  Detection of Multiple Identity Manipulation in Collaborative Projects , 2016, WWW.

[24]  Prashant Bordia,et al.  Problem Solving in Social Interactions on the Internet: Rumor As Social Cognition , 2004 .

[25]  V. S. Subrahmanian,et al.  Accurately detecting trolls in Slashdot Zoo via decluttering , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[26]  Luca de Alfaro,et al.  Detecting Wikipedia Vandalism using WikiTrust - Lab Report for PAN at CLEF 2010 , 2010, CLEF.

[27]  V. S. Subrahmanian,et al.  Using sentiment to detect bots on Twitter: Are humans more opinionated than bots? , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[28]  Sherali Zeadally,et al.  Multiple Account Identity Deception Detection in Social Media Using Nonverbal Behavior , 2014, IEEE Transactions on Information Forensics and Security.

[29]  Michael Sirivianos,et al.  Aiding the Detection of Fake Accounts in Large Scale Social Online Services , 2012, NSDI.

[30]  Justin W. Patchin,et al.  Bullying, Cyberbullying, and Suicide , 2010, Archives of suicide research : official journal of the International Academy for Suicide Research.

[31]  Richard L. Gilbert,et al.  Multiple Personality Order: Physical and Personality Characteristics of the Self, Primary Avatar and Alt , 2011 .

[32]  Philip Calvert,et al.  Web of Deception: Misinformation on the Internet , 2002 .

[33]  Andrea Ballatore,et al.  Defacing the Map: Cartographic Vandalism in the Digital Commons , 2014, ArXiv.

[34]  David Wiener Negligent Publication of Statements Posted on Electronic Bulletin Boards: Is There Any Liability Left After Zeran? , 1999 .

[35]  Hyun Ah Song,et al.  FRAUDAR: Bounding Graph Fraud in the Face of Camouflage , 2016, KDD.

[36]  V. S. Subrahmanian,et al.  Predicting human behavior: The next frontiers , 2017, Science.

[37]  Arjun Mukherjee,et al.  What Yelp Fake Review Filter Might Be Doing? , 2013, ICWSM.

[38]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.

[39]  Paolo Rosso,et al.  Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features , 2011, CICLing.

[40]  Filippo Menczer,et al.  The rise of social bots , 2014, Commun. ACM.

[41]  Georgios Zervas,et al.  Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud , 2015, Manag. Sci..

[42]  Michael S. Bernstein,et al.  Anyone Can Become a Troll: Causes of Trolling Behavior in Online Discussions , 2017, CSCW.

[43]  Jure Leskovec,et al.  How Community Feedback Shapes User Behavior , 2014, ICWSM.

[44]  R. Stuart Geiger,et al.  The work of sustaining order in wikipedia: the banning of a vandal , 2010, CSCW '10.

[45]  Whitney Phillips,et al.  LOLing at tragedy: Facebook trolls, memorial pages and resistance to grief online , 2011, First Monday.

[46]  Hakim Hacid,et al.  Towards multiple identity detection in social networks , 2012, WWW.

[47]  References , 1971 .

[48]  Yi Li,et al.  In a World That Counts: Clustering and Detecting Fake Social Engagement at Scale , 2015, WWW.

[49]  V. S. Subrahmanian,et al.  Tutorial: Identifying Malicious Actors on Social Media , 2016, ASONAM.

[50]  V. S. Subrahmanian,et al.  An Army of Me: Sockpuppets in Online Discussion Communities , 2017, WWW.

[51]  Paul S. Piper,et al.  Better Read That Again: Web Hoaxes and Misinformation , 2000 .

[52]  D. Paulhus,et al.  Trolls just want to have fun , 2014 .

[53]  Christos Faloutsos,et al.  FairJudge: Trustworthy User Prediction in Rating Platforms , 2017, ArXiv.

[54]  J. Malbon Taking Fake Online Consumer Reviews Seriously , 2013 .

[55]  Huan Liu,et al.  Social Spammer Detection with Sentiment Information , 2014, 2014 IEEE International Conference on Data Mining.

[56]  Jure Leskovec,et al.  Antisocial Behavior in Online Discussion Communities , 2015, ICWSM.

[57]  Tieniu Tan,et al.  Information Credibility Evaluation on Social Media , 2016, AAAI.

[58]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[59]  Thamar Solorio,et al.  A Case Study of Sockpuppet Detection in Wikipedia , 2013 .

[60]  Yavuz Akbulut,et al.  Cyberbullying Victimization among Turkish Online Social Utility Members , 2010, J. Educ. Technol. Soc..

[61]  Michael Luca Reviews, Reputation, and Revenue: The Case of Yelp.Com , 2016 .