Studying Anti-Social Behaviour on Reddit with Communalytic

The chapter presents a new social media research tool for studying subreddits (i.e., groups) on Reddit called Communalytic. It is an easy-to-use, web-based tool that can collect, analyze and visualize publicly available data from Reddit. In addition to collecting data, Communalytic can assess the toxicity of Reddit posts and replies using a machine learning API. The resulting anti-social scores from the toxicity analysis are then added as weights to each tie in a "who replies to whom" communication network, allowing researchers to visually identify and study toxic exchanges happening within a subreddit. The chapter consists of two parts: first, it introduces our methodology and Communalytic’s main functionalities. Second, it presents a case study of a public subreddit called r/metacanada. This subreddit, popular among the Canadian alt-right, was selected due to its polarizing nature. The case study demonstrates how Communalytic can support researchers studying toxicity in online communities. Specifically, by having access to this additional layer of information about the nature of the communication ties among group members, we were able to provide a more nuanced description of the group dynamics.

[1]  Savvas Zannettou,et al.  "And We Will Fight For Our Race!" A Measurement Study of Genetic Testing Conversations on Reddit and 4chan , 2019, ICWSM.

[2]  Yuzhou Wang,et al.  Locate the Hate: Detecting Tweets against Blacks , 2013, AAAI.

[3]  Katie Elson Anderson,et al.  Ask me anything: what is Reddit? , 2015 .

[4]  K. Hazel Kwon,et al.  Is offensive commenting contagious online? Examining public vs interpersonal swearing in response to Donald Trump's YouTube campaign videos , 2017, Internet Res..

[5]  Ye Sun How conversational ties are formed in an online community: a social network analysis of a tweet chat group , 2020, Information, Communication & Society.

[6]  David A. Broniatowski,et al.  Characterizing Trends in Human Papillomavirus Vaccine Discourse on Reddit (2007-2015): An Observational Study , 2019, JMIR Public Health and Surveillance.

[7]  Lucy Vasserman,et al.  Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification , 2019, WWW.

[8]  Christos Faloutsos,et al.  oddball: Spotting Anomalies in Weighted Graphs , 2010, PAKDD.

[9]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Anatoliy Gruzd,et al.  Information Wars and Online Activism During the 2013/2014 Crisis in Ukraine: Examining the Social Structures of Pro- and Anti-Maidan Groups , 2015 .

[11]  Imran Awan Islamophobia and Twitter: A Typology of Online Hate Against Muslims on Social Media , 2014 .

[12]  Marc Esteve Del Valle,et al.  Learning in the Wild: Understanding Networked Ties in Reddit , 2020, Mobility, Data and Learner Agency in Networked Learning.

[13]  Irene Zempi,et al.  The affinity between online and offline anti-muslim hate crime: dynamics and impacts , 2016 .

[14]  Yannis Theocharis,et al.  A Bad Workman Blames His Tweets? The Consequences of Citizens’ Uncivil Twitter Use When Interacting with Party Candidates , 2016 .

[15]  Charu C. Aggarwal,et al.  Outlier Analysis , 2013, Springer New York.

[16]  S. Bradshaw,et al.  Troops, Trolls and Troublemakers: A Global Inventory of Organized Social Media Manipulation , 2017 .

[17]  Navoneel Chakrabarty,et al.  A Machine Learning Approach to Comment Toxicity Classification , 2019, Computational Intelligence in Pattern Recognition.

[18]  Julien Cornebise,et al.  A large-scale crowdsourced analysis of abuse against women journalists and politicians on Twitter , 2019, ArXiv.

[19]  Wagner Meira,et al.  Factors Associated With Weight Change in Online Weight Management Communities: A Case Study in the LoseIt Reddit Community , 2017, Journal of medical Internet research.

[20]  Patrícia G. C. Rossini Toxic for Whom? Examining the Targets of Uncivil and Intolerant Discourse in Online Political Talk , 2018 .

[21]  Dolf Trieschnigg,et al.  Experts and Machines against Bullies: A Hybrid Approach to Detect Cyberbullies , 2014, Canadian Conference on AI.

[22]  Min Zhang,et al.  Structural correlation between communities and core-periphery structures in social networks: Evidence from Twitter data , 2017, Expert Syst. Appl..

[23]  C. Haythornthwaite,et al.  Enabling Community Through Social Media , 2013, Journal of medical Internet research.

[24]  Caroline Haythornthwaite,et al.  Analyzing Social Media And Learning Through Content And Social Network Analysis: A Faceted Methodological Approach , 2016, J. Learn. Anal..

[25]  Robert J. Topinka Politically incorrect participatory media: Racist nationalism on r/ImGoingToHellForThis , 2018, New Media Soc..

[26]  Dominique Brossard,et al.  Uncivil and personal? Comparing patterns of incivility in comments on the Facebook pages of news outlets , 2018, New Media Soc..

[27]  Robin M. Kowalski,et al.  Bullying in the digital age: a critical review and meta-analysis of cyberbullying research among youth. , 2014, Psychological bulletin.

[28]  D. Paulhus,et al.  Trolls just want to have fun , 2014 .

[29]  Amr Tolba,et al.  Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach , 2019, Computing.

[30]  Emmanuel Müller,et al.  Focused clustering and outlier detection in large attributed graphs , 2014, KDD.

[31]  M. Lindsay,et al.  Experiences of Online Harassment Among Emerging Adults , 2016, Journal of interpersonal violence.

[32]  Kalina Bontcheva,et al.  Race and Religion in Online Abuse towards UK Politicians: Working Paper , 2019, ArXiv.

[33]  Rishab Nithyanand,et al.  Online Political Discourse in the Trump Era , 2017, ArXiv.

[34]  Dolf Trieschnigg,et al.  Expert knowledge for automatic detection of bullies in social networks , 2013 .

[35]  Angela Baldasare,et al.  Cyber Aggression Among College Students: Demographic Differences, Predictors of Distress, and the Role of the University , 2015 .

[36]  Nitin Agarwal,et al.  Identifying Toxicity Within YouTube Video Comment , 2019, SBP-BRiMS.

[37]  Jing Qian,et al.  A Benchmark Dataset for Learning to Intervene in Online Hate Speech , 2019, EMNLP.

[38]  Danai Koutra,et al.  Graph based anomaly detection and description: a survey , 2014, Data Mining and Knowledge Discovery.

[39]  Caroline Haythornthwaite,et al.  Learning in the wild: coding for learning and practice on Reddit , 2018, Learning, Media and Technology.

[40]  N. Hara,et al.  An emerging form of public engagement with science: Ask Me Anything (AMA) sessions on Reddit r/science , 2019, PloS one.

[41]  Adrienne Massanari,et al.  Attack of the 50-foot social justice warrior: the discursive construction of SJW memes as the monstrous feminine , 2018 .

[42]  Chedia Dhaoui,et al.  Social media sentiment analysis: lexicon versus machine learning , 2017 .

[43]  Kevin Durkin,et al.  The emotional impact of cyberbullying: Differences in perceptions and experiences as a function of role , 2015, Comput. Educ..

[44]  John Pavlopoulos,et al.  ConvAI at SemEval-2019 Task 6: Offensive Language Identification and Categorization with Perspective and BERT , 2019, *SEMEVAL.

[45]  Cliff Lampe,et al.  Crowdsourcing civility: A natural experiment examining the effects of distributed moderation in online forums , 2014, Gov. Inf. Q..

[46]  Radha Poovendran,et al.  Deceiving Google's Perspective API Built for Detecting Toxic Comments , 2017, ArXiv.

[47]  Huanying Gu,et al.  Adversarial Text Generation for Google's Perspective API , 2018, 2018 International Conference on Computational Science and Computational Intelligence (CSCI).

[48]  Pawel Dybala,et al.  Machine Learning and Affect Analysis Against Cyber-Bullying , 2010 .

[49]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[50]  Adrienne Massanari,et al.  #Gamergate and The Fappening: How Reddit’s algorithm, governance, and culture support toxic technocultures , 2017, New Media Soc..

[51]  Qinbao Song,et al.  Revealing Density-Based Clustering Structure from the Core-Connected Tree of a Network , 2013, IEEE Transactions on Knowledge and Data Engineering.

[52]  Jaigris Hodson,et al.  I get by with a little help from my friends: The ecological model and support for women scholars experiencing online harassment , 2018, First Monday.

[53]  Evita March,et al.  The dark side of Facebook®: The Dark Tetrad, negative social potency, and trolling behaviours , 2016 .

[54]  Henry Lieberman,et al.  Modeling the Detection of Textual Cyberbullying , 2011, The Social Mobile Web.

[55]  Foster J. Provost,et al.  A Brief Survey of Machine Learning Methods for Classification in Networked Data and an Application to Suspicion Scoring , 2006, SNA@ICML.

[56]  Sang-Won Lee,et al.  Semantic network analysis for understanding user experiences of bipolar and depressive disorders on Reddit , 2019, Inf. Process. Manag..

[57]  Stephen A. Rains,et al.  Perceptions of Uncivil Discourse Online: An Examination of Types and Predictors , 2020 .

[58]  Debbie Ging,et al.  Special issue on online misogyny , 2018, Feminist Media Studies.

[59]  Khim-Yong Goh,et al.  Investigating Participation in Online Policy Discussion Forums Over Time: Does Network Structure Matter? , 2007, ICIS.

[60]  T. Shaw,et al.  Disentangling functions of online aggression: The Cyber-Aggression Typology Questionnaire (CATQ). , 2017, Aggressive behavior.

[61]  Toby Hopp,et al.  Social Capital as an Inhibitor of Online Political Incivility: An Analysis of Behavioral Patterns Among Politically Active Facebook Users , 2019 .

[62]  Rosalynd Southern,et al.  Twitter, Incivility and “Everyday” Gendered Othering: An Analysis of Tweets Sent to UK Members of Parliament , 2019, Social Science Computer Review.

[63]  Heri Ramampiaro,et al.  Effective hate-speech detection in Twitter data using recurrent neural networks , 2018, Applied Intelligence.

[64]  Monica Anderson,et al.  A Majority of Teens Have Experienced Some Form of Cyberbullying , 2018 .

[65]  J. Golbeck Online Harassment , 2020, Human Resource Management International Digest.