Racism is a virus: anti-asian hate and counterspeech in social media during the COVID-19 crisis

The spread of COVID-19 has sparked racism and hate on social media targeted towards Asian communities. However, little is known about how racial hate spreads during a pandemic and the role of counterspeech in mitigating this spread. In this work, we study the evolution and spread of anti-Asian hate speech through the lens of Twitter. We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months, containing over 206 million tweets, and a social network with over 127 million nodes. By creating a novel hand-labeled dataset of 3,355 tweets, we train a text classifier to identify hateful and counterspeech tweets that achieves an average macro-F1 score of 0.832. Using this dataset, we conduct longitudinal analysis of tweets and users. Analysis of the social network reveals that hateful and counterspeech users interact and engage extensively with one another, instead of living in isolated polarized communities. We find that nodes were highly likely to become hateful after being exposed to hateful content in the year 2020. Notably, counterspeech messages discourage users from turning hateful, potentially suggesting a solution to curb hate on web and social media platforms. Data and code is available at http://claws.cc.gatech.edu/covid.

[1]  Kristina Lerman,et al.  COVID-19: The First Public Coronavirus Twitter Dataset , 2020, ArXiv.

[2]  Jean-Gabriel Young,et al.  Countering hate on social media: Large scale classification of hate and counter speech , 2020, ALW.

[3]  P. Alam,et al.  R , 1823, The Herodotus Encyclopedia.

[4]  P. Alam ‘E’ , 2021, Composites Engineering: An A–Z Guide.

[5]  Animesh Mukherjee,et al.  Analyzing the hate and counter speech accounts on Twitter , 2018, ArXiv.

[6]  Rui Zhao,et al.  Automatic detection of cyberbullying on social networks based on bullying features , 2016, ICDCN.

[7]  Matthew Beatty Graph-Based Methods to Detect Hate Speech Diffusion on Twitter , 2020, 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[8]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[9]  Jabra Zarka,et al.  Coronavirus Goes Viral: Quantifying the COVID-19 Misinformation Epidemic on Twitter , 2020, Cureus.

[10]  Sebastien Feve,et al.  Review of programs to counter narratives of violent extremism , 2013 .

[11]  Tanushree Mitra,et al.  Many Faced Hate: A Cross Platform Study of Content Framing and Information Sharing by Online Hate Groups , 2020, CHI.

[12]  Jeremy Blackburn,et al.  "You Know What to Do" , 2018, Proc. ACM Hum. Comput. Interact..

[13]  Animesh Mukherjee,et al.  Thou shalt not hate: Countering Online Hate Speech , 2018, ICWSM.

[14]  Alex Nikolov,et al.  Nikolov-Radivchev at SemEval-2019 Task 6: Offensive Tweet Classification with BERT and Ensembles , 2019, *SEMEVAL.

[15]  Ping Liu,et al.  Forecasting the presence and intensity of hostility on Instagram using linguistic and social features , 2018, ICWSM.

[16]  B. Parekh The Content and Context of Hate Speech: Is There a Case for Banning Hate Speech? , 2012 .

[17]  V. S. Subrahmanian,et al.  An Army of Me: Sockpuppets in Online Discussion Communities , 2017, WWW.

[18]  Scott A. Hale,et al.  Detecting East Asian Prejudice on Social Media , 2020, ALW.

[19]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[20]  Marco Guerini,et al.  CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech , 2019, ACL.

[21]  Alexa Alice Joubin Anti-Asian Racism during COVID-19 Pandemic, GW Today, April 20, 2020 , 2020 .

[22]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[23]  P. Alam ‘N’ , 2021, Composites Engineering: An A–Z Guide.

[24]  Guido Caldarelli,et al.  Echo Chambers: Emotional Contagion and Group Polarization on Facebook , 2016, Scientific Reports.

[25]  Jiebo Luo,et al.  In the Eyes of the Beholder: Sentiment and Topic Analyses on Social Media Use of Neutral and Controversial Terms for COVID-19 , 2020, ArXiv.

[26]  Reima Al-Jarf Combating the Covid-19 Hate and Racism Speech on Social Media , 2021 .

[27]  Jacob Eisenstein,et al.  Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers , 2018, CD@KDD.

[28]  Shivakant Mishra,et al.  Prediction of cyberbullying incidents in a media-based social network , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[29]  Matthew Leighton Williams,et al.  Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making , 2015 .

[30]  Angela R. Gover,et al.  Anti-Asian Hate Crime During the COVID-19 Pandemic: Exploring the Reproduction of Inequality , 2020, American journal of criminal justice : AJCJ.

[31]  Kristie B. Hadden,et al.  2020 , 2020, Journal of Surgical Orthopaedic Advances.

[32]  Rahul Goel,et al.  The Social Dynamics of Language Change in Online Networks , 2016, SocInfo.

[33]  P. Alam ‘G’ , 2021, Composites Engineering: An A–Z Guide.

[34]  Sérgio Nunes,et al.  A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[35]  P. Alam ‘S’ , 2021, Composites Engineering: An A–Z Guide.

[36]  Susan Benesch Countering Dangerous Speech: New Ideas for Genocide Prevention , 2014 .

[37]  Fabrício Benevenuto,et al.  A Measurement Study of Hate Speech in Social Media , 2017, HT.

[38]  Giovanni Vigna,et al.  Peer to Peer Hate: Hate Speech Instigators and Their Targets , 2018, ICWSM.

[39]  Melanie Coates Covid-19 and the rise of racism , 2020, BMJ.

[40]  Lucia Kováčová,et al.  Countering Hate Speech on Facebook: The Case of the Roma Minority in Slovakia , 2018, Social Science Computer Review.

[41]  Savvas Zannettou,et al.  "And We Will Fight For Our Race!" A Measurement Study of Genetic Testing Conversations on Reddit and 4chan , 2019, ICWSM.

[42]  Giovanni Luca Ciampaglia,et al.  The spread of low-credibility content by social bots , 2017, Nature Communications.

[43]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[44]  Filippo Menczer,et al.  BotOrNot: A System to Evaluate Social Bots , 2016, WWW.

[45]  The Fear of COVID-19 Scale: Development and Initial Validation , 2020, International Journal of Mental Health and Addiction.

[46]  Panagiotis Karampelas,et al.  Detecting Hate Speech Within the Terrorist Argument: A Greek Case , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[47]  Greg Miller,et al.  Social distancing prevents infections, but it can have unintended consequences , 2020 .

[48]  Shivakant Mishra,et al.  International Conference on Advances in Social Networks Analysis and Mining ( ASONAM ) Are They Our Brothers ? Analysis and Detection of Religious Hate Speech in the Arabic Twittersphere , 2018 .

[49]  Michael Herz,et al.  The Content and Context of Hate Speech: Rethinking Regulation and Responses , 2012 .

[50]  N. F. Johnson,et al.  Hate multiverse spreads malicious COVID-19 content online beyond individual platform control , 2020, 2004.00673.

[51]  Bernard J. Jansen,et al.  Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media , 2018, ICWSM.

[52]  Emily K. Vraga,et al.  A first look at COVID-19 information and misinformation sharing on Twitter , 2020, ArXiv.

[53]  Jun-Ming Xu,et al.  Learning from Bullying Traces in Social Media , 2012, NAACL.

[54]  P. Alam ‘A’ , 2021, Composites Engineering: An A–Z Guide.

[55]  Kevin Munger Tweetment Effects on the Tweeted: Experimentally Reducing Racist Harassment , 2017 .

[56]  Hao Chen,et al.  The Use of Deep Learning Distributed Representations in the Identification of Abusive Text , 2019, ICWSM.

[57]  Derek Ruths,et al.  Vectors for Counterspeech on Twitter , 2017, ALW@ACL.

[58]  P. Alam ‘L’ , 2021, Composites Engineering: An A–Z Guide.

[59]  M. Barreto,et al.  Xenophobia in the time of pandemic: othering, anti-Asian attitudes, and COVID-19 , 2020, Politics, Groups, and Identities.

[60]  Dirk Hovy,et al.  Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[61]  Tanushree Mitra,et al.  Framing Hate with Hate Frames: Designing the Codebook , 2018, CSCW Companion.

[62]  Yulan He,et al.  Approaches to Automated Detection of Cyberbullying: A Survey , 2020, IEEE Transactions on Affective Computing.

[63]  Atefeh Zandifar,et al.  Iranian mental health during the COVID-19 epidemic , 2020, Asian Journal of Psychiatry.

[64]  P. Alam,et al.  H , 1887, High Explosives, Propellants, Pyrotechnics.

[65]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[66]  Joel R. Tetreault,et al.  Do Characters Abuse More Than Words? , 2016, SIGDIAL Conference.

[67]  G. Stringhini,et al.  “Go eat a bat, Chang!”: On the Emergence of Sinophobic Behavior on Web Communities in the Face of COVID-19 , 2021, WWW.

[68]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[69]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[70]  Savvas Zannettou,et al.  A Quantitative Approach to Understanding Online Antisemitism , 2018, ICWSM.

[71]  P. Alam ‘W’ , 2021, Composites Engineering.

[72]  Jing Zhou,et al.  Hate Speech Detection with Comment Embeddings , 2015, WWW.

[73]  Walter Quattrociocchi,et al.  Echo Chambers on Facebook , 2016 .

[74]  Walter Daelemans,et al.  Automatic detection of cyberbullying in social media text , 2018, PloS one.

[75]  Björn Ross,et al.  Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis , 2016, ArXiv.

[76]  Heyam H. Al-Baity,et al.  Detection of Hate Speech in COVID-19–Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach , 2020, Journal of Medical Internet Research.

[77]  Cindy D. Kam Infectious Disease, Disgust, and Imagining the Other , 2019, The Journal of Politics.

[78]  Emilio Ferrara,et al.  What types of COVID-19 conspiracies are populated by Twitter bots? , 2020, First Monday.

[79]  A. Johnson,et al.  Stigmatization and prejudice during the COVID-19 pandemic , 2020 .

[80]  Michael S. Bernstein,et al.  Anyone Can Become a Troll: Causes of Trolling Behavior in Online Discussions , 2017, CSCW.

[81]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[82]  Vasudeva Varma,et al.  Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[83]  Thomas de Quincey [C] , 2000, The Works of Thomas De Quincey, Vol. 1: Writings, 1799–1820.

[84]  Aristides Gionis,et al.  Political Discourse on Social Media: Echo Chambers, Gatekeepers, and the Price of Bipartisanship , 2018, WWW.

[85]  Animesh Mukherjee,et al.  Spread of Hate Speech in Online Social Media , 2018, WebSci.

[86]  Nicola Montemurro,et al.  The emotional impact of COVID-19: From medical staff to common people , 2020, Brain, Behavior, and Immunity.

[87]  Munmun De Choudhury,et al.  Prevalence and Psychological Effects of Hateful Speech in Online College Communities , 2019, WebSci.

[88]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[89]  Fabrício Benevenuto,et al.  Analyzing the Targets of Hate in Online Social Media , 2016, ICWSM.

[90]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[91]  Jeremy Blackburn,et al.  "Go eat a bat, Chang!": An Early Look on the Emergence of Sinophobic Behavior on Web Communities in the Face of COVID-19 , 2020, ArXiv.

[92]  Manlio De Domenico,et al.  Assessing the risks of 'infodemics' in response to COVID-19 epidemics. , 2020, Nature human behaviour.

[93]  Mai ElSherief,et al.  Hate Lingo: A Target-based Linguistic Analysis of Hate Speech in Social Media , 2018, ICWSM.

[94]  Kristina Lerman,et al.  Tracking Social Media Discourse About the COVID-19 Pandemic: Development of a Public Coronavirus Twitter Data Set , 2020, JMIR public health and surveillance.

[95]  Michael Wiegand,et al.  A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[96]  P. Alam ‘K’ , 2021, Composites Engineering.

[97]  Matthew Costello,et al.  On Analyzing COVID-19-related Hate Speech Using BERT Attention , 2020, 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA).