Characterizing usage of explicit hate expressions in social media

ABSTRACT Social media platforms provide an inexpensive communication medium that allows anyone to publish content and anyone interested in the content can obtain it. However, this same potential of social media provide space for discourses that are harmful to certain groups of people. Examples of these discourses include bullying, offensive content, and hate speech. Out of these discourses hate speech is rapidly recognized as a serious problem by authorities of many countries. In this paper, we provide the first of a kind systematic large-scale measurement and analysis study of explicit expressions of hate speech in online social media. We aim to understand the abundance of hate speech in online social media, the most common hate expressions, the effect of anonymity on hate speech, the sensitivity of hate speech and the most hated groups across regions. In order to achieve our objectives, we gather traces from two social media systems: Whisper and Twitter. We then develop and validate a methodology to identify hate speech on both of these systems. Our results identify hate speech forms and unveil a set of important patterns, providing not only a broader understanding of online hate speech, but also offering directions for detection and prevention approaches.

[1]  Mark Dredze,et al.  Annotating Named Entities in Twitter Data with Crowdsourcing , 2010, Mturk@HLT-NAACL.

[2]  Jean Stefancic,et al.  Understanding Words that Wound , 2004 .

[3]  Krishna P. Gummadi,et al.  The Many Shades of Anonymity: Characterizing Anonymous Social Media Content , 2021, ICWSM.

[4]  Yuzhou Wang,et al.  Locate the Hate: Detecting Tweets against Blacks , 2013, AAAI.

[5]  Ashish Sureka,et al.  Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter , 2015, ICDCIT.

[6]  T. Massaro,et al.  Equality and Freedom of Expression: The Hate Speech Dilemma , 1991 .

[7]  Ben Y. Zhao,et al.  Whispers in the dark: analysis of an anonymous social network , 2014, Internet Measurement Conference.

[8]  H. Sánchez,et al.  Bullying Detection , 2011 .

[9]  Huan Liu,et al.  When is it biased?: assessing the representativeness of twitter's streaming API , 2014, WWW.

[10]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[11]  Alan F. Smeaton,et al.  Classifying racist texts using a support vector machine , 2004, SIGIR '04.

[12]  Irfan Chaudhry,et al.  #Hashtagging hate: Using Twitter to track racism online , 2015, First Monday.

[13]  P. Stiefelhagen,et al.  Breaking News , 2004, Der Internist.

[14]  Susan Joe Self-disclosure in Computer-Mediated Communication , 2006 .

[15]  J. Waldron,et al.  The Harm in Hate Speech , 2012 .

[16]  Jacob Mchangama The Harm in Hate Speech Laws , 2012 .

[17]  I-Hsien Ting,et al.  An Approach for Hate Groups Detection in Facebook , 2013 .

[18]  P. Zimbardo The human choice: Individuation, reason, and order versus deindividuation, impulse, and chaos. , 1969 .

[19]  Njagi Dennis Gitari,et al.  A Lexicon-based Approach for Hate Speech Detection , 2015, MUE 2015.

[20]  Fabrício Benevenuto,et al.  Analyzing the Targets of Hate in Online Social Media , 2016, ICWSM.

[21]  Julia Hirschberg,et al.  Detecting Hate Speech on the World Wide Web , 2012 .

[22]  Marko M. Skoric,et al.  Facebook bullying: An extension of battles in school , 2013, Comput. Hum. Behav..

[23]  Jacob Eisenstein,et al.  You Can't Stay Here , 2017 .

[24]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[25]  Fabrício Benevenuto,et al.  A Measurement Study of Hate Speech in Social Media , 2017, HT.

[26]  J. Suler The Online Disinhibition Effect , 2004, Cyberpsychology, Behavior, and Social Networking.

[27]  Shivakant Mishra,et al.  Analyzing Labeled Cyberbullying Incidents on the Instagram Social Network , 2015, SocInfo.

[28]  Júlio Cesar dos Reis,et al.  Breaking the News: First Impressions Matter on Online News , 2015, ICWSM.

[29]  Alain Pinsonneault,et al.  Anonymity in Group Support Systems Research: A New Conceptualization, Measure, and Contingency Framework , 1997, J. Manag. Inf. Syst..

[30]  Robert Faris,et al.  Understanding Harmful Speech Online , 2016 .

[31]  Jeremy Reffin,et al.  Anti-social media , 2014 .