论文信息 - Characterizing and Detecting Hateful Users on Twitter - 字舞流文

Characterizing and Detecting Hateful Users on Twitter

Most current approaches to characterize and detect hate speech focus on \textit{content} posted in Online Social Networks. They face shortcomings to collect and annotate hateful speech due to the incompleteness and noisiness of OSN text and the subjectivity of hate speech. These limitations are often aided with constraints that oversimplify the problem, such as considering only tweets containing hate-related words. In this work we partially address these issues by shifting the focus towards \textit{users}. We develop and employ a robust methodology to collect and annotate hateful users which does not depend directly on lexicon and where the users are annotated given their entire profile. This results in a sample of Twitter's retweet graph containing $100,386$ users, out of which $4,972$ were annotated. We also collect the users who were banned in the three months that followed the data collection. We show that hateful users differ from normal ones in terms of their activity patterns, word usage and as well as network structure. We obtain similar results comparing the neighbors of hateful vs. neighbors of normal users and also suspended users vs. active users, increasing the robustness of our analysis. We observe that hateful users are densely connected, and thus formulate the hate speech detection problem as a task of semi-supervised learning over a graph, exploiting the network of connections on Twitter. We find that a node embedding algorithm, which exploits the graph structure, outperforms content-based approaches for the detection of both hateful ($95\%$ AUC vs $88\%$ AUC) and suspended users ($93\%$ AUC vs $88\%$ AUC). Altogether, we present a user-centric view of hate speech, paving the way for better detection and understanding of this relevant and challenging issue.

Virgílio A. F. Almeida | Wagner Meira | Manoel Horta Ribeiro | Pedro H. Calais | Yuri A. Santos | Wagner Meira Jr | Yuri A. Santos

[1] Michael Wiegand,et al. A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[2] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3] Gianluca Stringhini,et al. Hate is not Binary: Studying Abusive Behavior of #GamerGate on Twitter , 2017, HT.

[4] Alan F. Smeaton,et al. Classifying racist texts using a support vector machine , 2004, SIGIR '04.

[5] Donald F. Towsley,et al. Sampling directed graphs with random walks , 2012, 2012 Proceedings IEEE INFOCOM.

[6] Jennifer Jie Xu,et al. Mining communities and their relationships in blogs: A study of online hate groups , 2007, Int. J. Hum. Comput. Stud..

[7] Ingmar Weber,et al. Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[8] Walter Daelemans,et al. Automatic Detection and Prevention of Cyberbullying , 2015 .

[9] Fabrício Benevenuto,et al. Analyzing the Targets of Hate in Online Social Media , 2016, ICWSM.

[10] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[11] Julia Hirschberg,et al. Detecting Hate Speech on the World Wide Web , 2012 .

[12] Michael S. Bernstein,et al. Empath: Understanding Topic Signals in Large-Scale Text , 2016, CHI.

[13] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[14] Krishna P. Gummadi,et al. Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[15] Krishna P. Gummadi,et al. Strength in Numbers: Robust Tamper Detection in Crowd Computations , 2015, COSN.

[16] Yimin Chen,et al. Misleading Online Content: Recognizing Clickbait as "False News" , 2015, WMDD@ICMI.

[17] Yuzhou Wang,et al. Locate the Hate: Detecting Tweets against Blacks , 2013, AAAI.

[18] Jing Zhou,et al. Hate Speech Detection with Comment Embeddings , 2015, WWW.

[19] Vasudeva Varma,et al. Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[20] Zeerak Waseem,et al. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.

[21] Jure Leskovec,et al. Representation Learning on Graphs: Methods and Applications , 2017, IEEE Data Eng. Bull..

[22] Jiebo Luo,et al. Detecting the Hate Code on Social Media , 2017, ICWSM.

[23] Björn Ross,et al. Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis , 2016, ArXiv.

[24] Njagi Dennis Gitari,et al. A Lexicon-based Approach for Hate Speech Detection , 2015, MUE 2015.

[25] Gianluca Stringhini,et al. Mean Birds: Detecting Aggression and Bullying on Twitter , 2017, WebSci.

[26] Jure Leskovec,et al. node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[27] Wagner Meira,et al. Antagonism Also Flows Through Retweets: The Impact of Out-of-Context Quotes in Opinion Polarization Analysis , 2017, ICWSM.

[28] Lee Rainie,et al. The future of free speech, trolls, anonymity and fake news online , 2017 .

[29] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[30] Ellen Riloff,et al. Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[31] Pete Burnap,et al. Us and them: identifying cyber hate on Twitter across multiple protected characteristics , 2016, EPJ Data Science.

[32] Zhong Zhou,et al. Tweet2Vec: Character-Based Distributed Representations for Social Media , 2016, ACL.

[33] Feng Shi,et al. The wisdom of polarized crowds , 2017, Nature Human Behaviour.

[34] Eric Stein,et al. History Against Free Speech: The New German Law Against the "Auschwitz" -- and Other -- "Lies" , 1986 .

[35] Phyllis B. Gerstenfeld,et al. Hate Online: A Content Analysis of Extremist Internet Sites , 2003 .

[36] Matthew O. Jackson,et al. Naïve Learning in Social Networks and the Wisdom of Crowds , 2010 .

[37] Daniele Quercia,et al. The Social World of Content Abusers in Community Question Answering , 2015, WWW.

[38] Virgílio A. F. Almeida,et al. Detecting Spammers on Twitter , 2010 .

[39] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[40] Fabio Sabatini,et al. Online Networks and Subjective Well‐Being , 2014, ArXiv.