论文信息 - Detecting Online Hate Speech: Approaches Using Weak Supervision and Network Embedding Models - 字舞流文

Detecting Online Hate Speech: Approaches Using Weak Supervision and Network Embedding Models

The ubiquity of social media has transformed online interactions among individuals. Despite positive effects, it has also allowed anti-social elements to unite in alternative social media environments (eg. this http URL) like never before. Detecting such hateful speech using automated techniques can allow social media platforms to moderate their content and prevent nefarious activities like hate speech propagation. In this work, we propose a weak supervision deep learning model that - (i) quantitatively uncover hateful users and (ii) present a novel qualitative analysis to uncover indirect hateful conversations. This model scores content on the interaction level, rather than the post or user level, and allows for characterization of users who most frequently participate in hateful conversations. We evaluate our model on 19.2M posts and show that our weak supervision model outperforms the baseline models in identifying indirect hateful interactions. We also analyze a multilayer network, constructed from two types of user interactions in Gab(quote and reply) and interaction scores from the weak supervision model as edge weights, to predict hateful users. We utilize the multilayer network embedding methods to generate features for the prediction task and we show that considering user context from multiple networks help achieving better predictions of hateful users in Gab. We receive up to 7% performance gain compared to single layer or homogeneous network embedding models.

Arunkumar Bagavathi | Elaheh Raisi | Siddharth Krishnan | Michael Ridenhour | A. Bagavathi | Elaheh Raisi | S. Krishnan | Michael Ridenhour

[1] Nitesh V. Chawla,et al. metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[2] Nicholas Worby,et al. Twitter, Gab, and Racism: The Case of the Soros Myth , 2018, SMSociety.

[3] Animesh Mukherjee,et al. Spread of Hate Speech in Online Social Media , 2018, WebSci.

[4] Wenwu Zhu,et al. Structural Deep Network Embedding , 2016, KDD.

[5] Mai ElSherief,et al. Hate Lingo: A Target-based Linguistic Analysis of Hate Speech in Social Media , 2018, ICWSM.

[6] Ryan Wesslen,et al. Shouting into the Void: A Database of the Alternative Social Media Platform Gab , 2019, ICWSM.

[7] Joel R. Tetreault,et al. Abusive Language Detection in Online User Content , 2016, WWW.

[8] M. Serrano,et al. The interconnected wealth of nations: Shock propagation on global trade-investment multiplex networks , 2019, Scientific Reports.

[9] Tomas Mikolov,et al. Efficient Large-Scale Multi-Modal Classification , 2018, AAAI.

[10] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[11] Virgílio A. F. Almeida,et al. Characterizing and Detecting Hateful Users on Twitter , 2018, ICWSM.

[12] Arunkumar Bagavathi,et al. Multi-Net: A Scalable Multiplex Network Embedding Framework , 2018, COMPLEX NETWORKS.

[13] Mingzhe Wang,et al. LINE: Large-scale Information Network Embedding , 2015, WWW.

[14] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15] Jing Zhou,et al. Hate Speech Detection with Comment Embeddings , 2015, WWW.

[16] Jure Leskovec,et al. node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[17] Reid McIlroy-Young,et al. From "Welcome New Gabbers" to the Pittsburgh Synagogue Shooting: The Evolution of Gab , 2019, ICWSM.

[18] Sérgio Nunes,et al. A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[19] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[20] Philip S. Yu,et al. Heterogeneous Information Network Embedding for Recommendation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[21] Gianluca Stringhini,et al. What is Gab: A Bastion of Free Speech or an Alt-Right Echo Chamber , 2018, WWW.

[22] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.

[23] Bert Huang,et al. Weakly Supervised Cyberbullying Detection Using Co-Trained Ensembles of Embedding Models , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).