Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation

This work addresses the challenge of hate speech detection in Internet memes, and attempts using visual information to automatically detect hate speech, unlike any previous work of our knowledge. Memes are pixel-based multimedia documents that contain photos or illustrations together with phrases which, when combined, usually adopt a funny meaning. However, hate memes are also used to spread hate through social networks, so their automatic detection would help reduce their harmful societal impact. Our results indicate that the model can learn to detect some of the memes, but that the task is far from being solved with this simple architecture. While previous work focuses on linguistic hate speech, our experiments indicate how the visual modality can be much more informative for hate speech detection than the linguistic one in memes. In our experiments, we built a dataset of 5,020 memes to train and evaluate a multi-layer perceptron over the visual and language representations, whether independently or fused. The source code and mode and models are available this https URL .

[1]  Yuzhou Wang,et al.  Locate the Hate: Detecting Tweets against Blacks , 2013, AAAI.

[2]  T. Massaro,et al.  Equality and Freedom of Expression: The Hate Speech Dilemma , 1991 .

[3]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[4]  Shervin Malmasi,et al.  Detecting Hate Speech in Social Media , 2017, RANLP.

[5]  Samuel Walker,et al.  Hate Speech: The History of an American Controversy , 1994 .

[6]  Jing Zhou,et al.  Hate Speech Detection with Comment Embeddings , 2015, WWW.

[7]  Fabrício Benevenuto,et al.  A Measurement Study of Hate Speech in Social Media , 2017, HT.

[8]  Shih-Fu Chang,et al.  Multimodal Social Media Analysis for Gang Violence Prevention , 2018, ICWSM.

[9]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[10]  Sérgio Nunes,et al.  A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[11]  Jefersson Alex dos Santos,et al.  A Benchmark Methodology for Child Pornography Detection , 2018, 2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI).

[12]  Sasan Karamizadeh,et al.  Methods of Pornography Detection: Review , 2018, ICCMS.

[13]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[14]  Guido Caldarelli,et al.  ROME 2019: Workshop on Reducing Online Misinformation Exposure , 2019, SIGIR.

[15]  Richard Delgado,et al.  The Harm in Hate Speech , 2013 .