论文信息 - HateClassify: A Service Framework for Hate Speech Identification on Social Media

HateClassify: A Service Framework for Hate Speech Identification on Social Media

It is indeed a challenge for the existing machine learning approaches to segregate the hateful content from the one that is merely offensive. One prevalent reason for low accuracy of hate detection with the current methodologies is that these techniques treat hate classification as a multiclass problem. In this article, we present the hate identification on the social media as a multilabel problem. To this end, we propose a CNN-based service framework called “HateClassify” for labeling the social media contents as the hate speech, offensive, or nonoffensive. Results demonstrate that the multiclass classification accuracy for the CNN-based approaches particularly sequential CNN (SCNN) is competitive and even higher than certain state-of-the-art classifiers. Moreover, in the multilabel classification problem, sufficiently high performance is exhibited by the SCNN among other CNN-based techniques. The results have shown that using multilabel classification instead of multiclass classification, hate speech detection is increased up to 20%.

[1] Jun-Ming Xu,et al. Learning from Bullying Traces in Social Media , 2012, NAACL.

[2] Joel R. Tetreault,et al. Abusive Language Detection in Online User Content , 2016, WWW.

[3] Matthew Leighton Williams,et al. Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making , 2015 .

[4] Carolyn Penstein Rosé,et al. Detecting offensive tweets via topical feature discovery over a large scale twitter corpus , 2012, CIKM.

[5] Julia Hirschberg,et al. Detecting Hate Speech on the World Wide Web , 2012 .

[6] Ying Chen,et al. Detecting Offensive Language in Social Media to Protect Adolescent Online Safety , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[7] Walter Daelemans,et al. Detection and Fine-Grained Classification of Cyberbullying Events , 2015, RANLP.

[8] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[9] Felice Dell'Orletta,et al. Hate Me, Hate Me Not: Hate Speech Detection on Facebook , 2017, ITASEC.

[10] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[11] Jing Zhou,et al. Hate Speech Detection with Comment Embeddings , 2015, WWW.

[12] Shivakant Mishra,et al. Analyzing Labeled Cyberbullying Incidents on the Instagram Social Network , 2015, SocInfo.

[13] Cornelia Caragea,et al. Content-Driven Detection of Cyberbullying on the Instagram Social Network , 2016, IJCAI.

[14] Wenpeng Yin,et al. Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms , 2017, TACL.

[15] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[16] Joel R. Tetreault,et al. Do Characters Abuse More Than Words? , 2016, SIGDIAL Conference.

[17] Mohammad S. Sorower. A Literature Survey on Algorithms for Multi-label Learning , 2010 .