论文信息 - One-step and Two-step Classification for Abusive Language Detection on Twitter

One-step and Two-step Classification for Abusive Language Detection on Twitter

Automatic abusive language detection is a difficult but important task for online social media. Our research explores a two-step approach of performing classification on abusive language and then classifying into specific types and compares it with one-step approach of doing one multi-class classification for detecting sexist and racist languages. With a public English Twitter corpus of 20 thousand tweets in the type of sexism and racism, our approach shows a promising performance of 0.827 F-measure by using HybridCNN in one-step and 0.824 F-measure by using logistic regression in two-steps.

Pascale Fung | Ji Ho Park | Pascale Fung | Ji Ho Park

[1] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[2] Antony J. Williams,et al. Beautiful Data: The Stories Behind Elegant Data Solutions , 2009 .

[3] Björn Ross,et al. Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis , 2016, ArXiv.

[4] Joel R. Tetreault,et al. Abusive Language Detection in Online User Content , 2016, WWW.

[5] Amit P. Sheth,et al. Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[6] Zeerak Waseem,et al. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.

[7] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.

[8] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[10] Jing Zhou,et al. Hate Speech Detection with Comment Embeddings , 2015, WWW.

[11] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12] Vasudeva Varma,et al. Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[13] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[14] Julia Hirschberg,et al. Detecting Hate Speech on the World Wide Web , 2012 .

[15] Craig Sanders,et al. Media Effects and Society , 2001 .

[16] Yann LeCun,et al. Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[17] Michael Wiegand,et al. A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.