论文信息 - Automatic Detection of Hateful Comments in Online Discussion

Automatic Detection of Hateful Comments in Online Discussion

Making violent threats towards minorities like immigrants or homosexuals is increasingly common on the Internet. We present a method to automatically detect threats of violence using machine learning. A material of 24,840 sentences from YouTube was manually annotated as violent threats or not, and was used to train and test the machine learning model. Detecting threats of violence works quit well with an error of classifying a violent sentence as not violent of about 10% when the error of classifying a non-violent sentence as violent is adjusted to 5%. The best classification performance is achieved by including features that combine specially chosen important words and the distance between those in the sentence.

Hugo Lewi Hammer | Hugo Hammer

[1] Peter Dalgaard,et al. R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[2] L. Angeles,et al. The Muslim conspiracy theory and the Oslo massacre , 2011 .

[3] David Madigan,et al. Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[4] Richard A. Johnson,et al. Applied Multivariate Statistical Analysis , 1983 .

[5] Matthew J. Goodwin,et al. The New Radical Right: Violent and Non-Violent Movements in Europe , 2012 .

[6] Michael Bröning,et al. The Rise of Populism in Europe , 2016 .

[7] Hugo Hammer,et al. Detecting Threats of Violence in Online Discussions Using Bigrams of Important Words , 2014, 2014 IEEE Joint Intelligence and Security Informatics Conference.

[8] Hans van Halteren,et al. Shallow parsing for recognizing threats in Dutch tweets , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[9] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[10] Julia Hirschberg,et al. Detecting Hate Speech on the World Wide Web , 2012 .

[11] Henry Lieberman,et al. Modeling the Detection of Textual Cyberbullying , 2011, The Social Mobile Web.

[12] Hans van Halteren,et al. N-Gram-Based Recognition of Threatening Tweets , 2013, CICLing.