Presenting a labelled dataset for real-time detection of abusive user posts

Social media sites facilitate users in posting their own personal comments online. Most support free format user posting, with close to real-time publishing speeds. However, online posts generated by a public user audience carry the risk of containing inappropriate, potentially abusive content. To detect such content, the straightforward approach is to filter against blacklists of profane terms. However, this lexicon filtering approach is prone to problems around word variations and lack of context. Although recent methods inspired by machine learning have boosted detection accuracies, the lack of gold standard labelled datasets limits the development of this approach. In this work, we present a dataset of user comments, using crowdsourcing for labelling. Since abusive content can be ambiguous and subjective to the individual reader, we propose an aggregated mechanism for assessing different opinions from different labellers. In addition, instead of the typical binary categories of abusive or not, we introduce a third class of 'undecided' to capture the real life scenario of instances that are neither blatantly abusive nor clearly harmless. We have performed preliminary experiments on this dataset using best practice techniques in text classification. Finally, we have evaluated the detection performance of various feature groups, namely syntactic, semantic and context-based features. Results show these features can increase our classifier performance by 18% in detection of abusive content.

[1]  Brian D. Davison,et al.  Detection of Harassment on Web 2.0 , 2009 .

[2]  Elizabeth F. Churchill,et al.  Automatic identification of personal insults on social news sites , 2012, J. Assoc. Inf. Sci. Technol..

[3]  Jun-Ming Xu,et al.  Learning from Bullying Traces in Social Media , 2012, NAACL.

[4]  Kelly Reynolds,et al.  Using Machine Learning to Detect Cyberbullying , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[5]  Franciska de Jong,et al.  Cyberbullying detection: a step toward a safer internet yard , 2012, WWW.

[6]  R. Ordelman,et al.  Improved cyberbullying detection using gender information , 2012 .

[7]  Dolf Trieschnigg,et al.  Experts and Machines against Bullies: A Hybrid Approach to Detect Cyberbullies , 2014, Canadian Conference on AI.

[8]  Ying Chen,et al.  Detecting Offensive Language in Social Media to Protect Adolescent Online Safety , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[9]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[10]  Elizabeth F. Churchill,et al.  Profanity use in online communities , 2012, CHI.

[11]  Ellen Riloff,et al.  Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , 2012, HLT-NAACL 2012.

[12]  Laura P. Del Bosque,et al.  Aggressive Text Detection for Cyberbullying , 2014, MICAI.

[13]  Henry Lieberman,et al.  Modeling the Detection of Textual Cyberbullying , 2011, The Social Mobile Web.

[14]  Pablo César,et al.  3rd International Workshop on Socially-Aware Multimedia (SAM'14) , 2014, ACM Multimedia.

[15]  April Kontostathis,et al.  Detecting the Presence of Cyberbullying Using Computer Software , 2011 .

[16]  Rui Zhao,et al.  Automatic detection of cyberbullying on social networks based on bullying features , 2016, ICDCN.

[17]  Rajeev R. Raje,et al.  Collaborative detection of cyberbullying behavior in Twitter data , 2015, 2015 IEEE International Conference on Electro/Information Technology (EIT).

[18]  Shivakant Mishra,et al.  A Comparison of Common Users across Instagram and Ask.fm to Better Understand Cyberbullying , 2014, 2014 IEEE Fourth International Conference on Big Data and Cloud Computing.

[19]  Kelly Reynolds,et al.  Detecting cyberbullying: query terms and techniques , 2013, WebSci.

[20]  Carolyn Penstein Rosé,et al.  Detecting offensive tweets via topical feature discovery over a large scale twitter corpus , 2012, CIKM.

[21]  Matthew Leighton Williams,et al.  Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making , 2015 .

[22]  Qianjia Huang,et al.  Cyber Bullying Detection Using Social and Textual Analysis , 2014, SAM '14.

[23]  K. K. Sahu,et al.  Normalization: A Preprocessing Stage , 2015, ArXiv.

[24]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[25]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[26]  Hao Chen,et al.  Harnessing the Power of Text Mining for the Detection of Abusive Content in Social Media , 2016, UKCI.