Use of Natural Language Processing to Identify Inappropriate Content in Text

The quick development of communication through new technology media such as social networks and mobile phones has improved our lives. However, this also produces collateral problems such as the presence of insults and abusive comments. In this work, we address the problem of detecting violent content on text documents using Natural Language Processing techniques. Following an approach based on Machine Learning techniques, we have trained six models resulting from the combinations of two text encoders, Term Frequency-Inverse Document Frequency and Bag of Words, together with three classifiers: Logistic Regression, Support Vector Machines and Naive Bayes. We have also assessed StarSpace, a Deep Learning approach proposed by Facebook and configured to use a Hit@1 accuracy. We evaluated these seven alternatives in two publicly available datasets from the Wikipedia Detox Project: Attack and Aggression. StarSpace achieved an accuracy of 0.938 and 0.937 in these datasets, respectively, being the algorithm recommended to detect violent content on text documents among the alternatives evaluated.

[1]  Hugo Lewi Hammer,et al.  Automatic Detection of Hateful Comments in Online Discussion , 2016, INISCOM.

[2]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[3]  Yiming Yang,et al.  High-performing feature selection for text classification , 2002, CIKM '02.

[4]  Marcin Mironczuk,et al.  A recent overview of the state-of-the-art elements of text classification , 2018, Expert Syst. Appl..

[5]  Jason Weston,et al.  StarSpace: Embed All The Things! , 2017, AAAI.

[6]  Diab M. Diab,et al.  Using differential evolution for fine tuning naïve Bayesian classifiers and its application for text classification , 2017, Appl. Soft Comput..

[7]  D. Cox The Regression Analysis of Binary Sequences , 2017 .

[8]  Akiko Aizawa,et al.  An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..

[9]  Vasudeva Varma,et al.  Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[10]  Mohammad S. Hasan,et al.  An application of machine learning to detect abusive Bengali text , 2017, 2017 20th International Conference of Computer and Information Technology (ICCIT).

[11]  Abdolreza Abhari,et al.  Automatic classification of the emotional content of URL documents using NLP algorithms , 2015, SpringSim.

[12]  Vikas S. Chavan,et al.  Machine learning approach for detection of cyber-aggressive comments by peers on social media network , 2015, 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[13]  Mun Y. Yi,et al.  Explicit Content Detection in Music Lyrics Using Machine Learning , 2018, 2018 IEEE International Conference on Big Data and Smart Computing (BigComp).

[14]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[15]  Siddhartha R. Jonnalagadda,et al.  PDF text classification to leverage information extraction from publication reports , 2016, J. Biomed. Informatics.

[16]  Natasha Duarte,et al.  Mixed Messages? The Limits of Automated Social Media Content Analysis , 2018, FAT.

[17]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[18]  Theodore Chu,et al.  Comment Abuse Classification with Deep Learning , 2017 .

[19]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[20]  Houkuan Huang,et al.  Feature selection for text classification with Naïve Bayes , 2009, Expert Syst. Appl..