Improved cyberbullying detection using gender information

As a result of the invention of social networks, friendships, relationships and social communication are all undergoing changes and new definitions seem to be applicable. One may have hundreds of ‘friends’ without even seeing their faces. Meanwhile, alongside this transition there is increasing evidence that online social applications are used by children and adolescents for bullying. State-of-the-art studies in cyberbullying detection have mainly focused on the content of the conversations while largely ignoring the characteristics of the actors involved in cyberbullying. Social studies on cyberbullying reveal that the written language used by a harasser varies with the author’s features including gender. In this study we used a support vector machine model to train a gender-specific text classifier. We demonstrated that taking gender-specific language features into account improves the discrimination capacity of a classifier to detect cyberbullying.

[1]  D. Espelage,et al.  Bullying and Victimization : What Have We Learned and Where Do We Go from Here ? [ Mini-Series ] , 2017 .

[2]  Marilyn A. Campbell,et al.  Cyber Bullying: An Old Problem in a New Guise? , 2005, Australian Journal of Guidance and Counselling.

[3]  June F. Chisholm,et al.  Cyberspace Violence against Girls and Adolescent Females , 2006, Annals of the New York Academy of Sciences.

[4]  Bart Goethals,et al.  Automatic Vandalism Detection in Wikipedia : Towards a Machine Learning Approach , 2008 .

[5]  Peter K. Smith,et al.  Cyberbullying: its nature and impact in secondary school pupils. , 2008, Journal of child psychology and psychiatry, and allied disciplines.

[6]  Lynne Edwards,et al.  ChatCoder: Toward the Tracking and Categorization of Internet Predators , 2009 .

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Brian D. Davison,et al.  Detection of Harassment on Web 2.0 , 2009 .

[9]  Val Besag,et al.  Cyber Bullying: Bullying in the Digital Age , 2010 .

[10]  Pang-Ning Tan,et al.  1 INFORMATION ASSURANCE : DETECTION OF WEB SPAM ATTACKS IN SOCIAL MEDIA , 2010 .

[11]  Anto Satriyo Nugroho,et al.  Text Classification Techniques Used to Faciliate Cyber Terrorism Investigation , 2010, 2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies.

[12]  Henry Lieberman,et al.  Modeling the Detection of Textual Cyberbullying , 2011, The Social Mobile Web.

[13]  Sarah Steiner Gender, Genre, and Writing Style in Formal Written Texts , 2014 .