Sustainable cyberbullying detection with category-maximized relevance of harmful phrases and double-filtered automatic optimization

Abstract We developed a supporting solution for “cyberbullying” prevention based on recent discoveries in Artificial Intelligence and Natural Language Processing. Cyberbullying, defined as using the Internet to humiliate and slander other people has become a serious problem. In Japan members of the Parent–Teacher Association manually perform Web monitoring to stop cyberbullying activities. Unfortunately, reading through the whole Web manually is an impossible task. Although the complexity of cyberbullying makes it a problem unsolvable solely with the help of technology, we found that technology could make cyberbullying prevention more efficient. We developed a novel method of automatic detection of cyberbullying entries on the Internet. In the method we use seed words from three categories to calculate a semantic orientation score and then maximize the relevance of categories. The proposed method outperformed baseline settings in both laboratory and real world conditions. The developed system was deployed and tested in practice. After a year of testing we noticed a greater than 30 percent-point-drop in its performance. We hypothesize on the reasons for the drop. To regain the lost performance and retain it in the future we propose additional improvements including automatic acquisition and filtering of seed words. Experimentally selected optimal improvements regained much of the lost performance.

[1]  Leslie Haddon,et al.  Patterns of risk and safety online: in-depth analyses from the EU Kids Online survey of 9- to 16-year-olds and their parents in 25 European countries , 2011 .

[2]  Robin M. Kowalski,et al.  Electronic bullying among middle school students. , 2007, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[3]  Janet C. Read,et al.  Child-computer interaction , 2013, Int. J. Child Comput. Interact..

[4]  Maria Beatriz Carmo,et al.  A serious game-based solution to prevent bullying , 2015, MoMM.

[5]  D. Cross,et al.  Cyberbullying Versus Face-to-Face Bullying A Theoretical and Conceptual Review , 2009 .

[6]  Sandra Weber,et al.  Growing Up Online: Young People and Digital Technologies , 2010 .

[7]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[8]  Laura Kann,et al.  Youth risk behavior surveillance--United States, 2013. , 2014, MMWR supplements.

[9]  Jun Zhao,et al.  A Weakly Supervised Bayesian Model for Violence Detection in Social Media , 2013, IJCNLP.

[10]  J. Pyżalski From cyberbullying to electronic aggression: typology of the phenomenon , 2012 .

[11]  Allen Kent,et al.  Machine literature searching X. Machine language; factors underlying its design and development , 1955 .

[12]  Qing Li,et al.  Bullying in the new playground: Research into cyberbullying and cyber victimisation , 2007 .

[13]  A. Sourander,et al.  Psychosocial risk factors associated with cyberbullying among adolescents: a population-based study. , 2010, Archives of general psychiatry.

[14]  Leslie Haddon,et al.  Comparing children’s online opportunities and risks across Europe: cross-national comparisons for EU Kids Online , 2008 .

[15]  Pawel Dybala,et al.  In the Service of Online Order: Tackling Cyber-Bullying with Machine Learning and Affect Analysis , 2010 .

[16]  Justin W. Patchin,et al.  Bullying Beyond the Schoolyard: Preventing and Responding to Cyberbullying , 2008 .

[17]  Adam Kilgarriff Googleology is Bad Science , 2007, Computational Linguistics.

[18]  J. Pyżalski,et al.  Empathy and Moral Disengagement in Adolescent Cyberbullying: Implications for Educational Intervention and Pedagogical Practice , 2012 .

[19]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[20]  Kenji Araki,et al.  Detecting Cyberbullying Entries on Informal School Websites Based on Category Relevance Maximization , 2013, IJCNLP.

[21]  Justin W. Patchin,et al.  Bullies Move Beyond the Schoolyard , 2006 .

[22]  Ana Paiva,et al.  FearNot!: providing children with strategies to cope with bullying , 2009, IDC.