A lexical database for public textual cyberbullying detection

Public textual cyberbullying has become one of the most prevalent issues associated with online safety of young people, particularly on social networks. To address this issue, we argue that the boundaries of what constitutes public textual cyberbullying needs to be first identified and a corresponding linguistically motivated definition needs to be advanced. Thus, we propose a definition of public textual cyberbullying that contains three necessary and sufficient elements: the personal marker, the dysphemistic element and the cyberbullying link between the previous two elements. Subsequently, we argue that one of the cornerstones in the overall process of mitigating the effects of cyberbullying is the design of a cyberbullying lexical database that specifies what linguistic and cyberbullying specific information is relevant to the detection process. In this vein, we propose a novel cyberbullying lexical database based on the definition of public textual cyberbullying. The overall architecture of our cyberbullying lexical database is determined semantically, and, in order to facilitate cyberbullying detection, the lexical entry encapsulates two new semantic dimensions that are derived from our definition: cyberbullying function and cyberbullying referential domain. In addition, the lexical entry encapsulates other semantic and syntactic information, such as sense and syntactic category, information that, not only aids the process of detection, but also allows us to expand the cyberbullying database using WordNet (Miller, 1993).

[1]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorisation: a survey , 1999 .

[2]  Peter K. Smith,et al.  Cyberbullying: another main type of bullying? , 2008, Scandinavian journal of psychology.

[3]  Narendra Shekokar,et al.  A Framework for Cyberbullying Detection in Social Network , 2015 .

[4]  Leonhard Lipka An outline of English lexicology , 1990 .

[5]  R. Ordelman,et al.  Improved cyberbullying detection using gender information , 2012 .

[6]  Erkki Sutinen,et al.  Antisocial Behavior corpus for harmful language detection , 2013, 2013 Federated Conference on Computer Science and Information Systems.

[7]  Pawel Dybala,et al.  Machine Learning and Affect Analysis Against Cyber-Bullying , 2010 .

[8]  Jun-Ming Xu,et al.  Learning from Bullying Traces in Social Media , 2012, NAACL.

[9]  D. Boyd Why Youth (Heart) Social Network Sites: The Role of Networked Publics in Teenage Social Life , 2007 .

[10]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[11]  Kenji Araki,et al.  Detecting Cyberbullying Entries on Informal School Websites Based on Category Relevance Maximization , 2013, IJCNLP.

[12]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[13]  Geoffrey K. Pullum,et al.  A Student's Introduction to English Grammar , 2021 .

[14]  Tatsuya Suda,et al.  A Framework to Identify Relationships among Students in School Bullying Using Digital Communication Media , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[15]  Sonia Livingstone,et al.  Risks and safety on the internet: the perspective of European children: full findings and policy implications from the EU Kids Online survey of 9-16 year olds and their parents in 25 countries , 2011 .

[16]  Brian D. Davison,et al.  Detection of Harassment on Web 2.0 , 2009 .

[17]  Katherine J. Miller,et al.  Adjectives in WordNet , 1990 .

[18]  Yorick Wilks,et al.  Making Preferences More Active , 1978, Artif. Intell..

[19]  Atsushi Tagami,et al.  A Study of Contact Network Generation for Cyber-bullying Detection , 2014, 2014 28th International Conference on Advanced Information Networking and Applications Workshops.

[20]  Karen Sullivan,et al.  Grammar in Metaphor: A Construction Grammar Account of Metaphoric Language , 2007 .

[21]  Christiane Fellbaum,et al.  English Verbs as a Semantic Net , 1990 .

[22]  Betty J. Birner Introduction to Pragmatics , 2012 .

[23]  Robert D. van Valin,et al.  An Introduction to Syntax , 2001 .

[24]  Kelly Reynolds,et al.  Using Machine Learning to Detect Cyberbullying , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[25]  Colette Langos,et al.  Cyberbullying: The Challenge to Define , 2012, Cyberpsychology Behav. Soc. Netw..

[26]  George A. Miller,et al.  Nouns in WordNet: A Lexical Inheritance System , 1990 .

[27]  Kelly Reynolds,et al.  Detecting cyberbullying: query terms and techniques , 2013, WebSci.

[28]  Xue Li,et al.  An Effective Approach for Cyberbullying Detection , 2013 .

[29]  Henry Lieberman,et al.  Common Sense Reasoning for Detection, Prevention, and Mitigation of Cyberbullying , 2012, TIIS.

[30]  Billy Henson Bullying beyond the schoolyard: Preventing and responding to cyberbullying , 2012 .

[31]  E. Menesini,et al.  Cyberbullying: Labels, Behaviours and Definition in Three European Countries , 2010, Australian Journal of Guidance and Counselling.

[32]  K. Burridge,et al.  Forbidden Words: Taboo and the Censoring of Language , 2006 .

[33]  Henry Lieberman,et al.  Modeling the Detection of Textual Cyberbullying , 2011, The Social Mobile Web.

[34]  Ying Chen,et al.  Detecting Offensive Language in Social Media to Protect Adolescent Online Safety , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.