Suicidal tendencies prediction in Greek poetry

Natural language processing (NLP) has been successfully used to predict a writer’s tendency of committing suicide, using various text types: suicide notes, micro-blog posts, lyrics and poems. This paper is an extended version of earlier work. We extend our previous work on text mining in Greek Poetry by employing more sophisticated approaches. More specifically we have applied (i) Deep Neural Networks (DNN), (ii) additional morphosyntactic and semantic features based on writers' emotions and Big Five personality traits and (iii) feature selection, for suicide prediction in Greek poetry. We extend previous research to Greek, i.e. a language that has not been tackled before in this setting, using both language-dependent (but easily portable across languages) and language-independent linguistic features in order to represent the poems of 13 Greek poets of the twentieth century. Our results differ significantly from previous literature. In general, our proposed DNN model offers promising results for suicide prediction, despite the fact that this task poses multiple difficulties, especially for a language with limited related research support.

[1]  Katia Kermanidis,et al.  Language Processing for Predicting Suicidal Tendencies: A Case Study in Greek Poetry , 2019, AIAI.

[2]  J. Pennebaker,et al.  Word Use in the Poetry of Suicidal and Nonsuicidal Poets , 2001, Psychosomatic medicine.

[3]  Nirvana Meratnia,et al.  Generic online animal activity recognition on collar tags , 2017, UbiComp/ISWC Adjunct.

[4]  K. Gnana Sheela,et al.  Review on Methods to Fix Number of Hidden Neurons in Neural Networks , 2013 .

[5]  B. Mwangi,et al.  Identifying a clinical signature of suicidality among patients with mood disorders: A pilot study using a machine learning approach. , 2016, Journal of affective disorders.

[6]  S. Stack,et al.  Suicide among artists. , 1997, The Journal of social psychology.

[7]  K. Petrie,et al.  Sense of coherence, self-esteem, depression and hopelessness as correlates of reattempting suicide. , 1992, The British journal of clinical psychology.

[8]  Lei Zhang,et al.  Using Linguistic Features to Estimate Suicide Probability of Chinese Microblog Users , 2014, HCC.

[9]  Pete Burnap,et al.  Machine Classification and Analysis of Suicide-Related Communication on Twitter , 2015, HT.

[10]  E. Durkheim Suicide: A Study in Sociology , 1897 .

[11]  P. Resnik,et al.  CLPsych 2019 Shared Task: Predicting the Degree of Suicide Risk in Reddit Posts , 2019, Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology.

[12]  Colin Cherry,et al.  Binary Classifiers and Latent Sequence Models for Emotion Detection in Suicide Notes , 2012, Biomedical informatics insights.

[13]  Bethany A. Teachman,et al.  Identification of Imminent Suicide Risk Among Young Adults using Text Messages , 2018, CHI.

[14]  Sung-Ah Kim,et al.  A Study on Performance Prediction in Early Design Stage of Apartment Housing Using Machine Learning , 2018 .

[15]  Alex B. Fine,et al.  Natural Language Processing of Social Media as Screening for Suicide Risk , 2018, Biomedical informatics insights.

[16]  R. Moos,et al.  Life Circumstances and Personal Resources as Predictors of the Ten-Year Course of Depression , 1998, American journal of community psychology.

[17]  K. Pajak,et al.  Escaping the World: Linguistic Indicators of Suicide Attempts in Poets , 2014 .

[18]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[19]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[20]  R. Gullison,et al.  Can Sustainable Management Save Tropical Forests , 1997 .

[21]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[22]  M. Sydes,et al.  Emotion Detection in Suicide Notes using Maximum Entropy Classification , 2012, Biomedical informatics insights.

[23]  Matthew Mulholland,et al.  Suicidal Tendencies: The Automatic Classification of Suicidal and Non-Suicidal Lyricists Using NLP , 2013, IJCNLP.

[24]  Philip Resnik,et al.  Expert, Crowdsourced, and Machine Assessment of Suicide Risk via Online Postings , 2018, CLPsych@NAACL-HTL.

[25]  A. Tellegen,et al.  An alternative "description of personality": the big-five factor structure. , 1990, Journal of personality and social psychology.

[26]  Michael ODonnell,et al.  Demonstration of the UAM CorpusTool for Text and Image Annotation , 2008, ACL.

[27]  Tatiana Litvinova,et al.  Identification of Suicidal Tendencies of Individuals based on the Quantitative Analysis of their Internet Texts , 2017, Computación y Sistemas.

[28]  Michelle Morales,et al.  An Investigation of Deep Learning Systems for Suicide Risk Assessment , 2019, Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology.

[29]  A. Leenaars,et al.  Suicide Note Classification Using Natural Language Processing: A Content Analysis , 2010, Biomedical informatics insights.

[30]  K. Jamison,et al.  Manic-depressive illness and creativity. , 1995, Scientific American.

[31]  Psychological issues and treatment strategies in popular musicians: a review, Part 2 * , 2000 .

[32]  Jingcheng Du,et al.  Extracting psychiatric stressors for suicide from social media using deep learning , 2018, BMC Medical Informatics and Decision Making.

[33]  Markus Hofmann,et al.  RapidMiner: Data Mining Use Cases and Business Analytics Applications , 2013 .