A Computational Approach to Feature Extraction for Identification of Suicidal Ideation in Tweets

Technological advancements in the World Wide Web and social networks in particular coupled with an increase in social media usage has led to a positive correlation between the exhibition of Suicidal ideation on websites such as Twitter and cases of suicide. This paper proposes a novel supervised approach for detecting suicidal ideation in content on Twitter. A set of features is proposed for training both linear and ensemble classifiers over a dataset of manually annotated tweets. The performance of the proposed methodology is compared against four baselines that utilize varying approaches to validate its utility. The results are finally summarized by reflecting on the effect of the inclusion of the proposed features one by one for suicidal ideation detection.

[1]  Xin Li,et al.  Topic Model for Identifying Suicidal Ideation in Chinese Microblog , 2015, PACLIC.

[2]  Rafael A. Calvo,et al.  CLPsych 2016 Shared Task: Triaging content in online peer-support forums , 2016, CLPsych@HLT-NAACL.

[3]  J. Friedman Stochastic gradient boosting , 2002 .

[4]  Lydia Denworth,et al.  Preventing Suicide. , 2018, Scientific American.

[5]  Paul R Duberstein,et al.  The impact of suicide on the family. , 2008, Crisis.

[6]  Stephen Wan,et al.  The Role of Features and Context on Suicide Ideation Detection , 2016, ALTA.

[7]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[8]  Nazli Goharian,et al.  Depression and Self-Harm Risk Assessment in Online Forums , 2017, EMNLP.

[9]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[10]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[11]  Xuanjing Huang,et al.  Recurrent Neural Network for Text Classification with Multi-Task Learning , 2016, IJCAI.

[12]  Kimberly Hoagwood,et al.  Development and natural history of mood disorders , 2002, Biological Psychiatry.

[13]  Mark Dredze,et al.  Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides , 2015, HT.

[14]  Véronique Hoste,et al.  Emotion detection in suicide notes , 2013, Expert Syst. Appl..

[15]  Carolyn Penstein Rosé,et al.  Detecting offensive tweets via topical feature discovery over a large scale twitter corpus , 2012, CIKM.

[16]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[17]  Tingshao Zhu,et al.  Detecting Suicidal Ideation in Chinese Microblogs with Psychological Lexicons , 2014, 2014 IEEE 11th Intl Conf on Ubiquitous Intelligence and Computing and 2014 IEEE 11th Intl Conf on Autonomic and Trusted Computing and 2014 IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops.

[18]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[19]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[20]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[21]  Glen Coppersmith,et al.  Exploratory Analysis of Social Media Prior to a Suicide Attempt , 2016, CLPsych@HLT-NAACL.

[22]  Michael D. Barnes,et al.  Tracking suicide risk factors through Twitter in the US. , 2014, Crisis.

[23]  K. Hawton,et al.  The Power of the Web: A Systematic Review of Studies of the Influence of the Internet on Self-Harm and Suicide in Young People , 2013, PloS one.

[24]  W W Zung,et al.  Suicide prevention by suicide detection. , 1979, Psychosomatics.

[25]  S. Hetrick,et al.  Social media and suicide prevention: a systematic review , 2016, Early intervention in psychiatry.

[26]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[27]  Kara Chan,et al.  Use of the internet and traditional media among young people , 2007 .

[28]  H. Christensen,et al.  Detecting suicidality on Twitter , 2015 .

[29]  Maria Liakata,et al.  Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records , 2016, CLPsych@HLT-NAACL.

[30]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[31]  Tadashi Takeshima,et al.  The Impact of Suicidality-Related Internet Use: A Prospective Large Cohort Study with Young and Middle-Aged Internet Users , 2014, PloS one.

[32]  Mark Dredze,et al.  Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media , 2016, CHI.

[33]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.