论文信息 - Suicide Risk Assessment with Multi-level Dual-Context Language and BERT

Suicide Risk Assessment with Multi-level Dual-Context Language and BERT

Mental health predictive systems typically model language as if from a single context (e.g. Twitter posts, status updates, or forum posts) and often limited to a single level of analysis (e.g. either the message-level or user-level). Here, we bring these pieces together to explore the use of open-vocabulary (BERT embeddings, topics) and theoretical features (emotional expression lexica, personality) for the task of suicide risk assessment on support forums (the CLPsych-2019 Shared Task). We used dual context based approaches (modeling content from suicide forums separate from other content), built over both traditional ML models as well as a novel dual RNN architecture with user-factor adaptation. We find that while affect from the suicide context distinguishes with no-risk from those with “any-risk”, personality factors from the non-suicide contexts provide distinction of the levels of risk: low, medium, and high risk. Within the shared task, our dual-context approach (listed as SBU-HLAB in the official results) achieved state-of-the-art performance predicting suicide risk using a combination of suicide-context and non-suicide posts (Task B), achieving an F1 score of 0.50 over hidden test set labels.

[1] D. Velting. Suicidal ideation and the five-factor model of personality , 1999 .

[2] Philip Resnik,et al. Expert, Crowdsourced, and Machine Assessment of Suicide Risk via Online Postings , 2018, CLPsych@NAACL-HTL.

[3] Alex B. Fine,et al. Natural Language Processing of Social Media as Screening for Suicide Risk , 2018, Biomedical informatics insights.

[4] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[5] Niranjan Balasubramanian,et al. Human Centered NLP with User-Factor Adaptation , 2017, EMNLP.

[6] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[7] Margaret L. Kern,et al. Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[8] P. Resnik,et al. CLPsych 2019 Shared Task: Predicting the Degree of Suicide Risk in Reddit Posts , 2019, Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology.

[9] D. Asch,et al. Facebook language predicts depression in medical records , 2018, Proceedings of the National Academy of Sciences.

[10] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[11] Maarten Sap,et al. Developing Age and Gender Predictive Lexica over Social Media , 2014, EMNLP.

[12] Maarten Sap,et al. DLATK: Differential Language Analysis ToolKit , 2017, EMNLP.

[13] Maarten Sap,et al. Towards Assessing Changes in Degree of Depression through Facebook , 2014, CLPsych@ACL.

[14] Jérôme Idier,et al. Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.

[15] Lyle H. Ungar,et al. Understanding and Measuring Psychological Stress using Social Media , 2018, ICWSM.

[16] Sharath Chandra Guntuku,et al. Detecting depression and mental illness on social media: an integrative review , 2017, Current Opinion in Behavioral Sciences.

[17] E. Mościcki,et al. Identification of suicide risk factors using epidemiologic studies. , 1997, The Psychiatric clinics of North America.

[18] Matthew K Nock,et al. Suicide and suicidal behavior. , 2008, Epidemiologic reviews.

[19] Lyle H. Ungar,et al. Modelling Valence and Arousal in Facebook posts , 2016, WASSA@NAACL-HLT.

[20] Saif Mohammad,et al. Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words , 2018, ACL.

[21] Mark Dredze,et al. Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media , 2016, CHI.