Detecting Anxiety through Reddit

Previous investigations into detecting mental illnesses through social media have predominately focused on detecting depression through Twitter corpora. In this paper, we study anxiety disorders through personal narratives collected through the popular social media website, Reddit. We build a substantial data set of typical and anxiety-related posts, and we apply N-gram language modeling, vector embeddings, topic analysis, and emotional norms to generate features that accurately classify posts related to binary levels of anxiety. We achieve an accuracy of 91% with vector-space word embeddings, and an accuracy of 98% when combined with lexicon-based features.

[1]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[2]  Cheri A. Levinson,et al.  Profiling Predicting Social Anxiety From Facebook Profiles , 2012 .

[3]  Moin Nadeem,et al.  Identifying Depression on Twitter , 2016, ArXiv.

[4]  Leonardo Max Batista Claudino,et al.  Beyond LDA: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter , 2015, CLPsych@HLT-NAACL.

[5]  Mark Dredze,et al.  Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides , 2015, HT.

[6]  J. Stockman Lifetime Prevalence of Mental Disorders in U.S. Adolescents: Results from the National Comorbidity Survey Replication–Adolescent Supplement (NCS-A) , 2012 .

[7]  R. Kessler,et al.  Prevalence, severity, and unmet need for treatment of mental disorders in the World Health Organization World Mental Health Surveys. , 2004, JAMA.

[8]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[9]  Maarten Sap,et al.  Mental Illness Detection at the World Well-Being Project for the CLPsych 2015 Shared Task , 2015, CLPsych@HLT-NAACL.

[10]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[11]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[12]  K. Merikangas,et al.  Lifetime prevalence of mental disorders in U.S. adolescents: results from the National Comorbidity Survey Replication--Adolescent Supplement (NCS-A). , 2010, Journal of the American Academy of Child and Adolescent Psychiatry.

[13]  Ted Pedersen,et al.  Screening Twitter Users for Depression and PTSD with Lexical Decision Lists , 2015, CLPsych@HLT-NAACL.

[14]  Ryan L. Boyd,et al.  The Development and Psychometric Properties of LIWC2015 , 2015 .

[15]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[16]  Mark Dredze,et al.  Shared Task : Depression and PTSD on Twitter , 2015 .

[17]  Svetha Venkatesh,et al.  Affective and Content Analysis of Online Depression Communities , 2014, IEEE Transactions on Affective Computing.

[18]  Munmun De Choudhury,et al.  Mental Health Discourse on reddit: Self-Disclosure, Social Support, and Anonymity , 2014, ICWSM.

[19]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[20]  Alan R. Ellis,et al.  County-level estimates of mental health professional shortage in the United States. , 2009, Psychiatric services.