Screening Internet forum participants for depression symptoms by assembling and enhancing multiple NLP methods

Depression is a disease that can dramatically lower quality of life. Symptoms of depression can range from temporary sadness to suicide. Embarrassment, shyness, and the stigma of depression are some of the factors preventing people from getting help for their problems. Contemporary social media technologies like Internet forums or micro-blogs give people the opportunity to talk about their feelings in a confidential anonymous environment. However, many participants in such networks may not recognize the severity of their depression and their need for professional help. Our approach is to develop a method that detects symptoms of depression in free text, such as posts in Internet forums, chat rooms and the like. This could help people appreciate the significance of their depression and realize they need to seek help. In this work Natural Language Processing methods are used to break the textual information into its grammatical units. Further analysis involves detection of depression symptoms and their frequency with the help of words known as indicators of depression and their synonyms. Finally, similar to common paper-based depression scales, e.g., the CES-D, that information is incorporated into a single depression score. In this evaluation study, our depressive mood detection system, DepreSD (Depression Symptom Detection), had an average precision of 0.84 (range 0.72-1.0 depending on the specific measure) and an average F measure of 0.79 (range 0.72-0.9).

[1]  Stephan M. Winkler,et al.  On Text Preprocessing for Opinion Mining Outside of Laboratory Environments , 2012, AMT.

[2]  E. B. Wilson Probable Inference, the Law of Succession, and Statistical Inference , 1927 .

[3]  Michael Chau,et al.  A Hybrid System for Online Detection of Emotional Distress , 2012, PAISI.

[4]  Qi Li,et al.  User-level psychological stress detection from social media using deep neural network , 2014, ACM Multimedia.

[5]  Yair Neuman,et al.  Proactive screening for depression through metaphorical and automatic text analysis , 2012, Artif. Intell. Medicine.

[6]  Otto B. Walter,et al.  Development of a Computer-adaptive Test for Depression (D-CAT) , 2005, Quality of Life Research.

[7]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[8]  D. Ford,et al.  Screening the public for depression through the Internet. , 2001, Psychiatric services.

[9]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[10]  Mark Dredze,et al.  Quantifying Mental Health Signals in Twitter , 2014, CLPsych@ACL.

[11]  Alex Pentland,et al.  Proceedings of the 5th Annual ACM Web Science Conference , 2013 .

[12]  Markus Kreuzthaler,et al.  Navigating through Very Large Sets of Medical Records: An Information Retrieval Evaluation Architecture for Non-standardized Text , 2011, USAB.

[13]  Thomas Loew,et al.  Selbsthilfe und Beratung im Internet , 2004, Medizinische Klinik.

[14]  Sidney Greenbaum,et al.  The Oxford English Grammar , 1996 .

[15]  Andreas Holzinger,et al.  Quality-Based Knowledge Discovery from Medical Text on the Web , 2013, Quality Issues in the Management of Web Information.

[16]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[17]  T A Sheldon,et al.  Routinely administered questionnaires for depression and anxiety: systematic review , 2001, BMJ : British Medical Journal.

[18]  L. Radloff The CES-D Scale , 1977 .

[19]  A. Ciampi,et al.  Recognition of Depression by Non-psychiatric Physicians—A Systematic Literature Review and Meta-analysis , 2007, Journal of General Internal Medicine.

[20]  Robert C. Hsiung,et al.  The Best of Both Worlds: An Online Self-Help Group Hosted by a Mental Health Professional , 2000, Cyberpsychology Behav. Soc. Netw..

[21]  H. Wittchen,et al.  Size and burden of mental disorders in Europe—a critical review and appraisal of 27 studies , 2005, European Neuropsychopharmacology.

[22]  Joseph L. Annest,et al.  Suicide Among Adults Aged 35–64 Years — United States, 1999–2010 , 2013, MMWR. Morbidity and mortality weekly report.

[23]  Reijo Savolainen,et al.  Requesting and providing information in blogs and internet discussion forums , 2011, J. Documentation.

[24]  Eric Horvitz,et al.  Social media as a measurement tool of depression in populations , 2013, WebSci.

[25]  Lyle H. Ungar,et al.  Identifying potential adverse effects using the web: A new approach to medical hypothesis generation , 2011, J. Biomed. Informatics.

[26]  Isabelle Guyon,et al.  Automatic Capacity Tuning of Very Large VC-Dimension Classifiers , 1992, NIPS.

[27]  Michal Karpowicz,et al.  Opinion Mining on the Web 2.0 - Characteristics of User Generated Content and Their Impacts , 2013, CHI-KDD.

[28]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[29]  M. Åsberg,et al.  A New Depression Scale Designed to be Sensitive to Change , 1979, British Journal of Psychiatry.

[30]  Joseph L. Annest,et al.  Suicide Trends Among Persons Aged 10–24 Years — United States, 1994–2012 , 2015, MMWR. Morbidity and mortality weekly report.

[31]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[32]  A. Beck,et al.  An inventory for measuring depression. , 1961, Archives of general psychiatry.

[33]  W W Zung,et al.  Self-rating depression scale in an outpatient clinic. Further validation of the SDS. , 1965, Archives of general psychiatry.

[34]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[35]  M. Hamilton A RATING SCALE FOR DEPRESSION , 1960, Journal of neurology, neurosurgery, and psychiatry.

[36]  B. Locke,et al.  Suicide ideation: its relation to depression, suicide and suicide attempt. , 1979, Suicide & life-threatening behavior.

[37]  Henry Lieberman,et al.  Stacked Generalization Learning to Analyze Teenage Distress , 2014, ICWSM.