RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses

Self-reported diagnosis statements have been widely employed in studying language related to mental health in social media. However, existing research has largely ignored the temporality of mental health diagnoses. In this work, we introduce RSDD-Time: a new dataset of 598 manually annotated self-reported depression diagnosis posts from Reddit that include temporal information about the diagnosis. Annotations include whether a mental health condition is present and how recently the diagnosis happened. Furthermore, we include exact temporal spans that relate to the date of diagnosis. This information is valuable for various computational methods to examine mental health through social media because one’s mental health state is not static. We also test several baseline classification and extraction approaches, which suggest that extracting temporal information from self-reported diagnosis statements is challenging.

[1]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[2]  Nazli Goharian,et al.  Depression and Self-Harm Risk Assessment in Online Forums , 2017, EMNLP.

[3]  Chen Lin,et al.  Improving Temporal Relation Extraction with Training Instance Augmentation , 2016, BioNLP@ACL.

[4]  Hiroyuki Ohsaki,et al.  Recognizing Depression from Twitter Activity , 2015, CHI.

[5]  Chen Lin,et al.  Neural Temporal Relation Extraction , 2017, EACL.

[6]  Ophir Frieder,et al.  Scalable mental health analysis in the clinical whitespace via natural language processing , 2017, 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[7]  Mark Dredze,et al.  Measuring Post Traumatic Stress Disorder in Twitter , 2014, ICWSM.

[8]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[9]  James Pustejovsky,et al.  SemEval-2016 Task 12: Clinical TempEval , 2016, NAACL 2016.

[10]  Mark Dredze,et al.  Quantifying Mental Health Signals in Twitter , 2014, CLPsych@ACL.

[11]  Munmun De Choudhury,et al.  Mental Health Discourse on reddit: Self-Disclosure, Social Support, and Anonymity , 2014, ICWSM.

[12]  Mark Dredze,et al.  From ADHD to SAD: Analyzing the Language of Mental Health on Twitter through Self-Reported Diagnoses , 2015, CLPsych@HLT-NAACL.

[13]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[14]  G Worrall,et al.  Early detection of depression by primary care physicians. , 1990, CMAJ : Canadian Medical Association journal = journal de l'Association medicale canadienne.

[15]  W. Caan,et al.  The Spirit Level. Why Equality is Better for Everyone , 2011 .

[16]  Maria Liakata,et al.  The language of mental health problems in social media , 2016, CLPsych@HLT-NAACL.

[17]  James Pustejovsky,et al.  The Specification Language TimeML , 2005, The Language of Time - A Reader.

[18]  Mike Conway,et al.  Social Media, Big Data, and Mental Health: Current Advances and Ethical Implications. , 2016, Current opinion in psychology.

[19]  Glen Coppersmith,et al.  Quantifying the Language of Schizophrenia in Social Media , 2015, CLPsych@HLT-NAACL.