Identifying Medical Self-Disclosure in Online Communities

Self-disclosure in online health conversations may offer a host of benefits, including earlier detection and treatment of medical issues that may have otherwise gone unaddressed. However, research analyzing medical self-disclosure in online communities is limited. We address this shortcoming by introducing a new dataset of health-related posts collected from online social platforms, categorized into three groups (No Self-Disclosure, Possible Self-Disclosure, and Clear Self-Disclosure) with high inter-annotator agreement (_k_=0.88). We make this data available to the research community. We also release a predictive model trained on this dataset that achieves an accuracy of 81.02%, establishing a strong performance benchmark for this task.

[1]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[2]  ZhangWen,et al.  A comparative study of TF*IDF, LSI and multi-words for text classification , 2011 .

[3]  A. Joinson,et al.  Self-disclosure, Privacy and the Internet , 2009 .

[4]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[5]  Burt L. Monroe,et al.  Fightin' Words: Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict , 2008, Political Analysis.

[6]  Mark Dredze,et al.  From ADHD to SAD: Analyzing the Language of Mental Health on Twitter through Self-Reported Diagnoses , 2015, CLPsych@HLT-NAACL.

[7]  Anna Cinzia Squicciarini,et al.  Detection and Analysis of Self-Disclosure in Online News Commentaries , 2019, WWW.

[8]  A. Joinson Self‐disclosure in computer‐mediated communication: The role of self‐awareness and visual anonymity , 2001 .

[9]  Ronald E. Rice,et al.  Mediated disclosure on Twitter: The roles of gender and identity in boundary impermeability, valence, disclosure, and stage , 2013, Comput. Hum. Behav..

[10]  Glen Coppersmith,et al.  Exploratory Analysis of Social Media Prior to a Suicide Attempt , 2016, CLPsych@HLT-NAACL.

[11]  Hae-Chang Rim,et al.  Some Effective Techniques for Naive Bayes Text Classification , 2006, IEEE Transactions on Knowledge and Data Engineering.

[12]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[13]  I. Altman,et al.  Social penetration: The development of interpersonal relationships , 1973 .

[14]  Thomas Wolf,et al.  DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter , 2019, ArXiv.

[15]  N. Bridges Therapist's self-disclosure: Expanding the comfort zone. , 2001 .

[16]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[17]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[18]  J. Greist,et al.  A computer interview for psychiatric patient target symptoms. , 1973, Archives of general psychiatry.

[19]  Yi-Zeng Liang,et al.  Monte Carlo cross validation , 2001 .

[20]  R. Kellner Psychotherapy in Psychosomatic Disorders: A Survey of Controlled Studies , 1975 .

[21]  Alice H. Oh,et al.  Self-Disclosure and Relationship Strength in Twitter Conversations , 2012, ACL.

[22]  L. Alden,et al.  Anxiety and self-disclosure: toward a motivational model. , 1993, Journal of personality and social psychology.

[23]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[24]  Robert E. Kraut,et al.  Modeling Self-Disclosure in Social Networking Sites , 2016, CSCW.

[25]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[26]  L. Tidwell,et al.  Computer-Mediated Communication Effects on Disclosure, Impressions, and Interpersonal Evaluations: Getting to Know One Another a Bit at a Time , 2002 .

[27]  W. Meissner The Problem of Self-Disclosure in Psychoanalysis , 2002, Journal of the American Psychoanalytic Association.

[28]  Björn W. Schuller,et al.  Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework , 2010, Cognitive Computation.

[29]  P. Barglow,et al.  Self-disclosure in psychotherapy. , 2005, American journal of psychotherapy.

[30]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[31]  O. Bruss,et al.  Tell Me More: Online Versus Face-to-Face Communication and Self-Disclosure , 2010 .

[32]  Cindy K. Chung,et al.  Expressive Writing, Emotional Upheavals, and Health. , 2007 .

[33]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[34]  John Cromby,et al.  Emotional inhibition: A discourse analysis of disclosure , 2012, Psychology & health.

[35]  Munmun De Choudhury,et al.  Detecting and Characterizing Mental Health Related Self-Disclosure in Social Media , 2015, CHI Extended Abstracts.

[36]  Isabelle Augenstein,et al.  emoji2vec: Learning Emoji Representations from their Description , 2016, SocialNLP@EMNLP.

[37]  George Forman,et al.  BNS feature scaling: an improved representation over tf-idf for svm text classification , 2008, CIKM '08.