Emotional and Linguistic Cues of Depression from Social Media

Health outcomes in modern society are often shaped by peer interactions. Increasingly, a significant fraction of such interactions happen online and can have an impact on various mental health and behavioral health outcomes. Guided by appropriate social and psychological research, we conduct an observational study to understand the interactions between clinically depressed users and their ego-network when contrasted with a differential control group of normal users and their ego-network. Specifically, we examine if one can identify relevant linguistic and emotional signals from social media exchanges to detect symptomatic cues of depression. We observe significant deviations in the behavior of depressed users from the control group. Reduced and nocturnal online activity patterns, reduced active and passive network participation, increase in negative sentiment or emotion, distinct linguistic styles (e.g. self-focused pronoun usage), highly clustered and tightly-knit neighborhood structure, and little to no exchange of influence between depressed users and their ego-network over time are some of the observed characteristics. Based on our observations, we then describe an approach to extract relevant features and show that building a classifier to predict depression based on such features can achieve an F-score of 90%.

[1]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[2]  Gene I. Rochlin,et al.  Mind the Gap: The Growing Distance between Institutional and Technical Capabilities in Organizations Performing Critical Operations , 2004, ISI.

[3]  Brian D. Davison,et al.  Topical TrustRank: using topicality to combat web spam , 2006, WWW '06.

[4]  Scott A. Golder,et al.  Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures , 2011 .

[5]  Patti M. Valkenburg,et al.  Online Communication and Adolescent Well-Being: Testing the Stimulation Versus the Displacement Hypothesis , 2007, J. Comput. Mediat. Commun..

[6]  T. Oxman,et al.  The language of paranoia. , 1982, The American journal of psychiatry.

[7]  Daniel Thalmann,et al.  ETAF: An extended trust antecedents framework for trust prediction , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[8]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[9]  C. Castelfranchi,et al.  Social Trust : A Cognitive Approach , 2000 .

[10]  Laurence Steinberg,et al.  Homophily of internalized distress in adolescent peer groups , 1995 .

[11]  N. Christakis,et al.  Dynamic spread of happiness in a large social network: longitudinal analysis over 20 years in the Framingham Heart Study , 2008, BMJ : British Medical Journal.

[12]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[13]  Koji Ueno,et al.  The effects of friendship networks on adolescent depressive symptoms , 2005 .

[14]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[15]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[16]  Amit P. Sheth,et al.  Comparative trust management with applications: Bayesian approaches emphasis , 2014, Future Gener. Comput. Syst..

[17]  E. Larson,et al.  Dissemination of health information through social networks: twitter and antibiotics. , 2010, American journal of infection control.

[18]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[19]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[20]  Eric Horvitz,et al.  Predicting postpartum changes in emotion and behavior via social media , 2013, CHI.

[21]  Georg Lausen,et al.  Analyzing Correlation between Trust and User Similarity in Online Communities , 2004, iTrust.

[22]  Dan Suciu,et al.  Data conflict resolution using trust mappings , 2010, SIGMOD Conference.

[23]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[24]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[25]  N. Freedman,et al.  The language of depression. , 1981, Bulletin of the Menninger Clinic.

[26]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[27]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[28]  Shuliang Wang,et al.  Data Mining and Knowledge Discovery , 2005, Mathematical Principles of the Internet.

[29]  H. Tajfel Social identity and intergroup relations , 1985 .

[30]  G. Metalsky,et al.  When depression breeds contempt: reassurance seeking, self-esteem, and rejection of depressed college students by their roommates. , 1992, Journal of abnormal psychology.

[31]  Amit P. Sheth,et al.  Spatio-Temporal-Thematic Analysis of Citizen Sensor Data: Challenges and Experiences , 2009, WISE.

[32]  Georg Lausen,et al.  Propagation Models for Trust and Distrust in Social Networks , 2005, Inf. Syst. Frontiers.

[33]  J. Greenberg,et al.  Examining the world of the depressed: do depressed people prefer others who are depressed? , 1991, Journal of personality and social psychology.

[34]  Munmun De Choudhury,et al.  Quantifying and Predicting Mental Illness Severity in Online Pro-Eating Disorder Communities , 2016, CSCW.

[35]  Srinivasan Parthasarathy,et al.  Bayesian Locality Sensitive Hashing for Fast Similarity Search , 2011, Proc. VLDB Endow..

[36]  C. Hawn Take two aspirin and tweet me in the morning: how Twitter, Facebook, and other social media are reshaping health care. , 2009, Health affairs.

[37]  Charles F. Reynolds,et al.  Depression and insomnia: questions of cause and effect. , 2000, Sleep medicine reviews.

[38]  Mark Dredze,et al.  Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media , 2016, CHI.

[39]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[40]  Elizabeth D. Cox,et al.  Feeling bad on Facebook: depression disclosures by college students on a social networking site , 2011, Depression and anxiety.

[41]  J. Friedman Stochastic gradient boosting , 2002 .

[42]  J. Pennebaker,et al.  Language use of depressed and depression-vulnerable college students , 2004 .

[43]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[44]  R. Ingram,et al.  Self-focused attention, gender, gender role, and vulnerability to negative affect. , 1988, Journal of personality and social psychology.

[45]  K. Arrow The limits of organization , 1974 .

[46]  Jennifer Golbeck,et al.  Trust and nuanced profile similarity in online social networks , 2009, TWEB.

[47]  Daniel M. Romero,et al.  Influence and Passivity in Social Media , 2011, ECML/PKDD.

[48]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[49]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[50]  Paolo Avesani,et al.  Controversial Users Demand Local Trust Metrics: An Experimental Study on Epinions.com Community , 2005, AAAI.

[51]  Mark Dredze,et al.  From ADHD to SAD: Analyzing the Language of Mental Health on Twitter through Self-Reported Diagnoses , 2015, CLPsych@HLT-NAACL.

[52]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[53]  Aravind Srinivasan,et al.  Predicting Trust and Distrust in Social Networks , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[54]  Joyce E. Berg,et al.  Trust, Reciprocity, and Social History , 1995 .

[55]  L. Berkman,et al.  Social ties and mental health , 2001, Journal of Urban Health.

[56]  David Allen,et al.  Geotagging one hundred million Twitter accounts with total variation minimization , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[57]  Mark Dredze,et al.  Quantifying Mental Health Signals in Twitter , 2014, CLPsych@ACL.

[58]  M. Moreno,et al.  "Facebook depression?" social networking site use and depression in older adolescents. , 2012, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[59]  Srinivasan Parthasarathy,et al.  Symmetrizations for clustering directed graphs , 2011, EDBT/ICDT '11.

[60]  Ronen Feldman,et al.  The Data Mining and Knowledge Discovery Handbook , 2005 .

[61]  G. S. O'Keeffe,et al.  The Impact of Social Media on Children, Adolescents, and Families , 2011, Pediatrics.

[62]  Hector Garcia-Molina,et al.  The Eigentrust algorithm for reputation management in P2P networks , 2003, WWW '03.

[63]  Theodore Shapiro,et al.  Verbal Behavior: Adaptation and Psychopathology , 1982 .

[64]  J. Golbeck,et al.  FilmTrust: movie recommendations using trust in web-based social networks , 2006, CCNC 2006. 2006 3rd IEEE Consumer Communications and Networking Conference, 2006..

[65]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[66]  Srinivasan Parthasarathy,et al.  Sequential Hypothesis Tests for Adaptive Locality Sensitive Hashing , 2015, WWW.

[67]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[68]  Stanley W. Borg Social Networks and Health: Models, Methods, and Applications , 2012 .

[69]  Jennifer Golbeck,et al.  Investigating interactions of trust and interest similarity , 2007, Decis. Support Syst..

[70]  Yair Neuman,et al.  Proactive screening for depression through metaphorical and automatic text analysis , 2012, Artif. Intell. Medicine.

[71]  Srinivasan Parthasarathy,et al.  On Understanding the Divergence of Online Social Group Discussion , 2014, ICWSM.

[72]  N. Christakis,et al.  Alone in the Crowd: The Structure and Spread of Loneliness in a Large Social Network , 2009 .

[73]  Srinivasan Parthasarathy,et al.  Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining , 2011, Proc. VLDB Endow..

[74]  A. Mathews,et al.  Selective processing of concern-related information in depression. , 1997, The British journal of clinical psychology.

[75]  Srinivasan Parthasarathy,et al.  A viewpoint-based approach for interaction graph analysis , 2009, KDD.

[76]  J. Pennebaker,et al.  Psychological aspects of natural language. use: our words, our selves. , 2003, Annual review of psychology.

[77]  A. Lott,et al.  Group cohesiveness as interpersonal attraction: a review of relationships with antecedent and consequent variables. , 1965, Psychological bulletin.

[78]  James A. Hendler,et al.  Inferring binary trust relationships in Web-based social networks , 2006, TOIT.

[79]  Minsu Park,et al.  Depressive Moods of Users Portrayed in Twitter , 2012 .