Predicting and Characterizing the Health of Individuals and Communities through Language Analysis of Social Media

A large and growing fraction of the global population uses social media, through which users share their thoughts, feelings, and behaviors, predominantly through text. To quantify the expression of psychological constructs in language, psychology has evolved a set of “closed-vocabulary” methods using pre-determined dictionaries. Advances in natural language processing have made possible the development of “openvocabulary” methods to analyze text in data-driven ways, and machine learning algorithms have substantially improved prediction performances. The first chapter introduces these methods, comparing traditional methods of text analysis with newer methods from natural language processing in terms of their relative ability to predict and elucidate the language correlates of age, gender and the personality of Facebook users (N = 65,896). The second and third chapters discuss the use of social media to predict depression in individuals (the most prevalent mental illness). The second chapter reviews the literature on detection of depression through social media and concludes that no study to date has yet demonstrated the efficacy of this approach to screen for clinician-reported depression. In the third chapter, Facebook data was collected and connected to patients’ medical records (N = 683), and prediction models based on Facebook data were able to forecast the occurrence of depression with fair accuracy–about as well as self-report screening surveys. The fourth chapter applies both sets of methods to geotagged Tweets to predict county-level mortality rates of atherosclerotic heart disease mortality (the leading cause of death in the U.S.) across 1,347 counties, capturing 88% of the U.S. population. In this study, a Twitter model outperformed a model combining ten other leading demographic, socioeconomic and health risk factors. Across both depression and heart disease, associated language profiles identified fine-grained psychological determinants (e.g., loneliness emerged as a risk factor for depression, and optimism showed a protective association with heart disease). In sum, these studies demonstrate that large-scale text analysis is a valuable tool for psychology with implications for public health, as it allows for the unobtrusive and cost-effective monitoring of disease risk and psychological states of individuals and large populations. Degree Type Dissertation Degree Name Doctor of Philosophy (PhD) Graduate Group Psychology First Advisor Martin E. Seligman

[1]  H. Murray,et al.  A METHOD FOR INVESTIGATING FANTASIES: THE THEMATIC APPERCEPTION TEST , 1935 .

[2]  H. Freed,et al.  The thematic apperception test. , 1946, Diseases of the nervous system.

[3]  E. L. Kelly Clinical versus statistical prediction: A theoretical analysis and review of the evidence. , 1955 .

[4]  J. M. Kittross The measurement of meaning , 1959 .

[5]  C. Osgood On understanding and creating sentences. , 1963 .

[6]  David C. McClelland,et al.  A cross-cultural study of folk-tale content and drinking. , 1966, Sociometry.

[7]  A. Strauss,et al.  The Discovery of Grounded Theory , 1967 .

[8]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[9]  Sho Tin Chen,et al.  National Institute of Mental Health , 2020, Definitions.

[10]  C. Martindale An experimental simulation of literary change. , 1973 .

[11]  C. Martindale Romantic Progression: The Psychology of Literary History , 1975 .

[12]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[13]  J. Pierce An introduction to information theory: symbols, signals & noise , 1980 .

[14]  M. Seligman,et al.  The Attributional Style Questionnaire1 , 1982 .

[15]  C. Peterson,et al.  Attributions and depressive mood shifts: a case study using the symptom-context method. , 1983, Journal of abnormal psychology.

[16]  R. Weber Computer-aided content analysis: A short primer , 1984 .

[17]  M. Ganguli,et al.  Assessing depression in primary medical and psychiatric practices. , 1985, Archives of general psychiatry.

[18]  P. Stone,et al.  Verbal Style and the Presidency: A Computer-Based Analysis. , 1986 .

[19]  R. Weber Basic Content Analysis , 1986 .

[20]  J. Rowe,et al.  Human aging: usual and successful. , 1987, Science.

[21]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[22]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[23]  Martha E. Francis,et al.  Putting Stress into Words: The Impact of Writing on Physiological, Absentee, and Self-Reported Emotional Well-Being Measures , 1992, American journal of health promotion : AJHP.

[24]  Charles P. Smith Motivation and personality: Name Index , 1992 .

[25]  P. Costa,et al.  Revised NEO Personality Inventory (NEO-PI-R) and NEO-Five-Factor Inventory (NEO-FFI) , 1992 .

[26]  G. Altmann,et al.  Quantitative text analysis , 1993 .

[27]  L. Kirmayer,et al.  Somatization and the recognition of depression and anxiety in primary care. , 1993, The American journal of psychiatry.

[28]  J. Coyne,et al.  Nondetection of depression by primary care physicians reconsidered. , 1995, General hospital psychiatry.

[29]  L. Gottschalk,et al.  Computerized measurement of the content analysis of natural language for use in biomedical and neuropsychiatric research. , 1995, Computer methods and programs in biomedicine.

[30]  C. Mulrow,et al.  Case-Finding Instruments for Depression in Primary Care Settings , 1995, Annals of Internal Medicine.

[31]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[32]  J. Pennebaker Writing About Emotional Experiences as a Therapeutic Process , 1997 .

[33]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[34]  S. Passik,et al.  Oncologists' recognition of depression in their patients with cancer. , 1998, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[35]  W. Bucci,et al.  Linking verbal and non-verbal representations: computer analysis of referential activity. , 1999, The British journal of medical psychology.

[36]  G. Simon,et al.  An international study of the relation between somatic symptoms and depression. , 1999, The New England journal of medicine.

[37]  A. Rush,et al.  Methods to improve diagnostic accuracy in a community mental health setting. , 2000, The American journal of psychiatry.

[38]  P. Miller,et al.  Inpatient diagnostic assessments: 1. Accuracy of structured vs. unstructured interviews , 2001, Psychiatry Research.

[39]  R. Rosenthal,et al.  Meta-analysis: recent developments in quantitative methods for literature reviews. , 2001, Annual review of psychology.

[40]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[41]  S. Gosling,et al.  A room with a cue: personality judgments based on offices and bedrooms. , 2002, Journal of personality and social psychology.

[42]  R. Rugulies Depression as a predictor for coronary heart disease. a review and meta-analysis. , 2002, American journal of preventive medicine.

[43]  Olga V. Demler,et al.  The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R). , 2003, JAMA.

[44]  I. Kawachi,et al.  Social capital and neighborhood mortality rates in Chicago. , 2003, Social science & medicine.

[45]  Michael B. W. Wolfe,et al.  Use of latent semantic analysis for predicting psychological phenomena: Two issues and proposed solutions , 2003, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[46]  W. Katon,et al.  Depression and pain comorbidity: a literature review. , 2003, Archives of internal medicine.

[47]  T. Farley,et al.  Why Is Poverty Unhealthy , 2003 .

[48]  T. Farley,et al.  Why is poverty unhealthy? Social and physical mediators. , 2003, Social science & medicine.

[49]  J. Pennebaker,et al.  The Secret Life of Pronouns , 2003, Psychological science.

[50]  D. Nease,et al.  Depression screening: a practical strategy. , 2003, The Journal of family practice.

[51]  Richard W. Kobylinski,et al.  Identifying physician-recognized depression from administrative data: consequences for quality measurement. , 2003, Health services research.

[52]  H. Möller,et al.  Identifying depression in primary care: a comparison of different methods in a prospective cohort study , 2003, BMJ : British Medical Journal.

[53]  J. Pennebaker,et al.  Psychological aspects of natural language. use: our words, our selves. , 2003, Annual review of psychology.

[54]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[55]  E. Isometsä,et al.  Depressive disorders in primary care: recurrent, chronic, and co-morbid , 2004, Psychological Medicine.

[56]  A. Sherwood,et al.  Depression as a Risk Factor for Coronary Artery Disease: Evidence, Mechanisms, and Treatment , 2004, Psychosomatic medicine.

[57]  R. Kessler,et al.  Prevalence, severity, and unmet need for treatment of mental disorders in the World Health Organization World Mental Health Surveys. , 2004, JAMA.

[58]  M. Berner [The treatment of major depression]. , 2004, Pharmazie in unserer Zeit.

[59]  Mehran Sahami,et al.  Evaluating similarity measures: a large-scale study in the orkut social network , 2005, KDD '05.

[60]  John F. Hurdle,et al.  Measuring diagnoses: ICD code accuracy. , 2005, Health services research.

[61]  Sheldon Cohen,et al.  Does positive affect influence health? , 2005, Psychological bulletin.

[62]  E. Diener,et al.  Handbook of Multimethod Measurement in Psychology , 2005 .

[63]  R. Kessler,et al.  Twelve-month use of mental health services in the United States: results from the National Comorbidity Survey Replication. , 2005, Archives of general psychiatry.

[64]  A. Leyland Socioeconomic gradients in the prevalence of cardiovascular disease in Scotland: the roles of composition and context , 2005, Journal of Epidemiology and Community Health.

[65]  P. O’Connor,et al.  Are Claims Data Accurate Enough to Identify Patients for Performance Measures or Quality Improvement? The Case of Diabetes, Heart Disease, and Depression , 2006, American journal of medical quality : the official journal of the American College of Medical Quality.

[66]  John A. Johnson,et al.  The international personality item pool and the future of public-domain personality measures ☆ , 2006 .

[67]  C. Camargo,et al.  A prospective study of depression among adult patients in an urban emergency department. , 2006, Primary care companion to the Journal of clinical psychiatry.

[68]  H. Murray,et al.  Explorations in Personality , 2007 .

[69]  F. Hustey,et al.  A depression screen and intervention for older ED patients. , 2007, The American journal of emergency medicine.

[70]  Philip J. Stone,et al.  The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information , 2007 .

[71]  C. Hewitt,et al.  Screening for Depression in Medical Settings with the Patient Health Questionnaire (PHQ): A Diagnostic Meta-Analysis , 2007, Journal of General Internal Medicine.

[72]  Mark Steyvers,et al.  Topics in semantic representation. , 2007, Psychological review.

[73]  J. Coyne,et al.  Do ultra-short screening instruments accurately detect depression in primary care? A pooled analysis and meta-analysis of 22 studies. , 2007, The British journal of general practice : the journal of the Royal College of General Practitioners.

[74]  Cindy K. Chung,et al.  The development and psychometric properties of LIWC2007 , 2007 .

[75]  J. Alderson Judging the Frequency of English Words , 2007 .

[76]  L. Gauvin,et al.  Toward the next generation of research into small area effects on health: a synthesis of multilevel investigations published since July 1998 , 2007, Journal of Epidemiology & Community Health.

[77]  A. Ciampi,et al.  Recognition of Depression by Non-psychiatric Physicians—A Systematic Literature Review and Meta-analysis , 2007, Journal of General Internal Medicine.

[78]  Cindy K. Chung,et al.  The Psychological Functions of Function Words , 2007 .

[79]  Margaret L. Kern,et al.  Health benefits: Meta-analytically determining the impact of well-being on objective health outcomes , 2007 .

[80]  T. Sheldon,et al.  Screening and case-finding instruments for depression: a meta-analysis , 2008, Canadian Medical Association Journal.

[81]  M. Maramis Depression and pain. , 2008, The Journal of clinical psychiatry.

[82]  Carla J. Groom,et al.  Gender Differences in Language Use: An Analysis of 14,000 Text Samples , 2008 .

[83]  H. Quan,et al.  Assessing validity of ICD-9-CM and ICD-10 administrative data in recording clinical conditions in a unique dually coded database. , 2008, Health services research.

[84]  A. Steptoe,et al.  Positive Psychological Well-Being and Mortality: A Quantitative Review of Prospective Observational Studies , 2008, Psychosomatic medicine.

[85]  David A. Hanauer,et al.  Enhanced identification of eligibility for depression research using an electronic medical record search engine , 2009, Int. J. Medical Informatics.

[86]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[87]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[88]  J. Brownstein,et al.  Digital disease detection--harnessing the Web for public health surveillance. , 2009, The New England journal of medicine.

[89]  A. Steptoe,et al.  The association of anger and hostility with future coronary heart disease: a meta-analytic review of prospective evidence. , 2009, Journal of the American College of Cardiology.

[90]  B. Chaix,et al.  Neighbourhoods in eco-epidemiologic research: delimiting personal exposure areas. A response to Riva, Gauvin, Apparicio and Brodeur. , 2009, Social science & medicine.

[91]  Patty Kostkova,et al.  Early Warning and Outbreak Detection Using Social Networking Websites: The Potential of Twitter , 2009, eHealth.

[92]  G. Eysenbach Infodemiology and Infoveillance: Framework for an Emerging Set of Public Health Informatics Methods to Analyze Search, Communication and Publication Behavior on the Internet , 2009, Journal of medical Internet research.

[93]  A. Mitchell,et al.  Clinical diagnosis of depression in primary care: a meta-analysis , 2009, The Lancet.

[94]  A. D. Diez Roux,et al.  Neighborhoods and health , 2010, Annals of the New York Academy of Sciences.

[95]  J. Denollet,et al.  Anxiety and risk of incident coronary heart disease : A meta-analysis , 2010 .

[96]  D. Mozaffarian,et al.  Defining and Setting National Goals for Cardiovascular Health Promotion and Disease Reduction: The American Heart Association's Strategic Impact Goal Through 2020 and Beyond , 2010, Circulation.

[97]  Adam D. I. Kramer An unobtrusive behavioral model of "gross national happiness" , 2010, CHI.

[98]  C. Elie,et al.  Anxiety and depression are unrecognised in emergency patients admitted to the observation care unit , 2010, Emergency Medicine Journal.

[99]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[100]  J. Aucott,et al.  The utility of "Google Trends" for epidemiological research: Lyme disease as an example. , 2010, Geospatial health.

[101]  G. Eysenbach,et al.  Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak , 2010, PloS one.

[102]  Tal Yarkoni Personality in 100,000 Words: A large-scale analysis of personality and word use among bloggers. , 2010, Journal of research in personality.

[103]  Brian K. Zimerle Visible Language: Inventions of Writing in the Ancient East and Beyond , 2010 .

[104]  A. Denison,et al.  Accuracy of Death Certifications and the Implications for Studying Disease Burdens , 2010 .

[105]  C. Peng,et al.  Do Seasons Have an Influence on the Incidence of Depression? The Use of an Internet Search Engine Query Data as a Proxy of Human Affect , 2010, PloS one.

[106]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[107]  A. Przeworski,et al.  A review of technology-assisted self-help and minimal contact therapies for anxiety and depression: is human contact necessary for therapeutic efficacy? , 2011, Clinical psychology review.

[108]  Richard E. Lucas,et al.  Predictors of Regional Well-Being: A County Level Analysis , 2011 .

[109]  J. O’Keefe,et al.  Psychological Risk Factors and Cardiovascular Disease: Is it All in Your Head? , 2011, Postgraduate medicine.

[110]  Soo Jeong Youn,et al.  Using electronic medical records to determine the diagnosis of clinical depression , 2011, Int. J. Medical Informatics.

[111]  A. Mitchell,et al.  International comparison of clinicians' ability to identify depression in primary care: meta-analysis and meta-regression of predictors. , 2011, The British journal of general practice : the journal of the Royal College of General Practitioners.

[112]  Francisco Iacobelli,et al.  Large Scale Personality Classification of Bloggers , 2011, ACII.

[113]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[114]  D. K. Keefer,et al.  Toward the next generation of research on earthquake-induced landslides: Current issues and future challenges , 2011 .

[115]  A. Alwan Global status report on noncommunicable diseases 2010. , 2011 .

[116]  K. Noyes,et al.  Medicare beneficiaries with depression: comparing diagnoses in claims data with the results of screening. , 2011, Psychiatric services.

[117]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[118]  E. Ford,et al.  Proportion of the decline in cardiovascular mortality disease due to prevention versus treatment: public health versus clinical care. , 2011, Annual review of public health.

[119]  Connie St Louis,et al.  Can Twitter predict disease outbreaks? , 2012, BMJ : British Medical Journal.

[120]  Eric Gilbert,et al.  Phrases that signal workplace hierarchy , 2012, CSCW.

[121]  A. D. Diez Roux,et al.  A review of spatial methods in epidemiology, 2000-2010. , 2012, Annual review of public health.

[122]  David C. Atkins,et al.  Topic models: a novel method for modeling couple and family text data. , 2012, Journal of family psychology : JFP : journal of the Division of Family Psychology of the American Psychological Association.

[123]  C. Vögele,et al.  Perseverative Thinking in Depression and Anxiety , 2012, Front. Psychology.

[124]  L. Kubzansky,et al.  The heart's content: the association between positive psychological well-being and cardiovascular health. , 2012, Psychological bulletin.

[125]  Eric Horvitz,et al.  Social media as a measurement tool of depression in populations , 2013, WebSci.

[126]  Eduardo Blanco,et al.  Toward Personality Insights from Language Exploration in Social Media , 2013, AAAI Spring Symposium: Analyzing Microtext.

[127]  R. Mckee Ethical issues in using social media for health and health care research. , 2013, Health policy.

[128]  E. Diener,et al.  Social relations, health behaviors, and health outcomes: a survey and synthesis. , 2013, Applied psychology. Health and well-being.

[129]  J. Brownstein,et al.  Influenza A (H7N9) and the importance of digital epidemiology. , 2013, The New England journal of medicine.

[130]  Megha Agrawal,et al.  Characterizing Geographic Variation in Well-Being Using Tweets , 2013, ICWSM.

[131]  Declan Butler,et al.  When Google got flu wrong , 2013, Nature.

[132]  Henriette Cramer,et al.  Representation and communication: challenges in interpreting large social media datasets , 2013, CSCW.

[133]  R. Mojtabai Clinician-Identified Depression in Community Settings: Concordance with Structured-Interview Diagnoses , 2013, Psychotherapy and Psychosomatics.

[134]  Margaret L. Kern,et al.  Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[135]  A. Kaplan,et al.  Power and Society: A Framework for Political Inquiry. , 1951 .

[136]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[137]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[138]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[139]  Rui Fan,et al.  Anger Is More Influential than Joy: Sentiment Correlation in Weibo , 2013, PloS one.

[140]  Joseph DiGrazia,et al.  Twitter publics: how online political communities signaled electoral outcomes in the 2010 US house election , 2014 .

[141]  Dhiraj Murthy,et al.  Modeling virtual organizations with Latent Dirichlet Allocation: A case for natural language processing , 2014, Neural Networks.

[142]  Ryen W. White,et al.  Seeking Insights About Cycling Mood Disorders via Anonymized Search Logs , 2014, Journal of medical Internet research.

[143]  Maarten Sap,et al.  Towards Assessing Changes in Degree of Depression through Facebook , 2014, CLPsych@ACL.

[144]  Maarten Sap,et al.  Developing Age and Gender Predictive Lexica over Social Media , 2014, EMNLP.

[145]  H. Fan,et al.  Depression after heart failure and risk of cardiovascular and all-cause mortality: a meta-analysis. , 2014, Preventive medicine.

[146]  Xin Tu,et al.  Social Structure and Depression in TrevorSpace , 2014, CSCW.

[147]  S. Patten,et al.  Systematic review and assessment of validated case definitions for depression in administrative data , 2014, BMC Psychiatry.

[148]  Eric Horvitz,et al.  Characterizing and predicting postpartum depression from shared facebook data , 2014, CSCW.

[149]  Eyal Sagi,et al.  Automated text analysis in psychology: methods, applications, and future developments* , 2014, Language and Cognition.

[150]  Gregory J. Park,et al.  From "Sooo excited!!!" to "So proud": using language to study development. , 2014, Developmental psychology.

[151]  Margaret L. Kern,et al.  Personality, well-being, and health. , 2014, Annual review of psychology.

[152]  Mark Dredze,et al.  Quantifying Mental Health Signals in Twitter , 2014, CLPsych@ACL.

[153]  Mark Dredze,et al.  Measuring Post Traumatic Stress Disorder in Twitter , 2014, ICWSM.

[154]  G. Arbanas Diagnostic and Statistical Manual of Mental Disorders (DSM-5) , 2015 .

[155]  Leonardo Max Batista Claudino,et al.  Beyond LDA: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter , 2015, CLPsych@HLT-NAACL.

[156]  Gregory J. Park,et al.  Automatic personality assessment through social media language. , 2015, Journal of personality and social psychology.

[157]  Hiroyuki Ohsaki,et al.  Recognizing Depression from Twitter Activity , 2015, CHI.

[158]  Xiao Wang,et al.  World Cup 2014 in the Twitter World: A big data analysis of sentiments in U.S. sports fans' tweets , 2015, Comput. Hum. Behav..

[159]  Mark Dredze,et al.  From ADHD to SAD: Analyzing the Language of Mental Health on Twitter through Self-Reported Diagnoses , 2015, CLPsych@HLT-NAACL.

[160]  Maarten Sap,et al.  The role of personality, age, and gender in tweeting about mental illness , 2015, CLPsych@HLT-NAACL.

[161]  Ted Pedersen,et al.  Screening Twitter Users for Depression and PTSD with Lexical Decision Lists , 2015, CLPsych@HLT-NAACL.

[162]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[163]  Kevin A Padrez,et al.  Linking social media and medical record data: a study of adults presenting to an academic, urban emergency department , 2015, BMJ Quality & Safety.

[164]  L. Ungar,et al.  Data-Driven Content Analysis of Social Media , 2015 .

[165]  S. Gosling,et al.  Facebook as a research tool for the social sciences: Opportunities, challenges, ethical considerations, and practical guidelines. , 2015, The American psychologist.

[166]  Mark Dredze,et al.  Shared Task : Depression and PTSD on Twitter , 2015 .

[167]  M. Kosinski,et al.  Computer-based personality judgments are more accurate than those made by humans , 2015, Proceedings of the National Academy of Sciences.

[168]  U. Hegerl The "European Alliance Against Depression" - a four-level intervention programme against depression and suicidality , 2015 .

[169]  M. Yosef,et al.  Associations between depression and all-cause and cause-specific risk of death: a retrospective cohort study in the Veterans Health Administration. , 2015, Journal of psychosomatic research.

[170]  Mike Conway,et al.  Towards Developing an Annotation Scheme for Depressive Disorder Symptoms: A Preliminary Study using Twitter Data , 2015, CLPsych@HLT-NAACL.

[171]  Gregory J. Park,et al.  Psychological Language on Twitter Predicts County-Level Heart Disease Mortality , 2015, Psychological science.

[172]  Margaret L. Kern,et al.  Social Networking Sites, Depression, and Anxiety: A Systematic Review , 2016, JMIR mental health.

[173]  Lyle H. Ungar,et al.  Analyzing Personality through Social Media Profile Picture Choice , 2016, ICWSM.

[174]  M. Kosinski,et al.  Self-Monitoring and the Metatraits. , 2016, Journal of personality.

[175]  Moin Nadeem,et al.  Identifying Depression on Twitter , 2016, ArXiv.

[176]  Glen Coppersmith,et al.  Exploratory Analysis of Social Media Prior to a Suicide Attempt , 2016, CLPsych@HLT-NAACL.

[177]  Mark Dredze,et al.  Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media , 2016, CHI.

[178]  Laura J. Bierut,et al.  A content analysis of depression-related tweets , 2016, Comput. Hum. Behav..

[179]  Maria Liakata,et al.  Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records , 2016, CLPsych@HLT-NAACL.

[180]  M. Kosinski,et al.  A decade into Facebook: where is psychiatry in the digital age? , 2016, The lancet. Psychiatry.

[181]  Gregory J. Park,et al.  Women are Warmer but No Less Assertive than Men: Gender and Language on Facebook , 2016, PloS one.

[182]  Gregory J. Park,et al.  Gaining insights from social media language: Methodologies and challenges. , 2016, Psychological methods.

[183]  Kristy Hollingshead,et al.  Crazy Mad Nutters: The Language of Mental Health , 2016, CLPsych@HLT-NAACL.

[184]  Johannes Zimmermann,et al.  First-person Pronoun Use in Spoken Language as a Predictor of Future Depressive Symptoms: Preliminary Evidence from a Clinical Sample of Depressed Patients. , 2016, Clinical Psychology and Psychotherapy.

[185]  Mike Conway,et al.  Feature Studies to Inform the Classification of Depressive Symptoms from Twitter Data for Population Health , 2017, ArXiv.

[186]  Christopher M. Danforth,et al.  Forecasting the onset and course of mental illness with Twitter data , 2016, Scientific Reports.

[187]  D. Mohr,et al.  Personal Sensing: Understanding Mental Health Using Ubiquitous Sensors and Machine Learning. , 2017, Annual review of clinical psychology.

[188]  Nicholas S. Holtzman,et al.  A meta-analysis of correlations between depression and first person singular pronoun use , 2017 .

[189]  Dirk Hovy,et al.  Multitask Learning for Mental Health Conditions with Limited Social Media Data , 2017, EACL.

[190]  Munmun De Choudhury,et al.  A Social Media Based Index of Mental Well-Being in College Campuses , 2017, CHI.

[191]  Eric S. Kim,et al.  Positive Psychological Well-Being and Cardiovascular Disease: JACC Health Promotion Series. , 2018, Journal of the American College of Cardiology.