Listening to Mental Health Crisis Needs at Scale: Using Natural Language Processing to Understand and Evaluate a Mental Health Crisis Text Messaging Service

The current mental health crisis is a growing public health issue requiring a large-scale response that cannot be met with traditional services alone. Digital support tools are proliferating, yet most are not systematically evaluated, and we know little about their users and their needs. Shout is a free mental health text messaging service run by the charity Mental Health Innovations, which provides support for individuals in the UK experiencing mental or emotional distress and seeking help. Here we study a large data set of anonymised text message conversations and post-conversation surveys compiled through Shout. This data provides an opportunity to hear at scale from those experiencing distress; to better understand mental health needs for people not using traditional mental health services; and to evaluate the impact of a novel form of crisis support. We use natural language processing (NLP) to assess the adherence of volunteers to conversation techniques and formats, and to gain insight into demographic user groups and their behavioural expressions of distress. Our textual analyses achieve accurate classification of conversation stages (weighted accuracy = 88%), behaviours (1-hamming loss = 95%) and texter demographics (weighted accuracy = 96%), exemplifying how the application of NLP to frontline mental health data sets can aid with post-hoc analysis and evaluation of quality of service provision in digital mental health services.

[1]  Laura S. Abrams,et al.  Self-Harm Narratives of Urban and Suburban Young Women , 2003 .

[2]  Arman Cohan,et al.  Longformer: The Long-Document Transformer , 2020, ArXiv.

[3]  A. Sordi,et al.  “Pandemic fear” and COVID-19: mental health burden and strategies , 2020, Revista brasileira de psiquiatria.

[4]  S. Patten,et al.  A growing need for youth mental health services in Canada: examining trends in youth mental health from 2011 to 2018 , 2020, Epidemiology and Psychiatric Sciences.

[5]  Kimberly B. Roth,et al.  Engagement With Crisis Text Line Among Subgroups of Users Who Reported Suicidality. , 2019, Psychiatric services.

[6]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[7]  Jaewoo Kang,et al.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..

[8]  Angelique Jenney,et al.  Toxic Masculinity and Mental Health in Young Women , 2018 .

[9]  P. Lenca,et al.  Machine Learning and Natural Language Processing in Mental Health: Systematic Review , 2021, Journal of medical Internet research.

[10]  Robert Stewart,et al.  Applied natural language processing in mental health big data , 2020, Neuropsychopharmacology.

[11]  Kristen Olson,et al.  Survey Participation, Nonresponse Bias, Measurement Error Bias, and Total Bias , 2006 .

[12]  Kai Zou,et al.  EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks , 2019, EMNLP.

[13]  Vishakha Arya,et al.  Machine learning approaches to mental stress detection: a review , 2021 .

[14]  Mauricio Barahona,et al.  Data-driven unsupervised clustering of online learner behaviour , 2019, npj Science of Learning.

[15]  S. Gau,et al.  ADHD-related symptoms and attention profiles in the unaffected siblings of probands with autism spectrum disorder: focus on the subtypes of autism and Asperger’s disorder , 2017, Molecular Autism.

[16]  J. Pestian,et al.  A Controlled Trial Using Natural Language Processing to Examine the Language of Suicidal Adolescents in the Emergency Department. , 2016, Suicide & life-threatening behavior.

[17]  Kate Roberts,et al.  Attempting rigour and replicability in thematic analysis of qualitative research data; a case study of codebook development , 2019, BMC Medical Research Methodology.

[18]  N. Gale,et al.  Using the framework method for the analysis of qualitative data in multi-disciplinary health research , 2013, BMC Medical Research Methodology.

[19]  Lawrence Mosley,et al.  A balanced approach to the multi-class imbalance problem , 2013 .

[20]  R. Wiggins,et al.  Estimating the prevalence of disability in the community: the influence of sample design and response bias. , 1981, Journal of epidemiology and community health.

[21]  Ziqian Xie,et al.  Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction , 2020, npj Digital Medicine.

[22]  S. Velupillai,et al.  A natural language processing approach for identifying temporal disease onset information from mental healthcare text , 2021, Scientific Reports.

[23]  P. Lichtenstein,et al.  Predicting mental health problems in adolescence using machine learning techniques , 2020, PloS one.

[24]  A. D. de Vries,et al.  Gender dysphoria and autism spectrum disorder: A narrative review , 2016, International review of psychiatry.

[25]  Jure Leskovec,et al.  Large-scale Analysis of Counseling Conversations: An Application of Natural Language Processing to Mental Health , 2016, TACL.

[26]  Brita Elvevåg,et al.  What do we really know about blunted vocal affect and alogia? A meta-analysis of objective assessments , 2014, Schizophrenia Research.

[27]  René Veenstra,et al.  Evaluation of non-response bias in mental health determinants and outcomes in a large sample of pre-adolescents , 2005, European Journal of Epidemiology.

[28]  W. Gove,et al.  Response Bias in Surveys of Mental Health: An Empirical Investigation , 1977, American Journal of Sociology.

[29]  Goran Nenadic,et al.  Automatic Extraction of Mental Health Disorders From Domestic Violence Police Narratives: Text Mining Study , 2018, Journal of medical Internet research.

[30]  V. Braun,et al.  What can "thematic analysis" offer health and wellbeing researchers? , 2014, International journal of qualitative studies on health and well-being.

[31]  L. Yardley Dilemmas in qualitative health research , 2000 .

[32]  Colin Raffel,et al.  Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[33]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[34]  N. Martin,et al.  Participation bias in a sexuality survey: psychological and behavioural characteristics of responders and non-responders. , 1997, International journal of epidemiology.

[35]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[36]  Yoav Goldberg,et al.  Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets , 2019, EMNLP.

[37]  David C. Atkins,et al.  A Comparison of Natural Language Processing Methods for Automated Coding of Motivational Interviewing. , 2016, Journal of substance abuse treatment.

[38]  Richard E. Boyatzis,et al.  Transforming Qualitative Information: Thematic Analysis and Code Development , 1998 .

[39]  Lucy Yardley,et al.  Demonstrating validity in qualitative psychology , 2007 .

[40]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[41]  Leonard W. D'Avolio,et al.  Measuring Use of Evidence Based Psychotherapy for Posttraumatic Stress Disorder , 2013, Administration and Policy in Mental Health and Mental Health Services Research.

[42]  Katarzyna Musial,et al.  Transformer based Deep Intelligent Contextual Embedding for Twitter sentiment analysis , 2020, Future Gener. Comput. Syst..

[43]  S. Sigmon,et al.  Gender Differences in Self-Reports of Depression: The Response Bias Hypothesis Revisited , 2005 .

[44]  Enrique Baca-García,et al.  Novel Use of Natural Language Processing (NLP) to Predict Suicidal Ideation and Psychiatric Symptoms in a Text-Based Mental Health Intervention in Madrid , 2016, Comput. Math. Methods Medicine.

[45]  A. D. de Vries,et al.  Is There a Link Between Gender Dysphoria and Autism Spectrum Disorder? , 2018, Journal of the American Academy of Child and Adolescent Psychiatry.

[46]  T. B. Üstün,et al.  Age of onset of mental disorders: a review of recent literature , 2007, Current opinion in psychiatry.

[47]  S. Baron-Cohen,et al.  Elevated rates of autism, other neurodevelopmental and psychiatric diagnoses, and autistic traits in transgender and gender-diverse individuals , 2020, Nature Communications.

[48]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[49]  Taghi M. Khoshgoftaar,et al.  Text Data Augmentation for Deep Learning , 2021, Journal of Big Data.

[50]  Aaron D. Shaw,et al.  The Wikipedia Gender Gap Revisited: Characterizing Survey Response Bias with Propensity Score Estimation , 2013, PloS one.

[51]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[52]  Mauricio Barahona,et al.  Understanding learner behaviour in online courses with Bayesian modelling and time series characterisation , 2021, Scientific Reports.

[53]  C. Gillberg,et al.  Predicting Nonresponse Bias from Teacher Ratings of Mental Health Problems in Primary School Children , 2008, Journal of abnormal child psychology.

[54]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[55]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.