How to automatically turn patient experience free-text responses into actionable insights: a natural language programming (NLP) approach

Background Patient experience surveys often include free-text responses. Analysis of these responses is time-consuming and often underutilized. This study examined whether Natural Language Processing (NLP) techniques could provide a data-driven, hospital-independent solution to indicate points for quality improvement. Methods This retrospective study used routinely collected patient experience data from two hospitals. A data-driven NLP approach was used. Free-text responses were categorized into topics, subtopics (i.e. n-grams) and labelled with a sentiment score. The indicator ‘impact’, combining sentiment and frequency, was calculated to reveal topics to improve, monitor or celebrate. The topic modelling architecture was tested on data from a second hospital to examine whether the architecture is transferable to another hospital. Results A total of 38,664 survey responses from the first hospital resulted in 127 topics and 294 n-grams. The indicator ‘impact’ revealed n-grams to celebrate (15.3%), improve (8.8%), and monitor (16.7%). For hospital 2, a similar percentage of free-text responses could be labelled with a topic and n-grams. Between-hospitals, most topics (69.7%) were similar, but 32.2% of topics for hospital 1 and 29.0% of topics for hospital 2 were unique. Conclusions In both hospitals, NLP techniques could be used to categorize patient experience free-text responses into topics, sentiment labels and to define priorities for improvement. The model’s architecture was shown to be hospital-specific as it was able to discover new topics for the second hospital. These methods should be considered for future patient experience analyses to make better use of this valuable source of information.

[1]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[2]  Jenny King,et al.  A framework analysis of free-text data from the neonatal survey 2014 , 2017, Journal of Neonatal Nursing.

[3]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[4]  Salim Khan,et al.  Use of Sentiment Mining and Online NMF for Topic Modeling Through the Analysis of Patients Online Unstructured Comments , 2018, ICSH.

[5]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[6]  Andrea Esuli,et al.  Building automated survey coders via interactive machine learning , 2019, ArXiv.

[7]  Marco Moreno,et al.  Latent Dirichlet Allocation complement in the vector space model for Multi-Label Text Classification , 2015, Int. J. Comb. Optim. Probl. Informatics.

[8]  Andreas Holzinger,et al.  Use of Sentiment Analysis for Capturing Patient Experience From Free-Text Comments Posted Online , 2013, Journal of medical Internet research.

[9]  G. Elwyn,et al.  Experiencing patient-experience surveys: a qualitative study of the accounts of GPs. , 2011, The British journal of general practice : the journal of the Royal College of General Practitioners.

[10]  Wiebke Wagner,et al.  Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit , 2010, Lang. Resour. Evaluation.

[12]  Kent A. Spackman,et al.  SNOMED RT: a reference terminology for health care , 1997, AMIA.

[13]  Mukesh A. Zaveri,et al.  Automatic Classification of Unstructured Blog Text , 2013 .

[14]  M. Wells,et al.  Qualitative analysis of 6961 free-text comments from the first National Cancer Patient Experience Survey in Scotland , 2017, BMJ Open.

[15]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[16]  Derek Greene,et al.  Down the (White) Rabbit Hole: The Extreme Right and Online Recommender Systems , 2015 .

[17]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[18]  Charles Vincent,et al.  Incident reporting and patient safety , 2007, BMJ : British Medical Journal.

[19]  Mark Lycett,et al.  Identifying patient experience from online resources via sentiment analysis and topic modelling , 2016, BDCAT.

[20]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[21]  J. Hamming,et al.  The Association Between Complications, Incidents, and Patient Experience: Retrospective Linkage of Routine Patient Experience Surveys and Safety Data. , 2019, Journal of patient safety.

[22]  Mohan Tanniru,et al.  Seeking Intelligence from Patient Experience Using Text Mining: Analysis of Emergency Department Data , 2015, Inf. Syst. Manag..

[23]  M. Bracher,et al.  Exploring experiences of cancer care in Wales: a thematic analysis of free-text responses to the 2013 Wales Cancer Patient Experience Survey (WCPES) , 2016, BMJ Open.

[24]  Terje Aven,et al.  Risk assessment and risk management: Review of recent advances on their foundation , 2016, Eur. J. Oper. Res..

[25]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[26]  J. Hamming,et al.  The problem with using patient complaints for improvement , 2018, BMJ Quality & Safety.

[27]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[28]  Timothy Baldwin,et al.  Automatic Evaluation of Topic Coherence , 2010, NAACL.

[29]  M. Simon,et al.  Development and testing of a text-mining approach to analyse patients’ comments on their experiences of colorectal cancer care , 2015, BMJ Quality & Safety.

[30]  Ariyur Mahadevan Abirami,et al.  Sentiment analysis model to emphasize the impact of online reviews in healthcare industry , 2017, Online Inf. Rev..

[31]  A. McCallum,et al.  Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[32]  Fred L. Drake,et al.  Python 3 Reference Manual , 2009 .