Identifying patient experience from online resources via sentiment analysis and topic modelling

Positive patient experience is crucial for retaining patient loyalty and in understanding and acting upon limitations of the treatment or service provided. Online platforms, such as websites and forums, are excellent sources for collecting more reliable feedback, as they provide anonymity and ease of use to the patients. Information from online sources can be vast and unstructured, thereby making patient feedback analysis challenging. Recent advancements in text mining and sentiment analysis approaches can enable automated and granular analysis of patient feedback. In this paper, we present our research, for which we applied and evaluated text mining and machine learning models to a patient feedback database obtained from the NHS Choices website to predict the patient sentiment in the database. There were two iterations to our research. First, we applied a linguistic approach using machine learning and dictionary scoring algorithms to predict patient sentiment from patient feedback and the predicted sentiment was validated against the ratings provided by the patients in the database. Second, a topic modelling approach was applied to identify "themes" within patient feedback so as to understand better the nature of the associated sentiment score, thereby providing a richer understanding of patient opinion.

[1]  Kurt Hornik,et al.  Text Mining Infrastructure in R , 2008 .

[2]  References , 1971 .

[3]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[4]  I. Atherton,et al.  Instruments to measure patient experience of health care quality in hospitals: a systematic review protocol , 2014, Systematic Reviews.

[5]  Robert Dale,et al.  Handbook of Natural Language Processing , 2001, Computational Linguistics.

[6]  Sherri L. LaVela,et al.  Evaluation and Measurement of Patient Experience , 2014 .

[7]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[8]  C. Lees Measuring patient experience. , 2014, Hospital peer review.

[9]  Nigel Collier,et al.  Uncovering text mining: A survey of current work on web-based epidemic intelligence , 2012, Global public health.

[10]  A. Coulter,et al.  The Point of Care Measures of patients ’ experience in hospital : purpose , methods and uses , 2009 .

[11]  M. K. White,et al.  Enhancing Patient Experience by Training Local Trainers in Fundamental Communication Skills , 2014 .

[12]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[13]  Andreas Holzinger,et al.  Use of Sentiment Analysis for Capturing Patient Experience From Free-Text Comments Posted Online , 2013, Journal of medical Internet research.

[14]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[15]  A. Darzi,et al.  Associations between Web-based patient ratings and objective measures of hospital quality. , 2012, Archives of internal medicine.

[16]  Jeffrey Heer,et al.  Termite: visualization techniques for assessing textual topic models , 2012, AVI.

[17]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[18]  S. LaVela,et al.  Defining Patient Experience , 2014 .

[19]  Xianggui Qu,et al.  Multivariate Data Analysis , 2007, Technometrics.

[20]  Ingo Feinerer Introduction to the tm Package Text Mining in R , 2007 .

[21]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[22]  Kenneth E. Shirley,et al.  LDAvis: A method for visualizing and interpreting topics , 2014 .

[23]  Yanchang Zhao R and Data Mining: Examples and Case Studies , 2012 .

[24]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[25]  Matthew P. Manary,et al.  The patient experience and health outcomes. , 2013, The New England journal of medicine.

[26]  N.D. du Preez,et al.  Leveraging unstructured information using topic modelling , 2008, PICMET '08 - 2008 Portland International Conference on Management of Engineering & Technology.

[27]  Susan T. Dumais,et al.  Partially labeled topic models for interpretable text mining , 2011, KDD.