Towards objective and reproducible study of patient-doctor interaction: Automatic text analysis based VR-CoDES annotation of consultation transcripts

While increasingly appreciated for its importance, the interaction between health care professionals (HCP) and patients is notoriously difficult to study, with both methodological and practical challenges. The former has been addressed by the so-called Verona coding definitions of emotional sequences (VR-CoDES) - a system for identifying and coding patient emotions and the corresponding HCP responses - shown to be reliable and informative in a number of independent studies in different health care delivery contexts. In the preset work we focus on the practical challenge of the scalability of this coding system, namely on making it easily usable more widely and on applying it on larger patient cohorts. In particular, VR-CoDES is inherently complex and training is required to ensure consistent annotation of audio recordings or textual transcripts of consultations. Following up on our previous pilot investigation, in the the present paper we describe the first automatic, computer based algorithm capable of providing coarse level coding of textual transcripts. We investigate different representations of patient utterances and classification methodologies, and label each utterance as either containing an explicit expression of emotional distress (a ‘concern’), an implicit one (a ‘cue’), or neither. Using a data corpus comprising 200 consultations between radiotherapists and adult female breast cancer patients we demonstrate excellent labelling performance.

[1]  Ognjen Arandjelović A New Framework for Interpreting the Outcomes of Imperfectly Blinded Controlled Clinical Trials , 2012, PloS one.

[2]  Hanneke C J M de Haes,et al.  Development of the Verona coding definitions of emotional sequences to code health providers' responses (VR-CoDES-P) to patient cues and concerns. , 2011, Patient education and counseling.

[3]  Galina Velikova,et al.  Discussion of emotional and social impact of cancer during outpatient oncology consultations , 2011, Psycho-oncology.

[4]  J. Turner,et al.  Emotional dimensions of chronic disease. , 2000, The Western journal of medicine.

[5]  Ognjen Arandelovic,et al.  Automatic knowledge extraction from EHRs , 2016 .

[6]  A. Pelosi,et al.  Predictors of chronic "postviral" fatigue , 1994, The Lancet.

[7]  Ognjen Arandjelovic,et al.  Object Matching Using Boundary Descriptors , 2012, BMVC.

[8]  J. Bensing,et al.  Cue-responding behaviours of oncology nurses in video-simulated interviews. , 2008, Journal of advanced nursing.

[9]  S. Linton,et al.  Catastrophizing moderates the effect of exposure in vivo for back pain patients with pain‐related fear , 2010, European journal of pain.

[10]  Svetha Venkatesh,et al.  Overcoming data scarcity of Twitter: Using tweets as bootstrap with application to autism-related topic content analysis , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[11]  David D. Lewis,et al.  A comparison of two learning algorithms for text categorization , 1994 .

[12]  J. Bensing,et al.  Nurse–patient communication in cancer care: does responding to patient's cues predict patient satisfaction with communication , 2009, Psycho-oncology.

[13]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[14]  Alan Apter,et al.  Psychiatric aspects of pediatric cancer. , 2003, Child and adolescent psychiatric clinics of North America.

[15]  Peter Salmon,et al.  Coding patient emotional cues and concerns in medical consultations: the Verona coding definitions of emotional sequences (VR-CoDES). , 2011, Patient education and counseling.

[16]  Svetha Venkatesh,et al.  Using Twitter to learn about the autism community , 2015, Social Network Analysis and Mining.

[17]  Joost Dekker,et al.  Predictors of multidisciplinary treatment outcome in fibromyalgia:a systematic review , 2013, Disability and rehabilitation.

[18]  Svetha Venkatesh,et al.  Data-mining twitter and the autism spectrum disorder: A Pilot study , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[19]  Richard L. Street,et al.  Patient-centered communication in cancer care: Promoting healing and reducing suffering , 2007 .

[20]  Ognjen Arandjelovic,et al.  Visualization of patient specific disease risk prediction , 2017, 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[21]  Ognjen Arandjelovic,et al.  Towards sophisticated learning from EHRs: Increasing prediction specificity and accuracy using clinically meaningful risk criteria , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[22]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[23]  Ognjen Arandjelovic Clinical Trial Adaptation by Matching Evidence in Complementary Patient Sub-groups of Auxiliary Blinding Questionnaire Responses , 2015, PloS one.