Expression recognition for severely demented patients in music reminiscence-therapy

Recognizing expressions in severely demented Alzheimer's disease (AD) patients is essential, since such patients have lost a substantial amount of their cognitive capacity, and some even their verbal communication ability (e.g., aphasia). This leaves patients dependent on clinical staff to assess their verbal and non-verbal language, in order to communicate important messages, as of the discomfort associated to potential complications of the AD. Such assessment classically requires the patients' presence in a clinic, and time consuming examination involving medical personnel. Thus, expression monitoring is costly and logistically inconvenient for patients and clinical staff, which hinders among others large-scale monitoring. In this work we present a novel approach for automated recognition of facial activities and expressions of severely demented patients, where we distinguish between four activity and expression states, namely talking, singing, neutral and smiling. Our approach caters to the challenging setting of current medical recordings of music-therapy sessions, which include continuous pose variations, occlusions, camera-movements, camera-artifacts, as well as changing illumination. Additionally and importantly, the (elderly) patients exhibit generally less profound facial activities and expressions in a range of intensities and predominantly occurring in combinations (e.g., talking and smiling). Our proposed approach is based on the extension of the Improved Fisher Vectors (IFV) for videos, representing a video-sequence using both, local, as well as the related spatio-temporal features. We test our algorithm on a dataset of over 229 video sequences, acquired from 10 AD patients, with promising results, which have sparked substantial interest in the medical community. The proposed approach can play a key role in assessment of different therapy treatments, as well as in remote large-scale healthcare-frameworks.

[1]  Zhiwei Luo,et al.  A virtual shopping test for realistic assessment of cognitive function , 2013, Journal of NeuroEngineering and Rehabilitation.

[2]  Alexandra König,et al.  Validation of an automatic video monitoring system for the detection of instrumental activities of daily living in dementia patients. , 2015, Journal of Alzheimer's disease : JAD.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Antitza Dantcheva,et al.  Can a Smile Reveal Your Gender? , 2016, 2016 International Conference of the Biometrics Special Interest Group (BIOSIG).

[5]  A C Hurley,et al.  Assessment of discomfort in advanced Alzheimer patients. , 1992, Research in nursing & health.

[6]  S. Demleitner [Communication without words]. , 1997, Pflege aktuell.

[7]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  François Brémond,et al.  Video Covariance Matrix Logarithm for Human Action Recognition in Videos , 2015, IJCAI.

[9]  Antitza Dantcheva,et al.  Gender Estimation Based on Smile-Dynamics , 2017, IEEE Transactions on Information Forensics and Security.

[10]  M P Lawton,et al.  Quality of Life in Alzheimer Disease , 1994, Alzheimer disease and associated disorders.

[11]  François Brémond,et al.  Human violence recognition and detection in surveillance videos , 2016, 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[12]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Slawomir Bak,et al.  Representing visual appearance by video Brownian covariance descriptor for human action recognition , 2014, 2014 11th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[14]  Antitza Dantcheva,et al.  Emotion facial recognition by the means of automatic video analysis , 2016 .

[15]  S. Koger,et al.  Is Music Therapy an Effective Intervention for Dementia?A Meta-Analytic Review of Literature. , 1999, Journal of music therapy.

[16]  Katherine B. Martin,et al.  Facial Action Coding System , 2015 .

[17]  I M Alba,et al.  The nurse's role in preventing circulatory complications in the patient with a fractured hip. , 1966, The Nursing clinics of North America.

[18]  S. Ashida The effect of reminiscence music therapy sessions on changes in depressive symptoms in elderly persons with dementia. , 2000, Journal of music therapy.

[19]  Cordelia Schmid,et al.  Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.

[20]  David G. Lowe,et al.  Spatially Local Coding for Object Recognition , 2012, ACCV.

[21]  S. Koger,et al.  The impact of music therapy on language functioning in dementia. , 2000, Journal of music therapy.

[22]  Chuan-Yu Chang,et al.  Personalized facial expression recognition in indoor environments , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[23]  Gwen Littlewort,et al.  Dynamics of Facial Expression Extracted Automatically from Video , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[24]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[25]  James M. Keller,et al.  Resident identification using kinect depth image data and fuzzy clustering techniques , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[26]  Hanne Mette Ochsner Ridder,et al.  The use of extemporizing in music therapy to facilitate communication in a person with dementia: An explorative case study , 2015 .

[27]  Florent Perronnin,et al.  Modeling the spatial layout of images beyond spatial pyramids , 2012, Pattern Recognit. Lett..

[28]  G L Odenheimer,et al.  Management of Patients with Alzheimer's Disease , 1991, Pharmacotherapy.

[29]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[31]  Luc Van Gool,et al.  Face Detection without Bells and Whistles , 2014, ECCV.