Detecting Dementia Through Interactive Computer Avatars

This paper proposes a new approach to automatically detect dementia. Even though some works have detected dementia from speech and language attributes, most have applied detection using picture descriptions, narratives, and cognitive tasks. In this paper, we propose a new computer avatar with spoken dialog functionalities that produces spoken queries based on the mini-mental state examination, the Wechsler memory scale-revised, and other related neuropsychological questions. We recorded the interactive data of spoken dialogues from 29 participants (14 dementia and 15 healthy controls) and extracted various audiovisual features. We tried to predict dementia using audiovisual features and two machine learning algorithms (support vector machines and logistic regression). Here, we show that the support vector machines outperformed logistic regression, and by using the extracted features they classified the participants into two groups with 0.93 detection performance, as measured by the areas under the receiver operating characteristic curve. We also newly identified some contributing features, e.g., gap before speaking, the variations of fundamental frequency, voice quality, and the ratio of smiling. We concluded that our system has the potential to detect dementia through spoken dialog systems and that the system can assist health care workers. In addition, these findings could help medical personnel detect signs of dementia.

[1]  Frank Rudzicz,et al.  Speech Recognition in Alzheimer's Disease and in its Assessment , 2016, INTERSPEECH.

[2]  Roelien Bastiaanse,et al.  Spontaneous speech in aphasia: A correlational study , 1989, Brain and Language.

[3]  Astrid Paeschke,et al.  Prosodic Characteristics of Emotional Speech: Measurements of Fundamental Frequency Movements , 2000 .

[4]  P. Garthwaite,et al.  Investigation of the single case in neuropsychology: confidence limits on the abnormality of test scores and test score differences , 2002, Neuropsychologia.

[5]  Brian Roark,et al.  Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Brian Roark,et al.  Syntactic complexity measures for detecting Mild Cognitive Impairment , 2007, BioNLP@ACL.

[7]  Satoshi Nakamura,et al.  Automatic detection of very early stage of dementia through multimodal interaction with computer avatars , 2016, ICMI.

[8]  J. Touchon,et al.  Does education level determine the course of cognitive decline? , 1996, Age and ageing.

[9]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[10]  Vanessa Taler,et al.  Language performance in Alzheimer's disease and mild cognitive impairment: A comparative review , 2008, Journal of clinical and experimental neuropsychology.

[11]  Yanghee Kim,et al.  Pedagogical Agent Design: The Impact of Agent Realism, Gender, Ethnicity, and Instructional Role , 2004, Intelligent Tutoring Systems.

[12]  V. Manera,et al.  Automatic speech analysis for the assessment of patients with predementia and Alzheimer's disease , 2015, Alzheimer's & dementia.

[13]  Michael J. Lyons,et al.  Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[14]  Dolores E. López,et al.  Speech in Alzheimer's Disease: Can Temporal and Acoustic Parameters Discriminate Dementia? , 2014, Dementia and Geriatric Cognitive Disorders.

[15]  Romola S. Bucks,et al.  Analysis of spontaneous, conversational speech in dementia of Alzheimer type: Evaluation of an objective technique for analysing lexical performance , 2000 .

[16]  Andrea Lockerd Thomaz,et al.  Turn Taking for Human-Robot Interaction , 2010, AAAI Fall Symposium: Dialog with Robots.

[17]  Fernando De la Torre,et al.  Detecting depression from facial actions and vocal prosody , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[18]  Reginald B. Adams,et al.  Smiling and sad wrinkles: Age-related changes in the face and the perception of emotions and intentions. , 2012, Journal of experimental social psychology.

[19]  D. Geldmacher,et al.  Cost-Effective Recognition and Diagnosis of Dementia , 2002, Seminars in neurology.

[20]  Jan P. H. van Santen,et al.  Autism and the use of fillers: differences between 'um' and 'uh' , 2010, DiSS-LPSS.

[21]  Keita Watanabe,et al.  Early detection of cognitive impairment in the elderly based on Bayesian mining using speech prosody and cerebral blood flow activation , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[22]  R. Harden,et al.  Assessment of clinical competence using an objective structured clinical examination (OSCE). , 1979, Medical education.

[23]  Shrikanth S. Narayanan,et al.  Rachel: Design of an emotionally targeted interactive agent for children with autism , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[24]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[25]  M. Gatz,et al.  Relationship Between Education and Dementia: An Updated Systematic Review , 2011, Alzheimer disease and associated disorders.

[26]  K. Horley,et al.  Emotional prosody perception and production in dementia of the Alzheimer's type. , 2010, Journal of speech, language, and hearing research : JSLHR.

[27]  H. H. Clark,et al.  Using uh and um in spontaneous speaking , 2002, Cognition.

[28]  S. Folstein,et al.  "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. , 1975, Journal of psychiatric research.

[29]  Maja Pantic,et al.  Social signal processing: Survey of an emerging domain , 2009, Image Vis. Comput..

[30]  Daniel Gildea,et al.  Automated Analysis and Prediction of Job Interview Performance , 2015, IEEE Transactions on Affective Computing.

[31]  F. Reischies,et al.  [Early diagnosis of dementia]. , 2002, Wiener medizinische Wochenschrift.

[32]  Yogesan Kanagasingam,et al.  Innovative diagnostic tools for early detection of Alzheimer's disease , 2015, Alzheimer's & Dementia.

[33]  Craig Newnes,et al.  Diagnostic and Statistical Manual , 2014, The SAGE Encyclopedia of Trans Studies.

[34]  Rosalind W. Picard,et al.  Acted vs. natural frustration and delight: Many people smile in natural frustration , 2011, Face and Gesture 2011.

[35]  Naomi Yatomi,et al.  Influence of Deteriorating Ability of Emotional Comprehension on Interpersonal Behavior in Alzheimer-Type Dementia , 2001, Brain and Cognition.

[36]  Marilyn Newhoff,et al.  Measures of lexical diversity in aphasia , 2003 .

[37]  David Howard,et al.  Do picture‐naming tests provide a valid assessment of lexical retrieval in conversation in aphasia? , 2008 .

[38]  Myrna F. Schwartz,et al.  The quantitative analysis of agrammatic production: Procedure and data , 1989, Brain and Language.

[39]  Eiji Aramaki,et al.  Vocabulary Size in Speech May Be an Early Indicator of Cognitive Impairment , 2016, PLoS ONE.

[40]  Justine Cassell,et al.  Small Talk and Conversational Storytelling In Embodied Conversational Interface Agents , 1999 .

[41]  Misha Pavel,et al.  Unobtrusive monitoring of computer interactions to detect cognitive status in elders , 2004, IEEE Transactions on Information Technology in Biomedicine.

[42]  Rolf Adolfsson,et al.  Facial expressions in severely demented patients—a stimulus–response study of four patients with dementia of the Alzheimer type , 1991 .

[43]  Brian Roark,et al.  Fully Automated Neuropsychological Assessment for Detecting Mild Cognitive Impairment , 2012, INTERSPEECH.

[44]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[45]  F. Pirozzolo,et al.  Simple and choice reaction time in dementia: Clinical implications , 1981, Neurobiology of Aging.

[46]  Colleen Richey,et al.  Aided diagnosis of dementia type through computer-based analysis of spontaneous speech , 2014, CLPsych@ACL.

[47]  Mattias Heldner,et al.  Pauses, gaps and overlaps in conversations , 2010, J. Phonetics.

[48]  Tamim Asfour,et al.  A Novel Culture-Dependent Gesture Selection System for a Humanoid Robot Performing Greeting Interaction , 2014, ICSR.

[49]  Simon Lucey,et al.  Face alignment through subspace constrained mean-shifts , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[50]  Satoshi Nakamura,et al.  Multimodal interaction data between clinical psychologists and students for attentive listening modeling , 2016, 2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA).

[51]  Satoshi Nakamura,et al.  Initial response time measurement in eye movement for dementia screening test , 2017, 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA).

[52]  Sylvester Olubolu Orimaye,et al.  Learning Predictive Linguistic Features for Alzheimer’s Disease and related Dementias using Verbal Utterances , 2014, CLPsych@ACL.

[53]  J. Morris,et al.  The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer's disease , 2011, Alzheimer's & Dementia.

[54]  Laurence Devillers,et al.  Smile and Laughter Detection for Elderly People-Robot Interaction , 2015, ICSR.

[55]  Tomoki Toda,et al.  Automated Social Skills Trainer , 2015, IUI.

[56]  Kiyoshi Yasuda,et al.  Listener agent for elderly people with dementia , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[57]  K. Stevens,et al.  Glottal characteristics of female speakers , 1995 .

[58]  Shrikanth S. Narayanan,et al.  Using physiology and language cues for modeling verbal response latencies of children with ASD , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[59]  Anita Madan,et al.  Mutual gaze in Alzheimer's disease, frontotemporal and semantic dementia couples. , 2011, Social cognitive and affective neuroscience.

[60]  Sandra E. Black,et al.  Impaired recognition of negative facial emotions in patients with frontotemporal dementia , 2005, Neuropsychologia.