An Automatic Assessment System for Alzheimer’s Disease Based on Speech Using Feature Sequence Generator and Recurrent Neural Network

Alzheimer disease and other dementias have become the 7th cause of death worldwide. Still lacking a cure, an early detection of the disease in order to provide the best intervention is crucial. To develop an assessment system for the general public, speech analysis is the optimal solution since it reflects the speaker’s cognitive skills abundantly and data collection is relatively inexpensive compared with brain imaging, blood testing, etc. While most of the existing literature extracted statistics-based features and relied on a feature selection process, we have proposed a novel Feature Sequence representation and utilized a data-driven approach, namely, the recurrent neural network to perform classification in this study. The system is also shown to be fully-automated, which implies the system can be deployed widely to all places easily. To validate our study, a series of experiments have been conducted with 120 speech samples, and the score in terms of the area under the receiver operating characteristic curve is as high as 0.838.

[1]  Jürgen Schmidhuber,et al.  Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.

[2]  J. Hodges,et al.  Performance on the Boston Cookie theft picture description task in patients with early dementia of the Alzheimer's type: Missing information , 1996 .

[3]  K. Jellinger,et al.  Accuracy of clinical criteria for AD in the Honolulu-Asia Aging Study, a population-based study. , 2002, Neurology.

[4]  Margaret Forbes,et al.  AphasiaBank: Methods for studying discourse , 2011, Aphasiology.

[5]  A. Saykin,et al.  Differential Memory Test Sensitivity for Diagnosing Amnestic Mild Cognitive Impairment and Predicting Conversion to Alzheimer's Disease , 2009, Neuropsychology, development, and cognition. Section B, Aging, neuropsychology and cognition.

[6]  K. Meguro,et al.  Impaired verbal description ability assessed by the Picture Description Task in Alzheimer's disease. , 1998, Archives of gerontology and geriatrics.

[7]  K. Marder,et al.  Neuropsychological detection and characterization of preclinical Alzheimer's disease , 1995, Neurology.

[8]  Howard G. Birnbaum,et al.  Implications of early treatment among Medicaid patients with Alzheimer's disease , 2014, Alzheimer's & Dementia.

[9]  Gábor Gosztolya,et al.  A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech , 2018, Current Alzheimer research.

[10]  Alan D. Baddeley,et al.  Analysis of the episodic memory deficit in early Alzheimer's disease: Evidence from the doors and people test , 1996, Neuropsychologia.

[11]  Florian Metze,et al.  Comparison of Decoding Strategies for CTC Acoustic Models , 2017, INTERSPEECH.

[12]  Dong Wang,et al.  THCHS-30 : A Free Chinese Speech Corpus , 2015, ArXiv.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Tanja Schultz,et al.  Manual and Automatic Transcriptions in Dementia Detection from Speech , 2017, INTERSPEECH.

[15]  M. Swash,et al.  Word fluency in patients with early dementia of Alzheimer type. , 1988, The British journal of clinical psychology.

[16]  S. Brucki,et al.  Category fluency test: effects of age, gender and education on total scores, clustering and switching in Brazilian Portuguese-speaking subjects. , 2004, Brazilian journal of medical and biological research = Revista brasileira de pesquisas medicas e biologicas.

[17]  Dietrich Klakow,et al.  Testing the correlation of word error rate and perplexity , 2002, Speech Commun..

[18]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[19]  W. Kukull,et al.  Accuracy of the Clinical Diagnosis of Alzheimer Disease at National Institute on Aging Alzheimer Disease Centers, 2005–2010 , 2012, Journal of neuropathology and experimental neurology.

[20]  Semantic fluency in Alzheimer's, Parkinson's, and Huntington's disease : dissociation of storage and retrieval failures , 1993 .

[21]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[22]  Jennifer H Barnett,et al.  Early intervention in Alzheimer’s disease: a health economic study of the effects of diagnostic timing , 2014, BMC Neurology.

[23]  J. Becker,et al.  The natural history of Alzheimer's disease. Description of study cohort and accuracy of diagnosis. , 1994, Archives of neurology.

[24]  Chong Wang,et al.  Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[25]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[26]  L. Kessing,et al.  Diagnostic Evaluation of Dementia in the Secondary Health Care Sector , 2009, Dementia and Geriatric Cognitive Disorders.

[27]  Alexandra König,et al.  Speech-based automatic and robust detection of very early dementia , 2014, INTERSPEECH.

[28]  P. Scheltens,et al.  Atrophy of medial temporal lobes on MRI in "probable" Alzheimer's disease and normal ageing: diagnostic value and neuropsychological correlates. , 1992, Journal of neurology, neurosurgery, and psychiatry.

[29]  O. Almkvist,et al.  Neuropsychological features of mild cognitive impairment and preclinical Alzheimer's disease , 2003, Acta neurologica Scandinavica. Supplementum.

[30]  B. Croisile,et al.  Comparative Study of Oral and Written Picture Description in Patients with Alzheimer's Disease , 1996, Brain and Language.

[31]  V. Manera,et al.  Automatic speech analysis for the assessment of patients with predementia and Alzheimer's disease , 2015, Alzheimer's & dementia.

[32]  G. Arbanas Diagnostic and Statistical Manual of Mental Disorders (DSM-5) , 2015 .

[33]  K. Forbes-McKay,et al.  Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task , 2005, Neurological Sciences.

[34]  Heidi Christensen,et al.  An Avatar-Based System for Identifying Individuals Likely to Develop Dementia , 2017, INTERSPEECH.

[35]  20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, O-COCOSDA 2017, Seoul, South Korea, November 1-3, 2017 , 2017, O-COCOSDA.

[36]  P. Scheltens,et al.  Atrophy of medial temporal lobes on MRI in “probable” Alzheimer's disease and normal ageing: diagnostic value and neuropsychological correlates , 2012, Journal of Neurology, Neurosurgery & Psychiatry.