Multilingual word embeddings for the assessment of narrative speech in mild cognitive impairment

Abstract We analyze the information content of narrative speech samples from individuals with mild cognitive impairment (MCI), in both English and Swedish, using a combination of supervised and unsupervised learning techniques. We extract information units using topic models trained on word embeddings in monolingual and multilingual spaces, and find that the multilingual approach leads to significantly better classification accuracies than training on the target language alone. In many cases, we find that augmenting the topic model training corpus with additional clinical data from a different language is more effective than training on additional monolingual data from healthy controls. Ultimately we are able to distinguish MCI speakers from healthy older adults with accuracies of up to 63% (English) and 72% (Swedish) on the basis of information content alone. We also compare our method against previous results measuring information content in Alzheimer’s disease, and report an improvement over other topic-modeling approaches. Furthermore, our results support the hypothesis that subtle differences in language can be detected in narrative speech, even at the very early stages of cognitive decline, when scores on screening tools such as the Mini-Mental State Exam are still in the “normal” range.

[1]  B. Winblad,et al.  Accuracy of the Mini‐Mental Status Examination as a screening test for dementia in a Swedish elderly population , 1993, Acta neurologica Scandinavica.

[2]  Brian MacWhinney,et al.  The Talkbank Project , 2007 .

[3]  K. Forbes-McKay,et al.  Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task , 2005, Neurological Sciences.

[4]  P. Garrard,et al.  Connected speech as a marker of disease progression in autopsy-proven Alzheimer’s disease , 2013, Brain : a journal of neurology.

[5]  A. Hillis,et al.  Paucity vs. Verbosity: Another Analysis of Right Hemisphere Communication Deficits , 1985 .

[6]  Philip D. Harvey,et al.  Outcomes Assessment in Clinical Trials of Alzheimer's Disease and its Precursors: Readying for Short-term and Long-term Clinical Trial Needs. , 2017, Innovations in clinical neuroscience.

[7]  Sylvester Olubolu Orimaye,et al.  Predicting probable Alzheimer’s disease using linguistic deficits and biomarkers , 2017, BMC Bioinformatics.

[8]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[9]  Peter Garrard,et al.  Features and machine learning classification of connected speech samples from patients with autopsy proven Alzheimer's disease with and without additional vascular pathology. , 2014, Journal of Alzheimer's disease : JAD.

[10]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[11]  Colleen Richey,et al.  Aided diagnosis of dementia type through computer-based analysis of spontaneous speech , 2014, CLPsych@ACL.

[12]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[13]  J. Hodges,et al.  The effects of very early Alzheimer's disease on the characteristics of writing by a renowned author. , 2004, Brain : a journal of neurology.

[14]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[15]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[16]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[17]  V. Manera,et al.  Automatic speech analysis for the assessment of patients with predementia and Alzheimer's disease , 2015, Alzheimer's & dementia.

[18]  Language changes in bilingual individuals with Alzheimer's disease. , 2016, International journal of language & communication disorders.

[19]  H. Pai,et al.  To be semantically-impaired or to be syntactically-impaired: Linguistic patterns in Chinese-speaking persons with or without dementia , 2009, Journal of Neurolinguistics.

[20]  Felix Nensa,et al.  Clinical applications of PET/MRI: current status and future perspectives. , 2014, Diagnostic and interventional radiology.

[21]  Frank Rudzicz,et al.  Vector-space topic models for detecting Alzheimer’s disease , 2016, ACL.

[22]  J. Becker,et al.  The natural history of Alzheimer's disease. Description of study cohort and accuracy of diagnosis. , 1994, Archives of neurology.

[23]  S. Stevens The language of dementia in the elderly: a pilot study. , 1985, The British journal of disorders of communication.

[24]  Zhiyuan Liu,et al.  A Unified Model for Word Sense Representation and Disambiguation , 2014, EMNLP.

[25]  Markus Forsberg,et al.  Sparv : Språkbanken ’ s corpus annotation pipeline infrastructure , 2016 .

[26]  Ling Zheng,et al.  Longitudinal Verbal Fluency in Normal Aging, Preclinical, and Prevalent Alzheimer’s Disease , 2009, American journal of Alzheimer's disease and other dementias.

[27]  Annalena Venneri,et al.  The evolution of dysgraphia in Alzheimer’s disease , 2004, Brain Research Bulletin.

[28]  Meysam Asgari,et al.  Predicting mild cognitive impairment from spontaneous spoken utterances , 2017, Alzheimer's & dementia.

[29]  Veronika Vincze,et al.  Speaking in Alzheimer’s Disease, is That an Early Sign? Importance of Changes in Language Abilities in Alzheimer’s Disease , 2015, Front. Aging Neurosci..

[30]  N. Cercone,et al.  Automatic detection and rating of dementia of Alzheimer type through lexical analysis of spontaneous speech , 2005, IEEE International Conference Mechatronics and Automation, 2005.

[31]  B. Croisile,et al.  Comparative Study of Oral and Written Picture Description in Patients with Alzheimer's Disease , 1996, Brain and Language.

[32]  John Liu,et al.  sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings , 2015, ArXiv.

[33]  Peter Garrard,et al.  Semantic processing in connected speech at a uniformly early stage of autopsy-confirmed Alzheimer's disease. , 2013, Neuropsychology.

[34]  D R Wekstein,et al.  Linguistic ability in early life and cognitive function and Alzheimer's disease in late life. Findings from the Nun Study. , 1996, JAMA.

[35]  Samuel L. Smith,et al.  Offline bilingual word vectors, orthogonal transformations and the inverted softmax , 2017, ICLR.

[36]  A. Wimo,et al.  The global prevalence of dementia: A systematic review and metaanalysis , 2013, Alzheimer's & Dementia.

[37]  Hyunjoo Choi Performances in a Picture Description Task in Japanese Patients with Alzheimer's Disease and with Mild Cognitive Impairment , 2009 .

[38]  K. Blennow,et al.  The Gothenburg MCI study: Design and distribution of Alzheimer’s disease and subcortical vascular disease diagnoses from baseline to 6-year follow-up , 2016, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[39]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[40]  B. Miller,et al.  Machine learning approaches to diagnosis and laterality effects in semantic dementia discourse , 2014, Cortex.

[41]  Yogesan Kanagasingam,et al.  Innovative diagnostic tools for early detection of Alzheimer's disease , 2015, Alzheimer's & Dementia.

[42]  Brian Roark,et al.  Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[43]  Graeme Hirst,et al.  Longitudinal detection of dementia through lexical and syntactic changes in writing: a case study of three British novelists , 2011, Lit. Linguistic Comput..

[44]  Marianne Lind,et al.  Semi-spontaneous oral text production: Measurements in clinical practice , 2009, Clinical linguistics & phonetics.

[45]  Serguei V. S. Pakhomov,et al.  Computerized Analysis of Speech and Language to Identify Psycholinguistic Correlates of Frontotemporal Lobar Degeneration , 2010, Cognitive and behavioral neurology : official journal of the Society for Behavioral and Cognitive Neurology.

[46]  Cecilia Ovesdotter Alm Language as Sensor in Human-Centered Computing: Clinical Contexts as Use Cases , 2016, Lang. Linguistics Compass.

[47]  G. Kavé,et al.  Word retrieval in picture descriptions produced by individuals with Alzheimer’s disease , 2016, Journal of clinical and experimental neuropsychology.

[48]  Philip D. Harvey,et al.  Practice effects due to serial cognitive assessment: Implications for preclinical Alzheimer's disease randomized controlled trials , 2015, Alzheimer's & dementia.

[49]  Adrian Basarab,et al.  On the early diagnosis of Alzheimer's Disease from multimodal signals: A survey , 2016, Artif. Intell. Medicine.

[50]  D. Selkoe Alzheimer's disease. , 2011, Cold Spring Harbor perspectives in biology.

[51]  Giuseppe Carenini,et al.  Detecting Dementia through Retrospective Analysis of Routine Blog Posts by Bloggers with Dementia , 2017, BioNLP.

[52]  Brian Roark,et al.  Graph-Based Word Alignment for Clinical Language Evaluation , 2015, CL.

[53]  T. Bschor,et al.  Spontaneous Speech of Patients With Dementia of the Alzheimer Type and Mild Cognitive Impairment , 2001, International Psychogeriatrics.

[54]  Sylvester Olubolu Orimaye,et al.  Learning Predictive Linguistic Features for Alzheimer’s Disease and related Dementias using Verbal Utterances , 2014, CLPsych@ACL.

[55]  B. Reisberg,et al.  Current evidence for subjective cognitive impairment (SCI) as the pre-mild cognitive impairment (MCI) stage of subsequently manifest Alzheimer's disease , 2008, International Psychogeriatrics.

[56]  Mark Johnson,et al.  Idea density for predicting Alzheimer’s disease from transcribed speech , 2017, CoNLL.

[57]  Eva Björkner,et al.  Data Collection from Persons with Mild Forms of Cognitive Impairment and Healthy Controls - Infrastructure for Classification and Prediction of Dementia , 2017, NODALIDA.

[58]  F. Cuetos,et al.  Linguistic changes in verbal expression: A preclinical marker of Alzheimer's disease , 2007, Journal of the International Neuropsychological Society.

[59]  P. Wolf,et al.  Lexical retrieval in discourse: An early indicator of Alzheimer’s dementia , 2013, Clinical linguistics & phonetics.

[60]  R. Petersen,et al.  Mild cognitive impairment , 2006, The Lancet.

[61]  S. Folstein,et al.  "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. , 1975, Journal of psychiatric research.