Detecting Dementia through Retrospective Analysis of Routine Blog Posts by Bloggers with Dementia

We investigate if writers with dementia can be automatically distinguished from those without by analyzing linguistic markers in written text, in the form of blog posts. We have built a corpus of several thousand blog posts, some by people with dementia and others by people with loved ones with dementia. We use this dataset to train and test several machine learning methods, and achieve prediction performance at a level far above the base-

[1]  M. Prince,et al.  World Alzheimer Report 2015 - The Global Impact of Dementia: An analysis of prevalence, incidence, cost and trends , 2015 .

[2]  Romola S. Bucks,et al.  Analysis of spontaneous, conversational speech in dementia of Alzheimer type: Evaluation of an objective technique for analysing lexical performance , 2000 .

[3]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[4]  Giuseppe Carenini,et al.  Domain Adaptation for Detecting Mild Cognitive Impairment , 2017, Canadian Conference on AI.

[5]  Blanka Klimova,et al.  Speech and language impairments in dementia , 2016 .

[6]  Sylvester Olubolu Orimaye,et al.  Learning Predictive Linguistic Features for Alzheimer’s Disease and related Dementias using Verbal Utterances , 2014, CLPsych@ACL.

[7]  Marc Brysbaert,et al.  Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English , 2009, Behavior research methods.

[8]  Danielle S. McNamara,et al.  Psycholinguistic word information in second language oral discourse , 2011 .

[9]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[10]  Xiaofei Lu,et al.  Automatic analysis of syntactic complexity in second language writing , 2010 .

[11]  David A. Snowdon,et al.  Early life linguistic ability, late life cognitive function, and neuropathology: findings from the Nun Study , 2005, Neurobiology of Aging.

[12]  Michael A. Covington,et al.  Cutting the Gordian Knot: The Moving-Average Type–Token Ratio (MATTR) , 2010, J. Quant. Linguistics.

[13]  Brian Roark,et al.  Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Elissa D. Asp,et al.  When Language Breaks Down: Analysing Discourse in Clinical Contexts , 2010 .

[15]  Graeme Hirst,et al.  Longitudinal detection of dementia through lexical and syntactic changes in writing: a case study of three British novelists , 2011, Lit. Linguistic Comput..

[16]  T. Mitzner,et al.  Language decline across the life span: findings from the Nun Study. , 2001, Psychology and aging.

[17]  J. Becker,et al.  The natural history of Alzheimer's disease. Description of study cohort and accuracy of diagnosis. , 1994, Archives of neurology.

[18]  Colleen Richey,et al.  Aided diagnosis of dementia type through computer-based analysis of spontaneous speech , 2014, CLPsych@ACL.

[19]  Peter Garrard,et al.  Features and machine learning classification of connected speech samples from patients with autopsy proven Alzheimer's disease with and without additional vascular pathology. , 2014, Journal of Alzheimer's disease : JAD.

[20]  M. Brysbaert,et al.  Age-of-acquisition ratings for 30,000 English words , 2012, Behavior research methods.

[21]  J. Hodges,et al.  Performance on the Boston Cookie theft picture description task in patients with early dementia of the Alzheimer's type: Missing information , 1996 .

[22]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[23]  Graeme Hirst,et al.  Comparison of different feature sets for identification of variants in progressive aphasia , 2014, CLPsych@ACL.