The Effect of Heterogeneous Data for Alzheimer's Disease Detection from Speech

Speech datasets for identifying Alzheimer's disease (AD) are generally restricted to participants performing a single task, e.g. describing an image shown to them. As a result, models trained on linguistic features derived from such datasets may not be generalizable across tasks. Building on prior work demonstrating that same-task data of healthy participants helps improve AD detection on a single-task dataset of pathological speech, we augment an AD-specific dataset consisting of subjects describing a picture with multi-task healthy data. We demonstrate that normative data from multiple speech-based tasks helps improve AD detection by up to 9%. Visualization of decision boundaries reveals that models trained on a combination of structured picture descriptions and unstructured conversational speech have the least out-of-task error and show the most potential to generalize to multiple tasks. We analyze the impact of age of the added samples and if they affect fairness in classification. We also provide explanations for a possible inductive bias effect across tasks using model-agnostic feature anchors. This work highlights the need for heterogeneous datasets for encoding changes in multiple facets of cognition and for developing a task-independent AD detection model.

[1]  M. Basso,et al.  Verbal Fluency: Language or Executive Function Measure? , 2016, Applied neuropsychology. Adult.

[2]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[3]  Frank Rudzicz,et al.  On the importance of normative data in speech-based assessment , 2017, ArXiv.

[4]  Jonathan Baxter,et al.  A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..

[5]  Hyeran Lee,et al.  Speech Dysfluencies in Normal and Pathological Aging: A Comparison between Alzheimer Patients and Healthy Elderly Subjects , 2011, ICPhS.

[6]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[7]  Laurie M. Ryan,et al.  Obstacles and opportunities in Alzheimer's clinical trial recruitment. , 2014, Health affairs.

[8]  J. Becker,et al.  The natural history of Alzheimer's disease. Description of study cohort and accuracy of diagnosis. , 1994, Archives of neurology.

[9]  M. Rahgozar,et al.  Persuasive Discourse Impairments in Traumatic Brain Injury , 2015, Archives of trauma research.

[10]  Sebastian Ruder,et al.  An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[11]  Carlos Guestrin,et al.  Anchors: High-Precision Model-Agnostic Explanations , 2018, AAAI.

[12]  M. Prince,et al.  World Alzheimer report 2016: improving healthcare for people living with dementia: coverage, quality and costs now and in the future , 2016 .

[13]  K. Forbes-McKay,et al.  Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task , 2005, Neurological Sciences.

[14]  Xiaofei Lu,et al.  Automatic analysis of syntactic complexity in second language writing , 2010 .

[15]  Mohit Bansal,et al.  Detecting Linguistic Characteristics of Alzheimer’s Dementia by Interpreting Neural Models , 2018, NAACL.

[16]  Rich Caruana,et al.  Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.

[17]  M. Mehl,et al.  Natural language indicators of differential gene regulation in the human immune system , 2017, Proceedings of the National Academy of Sciences.

[18]  Frank Rudzicz,et al.  Automatically identifying trouble-indicating speech behaviors in alzheimer's disease , 2014, ASSETS.

[19]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[20]  J. Peña-Casanova,et al.  Discourse coherence and its relation with cognition in Alzheimer's disease , 2013 .

[21]  Iryna Gurevych,et al.  Multi-Task Learning for Argumentation Mining in Low-Resource Settings , 2018, NAACL.

[22]  Jianzhong Wang,et al.  Locally Linear Embedding , 2021, Unsupervised Learning Approaches for Dimensionality Reduction and Data Visualization.

[23]  Amy Beth Warriner,et al.  Norms of valence, arousal, and dominance for 13,915 English lemmas , 2013, Behavior Research Methods.