Isolating effects of age with fair representation learning when assessing dementia

One of the most prevalent symptoms among the elderly population, dementia, can be detected by classifiers trained on linguistic features extracted from narrative transcripts. However, these linguistic features are impacted in a similar but different fashion by the normal aging process. Aging is therefore a confounding factor, whose effects have been hard for machine learning classifiers to isolate. In this paper, we show that deep neural network (DNN) classifiers can infer ages from linguistic features, which is an entanglement that could lead to unfairness across age groups. We show this problem is caused by undesired activations of v-structures in causality diagrams, and it could be addressed with fair representation learning. We build neural network classifiers that learn low-dimensional representations reflecting the impacts of dementia yet discarding the effects of age. To evaluate these classifiers, we specify a model-agnostic score $\Delta_{eo}^{(N)}$ measuring how classifier results are disentangled from age. Our best models outperform baseline neural network classifiers in disentanglement, while compromising accuracy by as little as 2.56\% and 2.25\% on DementiaBank and the Famous People dataset respectively.

[1]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[2]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[3]  Victor H. Yngve,et al.  A model and an hypothesis for language structure , 1960 .

[4]  Roger B. Grosse,et al.  Isolating Sources of Disentanglement in Variational Autoencoders , 2018, NeurIPS.

[5]  Jekaterina Novikova,et al.  Semi-supervised classification by reaching consensus among modalities , 2018, ArXiv.

[6]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[7]  Arvind Narayanan,et al.  Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[8]  R. Marioni,et al.  Age-associated cognitive decline. , 2009, British medical bulletin.

[9]  ChoiChong-Ho,et al.  Input Feature Selection by Mutual Information Based on Parzen Window , 2002 .

[10]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[11]  C. Harada,et al.  Normal cognitive aging. , 2013, Clinics in geriatric medicine.

[12]  Frank Rudzicz,et al.  Using text and acoustic features to diagnose progressive aphasia and its subtypes , 2013, INTERSPEECH.

[13]  Amos J. Storkey,et al.  Censoring Representations with an Adversary , 2015, ICLR.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Sylvie Ratté,et al.  Computer-based evaluation of Alzheimer’s disease and mild cognitive impairment patients during a picture description task , 2018, Alzheimer's & dementia.

[16]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[17]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[18]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[19]  S. Folstein,et al.  "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. , 1975, Journal of psychiatric research.

[20]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Curry I. Guinn,et al.  Language Analysis of Speakers with Dementia of the Alzheimer's Type , 2012, AAAI Fall Symposium: Artificial Intelligence for Gerontechnology.

[23]  Jost Tobias Springenberg,et al.  Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks , 2015, ICLR.

[24]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[25]  Sameer Singh,et al.  Evaluation of an objective technique for analysing temporal variables in DAT spontaneous speech , 2001 .

[26]  Brian Roark,et al.  Syntactic complexity measures for detecting Mild Cognitive Impairment , 2007, BioNLP@ACL.

[27]  Toniann Pitassi,et al.  Learning Adversarially Fair and Transferable Representations , 2018, ICML.

[28]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[29]  Brian Roark,et al.  Fully Automated Neuropsychological Assessment for Detecting Mild Cognitive Impairment , 2012, INTERSPEECH.

[30]  Peter A. Heeman,et al.  Using linguistic indicators of difficulty to identify mild cognitive impairment , 2015, INTERSPEECH.

[31]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[32]  D. Murman The Impact of Age on Cognition , 2015, Seminars in Hearing.

[33]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[34]  J. Becker,et al.  The natural history of Alzheimer's disease. Description of study cohort and accuracy of diagnosis. , 1994, Archives of neurology.

[35]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[36]  Jekaterina Novikova,et al.  Detecting cognitive impairments by agreeing on interpretations of linguistic features , 2018, NAACL.