Quantifying the Uncertainty of Parameters Measured in Spontaneous Speech of Speakers With Dementia.

Purpose Corpus analyses of spontaneous language fragments of varying length provide useful insights in the language change caused by brain damage, such as caused by some forms of dementia. Sample size is an important experimental parameter to consider when designing spontaneous language analyses studies. Sample length influences the confidence levels of analyses. Machine learning approaches often favor to use as much language as available, whereas language evaluation in a clinical setting is often based on truncated samples to minimize annotation labor and to limit any discomfort for participants. This article investigates, using Bayesian estimation of machine learned models, what the ideal text length should be to minimize model uncertainty. Method We use the Stanford parser to extract linguistic variables and train a statistic model to distinguish samples by speakers with no brain damage from samples by speakers with probable Alzheimer's disease. We compare the results to previously published models that used CLAN for linguistic analysis. Results The uncertainty around six individual variables and its relation to sample length are reported. The same model with linguistic variables that is used in all three experiments can predict group membership better than a model without them. One variable (concept density) is more informative when measured using the Stanford tools than when measured using CLAN. Conclusion For our corpus of German speech, the optimal sample length is found to be around 700 words long. Longer samples do not provide more information.

[1]  Stefan Th. Gries,et al.  What is Corpus Linguistics? , 2009, Lang. Linguistics Compass.

[2]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[3]  David Malvern,et al.  Lexical Diversity and Language Development , 2004 .

[4]  Erhard W. Hinrichs,et al.  Is it Really that Difficult to Parse German? , 2006, EMNLP.

[5]  Tetsuya Takiguchi,et al.  Detecting Abnormal Word Utterances in Children With Autism Spectrum Disorders , 2017, Perceptual and motor skills.

[6]  S. Folstein,et al.  "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. , 1975, Journal of psychiatric research.

[7]  Veronika Vincze,et al.  Speaking in Alzheimer’s Disease, is That an Early Sign? Importance of Changes in Language Abilities in Alzheimer’s Disease , 2015, Front. Aging Neurosci..

[8]  Dimitra Vergyri,et al.  Learning diagnostic models using speech and language measures , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  Carlo Caltagirone,et al.  The language of schizophrenia: An analysis of micro and macrolinguistic abilities and their neuropsychological correlates , 2008, Schizophrenia Research.

[10]  Bernadette Ska,et al.  Production of narratives: Picture sequence facilitates organizational but not conceptual processing in less educated subjects , 2001, Brain and Cognition.

[11]  V. Leirer,et al.  Development and validation of a geriatric depression screening scale: a preliminary report. , 1982, Journal of psychiatric research.

[12]  Kathleen C. Fraser,et al.  Automated classification of primary progressive aphasia subtypes from narrative speech transcripts , 2014, Cortex.

[13]  Linda B Smith,et al.  Quantity and Diversity: Simulating Early Word Learning Environments. , 2018, Cognitive science.

[14]  Roelien Bastiaanse,et al.  Spontaneous speech in aphasia: A correlational study , 1989, Brain and Language.

[15]  S. Cappa,et al.  Connected Speech in Neurodegenerative Language Disorders: A Review , 2017, Front. Psychol..

[16]  Michael A Covington,et al.  Automatic measurement of propositional idea density from part-of-speech tagging , 2008, Behavior research methods.

[17]  J. Becker,et al.  The natural history of Alzheimer's disease. Description of study cohort and accuracy of diagnosis. , 1994, Archives of neurology.

[18]  Sylvester Olubolu Orimaye,et al.  Learning Predictive Linguistic Features for Alzheimer’s Disease and related Dementias using Verbal Utterances , 2014, CLPsych@ACL.

[19]  Sumio Watanabe,et al.  Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory , 2010, J. Mach. Learn. Res..

[20]  Juliana Onofre DE Lira,et al.  Microlinguistic aspects of the oral narrative in patients with Alzheimer's disease , 2010, International Psychogeriatrics.

[21]  Stephen M. Wilson,et al.  A quick aphasia battery for efficient, reliable, and multidimensional assessment of language function , 2018, PloS one.

[22]  R. Bastiaanse,et al.  Analysing the spontaneous speech of aphasic speakers , 2004 .

[23]  W. Huber,et al.  Computer-assisted analysis of spontaneous speech: quantification of basic parameters in aphasic and unimpaired language , 2012, Clinical linguistics & phonetics.

[24]  H. Akaike A new look at the statistical model identification , 1974 .

[25]  Brian Roark,et al.  Spoken Language Derived Measures for Detecting Mild Cognitive Impairment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Serguei V. S. Pakhomov,et al.  A computerized technique to assess language use patterns in patients with frontotemporal dementia , 2010, Journal of Neurolinguistics.

[27]  Wolfram Hinzen,et al.  A systematic linguistic profile of spontaneous narrative speech in pre-symptomatic and early stage Huntington's disease , 2017, Cortex.

[28]  Vitor C. Zimmerer,et al.  The language profile of formal thought disorder , 2018, npj Schizophrenia.

[29]  Christopher D. Manning,et al.  Parsing Three German Treebanks: Lexicalized and Unlexicalized Baselines , 2008 .

[30]  Vitor C. Zimmerer,et al.  Formulaic Language in People with Probable Alzheimer's Disease: A Frequency-Based Approach. , 2016, Journal of Alzheimer's disease : JAD.

[31]  Martina Piefke,et al.  Basic parameters of spontaneous speech as a sensitive method for measuring change during the course of aphasia. , 2008, International journal of language & communication disorders.