A "Verbal Thermometer" for Assessing Neurodegenerative Disease: Automated Measurement of Pronoun and Verb Ratio from Speech

Clinicians often use speech to characterize neurodegenerative disorders. Such characterizations require clinical judgment, which is subjective and can require extensive training. Quantitative Production Analysis (QPA) can be used to obtain objective quantifiable assessments of patient functioning. However, such human-based analyses of speech are costly and time consuming. Inexpensive off-the-shelf technologies such as speech recognition and part of speech taggers may avoid these problems. This study evaluates the ability of an automatic speech to text transcription system and a part of speech tagger to assist with measuring pronoun and verb ratios, measures based on QPA. Five participant groups provided spontaneous speech samples. One group consisted of healthy controls, while the remaining groups represented four subtypes of frontotemporal dementia. Findings indicated measurement of pronoun and verb ratio was robust despite errors introduced by automatic transcription and the tagger and despite these off-the-shelf products not having been trained on the language obtained from speech of the included population.

[1]  Lyndsey Nickels,et al.  Language in individuals with left hemisphere tumors: Is spontaneous speech analysis comparable to formal testing? , 2018, Journal of clinical and experimental neuropsychology.

[2]  R. Brookshire,et al.  Speech sample size and test-retest stability of connected speech measures for adults with aphasia. , 1994, Journal of speech and hearing research.

[3]  A. Kertesz The Western Aphasia Battery , 1982 .

[4]  Chien-Ju Hsu,et al.  Manual Versus Automated Narrative Analysis of Agrammatic Production Patterns: The Northwestern Narrative Language Analysis and Computerized Language Analysis. , 2018, Journal of speech, language, and hearing research : JSLHR.

[5]  K. McGraw,et al.  Forming inferences about some intraclass correlation coefficients. , 1996 .

[6]  Colleen Richey,et al.  Aided diagnosis of dementia type through computer-based analysis of spontaneous speech , 2014, CLPsych@ACL.

[7]  B. Miller,et al.  Machine learning approaches to diagnosis and laterality effects in semantic dementia discourse , 2014, Cortex.

[8]  S. Cappa,et al.  Connected Speech in Neurodegenerative Language Disorders: A Review , 2017, Front. Psychol..

[9]  Kathleen C. Fraser,et al.  Automated classification of primary progressive aphasia subtypes from narrative speech transcripts , 2014, Cortex.

[10]  Esref Adali,et al.  Disambiguating Main POS tags for Turkish , 2012, ROCLING.

[11]  P. Pressman,et al.  Diagnosis and Management of Behavioral Variant Frontotemporal Dementia , 2014, Biological Psychiatry.

[12]  Frank Rudzicz,et al.  Automatic speech recognition in the diagnosis of primary progressive aphasia , 2013, SLPAT.

[13]  Terry K Koo,et al.  A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. , 2016, Journal Chiropractic Medicine.

[14]  T. Hothorn,et al.  Simultaneous Inference in General Parametric Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[15]  T. Santarius,et al.  Survey on current cognitive practices within the European Low-Grade Glioma Network: towards a European assessment protocol , 2017, Acta Neurochirurgica.

[16]  Curry I. Guinn,et al.  Language Analysis of Speakers with Dementia of the Alzheimer's Type , 2012, AAAI Fall Symposium: Artificial Intelligence for Gerontechnology.

[17]  Pawan Goyal,et al.  Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit , 2018, EMNLP.

[18]  Maria Luisa Gorno-Tempini,et al.  Connected speech production in three variants of primary progressive aphasia. , 2010, Brain : a journal of neurology.

[19]  Christopher D. Manning Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics? , 2011, CICLing.

[20]  Myrna F. Schwartz,et al.  The quantitative analysis of agrammatic production: Procedure and data , 1989, Brain and Language.

[21]  J. Tukey Comparing individual means in the analysis of variance. , 1949, Biometrics.

[22]  D. Cicchetti Guidelines, Criteria, and Rules of Thumb for Evaluating Normed and Standardized Assessment Instruments in Psychology. , 1994 .

[23]  Lyn S. Turkstra,et al.  Western Aphasia Battery , 2018 .

[24]  Kathleen C. Fraser,et al.  Sentence segmentation of aphasic speech , 2015, HLT-NAACL.

[25]  Anil Kumar,et al.  A review on Alzheimer’s disease pathophysiology and its management: an update , 2015, Pharmacological reports : PR.

[26]  R. Bastiaanse,et al.  Analysing the spontaneous speech of aphasic speakers , 2004 .

[27]  Peter Garrard,et al.  Features and machine learning classification of connected speech samples from patients with autopsy proven Alzheimer's disease with and without additional vascular pathology. , 2014, Journal of Alzheimer's disease : JAD.

[28]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[29]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.