Automatic evaluation of parkinson's speech - acoustic, prosodic and voice related cues

Articulation and phonation is affected in 70 % to 90 % of patients with Parkinson’s disease (PD). This study focuses on the question whether speech carries information about 1. PD being present at a speaker or not, and 2. estimating the severity of PD (if present). We first perform classification experiments focusing on the automatic detection of PD as a 2-class problem (PD vs. healthy speakers). The detection of severity is described as a 3-class task based on the Unified Parkinson’s Disease Rating Scale (UPDRS) ratings. We employ acoustic, prosodic and glottal features on different kinds of speech tests: various syllable repetition tasks, read sentences and texts, and monologues. Classification is performed in either case by SVMs. We report recognition results of 81.9 % when trying to differentiate between normally speaking persons and speakers with PD. With system fusion we achieved a recognition results of 59.1 % on the task of UPDRS classification. Index Terms: Parkinson’s Disease, pathologic speech, speech analysis

[1]  Elmar Nöth,et al.  Detection of persons with Parkinson's disease by acoustic, vocal, and prosodic analysis , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[2]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[3]  J. Logemann,et al.  Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of Parkinson patients. , 1978, The Journal of speech and hearing disorders.

[4]  Jessica E Huber,et al.  The intonation-syntax interface in the speech of individuals with Parkinson's disease. , 2011, Journal of speech, language, and hearing research : JSLHR.

[5]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[6]  Neha Singh,et al.  Advances in the treatment of Parkinson's disease , 2007, Progress in Neurobiology.

[7]  Viktor Zeißler Robuste Erkennung der prosodischen Phänomene und der emotionalen Benutzerzustände in einem multimodalen Dialogsystem , 2011 .

[8]  Houeto Jean-Luc [Parkinson's disease]. , 2022, La Revue du praticien.

[9]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[10]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[11]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[12]  E. Růžička,et al.  Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson's disease. , 2011, The Journal of the Acoustical Society of America.

[13]  J. Flanagan,et al.  Synthesis of voiced sounds from a two-mass model of the vocal cords , 1972 .

[14]  V. Fromkin,et al.  A characterization of the prosodic loss in Parkinson's disease , 1988, Brain and Language.

[15]  Elmar Nöth,et al.  Age and gender recognition based on multiple systems - early vs. late fusion , 2010, INTERSPEECH.

[16]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[17]  C. Coelho,et al.  Acoustic analysis of parkinsonian speech I: speech characteristics and L-Dopa therapy. , 2002, NeuroRehabilitation.