A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing

Abstract Literature evidences the existence of hypokinetic dysarthria in parkinsonian patients and, consequently, the objective characterization of the dysarthric signs associated to the articulatory aspect of speech can be used to detect Parkinson's Disease (PD) providing clinicians with new tools to support the clinical diagnosis. However, no work has analyzed in detail the importance of the different phonemes in the automatic detection of PD from the speech. This work proposes new approaches for this detection by using new classification schemes that allow to compare independently the different phonetic units of patients and controls employed during several speech tasks. Three different parkinsonian corpora were used allowing cross-validation and cross-corpora trials. The results of cross-validation trials (k-folds) provided accuracies between 81% and 94%, with AUC between 0.87 and 0.97 depending on the corpus, while cross-corpora trials yielded accuracies between 66% and 76% with AUC between 0.76 and 0.87. These results suggest that PD affects to the articulatory sequence as a whole, influencing more clearly phonetic units requiring a higher narrowing of the vocal tract. Additionally, text-dependent utterances are considered as the recommended speech task for the detection of PD in this type of schemes as these allow to compare more precisely the phonetic units of patients and controls. Lastly, this work discusses the existence of a glass ceiling in the accuracy of the systems for the automatic detection of PD using speech, concluding that this is below 95% for most of the cases.

[1]  Evžen Růžička,et al.  Automated analysis of connected speech reveals early biomarkers of Parkinson’s disease in patients with rapid eye movement sleep behaviour disorder , 2017, Scientific Reports.

[2]  Andrea Lee,et al.  Epidemiology of Parkinson Disease. , 2016, Neurologic clinics.

[3]  Gregory J. Snyder,et al.  Speech Rate Deficits in Individuals with Parkinson's Disease: A Review of the Literature , 2009 .

[4]  Pedro Gómez-Vilda,et al.  An integrated tool for the diagnosis of voice disorders , 2006 .

[5]  S. Furui,et al.  Cepstral analysis technique for automatic speaker verification , 1981 .

[6]  Juan Ignacio Godino-Llorente,et al.  Towards the identification of Idiopathic Parkinson’s Disease from the speech. New articulatory kinetic biomarkers , 2017, PloS one.

[7]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[8]  S. Skodda,et al.  Progression of Voice and Speech Impairment in the Course of Parkinson's Disease: A Longitudinal Study , 2013, Parkinson's disease.

[9]  S. Skodda,et al.  Impairment of Vowel Articulation as a Possible Marker of Disease Progression in Parkinson's Disease , 2012, PloS one.

[10]  Jesús Francisco Vargas-Bonilla,et al.  New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease , 2014, LREC.

[11]  Jean-Pierre Martens,et al.  Automated Intelligibility Assessment of Pathological Speech Using Phonological Features , 2009, EURASIP J. Adv. Signal Process..

[12]  Anne Smith,et al.  Basic parameters of articulatory movements and acoustics in individuals with Parkinson's disease , 2012, Movement disorders : official journal of the Movement Disorder Society.

[13]  E. Růžička,et al.  Imprecise vowel articulation as a potential early marker of Parkinson's disease: effect of speaking task. , 2013, The Journal of the Acoustical Society of America.

[14]  I Litvan,et al.  Progression of dysarthria and dysphagia in postmortem-confirmed parkinsonian disorders. , 2001, Archives of neurology.

[15]  Yoav Ben-Shlomo,et al.  The accuracy of diagnosis of parkinsonian syndromes in a specialist movement disorder service. , 2002, Brain : a journal of neurology.

[16]  A. Aronson,et al.  Differential diagnostic patterns of dysarthria. , 1969, Journal of speech and hearing research.

[17]  Juan Ignacio Godino-Llorente,et al.  Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease , 2018, Appl. Soft Comput..

[18]  Kris Tjaden,et al.  Consonant Acoustics in Parkinson's Disease and Multiple Sclerosis: Comparison of Clear and Loud Speaking Conditions. , 2017, American journal of speech-language pathology.

[19]  Shimon Sapir,et al.  Articulatory changes in muscle tension dysphonia: evidence of vowel space expansion following manual circumlaryngeal therapy. , 2009, Journal of communication disorders.

[20]  Jennifer L. Spielman,et al.  Effects of intensive voice treatment (the Lee Silverman Voice Treatment [LSVT]) on vowel articulation in dysarthric individuals with idiopathic Parkinson disease: acoustic and perceptual findings. , 2007, Journal of speech, language, and hearing research : JSLHR.

[21]  Jesús Francisco Vargas-Bonilla,et al.  Towards an automatic monitoring of the neurological state of Parkinson's patients from speech , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  G. Zaccara,et al.  Markerless Analysis of Articulatory Movements in Patients With Parkinson's Disease. , 2016, Journal of voice : official journal of the Voice Foundation.

[23]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[24]  José B. Mariño,et al.  Albayzin speech database: design of the phonetic corpus , 1993, EUROSPEECH.

[25]  Jason A. Whitfield,et al.  Articulatory-acoustic vowel space: application to clear speech in individuals with Parkinson's disease. , 2014, Journal of communication disorders.

[26]  Pedro J. Moreno,et al.  A recursive algorithm for the forced alignment of very long audio segments , 1998, ICSLP.

[27]  Claudia Manfredi,et al.  Automatic identification of dysprosody in idiopathic Parkinson's disease , 2015, Biomed. Signal Process. Control..

[28]  G. Rizzo,et al.  Accuracy of clinical diagnosis of Parkinson disease , 2016, Neurology.

[29]  J. Hughes,et al.  Accuracy of clinical diagnosis of idiopathic Parkinson's disease: a clinico-pathological study of 100 cases. , 1992, Journal of neurology, neurosurgery, and psychiatry.

[30]  S. Robertson,et al.  Speech therapy in Parkinson's disease: a study of the efficacy ad long term effects of intensive treatment. , 1984, The British journal of disorders of communication.

[31]  J. Logemann,et al.  Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of Parkinson patients. , 1978, The Journal of speech and hearing disorders.

[32]  H. Kirshner,et al.  Swallowing and speech production in Parkinson's disease , 1986, Annals of neurology.

[33]  J. Logemann,et al.  Vocal Tract Control in Parkinson's Disease , 1981 .

[34]  Kris Tjaden,et al.  Vowel acoustics in Parkinson's disease and multiple sclerosis: comparison of clear, loud, and slow speaking conditions. , 2013, Journal of speech, language, and hearing research : JSLHR.

[35]  Yun Lei,et al.  A novel scheme for speaker recognition using a phonetically-aware deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[36]  H. Ackermann,et al.  Articulatory deficits in parkinsonian dysarthria: an acoustic analysis. , 1991, Journal of neurology, neurosurgery, and psychiatry.

[37]  Giovanni Defazio,et al.  Assessment of voice and speech symptoms in early Parkinson’s disease by the Robertson dysarthria profile , 2016, Neurological Sciences.

[38]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[39]  Jirí Mekyska,et al.  Assessing progress of Parkinson's disease using acoustic analysis of phonation , 2015, 2015 4th International Work Conference on Bioinspired Intelligence (IWOBI).

[40]  Emily Q. Wang,et al.  Hemisphere-Specific Effects of Subthalamic Nucleus Deep Brain Stimulation on Speaking Rate and Articulatory Accuracy of Syllable Repetitions in Parkinson's Disease. , 2006, Journal of medical speech-language pathology.

[41]  Howard Poizner,et al.  Articulatory Consequences of Parkinson's Disease: Perspectives from Two Modalities , 1999, Brain and Cognition.

[42]  Evžen Růžička,et al.  Automatic Evaluation of Speech Rhythm Instability and Acceleration in Dysarthrias Associated with Basal Ganglia Dysfunction , 2015, Front. Bioeng. Biotechnol..

[43]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..