Spectral and cepstral analyses for Parkinson's disease detection in Spanish vowels and words

About 1% of people older than 65years suffer from Parkinson's disease PD and 90% of them develop several speech impairments, affecting phonation, articulation, prosody and fluency. Computer-aided tools for the automatic evaluation of speech can provide useful information to the medical experts to perform a more accurate and objective diagnosis and monitoring of PD patients and can help also to evaluate the correctness and progress of their therapy. Although there are several studies that consider spectral and cepstral information to perform automatic classification of speech of people with PD, so far it is not known which is the most discriminative, spectral or cepstral analysis. In this paper, the discriminant capability of six sets of spectral and cepstral coefficients is evaluated, considering speech recordings of the five Spanish vowels and a total of 24 isolated words. According to the results, linear predictive cepstral coefficients are the most robust and exhibit values of the area under the receiver operating characteristic curve above 0.85 in 6 of the 24 words.

[1]  S. Skodda,et al.  Vowel articulation in Parkinson's disease. , 2011, Journal of voice : official journal of the Voice Foundation.

[2]  Elmar Nöth,et al.  PEAKS - A system for the automatic evaluation of voice and speech disorders , 2009, Speech Commun..

[3]  M. Hoehn,et al.  Parkinsonism , 1967, Neurology.

[4]  A. Goberman,et al.  Acoustic analysis of clear versus conversational speech in individuals with Parkinson disease. , 2005, Journal of communication disorders.

[5]  O. Hornykiewicz Biochemical aspects of Parkinson's disease , 1998, Neurology.

[6]  Germán Castellanos-Domínguez,et al.  An improved method for voice pathology detection by means of a HMM-based feature space transformation , 2010, Pattern Recognit..

[7]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[8]  Yoshiyuki Horii,et al.  Pause and utterance durations and fundamental frequency characteristics of repeated oral readings by stutterers and nonstutterers , 1987 .

[9]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[10]  M. Lindstrom,et al.  Articulatory movements during vowels in speakers with dysarthria and healthy controls. , 2008, Journal of speech, language, and hearing research : JSLHR.

[11]  Jesús Francisco Vargas-Bonilla,et al.  Analysis of Speech from People with Parkinson's Disease through Nonlinear Dynamics , 2013, NOLISP.

[12]  Raymond D. Kent,et al.  Toward an acoustic typology of motor speech disorders , 2003, Clinical linguistics & phonetics.

[13]  Bridget Walsh,et al.  Linguistic complexity, speech production, and comprehension in Parkinson's disease: behavioral and physiological indices. , 2011, Journal of speech, language, and hearing research : JSLHR.

[14]  Robert M. Gray,et al.  Speech coding based upon vector quantization , 1980, ICASSP.

[15]  Dimitar D Deliyski,et al.  Influence of sampling rate on accuracy and reliability of acoustic voice analysis , 2005, Logopedics, phoniatrics, vocology.

[16]  Max A. Little,et al.  Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease , 2008, IEEE Transactions on Biomedical Engineering.

[17]  Raymond D. Kent,et al.  Acoustic studies of dysarthric speech: methods, progress, and potential. , 1999, Journal of communication disorders.

[18]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[19]  L. Ramig,et al.  Speech treatment for Parkinson’s disease , 2008, Expert review of neurotherapeutics.

[20]  L. Ramig,et al.  The Parkinson larynx: tremor and videostroboscopic findings. , 1996, Journal of voice : official journal of the Voice Foundation.

[21]  Christian Hacker,et al.  Revising Perceptual Linear Prediction (PLP) , 2005, INTERSPEECH.

[22]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[23]  Hwang Soo Lee,et al.  On approximating line spectral frequencies to LPC cepstral coefficients , 2000, IEEE Trans. Speech Audio Process..

[24]  E. Růžička,et al.  Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson's disease. , 2011, The Journal of the Acoustical Society of America.

[25]  Elmar Nöth,et al.  Automatic evaluation of parkinson's speech - acoustic, prosodic and voice related cues , 2013, INTERSPEECH.

[26]  G. Stebbins,et al.  Factor structure of the unified Parkinson's disease rating scale: Motor examination section , 1998, Movement disorders : official journal of the Movement Disorder Society.

[27]  Raymond D. Kent,et al.  Acoustic and Intelligibility Characteristics of Sentence Production in Neurogenic Speech Disorders , 2000, Folia Phoniatrica et Logopaedica.

[28]  A. Hofman,et al.  Prevalence of Parkinson's disease in Europe: A collaborative study of population-based cohorts. Neurologic Diseases in the Elderly Research Group. , 2000, Neurology.

[29]  R. Iansek,et al.  Speech impairment in a large sample of patients with Parkinson's disease. , 1998, Behavioural neurology.

[30]  A. Lang,et al.  Parkinson's disease. First of two parts. , 1998, The New England journal of medicine.

[31]  B. Gerratt,et al.  Cinegraphic observations of laryngeal function in parkinson's disease , 1984, The Laryngoscope.

[32]  Jennifer L. Spielman,et al.  Formant centralization ratio: a proposal for a new acoustic measure of dysarthric speech. , 2010, Journal of speech, language, and hearing research : JSLHR.

[33]  Jesús Francisco Vargas-Bonilla,et al.  New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease , 2014, LREC.

[34]  Jean‐Pierre A. Radley,et al.  Acoustic Properties of Stop Consonants , 1957 .

[35]  S. Skodda Aspects of speech rate and regularity in Parkinson's disease , 2011, Journal of the Neurological Sciences.

[36]  Jesús Francisco Vargas-Bonilla,et al.  Nonlinear Dynamics for Hypernasality Detection in Spanish Vowels and Words , 2012, Cognitive Computation.

[37]  Germán Castellanos-Domínguez,et al.  Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients , 2011, IEEE Transactions on Biomedical Engineering.

[38]  W Poewe,et al.  Repetitive speech phenomena in Parkinson's disease , 2000, Journal of neurology, neurosurgery, and psychiatry.

[39]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[40]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..