Low-frequency components analysis in running speech for the automatic detection of parkinson's disease

This paper explores the analysis of low-frequency components of continuous speech signals from people with Parkinson’s disease, in order to detect changes in the spectrum that could be associated to the presence of tremor in the speech. Different time-frequency (TF) techniques are used for the characterization of the low frequency content of the speech signals, by paying special attention on the ability to work in non-stationary frameworks, due to the need for the analysis of long enough time segments, where the assumptions of stationary can not be met. The set of variables extracted from the TF representations includes centroids and the energy content of different frequency bands, along with entropy measures and nonlinear energy operators, which are used as features for the automatic detection of people with Parkinson’s disease vs healthy controls. The discrimination capability of the estimated features is evaluated using three different classification strategies: GMM, GMM-UBM, and SVM. Furthermore, the information provided by different TF techniques is combined using a second classification stage. The results show that the changes in the low frequency components are able to discriminate between people with Parkinson’s and healthy speakers with an accuracy of 77%, using one single sentence.

[1]  Mohammad Pooyan,et al.  An optimum algorithm in pathological voice quality assessment using wavelet-packet-based features, linear discriminant analysis and support vector machine , 2012, Biomed. Signal Process. Control..

[2]  M. Dougherty,et al.  Classification of speech intelligibility in Parkinson's disease , 2014 .

[3]  Pedro Gómez Vilda,et al.  Methodological issues in the development of automatic systems for voice pathology detection , 2006, Biomed. Signal Process. Control..

[4]  Miguel A. Ferrer,et al.  Using Nonlinear Features for Voice Disorder Detection , 2005 .

[5]  L. Ramig,et al.  The Parkinson larynx: tremor and videostroboscopic findings. , 1996, Journal of voice : official journal of the Voice Foundation.

[6]  Christopher G. Goetz Chairperson,et al.  Movement Disorder Society Task Force report on the Hoehn and Yahr staging scale: Status and recommendations The Movement Disorder Society Task Force on rating scales for Parkinson's disease , 2004 .

[7]  J. Sundberg,et al.  Perceptual and acoustic correlates of abnormal voice qualities. , 1980, Acta oto-laryngologica.

[8]  Boualem Boashash,et al.  An efficient real-time implementation of the Wigner-Ville distribution , 1987, IEEE Trans. Acoust. Speech Signal Process..

[9]  Jagadish Nayak,et al.  Identification of voice disorders using speech samples , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[10]  Yannis Stylianou,et al.  On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices , 2011, Logopedics, phoniatrics, vocology.

[11]  Aurobinda Routray,et al.  Vocal emotion recognition in five native languages of Assam using new wavelet features , 2009, Int. J. Speech Technol..

[12]  Kuldip K. Paliwal,et al.  Robust feature extraction using subband spectral centroid histograms , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[13]  Karthikeyan Umapathy,et al.  Discrimination of pathological voices using a time-frequency approach , 2005, IEEE Transactions on Biomedical Engineering.

[14]  Maria Markaki,et al.  Using modulation spectra for voice pathology detection and classification , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[15]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007, Biomedical engineering online.

[16]  Jesús Francisco Vargas-Bonilla,et al.  New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease , 2014, LREC.

[17]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[18]  Evelyn Abberton,et al.  Hearing and phonetic criteria in voice measurement: Clinical applications , 2008, Logopedics, phoniatrics, vocology.

[19]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments from text-dependent running speech , 2009, Biomed. Signal Process. Control..

[20]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[21]  E. Růžička,et al.  Objectification of dysarthria in Parkinson's disease using Bayes theorem , 2011 .

[22]  Roman Cmejla,et al.  Acoustic analysis of voice and speech characteristics in early untreated Parkinson's disease , 2011, MAVEBA.

[23]  Gerald Matz,et al.  Wigner distributions (nearly) everywhere: time-frequency analysis of signals, systems, random processes, signal spaces, and frames , 2003, Signal Process..

[24]  Kuldip K. Paliwal Spectral subband centroids as features for speech recognition , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[25]  L. Sulica,et al.  Common Movement Disorders Affecting the Larynx: A Report from the Neurolaryngology Committee of the AAO-HNS , 2005, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[26]  P. Alm Stuttering and the basal ganglia circuits: a critical review of possible relations. , 2004, Journal of communication disorders.

[27]  G. Deuschl,et al.  The pathophysiology of parkinsonian tremor: a review , 2000, Journal of Neurology.