Time-frequency approach in continuous speech for detection of Parkinson's disease

In this paper low-frequency analysis is addressed in order to explore components of continuous speech signals, trying to making evident the changes in the spectrum, which could be associated to the tremor in speech of people with Parkinson's disease. Four time-frequency (TF) techniques based on Wigner-Ville distribution (WVD) are used for the characterization of the low frequency content of the speech signals. The set of features includes centroids and the energy content of different frequency bands, due to the assumptions of non-stationary was taken into a account using enough time frameworks. The discrimination capability of the estimated features is evaluated using a support vector machine (SVM). The results show that the low frequency components are able to discriminate between pathological and healthy speakers with an accuracy of 72%.

[1]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[2]  Maria Markaki,et al.  Using modulation spectra for voice pathology detection and classification , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007, Biomedical engineering online.

[4]  Jesús Francisco Vargas-Bonilla,et al.  New Spanish speech corpus database for the analysis of people suffering from Parkinson's disease , 2014, LREC.

[5]  Kuldip K. Paliwal Spectral subband centroids as features for speech recognition , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[6]  Karthikeyan Umapathy,et al.  Discrimination of pathological voices using a time-frequency approach , 2005, IEEE Transactions on Biomedical Engineering.

[7]  L. Sulica,et al.  Common Movement Disorders Affecting the Larynx: A Report from the Neurolaryngology Committee of the AAO-HNS , 2005, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[8]  Boualem Boashash,et al.  An efficient real-time implementation of the Wigner-Ville distribution , 1987, IEEE Trans. Acoust. Speech Signal Process..

[9]  Roman Cmejla,et al.  Acoustic analysis of voice and speech characteristics in early untreated Parkinson's disease , 2011, MAVEBA.

[10]  William J. Williams,et al.  Improved time-frequency representation of multicomponent signals using exponential kernels , 1989, IEEE Trans. Acoust. Speech Signal Process..

[11]  Christopher G. Goetz Chairperson,et al.  Movement Disorder Society Task Force report on the Hoehn and Yahr staging scale: Status and recommendations The Movement Disorder Society Task Force on rating scales for Parkinson's disease , 2004 .

[12]  Abdul Rahim Abdullah,et al.  Power quality analysis using smooth-windowed wigner-ville distribution , 2010, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010).

[13]  Mengdao Xing,et al.  New ISAR imaging algorithm based on modified Wigner–Ville distribution , 2009 .

[14]  Patrick Flandrin,et al.  Wigner-Ville spectral analysis of nonstationary processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[15]  Kuldip K. Paliwal,et al.  Robust feature extraction using subband spectral centroid histograms , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[16]  Yen Mei Chee,et al.  Adaptive windowed cross Wigner-Ville distribution as an optimum phase estimator for PSK signals , 2013, Digit. Signal Process..

[17]  P. Alm Stuttering and the basal ganglia circuits: a critical review of possible relations. , 2004, Journal of communication disorders.

[18]  E. Růžička,et al.  Objectification of dysarthria in Parkinson's disease using Bayes theorem , 2011 .

[19]  G. Deuschl,et al.  The pathophysiology of parkinsonian tremor: a review , 2000, Journal of Neurology.

[20]  O. Hornykiewicz Biochemical aspects of Parkinson's disease , 1998, Neurology.

[21]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[22]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[23]  Evelyn Abberton,et al.  Hearing and phonetic criteria in voice measurement: Clinical applications , 2008, Logopedics, phoniatrics, vocology.

[24]  Joerg F. Hipp,et al.  Time-Frequency Analysis , 2014, Encyclopedia of Computational Neuroscience.

[25]  Jagadish Nayak,et al.  Identification of voice disorders using speech samples , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[26]  Gerald Matz,et al.  Wigner distributions (nearly) everywhere: time-frequency analysis of signals, systems, random processes, signal spaces, and frames , 2003, Signal Process..

[27]  M. Dougherty,et al.  Classification of speech intelligibility in Parkinson's disease , 2014 .

[28]  Mohammad Pooyan,et al.  An optimum algorithm in pathological voice quality assessment using wavelet-packet-based features, linear discriminant analysis and support vector machine , 2012, Biomed. Signal Process. Control..

[29]  F. Taylor,et al.  The wigner distribution in speech processing applications , 1984 .

[30]  Pedro Gómez Vilda,et al.  Automatic detection of voice impairments from text-dependent running speech , 2009, Biomed. Signal Process. Control..

[31]  Kun Cai,et al.  Semi-Blind Fetal Electrocardiogram Extraction by Eliminating the Cross-Terms of the Wigner-Ville Representations , 2011, 2011 5th International Conference on Bioinformatics and Biomedical Engineering.