Multitaper perceptual linear prediction features of voice samples to discriminate healthy persons from early stage Parkinson diseased persons

The performance of multitaper perceptual linear prediction (PLP) features of speech samples to discriminate healthy and early stage Parkinson diseased subjects is investigated in this paper. The PLP features are conventionally obtained by computing the power spectrum using a single tapered Hamming window. This estimated spectrum exhibits large variance which can be reduced by computing the weighted average of power spectra obtained using a set of tapered windows, leading to multitaper spectral estimation. In this investigation, two multitaper techniques namely Sine wave taper and Thomson multitaper along with the conventional single taper windowing are investigated. Artificial Neural network is then used to classify the PLP features extracted by applying the three types of window tapers on the speech signals of healthy and early stage Parkinson affected people and their respective performances are compared. The results show more accuracy using the multitaper techniques when compared with the conventional single taper technique. It is seen that the accuracy obtained using Sine wave tapers as well as Thomson multitaper is maximum for five tapers. An improvement in the recognition accuracy by 7.5% using the Sine tapers and by 6.9% using the Thomson tapers is obtained when compared with the conventional method. An improvement in other performance measures like Equal error rate, False positive rate, False negative rate, Sensitivity and Specificity is also observed in the multitaper techniques.

[1]  Danial Taheri Far,et al.  Speech Analysis for Diagnosis of Parkinson’s Disease Using Genetic Algorithm and Support Vector Machine , 2014 .

[2]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[3]  D. Slepian,et al.  Prolate spheroidal wave functions, fourier analysis and uncertainty — II , 1961 .

[4]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[5]  Kurt S. Riedel,et al.  Minimum bias multiple taper spectral estimation , 2018, IEEE Trans. Signal Process..

[6]  Douglas D. O'Shaughnessy,et al.  Multiple windowed spectral features for emotion recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  O. Hornykiewicz Biochemical aspects of Parkinson's disease , 1998, Neurology.

[8]  Ahmed Hammouch,et al.  Discriminating Between Patients With Parkinson’s and Neurological Diseases Using Cepstral Analysis , 2016, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  Tomi Kinnunen,et al.  Multitaper Estimation of Frequency-Warped Cepstra With Application to Speaker Verification , 2010, IEEE Signal Processing Letters.

[10]  Mireia Díez,et al.  On the complementarity of short-time fourier analysis windows of different lengths for improved language recognition , 2014, INTERSPEECH.

[11]  Jesús Francisco Vargas-Bonilla,et al.  Perceptual Analysis of Speech Signals from People with Parkinson's Disease , 2013, IWINAC.

[12]  Haizhou Li,et al.  Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[14]  Maria Hansson,et al.  Optimal cepstrum estimation using multiple windows , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  D. Thomson,et al.  Spectrum estimation and harmonic analysis , 1982, Proceedings of the IEEE.

[16]  A. Benba,et al.  Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson's disease , 2015, 2015 International Conference on Electrical and Information Technologies (ICEIT).

[17]  Douglas D. O'Shaughnessy,et al.  Multitaper MFCC and PLP features for speaker verification using i-vectors , 2013, Speech Commun..

[18]  E. Růžička,et al.  Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson's disease. , 2011, The Journal of the Acoustical Society of America.

[19]  Max A. Little,et al.  Novel Speech Signal Processing Algorithms for High-Accuracy Classification of Parkinson's Disease , 2012, IEEE Transactions on Biomedical Engineering.

[20]  G. W. Snedecor Statistical Methods , 1964 .

[21]  Maria Hansson,et al.  A multiple window method for estimation of peaked spectra , 1997, IEEE Trans. Signal Process..

[22]  Tomi Kinnunen,et al.  What else is new than the hamming window? robust MFCCs for speaker recognition via multitapering , 2010, INTERSPEECH.