论文信息 - Vocal Folds Disorder Detection using Pattern Recognition Methods

Vocal Folds Disorder Detection using Pattern Recognition Methods

Diagnosis of pathological voice is one of the most important issues in biomedical applications of speech technology. This study focuses on the classification of pathological voice using the HMM (hidden Markov model), the GMM (Gaussian mixture model) and a SVM (support vector machine), and then compares the results to work done previously using an ANN (artificial neural network). Speech data were collected from those without and those with vocal disorders. Normal and pathological speech data were mixed in out experiment. Six characteristic parameters (jitter, shimmer, NHR, SPI, APQ and RAP) were chosen. Then the pattern recognition methods (HMM, GMM and SVM) were used to distinguish the mixed data into categories of normal and pathological speech. We found that the GMM-based method can give us superior classification rates compared to the other classification methods.

Jianglin Wang | Cheolwoo Jo | Cheol-Woo Jo | Jianglin Wang

[1] Pedro Gómez Vilda,et al. Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors , 2004, IEEE Transactions on Biomedical Engineering.

[2] Dae-Hyun Kim,et al. Diagnosis of Pathological Speech Signals Using Wavelet Transform , 1998 .

[3] Tao Li,et al. Classification of pathological voice including severely noisy cases , 2004, INTERSPEECH.

[4] Steve Young,et al. The HTK book , 1995 .

[5] S. Narayanan,et al. A System for Automatic Detection of Pathological Speech , 2003 .

[6] R. Redner,et al. Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[7] Dae-Hyun Kim,et al. Screening of pathological voice from ARS using neural networks , 2001, MAVEBA.

[8] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9] Donald B. Rubin,et al. Max-imum Likelihood from Incomplete Data , 1972 .

[10] Vladimir Vapnik,et al. An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.