论文信息 - Feature Estimation for Vocal Fold Edema Detection Using Short-Term Cepstral Analysis

Feature Estimation for Vocal Fold Edema Detection Using Short-Term Cepstral Analysis

Digital signal processing techniques have been used to perform an acoustic analysis for vocal quality assessment due to the simplicity and the noninvasive nature of the measurement procedures. Their employment is of special interest, as they can provide an objective diagnosis of pathological voices, and may be used as complementary tool in laryngoscope exams. The acoustic modeling of pathological voices is very important to discriminate normal and pathological voices. The degree of reliability and effectiveness of the discriminating process depends on the appropriate acoustic feature extraction. This paper aims at specifying and evaluating the acoustic features for vocal fold edema through a parametric modeling approach based on the resonant structure of the human speech production mechanism, and a nonparametric approach related to human auditory perception system. For this purpose, LPC and LPC-based cepstral coefficients, and mel-frequency cepstral coefficients are used. A vector-quantizing-trained distance classifier is used in the discrimination process.

Joseana Macêdo Fechine | Benedito G. Aguiar Neto | Silvana Cunha Costa | Menaka Muppa

[1] John B. Shoven,et al. I , Edinburgh Medical and Surgical Journal.

[2] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[3] Peter J Murphy,et al. Noise estimation in voice signals using short-term cepstral analysis. , 2007, The Journal of the Acoustical Society of America.

[4] Pedro Gómez Vilda,et al. Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[5] D. Jamieson,et al. Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[6] S. Furui,et al. Cepstral analysis technique for automatic speaker verification , 1981 .

[7] Mary P. Harper,et al. Speech pauses and gestural holds in parkinson²s disease , 2002, INTERSPEECH.

[8] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[9] Douglas D. O'Shaughnessy,et al. Speech communications - human and machine, 2nd Edition , 2000 .

[10] Ronald W. Schafer,et al. Digital Processing of Speech Signals , 1978 .

[11] M. Bahoura,et al. Respiratory sounds classification using cepstral analysis and Gaussian mixture models , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[12] Kumara Shama,et al. Study of Harmonics-to-Noise Ratio and Critical-Band Energy Spectrum of Speech as Acoustic Indicators of Laryngeal and Voice Pathology , 2007, EURASIP J. Adv. Signal Process..

[13] T.W. Berger,et al. Pathological Voice Assessment , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[14] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[15] S. B. Davis. Acoustic Characteristics of Normal and Pathological Voices , 1979 .

[16] Marcelo de Oliveira Rosa,et al. Adaptive estimation of residue signal for voice pathology diagnosis , 2000, IEEE Trans. Biomed. Eng..

[17] M. Bahoura,et al. Respiratory sounds classification using Gaussian mixture models , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[18] L. Gavidia-Ceballos,et al. Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection , 1996, IEEE Transactions on Biomedical Engineering.

[19] Ioannis Pitas,et al. Automatic detection of vocal fold paralysis and edema , 2004, INTERSPEECH.

[20] John H. L. Hansen,et al. A comparative study of traditional and newly proposed features for recognition of speech under stress , 2000, IEEE Trans. Speech Audio Process..

[21] Karthikeyan Umapathy,et al. Discrimination of pathological voices using a time-frequency approach , 2005, IEEE Transactions on Biomedical Engineering.